Which Problems Is It Hard to Design AI for?

The less data there is, or the lower quality the data that is available, the more difficult it is to build AI based on statistical learning. For scarce data domains, the only way to design AI is to elicit knowledge from experts, design rules that represent that knowledge, parameterize them so that they apply to more cases.

AI based on expert rules is relatively more expensive to design than AI based on statistical learning, the more data there is to train the AI, and the higher the quality of that data. An extreme example would be if someone wanted to make an AI system comparable to any version of ChatGPT, and they had no access to crawled large scale Internet data, but instead had to elicit information from people. This is impossible, as it would imply eliciting all sorts of information that accumulated online over the last few decades. Put another way, many of the currently interesting AI systems available to consumers, and those based on Large Language Models in particular, depend heavily on Internet content, and on it being low cost to use for training.

There’s an important implication of this reliance on Internet content, and its scale in particular: most knowledge or problem domains for which content to derive patterns from is scarce, will lead to AI systems that cannot perform at comparable levels of sophistication as those trained on general purpose Internet content.

A few hypotheses, then, about the enterprise AI market, or the market for AI systems trained on enterprise data:

Adoption of enterprise AI will be lower than the adoption of general purpose AI: The percentage of staff in an organization who use that organization’s enterprise AI is likely going to be lower than the percentage of people using general purpose AI systems built on Internet data.
Lower adoption of enterprise AI will lead to the lower impact of AI on headcount, in particular for jobs that involve making impactful decisions.
General purpose AI, such as ChatGPT and similar, will have more impact on headcount than enterprise AI trained on enterprise data and content, and the mechanism for that impact will be the application of general purpose AI to repetitive information management tasks, in particular tasks involving search and synthesis of information that is not specific to the given organization.

I look forward to being proven wrong about these, as that is the more interesting outcome.

Does the EU AI Act apply to most software?

Does the EU AI Act apply to most, if not all software? It is probably not what was intended, but it may well be the case. The EU AI Act, here, applies to “artificial intelligence systems” (AI system), and defines AI systems as follows: ‘artificial intelligence system’ (AI system) means software that is developed with…

AI | AI Regulations | Decision Problems | Intentionality | Quality

Can an Artificial Intelligence System Decide Autonomously?

To say that something is able to decide requires that it is able to conceive more than the single course of action in a situation where it is triggered to act, that it can compare these alternative courses of action prior to choosing one, and that it likes one over all others as a result…

AI | AI Governance

What is AI Governance for?

If an AI is not predictable by design, then the purpose of governing it is to ensure that it gives the right answers (actions) most of the time, and that when it fails, the consequences are negligible, or that it can only fail on inconsequential questions, goals, or tasks.

AI | Opinion | Quality

Data Quality & AI Quality Are not Independent

AI | AI Governance | Innovation | Preferences

Can Decision Autonomy of an AI Be Distinguished from Malfunction?

I wrote in another note (here) that AI cannot decide autonomously because it does not have self-made preferences. I argued that its preferences are always a reflection of those that its designers wanted it to exhibit, or that reflect patterns in training data. The irony with this argument is that if an AI is making…

AI | Quality

Data Authenticity, Accuracy, Objectivity, and Diversity Requirements in Generative AI

In April 2023, the Cyberspace Administration of China released a draft Regulation for Generative Artificial Intelligence Services. The note below continues the previous one related to the same regulation, here. One of the requirements on Generative AI is that the authenticity, accuracy, objectivity, and diversity of the data can be guaranteed. My intent below is…

Similar Posts