How Data Availability and Cost Relate to AI Differentiation?

When someone pitches me an #ai/#MachineLearning idea, I always (also) ask about #data availability, data cost, and how they relate to their product differentiation and #aitechnology. Here’s how I see them, roughly speaking. #strategy #AIstrategy #AIeconomics pic.twitter.com/v6yb8JOHwi
— ivanjureta (@ivanjureta) February 19, 2018
If any text can be training data for a Large Language Model, then any text is a training dataset that can be valued through a market for training data. Which datasets have high value? Wikipedia, StackOverflow, Reddit, Quora are examples that have value for different reasons, that is, because they can be used to train…
Is it one that led to the best outcome? Or one that integrates all the relevant and available information? Maybe one that is liked by a majority? If decision governance is followed to the letter, will that guarantee a high quality decision? The quality of a decision depends on the following: The reason a decision…
If AI is made for profit, then should its design be confidential? This choice is part of AI product strategy. The decision on this depends on the following at least. What is the relationship of each of these to AI confidentiality? Correctness: The more likely the AI / algorithm is to make errors, the more…
The short answer is “No”, and the reasons for it are interesting. An AI system is opaque if it is impossible or costly for it (or people auditing it) to explain why it gave some specific outputs. Opacity is undesirable in general – see my note here. So this question applies for both those outputs…
Just like l’art pour l’art, or art for the sake of art was the bohemian creed in the 19th century, it looks like there’s an “AI for the sake of AI” creed now when building general-purpose AI systems based on Large Language Models. Let’s say that the aim for a sustainable business are happy, paying,…
Opacity, complexity, bias, and unpredictability are key negative nonfunctional requirements to address when designing AI systems. Negative means that if you have a design that reduces opacity, for example, relative to another design, the former is preferred, all else being equal. The first thing is to understand what each term refers to in general, that…