Valuation of an AI Training Dataset
|

Valuation of an AI Training Dataset

If there is a market for AI training datasets, then the price will be determined by supply and demand. How does the supplier set the price, and how does the buyer evaluate if the price is right? The question behind both of these is this: how to estimate the value of a training dataset? We…

AI Growth through Expert Communities
| | |

AI Growth through Expert Communities

In the creator economy, the creative individual sells content. The more attention the content captures, the more valuable it is. The incentive for the creator is status and payment for consumption of their content. Distribution channels are Internet platforms, where content is delivered as intended by the author, the platform does not transform it (other…

What Does a Training Data Market Mean for Authors?
| |

What Does a Training Data Market Mean for Authors?

If any text can be training data for a Large Language Model, then any text is a training dataset that can be valued through a market for training data.  Which datasets have high value? Wikipedia, StackOverflow, Reddit, Quora are examples that have value for different reasons, that is, because they can be used to train…

Preconditions for a Market for High Quality AI Training Data
| |

Preconditions for a Market for High Quality AI Training Data

There is no high quality AI without high quality training data. A large language model (LLM) AI system, for example, may seem to deliver accurate and relevant information, but verifying that may be very hard – hence the effort into explainable AI, among others.  If I wanted accurate and relevant legal advice, how much risk…

AI Compliance at Scale via Embedded Data Governance
| |

AI Compliance at Scale via Embedded Data Governance

There are, roughly speaking, three problems to solve for an Artificial Intelligence system to comply with AI regulations in China (see the note here) and likely future regulation in the USA (see the notes on the Algorithmic Accountability Act, starting here):  Using available, large-scale crawled web/Internet data is a low-cost (it’s all relative) approach to…

Can an Artificial Intelligence Trained on Large-Scale Crawled Web Data Comply with the Algorithmic Accountability Act?
| | | |

Can an Artificial Intelligence Trained on Large-Scale Crawled Web Data Comply with the Algorithmic Accountability Act?

If an artificial intelligence system is trained on large-scale crawled web/Internet data, can it comply with the Algorithmic Accountability Act?  For the sake of discussion, I assume below that (1) the Act is passed, which it is not at the time of writing, and (2) the Act applies to the system (for more on applicability,…

Does the EU AI Act apply to most software?
| | |

Does the EU AI Act apply to most software?

Does the EU AI Act apply to most, if not all software? It is probably not what was intended, but it may well be the case.  The EU AI Act, here, applies to “artificial intelligence systems” (AI system), and defines AI systems as follows: ‘artificial intelligence system’ (AI system) means software that is developed with…

What is AI Governance for?
|

What is AI Governance for?

If an AI is not predictable by design, then the purpose of governing it is to ensure that it gives the right answers (actions) most of the time, and that when it fails, the consequences are negligible, or that it can only fail on inconsequential questions, goals, or tasks.

| |

Decreasing the Odds of Misunderstanding

A requirements model is, in simplest terms, a set of labeled propositions: most of it is natural language text. If so, how can you reduce the odds of it being misunderstood? Natural language is vague, ambiguous, unclear, while systems/products/services we make to solve requirements tend to be well defined, at least when they’re made; hence…

|

Conditions for Incomplete Requirements Models

When is a requirements model incomplete? The answer depends on the requirements modeling language (RML) used to make the model. Therefore, when you choose an RML, you are also choosing its own definition of when a model is incomplete.  The reason that conditions for model incompleteness are important, is that you cannot claim that you…

|

A Trigger for Requirements Change

There is a simple condition, called “fitness improvement” that triggers (i.e., is both necessary and sufficient for) the change of a requirements model. The problem with it is that it is simple to define, but expensive to check when it verifies in practice. What is that condition, and why is checking it expensive? In discussing…

|

Are Refinement and Decomposition Equivalent?

In requirements modeling languages, refinement and decomposition show up as two relationships over requirements. Both terms are also, somewhat confusingly, used to refer to processes for changing the information in a requirements model. Although they have different origins, and appear in different modeling languages, they are actually not independent relationships. I will argue below that…

|

Requirements Lifecycle & the DevOps Loop

It requires paraconsistent reasoning and involves cognitive dissonance to think at the same time about requirements in the way promoted in mainstream requirements engineering, and then use the DevOps loop (and the broader model), a method that has been demonstrated to work (and I’ve seen it applied in a team I was part of in…

| |

Requirements Satisfaction ≠ Customer Satisfaction

There is engineering quality of a product or service, which is fitness to the specification, and there is perceived quality, or subjective quality, which is proportional to the distance between expectations and experience of the person asked. What is the relationship between these, between specification, requirements, expectations, and experience? This is a longstanding question in…

| |

Choice in Absence of Utility and Probability Estimates

In expected utility models, utility quantifies preferences, probability quantifies uncertainty. Sounds simple, elegant, but tends to be expensive. What if options can be identified, but there is no information about preferences or uncertainty in a format that can be translated into, respectively, utility and probability? What is an alternative decision process, which is still structured…

| |

When Is a Requirement Accurate?

What conditions should a requirement satisfy, to be considered accurate? It turns out that this is a very complicated question. Two easy, yet unsatisfying ways out are (i) to think the question is irrelevant, and (ii) to claim that the validation of a requirement answers that question (i.e., if a stakeholder A gave the requirement…

|

Approaches to Requirements Satisfaction

Given a set of requirements to satisfy, and assuming they can be satisfied together, are we always aiming to find the solution that maximizes the level of satisfaction of all these requirements? Maximization of satisfaction, or more generally, finding an optimal solution to given requirements, is a common way of thinking about what we want…

|

Requirements as Decision Criteria

Assuming two or more alternative solutions are available, to make a decision means to pick only one of these, or, equivalently here, to commit to one and ignore others. What role do requirements play in such decision making situations?  In classical decision theory [1], the best solution is the one that yields the highest expected…

|

Unavoidable Frictions to Transparency

There is transparency when “activities are done in an open way without secrets, so that people can trust that they are fair and honest” [1]. If there is consensus that transparency is desirable, can it be practically achieved? Or, why couldn’t it be achieved? For this to be the case, it should be possible to…

Specialization versus Transparency
| |

Specialization versus Transparency

In a firm, what is the relationship between transparency of information and specialization of work?  Increasing specialization means that individuals over time deepen a relatively narrow set of skills and knowledge required for these, in response to the opportunities and problems they are responsible for. There are organizational structures that evidently encourage specialization, such as…

|

Specialization Costs in Functional Organization

In a functional organizational structure, each team is responsible for a set of something called functions. An essential property of a functional team is homogeneity of knowledge within the team: people in it usually share similar educational background, similar expertise, similar career development paths. A clear benefit of functional organization is that it allows a…

|

Nurtured Choice

What can you do to influence someone’s decision, if you cannot give them advice? In short, a possible approach is to take actions that satisfy two conditions: I call this the nurturing of choice. Instead of providing advice that is clearly directed at the elements of the decision problem (as I discussed in my book…

| |

Limits of Decentralized Autonomous Organizations (DAO)

The crypto glossary at Andreessen Horowitz [1] gives the following definition of a Decentralized Autonomous Organization, or DAO:  “Decentralized autonomous organizations” or DAOs represent exactly what they are called; they are: (i) decentralized so, the rules cannot be changed by a single individual or centralized party; (ii) autonomous, so they operate based on logic written…

|

The Firm as a Network of Teams

What determines the distribution of knowledge and information flow between teams in a firm? Why are some bigger than others? Why do some teams collaborate more with others? Many small choices made by different people, inside and outside a team, accumulate to hard-to-reverse distribution of expertise, decision authority, and resources, that is, into the set…

Economics of the Acceptability of an Argument
| |

Economics of the Acceptability of an Argument

An argumentation framework [1] is a graph of nodes called arguments, and edges called attacks. If arguments are propositions, and “p1 attacks p2” reads “if you believe p1 then you shouldn’t believe p2”, then an argumentation framework looks like something you can use to represent the relationship between arguments and counterarguments in, say, a debate….

Linguistic Causes of Distracting Disagreement
| |

Linguistic Causes of Distracting Disagreement

There is disagreement which leads to constructive revision of definitions (see Plastic Definitions and Define/Destroy method), i.e., the improvement of definitions during innovation, and then there is disagreement which is distracting, useless, wastes time, and takes focus and attention away from improvements. Distracting disagreement comes from ambiguity, synonymy, and vagueness, what I call linguistic causes…

Unpacking Disagreement over New Ideas
|

Unpacking Disagreement over New Ideas

”Why is it a problem to have stops? Stops are common. We should be able to add them to a live load.” He was insisting. This made no sense to me.  ”You mean a shipment, right? The load becomes a shipment once matched.” I waited for his confirmation. It wasn’t happening.  This got me thinking…

How New Ideas Use Old Terms
| |

How New Ideas Use Old Terms

One of the observations made in the discussion of conceptual leaps is that new ideas rely on old. Even if new ideas mark discontinuity with old, they also built on old ideas. By old ideas, I mean ideas which are established, non-controversial for a given group of people working together (so they may be controversial…