Opt-out Vocabulary
2025-03-26
Last updated
2025-03-26
Last updated
This section defines the controlled vocabulary used to express rights or usage reservations (in the following termed: “opt-out declarations”) under the TDM·AI Protocol. The vocabulary enables machine-readable communication of rights reservations regarding the use of digital content for text and data mining (TDM), AI training, and generative AI training.
The vocabulary is based on the proposal by the Open Future Foundation: "A Vocabulary for Opting Out of AI Training and Other Forms of TDM" (07 March 2025) Available at: .
Also see the 'Active Internet-Draft' of the Internet Engineering Task Force (IETF) aipref WG:
The vocabulary is structured hierarchically to reflect legal and technical relationships between different types of uses. Each category can be declared independently, but opt-outs at a higher level also cause restrictions on subordinate categories, as described below.
This vocabulary is designed to provide a standardised, machine-readable means for rightsholders to communicate reservations of rights concerning the use of protected content for text and data mining (TDM), artificial intelligence (AI) training, and generative AI training.
This vocabulary operates in the context of ongoing legal debate regarding the scope and applicability of statutory TDM exceptions – particularly whether such exceptions, as provided for in EU and national copyright laws, extend to the training of generative AI systems. Current academic and legal discourse, including the work of Dornis and Stober (2024), indicates divergent interpretations and unresolved questions in this area (Dornis, T.W. & Stober, S. (2024). Urheberrecht und Training generativer KI-Modelle – Technologische und juristische Grundlagen, Recht und Digitalisierung, Nomos Verlag.; also available at SSRN:).
The inclusion of terms and categories in this vocabulary does not constitute a legal determination of whether any given use is permitted or prohibited under applicable law. Rather, it reflects the intention of rightsholders to reserve rights to the fullest extent permitted by law and to provide clear signals to users and AI developers in light of legal uncertainty.
Implementers and users of this vocabulary are advised to seek legal counsel regarding the specific application of copyright exceptions and limitations in their jurisdiction. The use of this vocabulary does not substitute for legal advice nor does it imply endorsement of any particular legal interpretation.
TDM
) The act of using assets in the context of any automated analytical technique aimed at analysing text and data in digital form in order to generate information, including but not limited to patterns, trends, and correlations.
A reservation of TDM
means that AITraining
and genAITraining
is also reserved.
AITraining
) The act of training AI models.
A reservation of AITraining
allows TDM but means that genAITraining
is also reserved.
genAITraining
) The act of training general-purpose AI models, improving their capacity to generate text, images, or other forms of synthetic content, or training other types of AI models that have the purpose of generating text, images, or other forms of synthetic content.
A reservation of genAITraining
does not imply any restriction on TDM
or AITraining
.