TDM·AI
Liccium.comContact us
  • Home
  • Introduction
    • What is the TDM·AI Protocol?
  • Recommendations for Opt-out
  • Metadata Binding
    • Issues of Domain-Based Opt-out
    • Issues of Asset-Based Optout
    • Advantages of Opt-Out Registries
  • Benefits of TDM·AI
  • Opt-out, opt-in and content licensing
  • Federated Registries
    • Registries for Training Preferences
  • Federated Registries Explained
  • Technical Specification
    • Usage Preferences Vocabulary
    • JSON Structure and Declaration Examples
    • Revocation
    • JSON Schema Definition
  • Legal Aspects
    • Legal Basis
  • Publications
    • Our Position Paper
  • Contact and Imprint Information
  • Liccium.com
Powered by GitBook
On this page
  • Disclaimer on Legal Applicability
  • 1. Usage Reservation: TDM (TDM)
  • 2. Usage Reservation: AI Training (AiTraining)
  • 3. Usage Reservation: Generative AI Training (genAiTraining)
  1. Technical Specification

Usage Preferences Vocabulary

2025-03-26

PreviousFederated Registries ExplainedNextJSON Structure and Declaration Examples

Last updated 18 days ago

​​This section defines the controlled vocabulary used to express usage permissions or usage reservations under the TDM·AI Protocol. The vocabulary enables machine-readable communication of usage preferences regarding the use of digital content for text and data mining (TDM), AI training, and generative AI training.

The vocabulary is based on the proposal by the Open Future Foundation: "A Vocabulary for Opting Out of AI Training and Other Forms of TDM" (07 March 2025) Available at: .

Also see the 'Active Internet-Draft' of the Internet Engineering Task Force (IETF) aipref WG:

The vocabulary is structured hierarchically to reflect legal and technical relationships between different types of uses. Each category can be declared independently, but opt-outs at a higher level also cause restrictions on subordinate categories, as described below.

Disclaimer on Legal Applicability

This vocabulary is designed to provide a standardised, machine-readable means for rightsholders to communicate usage preferences concerning the use of protected content for text and data mining (TDM), artificial intelligence (AI) training, and generative AI training.

This vocabulary operates in the context of ongoing legal debate regarding the scope and applicability of statutory TDM exceptions – particularly whether such exceptions, as provided for in EU and national copyright laws, extend to the training of generative AI systems. Current academic and legal discourse, including the work of Dornis and Stober (2024), indicates divergent interpretations and unresolved questions in this area (Dornis, T.W. & Stober, S. (2024). Urheberrecht und Training generativer KI-Modelle – Technologische und juristische Grundlagen, Recht und Digitalisierung, Nomos Verlag.; also available at SSRN:).

The inclusion of terms and categories in this vocabulary does not constitute a legal determination of whether any given use is permitted or prohibited under applicable law. Rather, it reflects the intention of rightsholders to express usage rights, e.g. reserve rights, to the fullest extent permitted by law and to provide clear signals to users and AI developers in light of legal uncertainty.

Implementers and users of this vocabulary are advised to seek legal counsel regarding the specific application of copyright exceptions and limitations in their jurisdiction. The use of this vocabulary does not substitute for legal advice nor does it imply endorsement of any particular legal interpretation.

1. Usage Reservation: TDM (TDM)

The act of using assets in the context of any automated analytical technique aimed at analysing text and data in digital form in order to generate information, including but not limited to patterns, trends, and correlations.

A reservation of TDM means that AiTraining and genAiTraining is also reserved.

2. Usage Reservation: AI Training (AiTraining)

The act of training AI models.

AI models can be training general-purpose AI models or other types of AI models capable of performing a wide range of tasks, including labeling, classifying, recognising patterns, making decisions, and semantically understanding content.

A reservation of AiTraining allows TDM but means that genAiTraining is also reserved.

3. Usage Reservation: Generative AI Training (genAiTraining)

The act of training general-purpose AI models, improving their capacity to generate text, images, or other forms of synthetic content, or training other types of AI models that have the purpose of generating text, images, or other forms of synthetic content.

A reservation of genAiTraining does not imply any restriction on TDM or AiTraining.

https://openfuture.eu/publication/a-vocabulary-for-opting-out-of-ai-training-and-other-forms-of-tdm
https://www.ietf.org/archive/id/draft-keller-aipref-vocab-01.html
Open Access version
https://ssrn.com/abstract=4946214