/terms/aipref · 5 min read · advanced

AIPREF (AI usage preferences)

AIPREF is the IETF AI Preferences working group's effort to standardize a machine-readable way for content owners to express how their content may be used by AI systems. The preference is carried by a Content-Usage signal, attached as an HTTP response header or a robots.txt rule, using a small vocabulary (currently the categories train-ai and search, each set to y or n). AIPREF declares a usage preference; it does not authenticate the requester (out of scope) and does not enforce compliance.

Citation status

ChatGPTPerplexityClaudeCopilotGemini

Last checked 2026-06-04

AIPREF (AI Preferences) is the IETF working group's effort to standardize a machine-readable way for content owners to express how their content may be used by AI systems1. The preference is carried by a Content-Usage signal, attached either as an HTTP response header or as a rule in robots.txt, and it draws on a small controlled vocabulary: as of the current drafts, the categories are train-ai (use in AI model training) and search (use in search-style applications that direct users back to the source), each set to y (allow) or n (disallow)23.

The point of AIPREF is to replace the confusing array of non-standard, vendor-specific signals (assorted noai meta tags, custom robots directives, terms-of-service clauses) with one interoperable vocabulary. It sits at a specific layer of the stack: it declares a usage preference, not the requester's identity and not an enforcement guarantee. The working group charter puts authenticating or authorizing crawlers explicitly out of scope1, which is exactly the boundary that Web Bot Auth occupies. AIPREF answers "what may be done with this content"; Web Bot Auth answers "who is asking."

Status in 2026

AIPREF is genuinely emerging, and both of its halves are adopted working-group documents that are still pre-standardization. An Internet-Draft is work in progress that may change, expire, or be replaced before it becomes an RFC, and a draft revision auto-expires six months after posting; that expiry is a routine part of the process, not a signal the work has been abandoned. The vocabulary draft, draft-ietf-aipref-vocab, is a current working-group document on the Proposed Standard track (revision 06, expiring October 2026); it defines the train-ai and search categories and their y/n values, and notes in its own text that it does not yet have full consensus2. The attachment draft, draft-ietf-aipref-attach, which defines the Content-Usage header and the robots.txt Content-Usage rule, is also an adopted working-group document, carrying an August 2026 milestone to send a standards-track specification to the IESG; its latest published revision (-04, October 28, 2025) has lapsed under that six-month rule, with no newer revision posted yet3. Several adjacent individual drafts (real-time bindings, content-bound consumption preferences, display-based preferences, exclusions) are tracked but not adopted.

So the working group is active on both halves, but neither is finalized: the vocabulary lacks full consensus, and the most recent header specification sits in a lapsed revision. The syntax can still change before it standardizes, so the practical state is "watch and prepare," not "deploy."

A second honest caveat: AIPREF is a preference, not an obligation. Like robots.txt before it, it expresses what a content owner would like automated systems to do; whether a given AI company honors train-ai=n is a function of that company's policy and the surrounding legal and regulatory environment, none of which the signal itself can compel. The drafts define the preference vocabulary; they do not establish that major AI-training or AI-search products will honor these values. Setting search=y or search=n records an intent whose effect depends entirely on voluntary adoption by those products.

How to apply

The useful framing is to place AIPREF correctly among the signals you already control, and to avoid over-implementing an unfinished spec:

  • Locate it as the preference layer, not access or identity. A site can carry several independent signals: robots.txt for crawl access, llms.txt for AI-readable guidance, AIPREF for usage preference, and Web Bot Auth for requester identity. They answer different questions and coexist; AIPREF can even ride inside robots.txt as a Content-Usage rule.
  • Track the working group rather than ship an unfinished header. The Content-Usage header's most recent specification sits in a lapsed draft revision, so it is not a stable target yet. If you experiment today, treat any robots.txt Content-Usage line (using the current train-ai/search vocabulary) as a non-stable, experimental signal, and monitor the working group for syntax changes before relying on it.
  • Separate preference from enforcement in your own planning. Setting train-ai=n records an intent; it does not stop training. Treat it as a documented, machine-readable statement of preference, and rely on contracts, licensing, or technical access control where you need an actual barrier rather than a request.

What to skip:

  • Implementing the Content-Usage header as though it were a finished standard. Its most recent specification sits in a lapsed draft revision; the syntax may still change before publication.
  • Treating AIPREF as an opt-out that guarantees your content is excluded from AI training. It is a preference signal whose effect depends on voluntary compliance.
  • Conflating AIPREF with crawler verification. AIPREF does not tell anyone who the requester is; that is a different problem with a different (also emerging) standard.

How it relates to other concepts

  • Complementary to Web Bot Auth: AIPREF is the preference half (what you may do with the content), Web Bot Auth is the identity half (cryptographically verifying which crawler is asking). The AIPREF charter putting crawler authentication out of scope is what makes the two cleanly complementary rather than overlapping.
  • A complementary preference layer alongside legacy robots.txt and llms.txt: robots.txt governs whether a resource may be fetched; llms.txt offers a curated AI-readable map; AIPREF expresses what the fetched content may be used for. AIPREF is not a successor to robots.txt (it reuses robots.txt as one of its carriers); it answers a different question alongside it.
  • Part of the AI access, identity, discovery, and usage-preference signal family alongside AI crawler bots (the agents these signals are aimed at) and discovery-side protocols like IndexNow. Together these form the surface a publisher has over how AI systems discover, fetch, identify against, and use a site. AIPREF itself is the usage-preference signal in that family, not an access-control or enforcement mechanism.
  • Upstream of generative engine optimization decisions: a site that signals train-ai=n but search=y is making a deliberate choice to stay visible in AI search while declaring a preference against training use, a distinction GEO strategy increasingly has to reason about explicitly.

Footnotes

  1. IETF AI Preferences (aipref) working group. datatracker.ietf.org/wg/aipref/about. Co-chairs Mark Nottingham and Suresh Krishnan. The charter's scope covers standardizing methods for expressing preferences about how content is collected and processed for AI, and attaching those preferences to content via metadata or protocol signaling. Verbatim out-of-scope item: "Application layer protocols for authenticating or authorizing clients and/or crawlers." This is the boundary that distinguishes AIPREF (preference) from Web Bot Auth (identity). 2

  2. Paul Keller (Open Future) and Martin Thomson (Mozilla, editor). "A Vocabulary For Expressing AI Usage Preferences." draft-ietf-aipref-vocab, revision 06, April 28, 2026; active working-group Internet-Draft on the Proposed Standard track, expiring October 2026. Defines two usage categories: train-ai ("using an asset in the production or refinement of an AI model that can generate content in one or more modalities") and search (applications whose "primary purpose ... is to select assets and direct users to the location of those assets," with references to the original source). Each category takes y (allow), n (disallow), or an unstated/unknown default. Example notation: train-ai=y, search=n. The draft states it "does not yet have consensus." 2

  3. Gary Illyes (Google) and Martin Thomson. "Associating AI Usage Preferences with Content in HTTP." draft-ietf-aipref-attach. An adopted working-group document with an August 2026 milestone to send a standards-track specification to the IESG; its latest published revision, -04 (October 28, 2025), lapsed under the IETF's routine six-month Internet-Draft expiry, and no newer revision has been posted as of mid-2026. Automatic expiry of a revision is a normal part of the process and is not in itself a sign the work has stopped. Defines the Content-Usage HTTP response header as a structured-field dictionary (per RFC 9651, which obsoletes RFC 8941), example Content-Usage: train-ai=n, applying the vocabulary and processing rules from the vocabulary draft. Defines the same preference as a robots.txt Content-Usage rule with an optional path prefix, example Content-Usage: /ai-ok/ train-ai=y, using longest-prefix matching consistent with Allow/Disallow. Both carriers transmit identical preference statements. Because the work is still pre-standardization and the latest revision has lapsed without replacement, the header and robots.txt syntax it specifies are not yet a stable standard. 2

Part of Infrastructure· editorial cluster, not a semantic link

Cluster pillar: AI access control

Also in this cluster: AI access control · AI crawler bots · IndexNow Protocol · LLMS.txt · Web Bot Auth

Mentioned in· auto-generated from other terms' related lists

FAQ

What is AIPREF and the Content-Usage header?
AIPREF is the IETF AI Preferences working group, which is working to standardize a machine-readable way for content owners to express how their content may be used by AI systems. The preference is carried by a Content-Usage signal, attached either as an HTTP response header (for example, Content-Usage: train-ai=n) or as a Content-Usage rule in robots.txt. The signal draws on a small controlled vocabulary: as of the current drafts, the two categories are train-ai (use in AI model training) and search (use in search-style applications that direct users back to the source), each set to y (allow) or n (disallow).
Is AIPREF a finished standard I can rely on?
Not yet. The vocabulary draft (draft-ietf-aipref-vocab) is an active working-group document on the Proposed Standard track, and the attachment-mechanism draft that defines the Content-Usage header and robots.txt rule (draft-ietf-aipref-attach) is also an adopted working-group document, carrying an August 2026 milestone toward a standards-track specification. Both are still pre-standardization: the vocabulary does not yet have full consensus, and the attachment draft's latest published revision has lapsed under the routine six-month Internet-Draft auto-expiry (a normal part of the process, not abandonment), with no newer revision posted yet. So the syntax can still change before it standardizes. Implementing the current header syntax today is premature; tracking the working group is the appropriate response.
Does AIPREF stop AI companies from using my content?
No. AIPREF is a declaration of preference, not an enforcement mechanism. Like robots.txt, it expresses what you would like automated systems to do; whether a given AI company honors that preference is a matter of that company's policy, not something the signal can compel. AIPREF also explicitly does not authenticate the requester: verifying that a crawler is who it claims to be is out of scope for the working group and is the problem Web Bot Auth addresses. AIPREF answers what may be done with the content; it does not answer who is asking or force anyone to comply.
How is AIPREF different from robots.txt and llms.txt?
All three are signals a site places for automated consumers, but they answer different questions. robots.txt is legacy crawl access (may you fetch this URL at all). llms.txt is guidance (here is a curated, AI-readable map of the site's important content). AIPREF is usage preference (given that you have the content, here is what you may use it for, such as training versus search). AIPREF can even ride inside robots.txt as a new Content-Usage rule, so the two coexist rather than compete. None of the three authenticate the requester; that is Web Bot Auth's role.

Sources & further reading

Get the monthly digest

New terms shipped that week, plus one observation from the AI-citation tracker.

More about what you'll get