Machine-readable · agent-discoverable

AI Licensing Terms

This page describes how AI crawlers, retrievers, and training pipelines may use content on davidtofan.com. It is the target of the site's Link: …; rel="license" response header and the corresponding entry in robots.txt.

Last updated:

TL;DR

  • Search indexing — allowed.
  • AI retrieval with citation (RAG, agent answers that link back) — allowed.
  • AI training and fine-tuningnot granted. Requires a written license.
  • Bulk scraping without attribution — not allowed.

Content Signals

This site publishes Content Signals (IETF AIPREF draft) via robots.txt. The current declaration is:

User-Agent: *
Content-Signal: search=yes, ai-input=yes, ai-train=no

These are machine-readable preferences. Where jurisdiction applies — in particular the European Union (EU) Directive 2019/790 on Copyright and Related Rights in the Digital Single Market (DSM) — Article 4(3) — they also constitute an express opt-out of Text and Data Mining (TDM) for commercial purposes.

License

Unless a specific article or project states otherwise, original content (text, diagrams, code samples) on this website is published under the following terms:

  • Search crawlers may index and display short excerpts with attribution and a back-link to the canonical URL.
  • AI agents performing user-initiated retrieval (e.g. answering a prompt with a citation) may fetch, quote briefly, and summarize content, provided the response links back to the canonical URL.
  • Training, fine-tuning, or model distillation on this content — whether directly or via a derived dataset — is not granted by default. Contact the author for a commercial license.
  • Bulk mirroring, republishing, or redistribution of full articles without attribution and a back-link is not permitted.

Text and Data Mining (TDM) opt-out reservation

Rights under Article 4(3) of the European Union (EU) Directive 2019/790 on Copyright and Related Rights in the Digital Single Market (DSM) are expressly reserved. This reservation is repeated machine-readably via the ai-train=no Content Signal in robots.txt.

Enforcement

This website is behind Cloudflare. Requests identifying as AI crawlers may be throttled, challenged, blocked, or charged via AI Crawl Control and Pay Per Crawl regardless of their declared preferences, according to the operator's policy at the time of the request.

Commercial licensing

For a commercial training or redistribution license, reach out via LinkedIn. Please include: the content you want to license, the intended use (training, fine-tuning, product), and the expected scale.

RSL 1.0 license document

These terms are also published as a Really Simple Licensing (RSL) 1.0 document at /rsl.xml, advertised via the License: directive in robots.txt and a Link: …; rel="license"; type="application/rsl+xml" response header on every HTML page. RSL is the emerging standard backed by Reddit, Yahoo, Quora, Medium, People and others — the natural successor to prose-only licensing for machine consumers.

Machine-readable summary

For agents that prefer a single JSON blob over fetching /rsl.xml:

{
  "site": "https://davidtofan.com",
  "content_signals": {
    "search": "yes",
    "ai-input": "yes",
    "ai-train": "no"
  },
  "tdm_opt_out": {
    "basis": "EU Directive 2019/790 Article 4(3)",
    "applies_to": "all original content"
  },
  "license": {
    "default": "all-rights-reserved",
    "grants": {
      "search_indexing": "allowed-with-attribution",
      "ai_retrieval_with_citation": "allowed-with-backlink",
      "ai_training": "requires-written-license",
      "bulk_redistribution": "not-allowed"
    },
    "commercial_contact": "https://www.linkedin.com/in/davidtofan/",
    "rsl": "https://davidtofan.com/rsl.xml"
  }
}