Machine-readable · agent-discoverable
AI Licensing Terms
This page describes how AI crawlers, retrievers, and training pipelines may use content
on davidtofan.com. It is the target of the site's
Link: …; rel="license" response header and the corresponding
entry in robots.txt.
Last updated:
TL;DR
- Search indexing — allowed.
- AI retrieval with citation (RAG, agent answers that link back) — allowed.
- AI training and fine-tuning — not granted. Requires a written license.
- Bulk scraping without attribution — not allowed.
Content Signals
This site publishes Content Signals (IETF AIPREF draft) via robots.txt. The current declaration is:
User-Agent: *
Content-Signal: search=yes, ai-input=yes, ai-train=no These are machine-readable preferences. Where jurisdiction applies — in particular the European Union (EU) Directive 2019/790 on Copyright and Related Rights in the Digital Single Market (DSM) — Article 4(3) — they also constitute an express opt-out of Text and Data Mining (TDM) for commercial purposes.
License
Unless a specific article or project states otherwise, original content (text, diagrams, code samples) on this website is published under the following terms:
- Search crawlers may index and display short excerpts with attribution and a back-link to the canonical URL.
- AI agents performing user-initiated retrieval (e.g. answering a prompt with a citation) may fetch, quote briefly, and summarize content, provided the response links back to the canonical URL.
- Training, fine-tuning, or model distillation on this content — whether directly or via a derived dataset — is not granted by default. Contact the author for a commercial license.
- Bulk mirroring, republishing, or redistribution of full articles without attribution and a back-link is not permitted.
Text and Data Mining (TDM) opt-out reservation
Rights under Article 4(3) of the
European Union (EU) Directive 2019/790 on Copyright and Related Rights in the Digital Single Market (DSM)
are expressly reserved. This reservation is repeated machine-readably via the
ai-train=no Content Signal in robots.txt.
Enforcement
This website is behind Cloudflare. Requests identifying as AI crawlers may be throttled, challenged, blocked, or charged via AI Crawl Control and Pay Per Crawl regardless of their declared preferences, according to the operator's policy at the time of the request.
Commercial licensing
For a commercial training or redistribution license, reach out via LinkedIn. Please include: the content you want to license, the intended use (training, fine-tuning, product), and the expected scale.
RSL 1.0 license document
These terms are also published as a
Really Simple Licensing (RSL) 1.0
document at /rsl.xml, advertised via the
License: directive in robots.txt and a
Link: …; rel="license"; type="application/rsl+xml" response header
on every HTML page. RSL is the emerging standard backed by Reddit, Yahoo, Quora,
Medium, People and others — the natural successor to prose-only licensing for
machine consumers.
Machine-readable summary
For agents that prefer a single JSON blob over fetching /rsl.xml:
{
"site": "https://davidtofan.com",
"content_signals": {
"search": "yes",
"ai-input": "yes",
"ai-train": "no"
},
"tdm_opt_out": {
"basis": "EU Directive 2019/790 Article 4(3)",
"applies_to": "all original content"
},
"license": {
"default": "all-rights-reserved",
"grants": {
"search_indexing": "allowed-with-attribution",
"ai_retrieval_with_citation": "allowed-with-backlink",
"ai_training": "requires-written-license",
"bulk_redistribution": "not-allowed"
},
"commercial_contact": "https://www.linkedin.com/in/davidtofan/",
"rsl": "https://davidtofan.com/rsl.xml"
}
}