The issue is that Markdown was designed for lightweight human writing, not for deterministic document systems.
The real failure modes of Markdown
Markdown looks simple because it hides structure inside visual conventions. That is comfortable for humans, but fragile for tools. When documents become inputs for build systems, validators, translators, PINA pipelines, or technical publishing workflows, hidden structure becomes a liability.
Ambiguous structure
Spacing, nesting, and extensions can change the meaning of the same text.
Mixed concerns
Content, semantics, layout, and rendering hints often live in the same file.
Renderer dependence
The same source may behave differently across GitHub, CommonMark, static sites, and custom tools.
For example, this looks harmless:
# Title
- Item
- Nested item?
- Third level?
But a machine now needs to answer questions that the author probably never meant to specify formally. How many levels are valid? Is this nesting intentional? Is indentation semantic? Which Markdown dialect is being used?
Another common example:
## Warning > ⚠️ Important note
Is this a warning block, a quote, an icon convention, a visual style, or real semantic metadata? Markdown does not know. The renderer, plugin, or human reader must guess.
DITA: correct principle, heavy execution
DITA goes in the opposite direction. It takes structure seriously. It gives documentation a strong semantic model, supports reuse, and fits enterprise-scale technical publishing.
The cost is complexity. XML verbosity, strict tooling, and a steep learning curve make DITA powerful but difficult to adopt for small teams, developers, and fast engineering workflows.
In short: DITA is right about structure, but often too heavy for everyday writing.
Docusaurus: great platform, same underlying problem
Docusaurus is excellent for developer-facing documentation websites. It gives a project navigation, routing, theming, search, versioning, and a modern publishing pipeline.
But Docusaurus is a documentation platform, not a clean document model. In many workflows it still relies on Markdown or MDX underneath. That means it organizes Markdown very well, but it does not fully solve Markdown's ambiguity as a source format.
Text2Doc: separate the layers
Text2Doc starts from a different assumption: the document source should remain human-readable, but it should not carry every responsibility at once.
| Layer | Responsibility |
|---|---|
| TEXT | Human narrative, natural sections, paragraphs, and lists. |
| SIDECAR | Semantic structure, injections, figures, tables, warnings, and metadata-driven behavior. |
| THEME | Visual design, layout, CSS, templates, and rendering decisions. |
The TEXT layer stays clean: no HTML, no Markdown syntax, no styling hints, no embedded directives. It is narrative content with strict structural rules. Everything else belongs to SIDECAR or the renderer.
It is trying to make documents deterministic enough that machines can trust them.
Comparison: Markdown, DITA, Docusaurus, Text2Doc
| Capability | Markdown | DITA | Docusaurus | Text2Doc |
|---|---|---|---|---|
| Human-readable source | Yes | No | Yes | Yes |
| Deterministic parsing | Weak | Strong | Depends | Strong |
| Machine-friendly structure | Limited | Strong | Partial | Strong |
| Separation of content and rendering | Weak | Strong | Partial | Strong |
| Low adoption friction | Very low | High | Medium | Low |
| Best use case | README files, notes, simple docs | Enterprise technical publishing | Developer documentation websites | Deterministic, machine-first documentation workflows |
The real problem: outputs should become inputs
The future of documentation is not only about nicer pages. It is about documents becoming reliable inputs for other systems: renderers, validators, translators, search engines, PINA tools, build pipelines, and product workflows.
Today, too many documents are written for humans first, then repeatedly reinterpreted by machines. That creates fragile pipelines and hidden manual work.
Text2Doc flips the direction: write clean human text, attach explicit semantics in SIDECAR, and let machines consume the result with confidence.
Where Text2Doc fits
Text2Doc is especially interesting for technical manuals, product documentation, specification documents, API-style documentation, hardware and FPGA engineering notes, translation workflows, and PINA-assisted document processing.
It is not meant to replace every writing format. Markdown remains excellent for casual writing. DITA remains powerful for enterprise publishing. Docusaurus remains great for documentation sites.
Text2Doc aims at the gap between them: simple enough to write, strict enough to parse, and structured enough to become a dependable machine input.