When ChatGPT Entered the Stage, the Industry Cheered: "Now AI can finally write our technical documentation!" But this hope was (and still is) a dangerous misconception – especially in the regulated world of medical technology. As tempting as it sounds to generate complex technical documentation with just a few prompts: Anyone who believes this will work using PDFs, Word files, or Excel sheets as input is building a ticking time bomb.
Or to put it another way: You’re mounting a Formula 1 engine onto a horse-drawn carriage – and wondering why it crashes in the first corner.
Large language models (LLMs) like GPT are impressively powerful. But they’re not magicians. They’re only as good as the data you feed them.
And here's the problem: Most technical documentation today still exists in unstructured formats.
That means:
These documents might be readable (sort of) for humans. But for AI, they’re a maze. Facts live side-by-side with redundancies. Versions aren’t clearly separated. Terminology varies across pages. The result? AI starts guessing. And in the best case, a reviewer spots the mistake.
In the worst case? The errors go undetected – and influence critical product decisions. Not a minor issue in a regulated environment.
This risk is particularly severe in regulated environments, where the impact of incorrect or inconsistent information can be legally and clinically significant (see DocBench 2024, Microsoft Research 2025).
Our own work at meddevo – along with numerous studies – shows one thing clearly:
Structured, content-based data models are key to using AI in technical documentation both effectively and safely.
Here’s a quick comparison:
A 2023 study by RWS (Tridion Docs) found that LLMs provided significantly higher factual accuracy and more relevant responses when fed content from structured, modular databases instead of raw PDFs or Word files.Similarly, Fluid Topics 2024 reported that AI assistants based on DITA XML content outperformed PDF-based approaches in answer quality and speed.
As one industry colleague aptly put it: "Garbage in, garbage out – but structured gold becomes real value."
Paper-based formats (PDF, Word, Excel) were never made for machines. They’re passive. They lack semantics, structure, and true metadata. They may look nice – but they’re not machine-readable in a meaningful way. AI needs context, clarity, and modularity.
Content-based data models provide exactly that:
Information organized by product components, intended use, or regulatory requirements. Versioned, referenceable, and traceable content.
And – most importantly – content that’s understandable not just by humans, but also by machines.
Microsoft's KBLaM (Knowledge Base Language Model, 2025) demonstrated that LLMs connected to structured knowledge sources were more accurate and less likely to hallucinate – even refusing to answer when reliable content wasn’t available. That’s a level of trustworthiness unstructured content simply can’t offer.
The concern that AI poses a risk to regulated documentation is understandable – but misleading. AI isn’t the risk. The real risk is feeding it the wrong kind of input.
Today we already see this clearly:
A content-based eTD model without AI saves time, prevents errors, and improves quality. The same model with AI amplifies those benefits – because the machine no longer guesses, it delivers with precision.
According to ChatBees 2023 and Webex Developer Blog 2025, AI chatbots trained on structured documentation outperform those trained on freeform documents in nearly every metric: speed, relevance, and user satisfaction.
The future of technical documentation lies in modular, version-controlled, semantically enriched data models. Ideally: Harmonized across the EU, interoperable, and AI-ready.
What does it take to get there?
As summed up in the Agrawal et al. 2024 Knowledge Graph Survey: "The combination of LLMs and structured ontologies is not optional – it is the natural evolution of AI-based decision support."
Anyone hoping AI will turn legacy Word documents into perfect technical files is bound to be disappointed. But those willing to invest in structured content models will be rewarded – with greater efficiency, higher quality, and regulatory peace of mind. The carriage is obsolete. The engine is ready.
What’s missing is the right chassis – and it’s definitely not made of paper.