Three terms that sound alike, three categories that do entirely different work. The buyer's decoder for KM, Knowledge AI and DKP — and why conflating them…

Knowledge AI vs. Knowledge Management vs. Document Knowledge Platform: untangling the 3 categories before you sabotage your enterprise AI project

Most of the useful knowledge inside a large enterprise does not live in a database. It lives in PDFs, slide decks, Confluence pages, emails, meeting notes. Databricks puts the share of enterprise knowledge that is “functionally invisible” — trapped in unstructured documents that classic analytical pipelines never touch — at around 80 % (Databricks, 2026). Atlan reports that unstructured volume is growing at 40-60 % per year inside its customer base (Atlan, 2026). At the same time, three terms are now competing for the label that supposedly describes the answer: Knowledge Management, Knowledge AI, Document Knowledge Platform. Five major vendors — Glean, Sinequa, Squirro, Writer, Workday/Sana — have each claimed one of these phrases in their communications over the last two months. None of them really untangles what separates them.

This confusion is not a semantic detail. When a CIO or CDO buys “Knowledge AI” hoping to fix the corpus problem, they have bought the wrong thing. When a Head of Knowledge Management is sold a “Document Knowledge Platform” thinking they are looking at a new Confluence, they have missed the point. This note tries to lay out a decoder — clarifying who does what, and where K-AI fits, while acknowledging that we are neither the first nor the only player on our patch.

Three words that sound alike do not point to the same thing

The reason the noise is loud is that all three categories deal with the same raw material: an enterprise’s documentary knowledge. But they do not act on the same layer, do not speak to the same buyer, and are not measured against the same KPIs.

A short version of the decoder before unpacking it: Knowledge Management is an organizational discipline — a set of practices and tools designed to give humans inside an organization the right information at the right time. Knowledge AI is a usage layer — an assistant or an agent that answers a question, drafts a brief, executes a task, drawing on the enterprise corpus. Document Knowledge Platform (DKP) is an upstream layer — the infrastructure that makes a corpus usable by AI: auditing, deduplication, detection of contradictions, obsolescence flagging, continuous monitoring. Three jobs. Three budget lines. Three different vendors when the organization is mature, sometimes one when it starts out — but with conscious clarity about what is being bought and what is not.

This is not a theoretical view. It is exactly the grid that we see crystallizing in RFPs when a serious large enterprise compares its three concurrent proposals: what do we solve first? Which layer is missing? Who do we speak to internally?

Category 1 — Knowledge Management: an organizational discipline, not a tech stack

Knowledge Management is the elder. It is a discipline that has structured the handling of knowledge inside the enterprise since the 1990s — first around intranets, then wikis (SharePoint, Confluence, Notion), then support knowledge bases (Document360, Zendesk Guide, Salesforce Knowledge). Its yardstick is not an AI accuracy score. It is a usage rate, a ticket resolution time, an onboarding reduction.

The sponsor of a KM project is rarely a CTO. It tends to be a Chief People Officer, a VP Customer Care, a Head of Knowledge Management, a Chief Operating Officer. Market tools — Sinequa, Lucidworks, Atlassian Confluence, Document360 — have for three years added an AI veneer (federated search, tag suggestion, automatic summarization). Sinequa has structured part of its 2026 messaging around “Empowering Enterprise Knowledge Management with AI” (Sinequa, January 2026) and publishes direct ROI figures reaching 21.7 % for unified information access projects (Sinequa, April 2026).

What KM does not do — and has never set out to do — is clean the corpus upstream. A Confluence whose documentary quality has decayed is still a Confluence. KM makes knowledge findable and navigable; it does not make it coherent, non-contradictory and up-to-date. It is a discipline of organization and access, not an infrastructure of quality.

Category 2 — Knowledge AI: the usage layer that answers questions

Knowledge AI is the category that has emerged fastest since 2023 and is crystallizing in 2026. It refers to the software layer that interprets an intent, formulates a natural language answer, or triggers an action by drawing on the enterprise corpus. Glean is the most visible figure. The vendor positions itself as a Work AI Platform, announced in May 2026 an AI coworker that no longer just answers but executes, and on May 12 published an Enterprise Agent Development Lifecycle in seven stages, codifying how enterprises ship agents to production (Glean, May 12 2026).

The territory around Glean is crowded. Writer pushes its enterprise Knowledge Graph and Control Panel, framed as the single place to govern, secure and scale knowledge sources (Writer, 2026). Squirro communicates on GraphRAG, claiming extreme accuracy on complex queries (Squirro, 2026). Workday acquired Sana Labs in November 2025 for $1.1 billion and positions it as the full AI platform for enterprise-wide knowledge, agents, and automation (Workday, March 17 2026). Microsoft Copilot, Notion AI, Atlassian Rovo occupy adjacent slots depending on the vertical.

The sponsor of a Knowledge AI project is, by contrast, a very different person: a CTO, a Chief Innovation Officer, a Chief Digital Officer. The yardstick is usage adoption, an answer relevance rate, a cost per task. Pinecone — long-known as a vector-database vendor — made a striking admission on May 5, 2026 when launching its Nexus knowledge engine: 85 % of an agent’s effort goes into retrieval, and fragmented RAG pipelines collapse task completion to 50-60 % (Pinecone, May 5 2026). In other words: a player everyone had filed under “infrastructure” is now saying that the AI usage layer is not enough on its own, and that knowledge has to be composed upstream. That is the bridge to the third category.

Category 3 — Document Knowledge Platform: the upstream layer that makes a corpus AI-ready

The Document Knowledge Platform (DKP) is the category that is crystallizing in 2026 to handle what neither KM nor Knowledge AI handles: the state of the corpus before an AI uses it. A DKP audits, deduplicates, detects contradictions, flags obsolescence, and exposes a semantic layer that downstream AI systems — chatbots, agents, copilots, search — can consume.

This is the slot K-AI occupies, and we are not alone in it. Iris.ai talks about an AI knowledge foundation (Iris.ai, 2026). NetDocuments unveiled a Legal Context Graph on May 5, 2026, with the same logic verticalized to legal (LawNext, May 5 2026). Pryon pushes a semantic variant under the AI memory label. On the data catalog side, BigID and Atlan announced on March 9, 2026 the first unified structured and unstructured data catalog for AI governance (BigID + Atlan, March 9 2026). The word platform is not settled, but the underlying movement is unambiguous: a new category is forming between IDP (document extraction) and Knowledge AI (AI usage).

At K-AI, our take on a DKP is articulated in two phases — Start Clean then Stay Clean — and rests on a Neural Semantic Graph that materializes relationships between documents (not a simple vector index). On a first diagnostic over a single documentary repository at a client organization, we routinely surface more than one thousand anomalies — procedures that contradict each other, divergent duplicates, obsolete information that was never retired. And that repository is just one among many across the organization. It is this repeated observation that motivates the existence of the DKP as a distinct layer.

The sponsor of a DKP project, in our deployments, is most often a CDO / Head of Knowledge Management pair, sometimes a CIO operating in AI Readiness mode. The KPIs are neither usage rates nor cost per task: they are measurable reductions in conflicts, duplicates and obsolete passages inside the corpus, and a lift in the answer accuracy of the downstream AI layers. The promise is that these upstream metrics condition every downstream metric.

Document Knowledge Platform: transposing the data catalog and data mesh to the unstructured world

For a CDO, the most direct decoder is probably this: a Document Knowledge Platform is to documents what a data catalog or a data mesh is to structured data. This is not a marketing analogy, it is a discipline transposition. A data catalog inventories, classifies, tracks lineage and applies policy on tables and columns. A DKP does the same on paragraphs, sections, versions, authors, cross-document contradictions — with the specificity that the useful information is not in a schema but in the sentence.

BigID and Atlan, with their March 9, 2026 unified data catalog, made the move by stating that unstructured data should be addressed by data governance tools. That is a legitimate motion. But two complementary gestures should be distinguished: a catalog inventories the data, exposing metadata and lineage; a DKP cleans and maintains the data, acting on the matter itself. Atlan puts it in their own words in a reference post: “unstructured data isn’t a storage problem, it’s an AI lineage problem” (Atlan, 2026). The DKP adds: “neither just a storage nor only a lineage problem — it is also a quality problem, which has to be measured and fixed upstream”.

This transposition is not innocent from a budget standpoint. Where Knowledge AI is typically funded by an innovation or digital workplace line, and KM by an operations or HR line, the DKP naturally lands on the data and data governance budget — the one that, in 2026, is receiving the rising share of investment in every organization serious about AI. Gartner observes that organizations whose AI initiatives are successful invest up to four times more, as a percentage of revenue, in data foundations, governance and AI-ready people, compared to those experiencing poor outcomes (Gartner, April 16 2026). The DKP is the mechanism by which that upstream investment produces downstream results.

Five categorical mistakes that wreck enterprise AI projects

Untangling the categories is not a doctrinal exercise. These are five very concrete mistakes we see resurfacing every quarter in the RFPs that reach us.

The first is to buy a Knowledge AI layer without addressing the upstream corpus quality. It is the costliest mistake because it is the most invisible: the project technically succeeds (the assistant answers), but the answers are wrong, incoherent or contradictory. Users lose trust within six weeks. The Knowledge AI layer takes the commercial blame for a problem that is not its own.

The second is to mistake KM SaaS for a DKP. A Confluence with AI search is not a DKP. It becomes findable, not clean. The useful RFP question: if we unplug your AI layer, does the corpus remain in the same shape? If yes, this is not a DKP.

The third is to assume that Microsoft Copilot or Glean are by construction a DKP because they touch the corpus. They are the downstream usage layer. Pinecone, when stating publicly on May 5 that retrieval consumes 85 % of an agent’s effort, implicitly says the same thing: that 85 % is solved upstream, not inside the agent.

The fourth is to take a unified data catalog (BigID, Atlan, Collibra extended to unstructured) for a DKP. It is a legitimate motion by the catalogs — but inventorying is not cleaning. Complementarity works extremely well (a DKP feeds the catalog with quality metadata). Substitution does not.

The fifth is more subtle: confusing AI memory (Pryon) or enterprise knowledge graph (Writer, Squirro) with a DKP. Memory is a useful component; a knowledge graph is a representation. Neither, on its own, handles audit and continuous monitoring of corpus quality. A DKP may, and at K-AI does, expose a semantic graph — but the graph is the output, not the category.

Assembling the stack correctly

Once the grid is laid out, the target architecture for a mature organization looks like this: a Document Knowledge Platform upstream, auditing and maintaining the corpus; a Knowledge AI layer querying or acting on that corpus; a Knowledge Management discipline governing the whole, structuring human roles, measuring adoption. The three layers do not have to be procured at the same time or from the same vendor. But they should be sequenced in this order: if the DKP is not in place, the Knowledge AI layer will underperform, and KM will have nothing of quality to lift up.

K-AI’s bet is to hold the upstream layer — that’s it, and that is the hardest job. We leave the AI usage layer to Glean, Sana, Writer and others, and the KM layer to Sinequa, Lucidworks, Confluence. Our conviction is that the absence of a DKP is what tips most enterprise AI projects into the quiet failure category in 2026 — the ones that ship a successful demo and a disappointing deployment.

Frequently asked questions (FAQ)

What is the difference between Knowledge AI and Knowledge Management?

Knowledge Management is a decades-old organizational discipline whose purpose is to make enterprise knowledge accessible to the humans who need it — via wikis, support knowledge bases, intranets, federated search tools. Knowledge AI is a more recent software layer that adds a conversational or agentic intelligence on top of that knowledge: an assistant that answers, drafts, executes. KM measures success by usage rate, onboarding reduction, ticket resolution time. Knowledge AI measures success by answer accuracy, volume of automated tasks, cost per task. The two coexist and complement each other — but they are distinct disciplines that should not be bought from the same vendor, are not measured with the same KPIs, and do not address the same internal sponsor.

What is a Document Knowledge Platform and how is it different from a Data Catalog?

A Document Knowledge Platform (DKP) is the infrastructure that makes a documentary corpus usable by AI: it audits the corpus, eliminates duplicates, detects cross-document contradictions, flags obsolete passages and exposes a semantic layer onto which downstream AI systems plug in. A data catalog (Atlan, BigID, Collibra, Alation) inventories data, exposes its metadata, tracks its lineage and enforces policy. The gesture is different: a catalog inventories and governs, a DKP cleans and maintains. The two are complementary — a quality DKP feeds a catalog with reliable metadata — but they do not substitute for each other. The joint BigID + Atlan announcement of March 9, 2026 about a unified structured and unstructured data catalog extends catalog coverage to unstructured data, but does not cover the cleaning operation itself.

Why does Copilot or Glean not find my SharePoint documents, or find them poorly?

Several causes coexist. Microsoft documents four standard technical reasons: misconfigured permissions, incomplete indexing, mismatched authentication scopes, file size above the cap. But there is a fifth cause that AI vendors rarely document: the document is found but unusable because the corpus is polluted. Two versions of a procedure contradict each other, an obsolete document was never retired, divergent duplicates exist between SharePoint and Teams. The assistant then surfaces findable but wrong content — which is worse than surfacing nothing at all. That is what a Document Knowledge Platform addresses upstream.

How do I assess the quality of my corpus before launching an AI project?

A structured corpus audit measures six axes: internal anomalies (coherence breaks within a document), cross-document conflicts, divergent duplicates, untagged obsolescence, traceability (author, date, validation, source) and per-segment freshness. Each axis admits a measurable KPI, an alert threshold and a remediation procedure. We describe the K-AI method in detail in our May 15 method note. The typical deliverable of a first audit is a per-repository contextualized report — not an aggregated global score, which would mask disparities between repositories.

Why do so many RAG projects never reach production?

Several converging studies have made this case. A reference post on Medium in February 2026 traces 80 % of RAG failures to the ingestion layer — duplicates inflating embeddings, undetected stale records, inconsistent metadata (Medium, February 2026). Pinecone, on May 5 2026, measures that 85 % of an agent’s effort is consumed by retrieval and that task completion rates collapse to 50-60 % on fragmented RAG pipelines. Both figures point in the same direction: the bottleneck is not the model, it is the quality of the layer upstream of the model. That is precisely the object of a DKP.

My company has 1,000-10,000 employees and a lot of unstructured data. Where should I start?

The recommended sequence is: (1) scope a pilot documentary perimeter — typically a critical repository (quality procedures, support knowledge base, product documentation); (2) audit that perimeter on the six axes above; (3) prioritize remediation on anomalies that carry significant business impact; (4) only then plug in the Knowledge AI layer (chatbot, copilot, agent). This order avoids the costliest mistake — investing in the usage layer before having cleaned the raw material. On a first diagnostic over a single client repository, K-AI routinely surfaces more than 1,300 anomalies, and documentary volume drops by an average of 32 % within a week of initial cleanup (duplicates and obsolete documents removed). These numbers only apply to the audited perimeter, but they give a realistic order of magnitude for a starting point.

Do I have to buy a DKP, a Knowledge AI and a KM tool separately?

Not necessarily. An organization starting out can buy a vendor that covers several slots, as long as it does so with clear eyes. The useful RFP question is: does this vendor explicitly handle upstream corpus quality, or does it assume the corpus is already clean? If the answer is “we assume the corpus is clean”, this is a Knowledge AI only, and the DKP layer is still to be acquired. If the answer is “we audit and maintain the corpus upstream, and expose a semantic layer consumable by downstream AI usage”, the vendor is playing a DKP role. Most mature organizations will eventually separate the layers for governance, lifecycle and vendor-specialization reasons.

Going further

If you are scoping an enterprise AI project and the category question came up in your last procurement or IT review, we would be happy to discuss it. You can reach us at contact@k-ai.ai. The first conversation is not about K-AI but about your perimeter — that is more useful for you and for us.

Sources cited

Databricks Blog — PDFs to Production: Announcing state-of-the-art document intelligence on Databricks — 2026. databricks.com
Atlan — Unstructured Data Isn’t a Storage Problem. It’s an AI Lineage Problem. — 2026. atlan.com
Sinequa — Empowering Enterprise Knowledge Management with AI — January 5, 2026. sinequa.com
Sinequa — How to Measure Enterprise AI Search and Agentic AI ROI 2026 — April 1, 2026. sinequa.com
Glean Press — Glean Introduces the Enterprise Agent Development Lifecycle — May 12, 2026. glean.com
Writer — Starter guide: Graph-based RAG for enterprise — 2026. writer.com
Squirro Blog — Roadmap to Knowledge Graph-Powered GenAI — 2025-2026. squirro.com
Workday Newsroom — Introducing Sana from Workday: Superintelligence for Work — March 17, 2026. workday.com
Pinecone Blog — Pinecone Nexus: The Knowledge Engine for Agents — May 5, 2026. pinecone.io
PRNewswire — BigID & Atlan Introduce the First Unified Structured & Unstructured Data Catalog for AI Governance — March 9, 2026. prnewswire.com
Gartner Newsroom — Gartner Says Organizations with Successful AI Initiatives Invest Up to Four Times More in Data and Analytics Foundations — April 16, 2026. gartner.com
LawNext — NetDocuments unveils Legal Context Graph to map legal knowledge — May 5, 2026. lawnext.com
A. Byakod, Medium — Why RAG Systems Fail in Production — Part 2 — February 2026. medium.com
Iris.ai — AI knowledge foundation for regulated enterprises. iris.ai

AI Act, Day-82: why a “dirty” documentary corpus makes your high-risk AI indefensible — the regulatory risk of running Knowledge AI without a DKP.
You think your RAG hallucinates because of embeddings? Look at your corpus. — the technical case that a DKP layer is required.
Auditing a documentary corpus for AI — the K-AI 6-axis method — the operational how-to of a DKP.

K-AI already partners with CMA CGM, Veolia, PwC, BNP Paribas, TotalEnergies and CEVA Logistics. Strategic partners: AWS, Snowflake, Microsoft, Wavestone, Devoteam.

Knowledge AI vs. Knowledge Management vs. DKP: untangling 3 enterprise AI categories

Knowledge AI vs. Knowledge Management vs. Document Knowledge Platform: untangling the 3 categories before you sabotage your enterprise AI project

Three words that sound alike do not point to the same thing

Category 1 — Knowledge Management: an organizational discipline, not a tech stack

Category 2 — Knowledge AI: the usage layer that answers questions

Category 3 — Document Knowledge Platform: the upstream layer that makes a corpus AI-ready

Document Knowledge Platform: transposing the data catalog and data mesh to the unstructured world

Five categorical mistakes that wreck enterprise AI projects

Assembling the stack correctly

Frequently asked questions (FAQ)

Going further

Sources cited

And in your organization, what does your document estate look like?

Knowledge AI vs. Knowledge Management vs. DKP: untangling 3 enterprise AI categories

Knowledge AI vs. Knowledge Management vs. Document Knowledge Platform: untangling the 3 categories before you sabotage your enterprise AI project

Three words that sound alike do not point to the same thing

Category 1 — Knowledge Management: an organizational discipline, not a tech stack

Category 2 — Knowledge AI: the usage layer that answers questions

Category 3 — Document Knowledge Platform: the upstream layer that makes a corpus AI-ready

Document Knowledge Platform: transposing the data catalog and data mesh to the unstructured world

Five categorical mistakes that wreck enterprise AI projects

Assembling the stack correctly

Frequently asked questions (FAQ)

Going further

Sources cited

Related reading

And in your organization, what does your document estate look like?