Skip to main content

Evidence, Not Terminology: How to Tell Whether a System Really Uses an Ontology

· 7 min read
John Beverley
President, National Center for Ontological Research

Semantic Infrastructure · Part 1

Evidence, Not Terminology

How to tell whether a system really uses an ontology — and why labels like “knowledge graph,” “semantic layer,” and “AI-ready” are not enough.

Core claim

A system should not receive credit for meaningful ontology use because it contains domain labels, graph nodes, schemas, dashboards, workflows, AI summaries, or an internal model called an “ontology.” It should receive credit only when it can show how meaning is represented, governed, tested, exported, and reconstructed.

Every few years, a new platform category promises to solve interoperability.

Sometimes the phrase is “ontology.” Sometimes it is “knowledge graph.” Sometimes it is “semantic layer,” “data fabric,” “AI-ready knowledge infrastructure,” or “enterprise knowledge model.”

The labels change. The evaluation problem remains the same.

A system does not meaningfully use an ontology merely because it says it does.

It may contain domain labels. It may have entity types. It may expose a graph. It may generate AI summaries. It may organize data into objects, workflows, dashboards, reports, APIs, schemas, and metadata. It may even have an internal model called an “ontology.”

Those things may be useful.

They are not, by themselves, evidence that meaning is preserved.

The problem with terminology-first evaluation

Terminology is cheap.

A vendor can call a model an ontology. A product team can call a graph a knowledge graph. A platform can describe its data layer as semantic. An AI system can claim to be grounded in domain knowledge.

But serious semantic evaluation cannot stop at the label.

The real question is whether the system makes important meanings explicit, governed, testable, inspectable, reusable, exportable, and reconstructable outside the implementation environment.

Weak evidence

“We have an ontology.”

The system uses the right word, but the evaluator still does not know where meaning resides or how it is governed.

Stronger evidence

“Here is the governed model.”

The system identifies the authoritative model, its identifiers, definitions, relations, constraints, mappings, provenance, validation rules, and export path.

The difference matters because semantic claims often hide inside operational success.

A platform may integrate data. It may power dashboards. It may run workflows. It may enable search, analytics, visualization, AI-assisted discovery, and application delivery.

That operational value may be real.

But operational value is not the same as semantic authority.

A system can be operationally useful and still fail to preserve the meanings needed for validation, inference, reuse, migration, audit, or independent implementation.

What should evaluators ask?

A better evaluation begins with a simple discipline:

Do not ask what the system calls its model. Ask what the system can prove about meaning.

Any system claiming meaningful ontology use should be able to answer six basic questions.

1

Where do the relevant semantic meanings reside?

2

How are those meanings represented?

3

Who governs them?

4

How are changes approved?

5

How can the meanings be validated, exported, and reconstructed outside the platform?

6

What semantic commitments are lost, approximated, or embedded in implementation logic?

These questions turn vague semantic claims into concrete engineering obligations.

They also prevent a common failure mode: treating the visible user-facing model as if it were the authoritative semantic model.

The user interface may show useful labels. The platform may expose useful objects. The graph may contain useful nodes and edges. The AI system may generate useful explanations.

But the evaluator still needs to know whether the underlying meanings are governed and recoverable.

Useful artifacts are not always semantic authority

Many representation artifacts can support semantic work without being authoritative semantic models.

A controlled vocabulary can standardize names.
A taxonomy can organize categories.
A schema can specify fields and data shapes.
A property graph can support traversal and analytics.
A workflow can encode operational behavior.
An AI pipeline can retrieve, summarize, classify, or recommend.

All of these can be useful.

None of them automatically establishes meaningful ontology use.

Useful does not mean authoritative

Controlled vocabularyTaxonomySchemaData modelObject modelProperty graphKnowledge graphAI pipeline

The question is whether these artifacts preserve the relevant semantic commitments: identifiers, definitions, relations, axioms, constraints, mappings, provenance, validation behavior, inference expectations, and query-answer explanations.

If they do, they should show how.

If they do not, they may still be acceptable for operational use, but they should not be treated as the authoritative home of meaning.

The evidence package

Evidence-based evaluation requires artifacts that can be inspected.

The point is to make meaning testable.

A serious evidence package should show the model, the mappings, the governance process, the validation procedure, and the known semantic gaps.

Authoritative model

The approved semantic model for the relevant scope.

Standards-aligned export

A representation that can be inspected outside the implementation environment.

Identifier policy

Rules for stable identifiers, namespaces, deprecation, replacement, and versioning.

Mapping matrix

Explicit correspondences between authoritative semantics and platform-specific artifacts.

Traceability matrix

Evidence that workflows, code, outputs, and AI behavior trace back to governed meaning.

Validation profile

Tests showing which data, mappings, or derived products satisfy semantic constraints.

Semantic gap report

Disclosure of loss, approximation, unsupported constructs, or implementation-specific reinterpretation.

Round-trip fidelity report

Evidence that meaning survives ingestion, use, export, comparison, and reconstruction.

The practical standard is simple: if the system claims semantic authority, the evaluator should be able to inspect the evidence without relying on the platform’s internal interpretation.

Operational acceptance is not semantic acceptance

A system may be good at operations and weak at semantics.

That distinction is important.

Operational acceptance asks whether the system is useful, performant, secure, usable, cost-effective, and mission-effective for its intended purpose.

Semantic acceptance asks a different set of questions. Does the system preserve governed meaning? Are identifiers stable? Are definitions explicit? Are relations represented correctly? Are mappings versioned? Are constraints testable? Is provenance preserved? Can semantic loss be identified and accepted? Can the model be exported and reconstructed outside the platform?

Operational acceptance

Can the system do useful work?

Data integration, workflows, dashboards, analytics, applications, access control, and AI-enabled support may all be valuable.

Semantic acceptance

Does the system preserve meaning?

Semantic authority requires governed meaning that is inspectable, testable, traceable, exportable, reconstructable, and reusable.

A system can pass the first test and fail the second.

That does not mean the system is useless. It means the system should not be credited as the authoritative semantic layer unless the relevant criteria are satisfied or the residual risk is explicitly accepted.

The rule

The rule for evaluation is straightforward:

Evaluation rule

Vendor assertions, platform labels, AI demonstrations, dashboards, and internal terminology should be evaluated against evidence artifacts.

This rule lets vendors compete on operational excellence while preventing operational convenience from silently becoming semantic authority.

It lets platform teams build useful tools while keeping the organization’s meaning governable.

It lets AI systems consume semantic structure without pretending that model behavior replaces authoritative definitions, identifiers, constraints, or provenance.

Most importantly, it gives decision-makers a way to separate marketing language from architectural fact.

If a system really uses an ontology, it should be able to prove it.