Lossy, Never Divergent: The Rule Every Semantic Architecture Needs

June 7, 2026 · 8 min read

President, National Center for Ontological Research

Semantic Infrastructure · Part 2

Lossy, Never Divergent

Operational systems can simplify meaning for performance, usability, and exchange. But they must not contradict, redefine, or silently alter the governed semantic model.

Core claim

A derived product may omit semantic detail when the target format cannot faithfully carry it. But omission is not permission to redefine meaning. The rule is simple: lossy is sometimes acceptable; divergent is not.

Every serious data architecture produces derived products.

An ontology may be projected into a schema. A semantic model may be mapped into a property graph. A governed vocabulary may appear inside an API. A relation may be implemented through code. A validation rule may become a SHACL profile, a database constraint, or an application check. A model may be transformed into JSON, tables, dashboards, workflow objects, vector indexes, or AI-ready data products.

This is normal.

No serious architecture should expect every downstream system to carry every semantic commitment in its richest form.

But that does not mean downstream systems can silently change what things mean.

That is where the rule matters.

The rule

Lossy, never divergent.

A derived product may omit semantic detail when the target format cannot faithfully carry it.

It must not contradict, redefine, alter, or silently deviate from the authoritative semantic model.

Loss is not the same as divergence

Semantic loss and semantic divergence are different problems.

Semantic loss occurs when a representation leaves something out. A table may not carry all the axioms from an ontology. A dashboard may not expose all provenance. A JSON message may not represent the full class hierarchy. A property graph may flatten some logical commitments into labels and edges.

That can be acceptable when the loss is known, documented, and appropriate for the use case.

Semantic divergence is different.

Divergence occurs when a derived product contradicts or silently changes the authoritative meaning. The downstream artifact no longer merely omits detail. It becomes a competing source of meaning.

Lossy

Some meaning is omitted

A target format cannot carry every semantic commitment, so the architecture documents what was omitted, approximated, or simplified.

Divergent

Meaning is changed

A target artifact contradicts, redefines, or silently alters governed meaning, becoming an unauthorized semantic authority.

The distinction is the difference between simplification and corruption.

A simplified map is still useful if you know what it leaves out.

A misleading map is dangerous because it tells you the wrong thing.

Why downstream products are often lossy

Different technologies are good at different jobs.

Relational tables are good at structured storage and query. JSON is good for exchange. Property graphs are good for traversal. Dashboards are good for presentation. Workflow engines are good for action. Vector stores are good for similarity search. AI pipelines are good for retrieval, classification, summarization, and recommendation.

None of those runtime forms should be expected to carry every semantic commitment from a formal ontology.

Tables

Useful for storage and reporting, but often weak at preserving rich relation meaning.

JSON

Useful for exchange, but often leaves definitions, constraints, and provenance elsewhere.

Property graphs

Useful for traversal, but may flatten logical commitments into local graph patterns.

Dashboards

Useful for decision support, but often hide assumptions behind filters and metrics.

Workflows

Useful for action, but may embed meaning in code, state transitions, or permissions.

AI-ready products

Useful for retrieval and generation, but cannot replace authoritative definitions and constraints.

The goal is not to force every operational representation to be maximally expressive.

The goal is to make every operational representation accountable to the authoritative model.

The derive-then-distribute pattern

The healthiest semantic architecture does not start with the platform’s internal model.

It starts with an authoritative semantic model. From that model, governed profiles, mappings, projections, validation artifacts, and operational products can be derived.

Those derived products can then be distributed to platforms, applications, workflows, dashboards, analytics tools, and AI systems.

Derive, then distribute

1Govern authoritative meaning

Classes, relations, identifiers, definitions, constraints, mappings, and provenance patterns.

2Create derived products

Profiles, mappings, schemas, JSON contexts, graph projections, validation artifacts, and workflow models.

3Check for non-divergence

Confirm that simplifications do not contradict, redefine, or silently alter governed meaning.

4Distribute to systems

Operational platforms consume approved artifacts while remaining traceable to the semantic baseline.

This pattern supports both governance and innovation.

The authoritative model remains stable, inspectable, and governed. Platforms remain free to optimize for performance, user experience, scale, security, deployment, and mission needs.

The boundary is non-divergence.

Operational optimization is permitted.

Ungoverned redefinition is not.

What must be documented

If a target format cannot carry the full semantic model, the architecture should document what happened.

What was preserved?
What was omitted?
What was approximated?
What was moved into code?
What was moved into workflow logic?
What was moved into a mapping?
What was validated?
What remains traceable?
What residual risk was accepted?

Minimum documentation for semantic loss

Source semantic commitment
Target representation
Transformation rule
Semantic justification
Known loss or approximation
Responsible authority
Version and release status
Validation or conformance result

This allows downstream products to remain usable without becoming independent authorities over meaning.

Traceability is the control point

The central control point is traceability.

A platform object should trace back to the semantic commitment it implements. A workflow rule should trace back to the meaning it depends on. A dashboard metric should trace back to the definition it summarizes. An AI output used for serious decisions should trace back to governed concepts, sources, constraints, and provenance.

Traceability keeps simplification honest

Platform object → semantic class

Workflow rule → governed constraint

Dashboard metric → approved definition

API field → stable identifier

Graph edge → governed relation

AI output → semantic grounding

Without traceability, a derived product becomes a local interpretation.

With traceability, it remains an implementation of a governed meaning.

The failure mode: silent divergence

Silent divergence is dangerous because systems can continue to look correct at the interface while their meanings drift underneath.

The labels still display. The dashboards still render. The queries still return results. The workflows still execute. The AI system still produces fluent responses.

But the underlying representation may no longer preserve the authoritative semantic commitments required for validation, inference, migration, reuse, or independent implementation.

The interface can look right while the semantics are wrong.

Silent divergence occurs when implementation behavior continues to appear operationally useful even though the system has altered the meaning it was supposed to preserve.

This creates a special kind of technical debt.

Over time, workflows, queries, dashboards, alerts, AI tools, reports, and user practices begin to depend on the local meaning. The cost of changing the meaning may eventually exceed the cost of changing the software.

That is semantic debt.

The acceptance rule

Some semantic loss is unavoidable.

A mission application does not need to expose every axiom. A dashboard does not need to display every provenance record. A data product may be intentionally scoped. A graph projection may be optimized for a specific query pattern. An AI pipeline may use a simplified representation for retrieval.

But any loss that matters should be visible, governed, and accepted.

Acceptance rule

Semantic loss may be acceptable when it is documented, justified, mitigated, and formally accepted. Semantic divergence should not be accepted unless the authoritative model itself is changed or a scoped variation is explicitly approved.

This rule gives teams room to build.

It also prevents local implementation choices from becoming hidden semantic authorities.

Why this matters for AI

AI makes non-divergence more urgent.

AI systems often consume derived products: vectorized documents, extracted entities, graph slices, metadata records, summaries, prompts, embeddings, and retrieval indexes. These artifacts are almost always lossy.

That is not automatically a problem.

The problem begins when lossy AI-ready artifacts become semantically divergent — when they redefine terms, collapse distinctions, drop provenance, blur assertion status, or treat similarity as meaning.

AI-ready does not mean meaning-preserving

Embeddings, prompts, summaries, retrieved passages, and generated answers can support semantic workflows. They do not replace authoritative definitions, identifiers, relations, constraints, provenance, validation, or governance.

If the AI system acts on governed meaning, recommends decisions based on governed meaning, or explains results using governed meaning, then its artifacts need traceability back to the semantic model.

The rule remains the same: