The Ontology issue: From knowledge to graphs and back again. The Year of the Graph Newsletter Vol. 29, Winter 2025 – 2026

The Ontology issue: From knowledge to graphs and back again. The Year of the Graph Newsletter Vol. 29, Winter 2025 – 2026

Enterprise and data architects, data modelers, GenAI adopters, analysts, thought leaders, Graph RAG application builders, Microsoft, Palantir – everyone is talking about ontologies. Why, what is an ontology actually, and how is it related to graphs?

By George Anadiotis

An ontology is an explicit specification of a conceptualization which is, in turn, the objects, concepts, and other entities that are presumed to exist in some area of interest and the relationships that hold among them.

Ontology introduces the semantic foundation that connects people, processes, systems, actions, rules and data into a unified ontology [sic]. By binding real-world data to these ontologies, raw tables and events are elevated into rich business entities and relationships, giving people and AI a higher-level, structured view of the business to think, reason, and act with confidence.

The first paragraph is taken from the entry for Ontology in the Encyclopedia of Database Systems, written by Tom Gruber in 2009. Gruber is credited with the most widely cited definition of ontology in the context of computer and information sciences, introduced circa 1993.

The second paragraph is taken from the blog post introducing Fabric IQ, a new semantic foundation within Microsoft Fabric. It was written by Chafia Aouissi in November 2025. Aouissi is Sr. Director Product Management Fabric IQ at Microsoft.

As usually happens when terms become part of the mainstream discourse, there is a certain tension between rigor and adoption. Case in point – not just Microsoft Fabric IQ, but also Palantir.

Palantir has been floating the term “ontology” as the cornerstone of its architecture for a while. While this has helped popularize the term, as Juan Sequeda notes, at the same time it has also ignited confusion as to what ontology really is.

When Palantir’s CEO, Alex Karp, lashed out at Michael Burry – “Big Short” investor who bet against Palantir and Nvidia – he wasn’t just defending his balance sheet, but  also an idea. In Karp’s words, ontology now sits beside chips as the engine of AI. But if we take ontology seriously, Burry’s short might not be wrong, J Bittner notes.

In this issue of the Year of the Graph, we identify ontology and knowledge graph definitions, applications, tools, and educational resources.


This issue of the Year of the Graph is brought to you by metaphacts, Process Tempo, Linkurious, RDFox, Tentris, Connected Data London and State of the Graph.

If you want to be featured in an upcoming issue and support this work, reach out!


Neuro-symbolic AI: The key to truly intelligent systems

Just as you wouldn’t bring half your brain to work, enterprises shouldn’t bring half of artificial intelligence’s capabilities to their architectures. Neuro-symbolic AI combines neural-network technology like LLMs with symbolic technology like knowledge graphs. This integration, also known as ‘knowledge-driven AI’, delivers significant advantages:
● Trustworthy & explainable insights grounded in explicit facts
● Reliable & transparent AI agents
● Grounded LLMs that can assist in complex modeling

If you’re not exploring how knowledge graphs and symbolic AI can augment your
organization’s intelligence—both artificial and actual—now is a good time to start.

Read the full article.


The “O” word

If you talk to people working with data, AI, or enterprise architecture and ask, “what is an ontology?”, you’ll get different answers. For some, ontology is a kind of clever data schema. For others, it’s a business glossary. For others still, the heart of a knowledge graph. They’re all right, and that’s the problem as per Juha-Pekka Joutsenlahti.

In “Demystifying ontologies“, Joutsenlahti gives a brief history of the concept of ontologies in IT and knowledge representation. He explains that different communities adopted “ontology” and bent it slightly towards their own needs, resulting in confusion.

The key to reducing the confusion is to always ask: What is this ontology for? Is it meant to clarify meaning or to define data structure (or both)? Once we make that distinction explicit, much of the mystery starts to disappear.

The O word: do you really need an ontology?” was published in 2019. Before GenAI was a thing, Mark Hall made a compelling case for ontologies and offered an explanation as to why isn’t everyone doing this.

Today, as Ole Olesen-Bagneux notes, ontologies are once again hot because they are key to succeeding with AI: ontologies provide context for AI to perform better. Thus, we are seeing the re-introduction of knowledge engineering as if it were new.

How Data Models and Ontologies Connect to Build Semantic Foundations

Knowledge Management and the Library Sciences, from which taxonomies, ontologies, and knowledge graphs were born, are well-established disciplines, as Juha Korpela notes in “How Data Models and Ontologies Connect to Build Your Semantic Foundations“.

Korpela points out that people who have traditionally worked with ontologies and knowledge graphs have not been communicating much with domains such as data modeling, but the exchange would be meaningful.

Data modelers focus on the technical implementation of data solutions, thus following a path from Conceptual to Logical to Physical modeling. Even if concept models and ontologies are different, as Jessica Talisman notes, there is overlap. Conceptual modeling may be used to build an ontological foundation that acts as the context provider for agents and chatbots as well as humans.


Bring your graph ideas to life!

Building a knowledge graph? You’re going to want people to use it. We can help!

Process Tempo Jupiter makes it easy to build beautiful, production-ready applications on top of your graph without writing code. Now you can bring your graph ideas to life using skills you already have.

Jupiter supports multi-modal, read and write, between graph databases, relational databases and Rest-based APIs. No ETL required!

✅Neo4j, ✅Memgraph, ✅PuppyGraph, ✅OpenCypher

✅Databricks, ✅Snowflake, ✅Big Query, ✅MySQL, ✅Postgres, ✅ Dremio

Ready to get started? Get on our calendar to learn more: calendly.com/processtempo


From knowledge to graphs and back again

Another term that is overloaded, being used in different contexts to mean different things, is “knowledge graph”. For an elaboration of the definition of a knowledge graph, the connection to ontology, and how they become the essential truth layer for Pragmatic AI, check this conversation between Tony Seale and George Anadiotis.

Like ontologies, what has largely contributed both to popularizing and creating confusion around knowledge graphs is their use for GenAI, specifically to support LLMs in Graph RAG. As Fanghua (Joshua) Yu notes, even when talking about knowledge graphs in Graph RAG use cases, there are different KG types to consider.

But is using knowledge graphs with GenAI always a good idea? The complexity may not always be worth it, Dave Bechberger argues. You may not always need Graph RAG. While graph-based enhancements improve information organization & reasoning, they also come with a heavy token cost.

Graph RAG is a data engineering challenge, not just an LLM trick, Fanghua (Joshua) Yu notes. Plus, Graph RAG does not always outperform “vanilla” RAG. If you are going to be using Graph RAG, however, there are some useful frameworks, applications and analyses to consider.


The shortest path between you and graph insights

Graph visualization and analytics just got a lot easier!
Introducing Linkurious Enterprise Cloud: An online solution that brings powerful graph exploration to anyone, right from a browser.

Create an account, connect your graph database, start the exploration of your data, collaborate with your teammates and share your results, all before lunch.

What else?
• Compatibility with leading graph databases
• Zero IT bottlenecks or infrastructure tasks
• Flexible plans that adapt to your needs

The fastest way to start a graph project today — and the easiest way to scale it tomorrow.

👉 Sign up now for a 30-day free trial.


Knowledge graph applications at scale

Today, knowledge graph applications in production are scaling up. Walmart uses its People.AI knowledge graph to power recommendation tools. This knowledge graph includes 1.6 million nodes and 83 million edges, representing entities such as job titles, openings, associates, applicants, and skills.

Barclays is scaling knowledge graphs and knowledge-based AI in the financial crime and KYC domain. They are using semantic reasoning to automatically enrich knowledge graphs with new facts based on expert knowledge to deliver improved operational efficiency, reduce compliance risk and improve customer experience.

GitLab is building what they call the GitLab Knowledge Graph. It’s a framework written in Rust to turn your codebase into a live, embeddable graph database that can be used for code retrieval and navigation, impact analysis and architecture visualization.

RTVE, the Spanish national radio and television public broadcaster, has created a knowledge graph that structures information by understanding the relationships among various audiovisual content as well as any potentially linked objects. It unifies data, making it accessible and comprehensible for both machines and people.

Wiz, the cybersecurity phenomenon recently acquired by Google for $32 billion, has redefined cloud security by providing comprehensive visibility across multi-cloud environments. Their success is based on a massive, a constantly evolving security graph that spans customers’ entire cloud footprints.

Microsoft introduced the Sentinel graph to bring relationship-aware context to Microsoft Security across Defender and Purview. This way defenders and AI can see connections, understand the impact of a potential compromise, and act faster across pre-breach and post-breach scenarios.

Graph-Wide scanning is a technique for solving advanced cyber threats. In one example, a query on a 150 billion-edge graph scanned a mind-boggling 123 trillion edges. It took time, but it found fewer than 4,000 answers, highlighting the power of finding the critical few from the overwhelming many.


Ontology and knowledge graph insights, tools and education

A key finding of the State of the Graph survey for 2025 is that knowledge graphs and graph databases are driving adoption, but guidance and training are still critical. Here are some pointers to knowledge graph and ontology educational resources.

Connected Data London 2025 was a gathering of top minds and practitioners in this space, featuring use cases, innovation and educational content from the likes of Airbus, AstraZeneca, AWS, Barclays, Bloomberg, Netflix, Nvidia, SAP, ServiceNow, S&P, Vodafone and more. You can follow CDL for trip reports and teasers, or dive into the content with a Remote Access Ticket and a special 15% discount.

metaphacts recently published their Semantic Modeling Guidelines for Knowledge Engineers. These semantic modeling guidelines are designed for beginners as well as advanced modelers, offering a step-by-step introduction to semantic modeling concepts, key elements and practical techniques.

Kurt Cagle shares tips for building knowledge graphs, noting that the hard part of building a knowledge graph is not the technical aspects, but identifying the types of things that are connected, acquiring good sources for them, and figuring out how they relate to one another.

It is better to create your own knowledge graph ontology, possibly building on existing upper ontologies, than it is to try to shoehorn your knowledge graph into an ontology that wasn’t designed with your needs in mind.

In “Becoming an Ontologist“, Cagle notes there is a surge in interest in the profession of ontologist. Some of it can be attributed to the fact that people are beginning to realize the value of knowledge graphs, but there is also the opportunistic element here. Like many other fields in the past, ontology work is seen as a ticket to big money. But perceptions and reality are not necessarily aligned.

Dean Allemang shares his insights on a day in the life of a working ontologist. Building ontologies is actually the last thing on the list, as there isn’t much spent on that compared to other tasks. Allemang notes that “ontologist” is going to be a much more sought after skill in the near future.

For more in-depth education:

  • Pragmatic AI Training: A holistic AI education program, including modules on knowledge graphs, Graph RAG and ontology design
  • The Knowledge Graph Academy: learn how to build and scale knowledge graphs through a unique program led by global experts

Do you need a graph technology that scales without sacrificing performance?

RDFox® is one of the most advanced RDF triple stores on the market. Built on world-leading research from the University of Oxford, RDFox® stands out through:

● Highly performant, scalable in-memory architecture
● Advanced rule-based reasoning for dynamic graph enrichment
● An optimised memory footprint for on device, server, or cloud

RDFox® is trusted by a growing list of the world’s largest brands across financial services, manufacturing, automotive, retail, and big tech.

Powering what is possibly the world’s largest graph deployment across tens of millions of Samsung smartphones – see what all the fuss is about here.


Two meanings of “Semantic Layer” and why both matter for AI

Another concept everyone is talking about, but not necessarily meaning the same thing, is semantic layers. On the analytics side, semantic layer usually refers to something we’ve seen before. It’s the business-friendly model that sits between raw data and BI tools. A governed way to define joins, metrics, and consistent logic.

This version of semantics is about trust. It makes sure when someone (or something) asks, “What’s our total revenue last month?” the answer is accurate, governed, and consistent.

Case in point: the Open Semantic Interchange (OSI). Snowflake, Salesforce, dbt Labs and other vendors announced they are working on what they claim will be a universal standard for how business data is defined and shared across platforms.

But there’s another camp using the same phrase and they’ve been doing it far longer. For ontologists, RDF, OWL and JSON-LD are open standards for exchanging semantic data. To them, a semantic layer isn’t a metrics model, it’s a knowledge model. It’s about representing meaning, relationships, and context across systems.

This version of semantics is about understanding. It connects definitions and relationships, providing the context AI uses to make sense of information.

Case in point: Semantic Layers and GraphRAG are essential for Trustworthy AI, Andreas Blumauer points out. Agents need a semantic layer, Anthony Alcaraz chimes in.


Query more, wait less. Try Tentris today

Tentris is a new disk-based RDF graph database that delivers blazing fast querying performance with highly efficient memory usage, turning analytics that once took days into results within minutes or seconds. Built for real time workloads and complex data, it solves tasks others cannot.

Start exploring Tentris today!


Subscribe to the Year of the Graph Newsletter

Keeping track of all things Graph Year over Year


Graph databases: growing market, competition & options

Moving to graph database updates, let’s open with another piece of news from Microsoft. In Microsoft’s Q1 2026 earnings call, it was reported that Cosmos DB business grew over 50% YoY. Cosmos DB is a multi-model database, and we don’t know the extent to which graph contributed to its growth.

However, this signal is one of many pointing towards growth for graph databases. According to Fortune Business Insights, the global graph database market size is projected to grow from $2.85 billion in 2025 to $15.32 billion by 2032, exhibiting a CAGR of 27.1%.

Microsoft, AWS, Google, and Oracle were named as leaders in the 2025 Gartner® Magic Quadrant for Cloud Database Management Systems. All superscalers have a graph database offering with Cosmos DB, Amazon Neptune, Google Spanner Graph, and Oracle Graph, respectively.

LadybugDB and RyuGraph are new forks of the KuzuDB codebase aiming to pick up where KuzuDB left off. Furthermore, GraphLite emerged as a new open source graph database for embedded processes, and FalkorDB introduced FalkorDBLite – both aiming to fill in the embedded graph database void left by KuzuDB’s departure.

Challengers such as QLeverRushDBTuringDBTypeDB, and DuckDB with its DuckPGQ extension are emerging. It seems like the graph database market pie is growing, and the competition for a piece of it is intensifying.

Finally, another signal pointing in the direction of growth for the graph market: Linkurious, who just published the 2026 update of their Graph technology landscape, are getting acquired by Nuix in a €20M deal.


New tools and research

Wrapping up with a roundup of new tools and research. Focusing on graph research and innovation in NeurIPS, one of the leading AI conferences, shows that graph is a significant part of NeurIPS 2025, supporting its growing importance and market share.

GraphFrames represents the natural evolution of GraphX, the native library for cluster-based graph processing on Apache Spark – one of the most powerful engines for large-scale data processing in distributed computing. GraphFrames has been revived after being dormant for a while, with version 0.10.0 recently released.

Brahmand is an open source Graph Engine built on top of ClickHouse. It extends ClickHouse with graph modeling and OpenCypher, merging OLAP speed with graph analysis.

Gephi Lite v1.0 was recently released. Gephi Lite is a lighter, web-based version of Gephi, used for visual network analysis. Cosmograph, a single-node web-based tool used to visualize graphs, released v.2.0.

Text2KGBench-LettrIA is a refined benchmark for Text2Graph systems, attempting to address the question of whether LLMs are good at populating knowledge graphs based on a set of text documents. RAGE-KG explores the state of the art and beyond in integrating Retrieval-Augmented Generation with Knowledge Graphs as well as the synergies between Large Language Models and the Linked Open Data ecosystem.

KGGen extracts knowledge graphs from plain text with language models. Exploring Network-Knowledge graph duality is a case study in agentic supply chain risk analysis. And GraphPFN is an attempt to create general-purpose graph foundation models from tabular foundation models.

Last but not least: the Connected Data Knowledge Graph v0.1 was released – an open knowledge graph for the community by the community.


Subscribe to the Year of the Graph Newsletter

Keeping track of all things Graph Year over Year


Write a Reply or Comment

Your email address will not be published.