In progress
KG4J — Knowledge Graphs for Justice
Research project dedicated to the use of knowledge graphs in the field of justice.
Period : 2025 – 2029
Funder : BELSPO
Role : Principal Investigator
Budget : 392,798.60 €
KG4J is a project submitted in 2025 as part of the P4Science call of the Belgian Federal Science Policy (BELSPO). I am the principal investigator and the author of the proposal.
The project aims to explore the contribution of knowledge graphs for data integration at the National Institute of Criminalistics and Criminology (NICC), and to improve analytical capabilities.
The consortium brings together the following partners:
- Department of Computer Science, UMONS, Belgium
- Departement Computerwetenschappen, KU Leuven, Belgium
- Louvain Research Institute in Management and Organizations, UCLouvain, Belgium
- Department of Mathematics and Industrial Engineering, Polytechnique Montréal, Canada
- Department of Linguistics, Indiana University Bloomington, USA
- Digital Transformation Office, Federal Public Service Justice, Federal Public Service Justice, Belgium
- Elephant Bird Consulting, Elephant Bird Consulting, Belgium
Official project summary:
The National Institute of Criminalistics and Criminology (NICC) plays a scientific role in the Belgian criminal justice system. On one hand, it provides forensic expertise in areas such as DNA, toxicology, and drugs. On the other hand, it conducts criminological research, for example on recidivism and criminal careers. To do this, it processes a large amount of data. However, much of this data is fragmented or underutilized. Valuable information remains hidden.
Proposal
The Knowledge Graphs for Justice (KG4J) project aims to solve this problem. How? By building and exploiting data analysis systems based on graph theory, knowledge graphs, and artificial intelligence (AI), including large language models (LLMs). And by defining a data and AI governance policy.
What is graph theory?
Graphs are a natural way to represent connected data. Entities become nodes linked by relations. Imagine a subway map: Each station is a node, each line a relation. What is the shortest distance between two stations? Which station is the most connected? Graph theory answers these types of questions.
What is a knowledge graph?
The addition of semantic information — i.e., definitions of meaning — transforms a graph into a knowledge graph. They become machine-readable, interpretable, and searchable. Modern AI systems can then exploit them to answer questions.
What is a large language model?
An LLM is the engine of modern conversational agents. It has been trained to predict human speech from large amounts of text. It can dialogue with us and help accomplish other tasks, such as programming.
What we will do with this technology
The KG4J project will apply graph theory, knowledge graphs, and LLMs to selected use cases, in order to reveal the value of existing data and provide new capabilities.
Criminological research
The NICC has already used a graph to examine recidivism and criminal careers from disconnected data sources. The goal is now to process this graph to discover new patterns. And to transform it into a knowledge graph enriched by AI. This will allow advanced queries and simpler exploration of criminal trajectories.
Forensic expertise
The NICC generally processes forensic data tactically, without subsequent analysis linking the elements together. For example, DNA databases link DNA profiles between cases, and then stop there. Transforming this data into a graph would reveal co-offending patterns. And discover criminal networks. Linking evidence supports both ongoing investigations and long-term criminal policy.
Unstructured data
Transcripts of hearings, expert notes, recordings: the amount of unstructured data is immense. Thanks to AI — and particularly LLMs — it becomes possible to convert them into a knowledge graph. This graph can then answer questions about these contents.
Objectives
-
Build a knowledge graph infrastructure (KGI) integrating criminological and forensic data, searchable via an intuitive interface.
-
Demonstrate use cases: recidivism statistics, discovery of new patterns, and forensic applications.
-
Develop methods for processing unstructured data: extraction of graphs from text, discourse analysis, and reconstruction of criminal trajectories, with conversational capabilities of LLMs.
-
Define a data and AI governance model consistent with institutional values and legal requirements.
Conclusion
The KG4J project will enable the NICC to produce enhanced criminological and forensic intelligence. It links fragmented data, values existing data, develops new capabilities via graphs and AI, and establishes reliable practices in data and AI for justice.
✦