Preventative medicine: understanding the cause of gum disease with AI
STFC Hartree® Centre and IBM collaborated with Salient Bio to develop a machine learning-enabled pipeline to categorise the oral microbiome and to further our understanding of gum disease through the Hartree National Centre of Digital Innovation (HNCDI).

Challenge
Gum disease (periodontitis) is one of the most widespread chronic conditions worldwide, but understanding its molecular mechanisms remains a challenge. The disease is driven not by any single ‘bad bacteria’ but by a breakdown in the balance of the entire oral ecosystem. Most existing research tools used to understand this condition lack predictive power; they take the approach of checking whether specific harmful bacteria are present or absent. This overlooks the interactions between bacteria and how these dynamics impact the oral microbiome. There is a clear need for diagnostic approaches beyond traditional techniques which consider the interactions, dependencies and competitive relationships among bacteria that collectively tip the balance towards health or disease.
Approach
Researchers analysed genetic sequencing data from the oral microbiomes of around 600 individuals, collected by Salient Bio and their dental partner. The data collection also involved extensive patient metadata, which captured lifestyle factors. This enabled analysis of the relationship between an unhealthy microbiome and an individual’s behaviour and environment. Inferring co-occurrence network maps from this data revealed disease-specific community modules; tightly connected clusters of bacteria that behave as functional units and are distinctive to either healthy or diseased states. By combining network analysis with explainable machine learning, the researchers distilled the complexity of the full microbiome down to a focused panel of just 13 key bacterial species. This compact set achieved strong diagnostic accuracy, comparable to models using hundreds of features.
Benefits
The accurate and deep biological insight uncovered with this approach reveals the ecological mechanisms underpinning the cause of gum disease. By being able to identify at-risk individuals before clinical symptoms become severe, novel, targeted therapies could be used to preventatively treat individuals, reducing overall cases of gum disease and reducing our reliance on broad-spectrum antibiotics. This framework is not just limited to gum disease, it establishes a reusable blueprint for microbiome-based diagnostics and therefore could be used to support research that deepens our understanding of human microbiomes and contributes to the development of novel therapies.
“Collaborating with the Hartree Centre and IBM gave us access to ML expertise and analytical frameworks that truly accelerated what we could achieve as a team. We went from oral sample collection through metagenomic sequencing to a validated 13-species diagnostic signature, that wouldn’t have been possible without that depth of analytical expertise”
Tom Sewell, Lead Bioinformatician, Salient Bio
Join Newsletter
Provide your details to receive regular updates from the STFC Hartree Centre.