Skip to content

SINTEF Blog Gå til forsiden

  • Energy
  • Ocean
  • Digital
  • Health
  • Industry
  • Climate and environment
  • Building
  • Society
  • EN
  • NO
Digital

Predicting molecular properties with Graph Neural Networks: From smells to public health

Imagine being able to predict the scent of a molecule before it's even synthesized or assessing its toxicity without lab experiments.

Model of the DDT molecule, as found in the crystal structure. Illustration: Joel Bjervig / SINTEF
Authors
Alexander Johannes Stasik
Research Scientist
Joel Bjervig
Master of Science
Published: 10. Feb 2025 | Last edited: 2. Apr 2025
2 min. reading
Comments (0)

At SINTEF, the Analytics & Artificial intelligence group is collaborating with the Biotechnolocy & Nanomedicine group to work towards this goal, using laboratory experiments, and AI models called Graph Neural Networks (GNNs).

Figure 1: From molecular graph to prediction

Fragrant, toxic, and anti-viral chemicals

Our work focuses on predicting molecular properties using GNNs. By training these networks on datasets like the Leffingwell Odor Dataset and the Tox21 dataset, we’re exploring new ways to understand molceules via a data-driven approach.

  • The Leffingwell Odor dataset allows us to develop models that can “smell” molecules, predicting their olfactory properties.
  • The Tox21 dataset enables us to predict toxicity, such as endocrine-disrupting chemicals that interfere with hormonal systems and impact reproductive health.

Accelerating drug discovery and saving resources

Beyond predicting smells and toxicity, this research uses GNNs to predict molecular properties, accelerating drug discovery and reducing the reliance on lab experiments, that can be expensive, slow, and in the case of animal studies, also unethical.

Figure 2: Hybrid AI workflow, where the trained model is cyclically validated and fine-tuned through lab experiments, until molecular candidates are identified for production.

Our approach involves a cyclical process:

  1. Train a model on existing (offline) data.
  2. Predict properties of new molecules using the trained model.
  3. Send the best candidates for validation in the wet lab.
  4. Update the dataset and refine the model based on lab results.
  5. Repeat until satisfactory performance is achieved, leading to faster product development and drug discovery.

Preliminary results and future directions

We’ve already seen promising results on data coming from SINTEFs own laboratories, currently reducing screening time by 50%, and aim to increase this to 90%. In addition to GNNs, we plan to explore the use of sequence-based models in the future. These models has achieved significant breakthroughs in natural language processing, and offer an alternative approach by treating the molecular structure as a sequence of letters that can be translated into properties. We are actively seeking partners with interesting data and problems to further expand the applications of our models.

By mimicking the lab with AI, we can reduce the amount of resources needed. This approach has the potential to improve how we discover new drugs and assess chemical safety.

Comments

No comments yet. Be the first to comment!

Leave a comment Cancel reply

Your email address will not be published. Required fields are marked *

More about Digital

Quantum communication: What it is and why it matters now

Author Image
Author Image
2 forfattere

How acoustic signals can help to detect failure in laser welding and 3D printing

Author Image
Author Image
Author Image
Author Image
Author Image
Author Image
Author Image
7 forfattere
AI-generated image by Adobe Firefly. Prompt: CGI of a humanoid robot with an overheating brain, drinking a bottle of water.

Cooling AI: How Phase Change Materials Make a Difference

Author Image
Author Image
2 forfattere

Technology for a better society

  • About this blog
  • How to write a science blog
  • Topics and collections
  • Sign up for our newsletter
  • News from NTNU and SINTEF
  • Facebook
Gå til SINTEF.no
SINTEF logo
© 2025 SINTEF Foundation
Privacy Editorial Press contacts Website by Headspin