Data, modeling, and artificial intelligence – research for tomorrow's bioeconomy

The Institute of Bio- and Geosciences (IBG) generates high-resolution data every day from sensors, sequencing, imaging, and remote sensing. This data is used to study topics such as microorganisms in soils, material flows in landscapes, and plant growth under climate stress. It forms the basis of innovative AI processes, simulation-based models, and data-driven decisions, which greatly benefit science, society, and the bioeconomy. These processes play a central role in enhancing our understanding of biological, environmental, and biotechnological stystems and processes across scales, from molecules to agroecosytems, and in designing them sustainably.

Simulation and data-driven modeling

As part of the Center for Advanced Simulation and Analytics (CASA) at the Jülich Supercomputing Center (JSC), IBG collaborates in the Simulation and Data Lab Digital Bioeconomy (SDL-DBE) to develop new methods for simulating and analyzing biological and biotechnological systems using data. The goal is to leverage high-performance computing, multiscale models, AI-powered simulations, and digital workflows to generate new insights in areas such as plant research, microbial systems, and circular biotechnology. he combination of experimentation, simulation, and AI enables predictive and resilient system design. A central infrastructure in this context is AgraSim, a unique IBG research infrastructure integrating multiscale agricultural system simulation across biological and environmental domains, enabling scenario analysis and data-driven decision-making under changing environmental conditions. Building on this, digital twin approaches such as ReGenFarm dynamically represent agricultural production systems.

Data Acquisition and Infrastructure

At IBG, plant, microbial, and environmental data are collected using advanced technologies such as 3D imaging (GrowScreen), sequencing, environmental sensors, drones, and satellite remote sensing (FLEX). These infrastructures operate across spatial scales, from local sensor networks to continental and global monitoring systems, including water resource assessments. The Helmholtz Sensor Management System (SMS) supports distributed sensor infrastructures, while platforms such as the Helmholtz Earth Portal enable spatiotemporal data visualization. Through integration with national research data infrastructures (NFDI) such as DataPLANT, NFDI4bioimage, NFDI4microbiota, and FAIRagro, IBG ensures FAIR-compliant data management (findable, accessible, interoperable, reusable). In addition, IBG coordinates the national infrastructure de.NBI (German Network for Bioinformatics Infrastructure) and its European branch ELIXIR Germany. These networks provide advanced tools, services, and cloud infrastructures for biological and biotechnological data analysis.

Artificial intelligence

AI-powered data analysis is a central pillar of IBG research. Across domains, AI enables the identification of complex spatiotemporal patterns and functional relationships that are not accessible through conventional methods. Applications include:

  • genome annotation (Helixer)
  • regulatory element prediction (deepCRE)
  • protein and enzyme function prediction (TopEC, OneProt, TopEnzyme)
  • microbiome data interpretation
  • environmental and water system modeling across scales

AI is also increasingly applied in remote sensing and Earth system analysis (e.g. KISTE), supporting integrated environmental monitoring from local to global scales.

Last Modified: 24.03.2026