New Model for District-Based Coronavirus Forecasts
Neuroinformatics scientists at Osnabrück University and data specialists at the Jülich Supercomputing Centre provide daily analyses of infection rates
Jülich / Osnabrück, 23 September 2020 – Neuroinformatics scientists at Osnabrück University and data specialists at Forschungszentrum Jülich are releasing new model results daily to forecast COVID-19 infections. The results include estimates updated daily of the reported new infections and a 5-day forecast for every German district, and are available at https://covid19-bayesian.fz-juelich.de. The predictions are based on data from the Robert Koch Institute, which are statistically analysed using a new model weighted by probability that was developed by Osnabrück’s neuroinformatics scientists on high-performance computers at the Jülich Supercomputing Centre (JSC).
The “COVID-19 Bayesian Modelling for Outbreak Detection”, or BSTI model for short, has two key features that distinguish it from other models: “For one thing, the new method provides a forecast horizon, which allows us to assess the reliability of the predictions. In addition, the model considers the effect of infection rates in neighbouring districts. This allows us to also assess the dynamics of the spread,” explains Prof. Gordon Pipa, head of the neuroinformatics research group on the AI campus at Osnabrück University, which only recently developed the BSTI model in a different context.
One of many challenges presented by breaking down the forecasts into individual districts is the low numbers of cases. “A single forecast trajectory can be misleading, since the reliability of the prediction cannot be assessed,” explains Prof. Pipa. “That is why the BSTI model that we use calculates not just one single likely trajectory, but instead takes many possible trajectories that comply with the data into account. This enables forecast horizons to be calculated as a measure of the distribution of probabilities. This method makes it possible to evaluate the situation including statistical uncertainties, which can provide helpful information even when case numbers are low.”
Moreover, the BSTI model calculates the effect of neighbouring regions. An interaction kernel describes how much a high or low case number in a neighbouring region impacts the infection rate in a district. The neuroinformatics research group in Osnabrück and the Robert Koch Institute successfully used the interaction kernel in 2019 to describe the progression of infections with the rotavirus, as well as Lyme disease and campylobacter bacteria.
Experts from the Jülich Supercomputing Centre (JSC) helped to adapt the method to COVID-19 data and to adjust the code for analysis on Jülich’s supercomputers. “Determining the forecast horizon is extremely compute-intensive, since we deploy many different model variations. Therefore, the statistical modelling requires much more computing time than methods that do not make use of a forecast horizon. In order to make the analyses available on a daily basis immediately after the RKI data are released, we use resources from the Jülich Supercomputing Centre – normal office computers would be completely overwhelmed by the job,” explains Jens Henrik Göbbert of the JSC.
The analyses, which are updated daily, and the option of spatial or temporal visual comparisons are free to access and shown on the website https://covid19-bayesian.fz-juelich.de in a format that is as comprehensible as possible. “We wanted to make the results available quickly and in an understandable format, so that a broad group of people can talk about the content straight away without getting bogged down in the technology,” explains Göbbert.
For example, visitors to the interactive site can freely select districts to view their 5-day forecasts, or they can compare the latest data reported by the Robert Koch Institute with the estimated actual new infections. Because of delays in the data transfer, the reported figures often, and sometimes considerably, differ from the actual number of new cases. Therefore, the “nowcast” has the goal of providing an initial assessment of the current figures using statistical analyses. A forecast, in turn, provides an estimate of the developments over the next five days.
The project team also has an idea for future challenges in pandemic research. Prof. Pipa explains: “Such interdisciplinary teams could be established as competence centres that go beyond the boundaries of institutes, for example to prepare us by simulating crisis situations for a variety of scenarios and the steps that need to be taken in terms of data acquisition and data analysis, in a similar way to procedures concerning disaster prevention.”
Current forecasts of coronavirus infections by district (in German):
Calculations of the reproduction number for German federal states (updated daily, in German)
(This analysis is based on rt.live, a project by Kevin Systrom, Mike Krieger, Thomas Vladek and others. For the application on Germany, Laura Helleckes and Michael Osthege from Forschungszentrum Jülich have extended the model by a prediction of the tests performed daily.)
Prof. Dr. Gordon Pipa, Osnabrück University
Institute of Cognitive Science
Wachsbleiche 27, 49090 Osnabrück, Germany
Tel: +49 541 969 2277
Jens Henrik Göbbert, Forschungszentrum Jülich GmbH
Jülich Supercomputing Centre
Wilhelm-Johnen-Straße, 52425 Jülich, Germany
Tel: +49 2461 61 96498
Tel.: +49 2461 61-4771