PRACE training course "Parallel I/O and Portable Data Formats"
(Course no. 1372018 in the training programme 2018 of Forschungszentrum Jülich)
Target audience: | Supercomputer users |
Contents: | |
Prerequisites: | Experience in parallel programming with MPI, and either C/C++ or Fortran in particular. |
Language: | This course is given in English. |
Duration: | 3 days |
Date: | 12-14 March 2018, 9:00-16:30 |
Venue: | Jülich Supercomputing Centre, Ausbildungsraum 2, building 16.3, room 211 |
Number of participants: | maximum 25 |
Instructors: | Sebastian Lührs, Dr. Michael Stephan, Benedikt Steinbusch, Dr. Kay Thust, JSC |
Contact: | Sebastian Lührs Phone: +49 2461 61-2863 E-mail: s.luehrs@fz-juelich.de |
Registration: | Please register until 7 March 2018 via the form at the PRACE web site
|
Numerical simulations conducted on current high-performance computing (HPC) systems face an ever growing need for scalability. Larger HPC platforms provide opportunities to push the limitations on size and properties of what can be accurately simulated. Therefore, it is needed to process larger data sets, be it reading input data or writing results. Serial approaches on handling I/O in a parallel application will dominate the performance on massively parallel systems, leaving a lot of computing resources idle during those serial application phases.
In addition to the need for parallel I/O, input and output data is often processed on different platforms. Heterogeneity of platforms can impose a high level of maintenance, when different data representations are needed. Portable, selfdescribing data formats such as HDF5 and netCDF are examples of already widely used data formats within certain communities.
This course will start with an introduction to the basics of I/O, including basic I/O-relevant terms, an overview over parallel file systems with a focus on GPFS, and the HPC hardware available at JSC. Different I/O strategies will be presented. The course will introduce the use of the HDF5, the NetCDF (NetCDF4 and PnetCDF) and the SIONlib library interfaces as well as MPI-I/O. Optimization potential and best practices are discussed.
This course is a PRACE training course.