Training course "Data analysis and data mining with Python"
(Course no. 92/2015 in the training programme of Forschungszentrum Jülich)
This course is fully booked.
Target audience: | Scientists who want to use Python for data analysis |
Contents: | |
Prerequisites: | Basic Experience with Python such as provided by the course Introduction to Pythonis assumed and required for the hands-on sessions. |
Language: | This course is given in English. |
Duration: | 3 days |
Date: | 9-11 November 2015, 9:00 - 16:30 |
Venue: | Jülich Supercomputing Centre, Ausbildungsraum 1, building 16.3, room 213a |
Number of participants: | minimum 5 |
Instructors: | Dr. Jan Meinke, Dr. Olav Zimmermann, JSC |
Contact: | Dr. Jan Meinke Phone: +49 2461 61-2315 E-mail: j.meinke@fz-juelich.de |
Registration: | The course is fully booked. |
Pandas, matplotlib, and scikit-learn make Python a powerful tool for data analysis, data mining, and visualization. All of these packages and many more can be combined with IPython to provide an interactive extensible environment.
In this course, we will explore matplotlib for visualization, pandas for time series analysis, and scikit-learn for data mining. We will use IPython to show how these and other tools can be used to facilitate interactive data analysis and exploration.
Day 1: Basic data analysis and visualization
- Introduction to IPython for interactive data analysis.
- pandas
- NumPy
- matplotlib
Day 2: Advanced data analysis visualization
- pandas
- Statsmodels
- Mayavi2
Day 3: Advanced topics
- Portable data formats
- scikit-learn
- PyMAFIA
This course is aimed at scientists who wish to explore the productivity gains made possible by Python for data analysis.