Welcome to Scientific Computing using Python - 2. High Performance Computing in Python


Description: Many research projects involve scientific computing for analyzing [big] data and/or simulating complex systems. This makes it necessary have a systematic approach to obtaining well-tested and documented code. Further, we see an increased interest in reproducible research, which allows other researchers the opportunity to dig further into research results as well as easy access to results and improving productivity by reusing code and software.

This is a course in scientific computing using the increasingly popular programming language Python. Python is gaining popularity in science due to a number of advantages such as having a rich set of libraries for computing and data visualization, excellent performance-optimizing possibilities, standard tools for simple parallel computing, fast development cycle and high productivity – just to name a few. Python is open source and as such an asset for any researcher following the reproducible research paradigm.

This part of the course covers the main area: High performance computing.

High performance computing:

  1. High-performance computing and computer architectures
  2. Performance optimization
    1. Cython (compiled Python via C-extensions)
    2. Numba (just in time compilation)
    3. f2py (inclusion of Fortran code in Python)
  3. Parallel/distributed computing
    1. Theoretical aspects (Amdahl's and Gustafson-Barsis' law)
    2. Parallel computingon one computer
    3. Distributed computing across multiple computers

Audience: The targeted audience is mainly engineers or similar with an interest in developing robust, portable and high quality code for various scientific computing purposes. By this we mean code to solve actual problems where [lots of] floating-point computations are needed.

Prerequisites: Participants must have some experience in programming Python. If not, there is an introductory course "Scientific Computing Using Python - 1. Python + Scientific Computing". Further, some basic skills in general use of a computer are expected. The tools applied work best using Linux or Mac OSX – Microsoft Windows may experience challenges when using parallel computing.

Criteria for assessment: A standard mini-project must be delivered (4-8 pages description) in addition to the developed code. The code must include testing/validation, and performance evaluation of parallel computing. An acceptable mini-project and at-least 75% participation is required to pass the course.

Learning objectives: After completing the course the participants will:

  1. Know how to use methods and software for performance optimization.
  2. know when and how to apply parallel computing for scientific computing.



Organizer and lecturer: Associate Professor Thomas Arildsen, e-mail: tha@es.aau.dk, Department of Electronic Systems 

ECTS: 2

Time: 29-30 May 2018 from 8:30 to 15:30

Place:  Niels Jernes Vej 14, room Njv 14/3-119

Zip code:
9220

City:
Aalborg Øst

Number of seats: 25

Deadline: 8 May 2018


Important information concerning PhD courses We have over some time experienced problems with no-show for both project and general courses. It has now reached a point where we are forced to take action. Therefore, the Doctoral School has decided to introduce a no-show fee of DKK 5,000 for each course where the student does not show up. Cancellations are accepted no later than 2 weeks before start of the course. Registered illness is of course an acceptable reason for not showing up on those days. Furthermore, all courses open for registration approximately three months before start. This can hopefully also provide new students a chance to register for courses during the year. We look forward to your registrations.