Welcome to Scientific Computing using Python – 1. Python + Scientific Computing (2021)

(This course is fully booked! Please contact vkns@adm.aau.dk to be placed on the waiting list)

Description:

Many research projects involve scientific computing for analyzing [big] data and/or simulating complex systems. This makes it necessary to have a systematic approach to obtaining well-tested and documented code. Further, we see an increased interest in reproducible research, which allows other researchers the opportunity to dig further into others' research results as well as easy access to results and improved productivity by reusing code and software.

This is an introductory course in scientific computing using the increasingly popular programming language Python. Python is gaining popularity in science due to a number of advantages such as having a rich set of libraries for computing and data visualization, excellent performance-optimizing possibilities, standard tools for simple parallel computing, fast development cycle and high productivity - just to name a few. Python is open source and as such an asset for any researcher following the reproducible research paradigm.

The course covers two main areas:

  1. The Python programming language itself and
  2. various aspects of scientific computing.

This specific course content is as follows:

The Python language

  1. Course introduction
    1. Historical overview of scientific computing and high performance computing
  2. Python development environment
    1. Python from above
    2. Data types, built-in functions
    3. Branching and looping
    4. Functions (definition, built-in, lambda)
    5. Modules and packages
  3. Debugging and testing
    1. Pytest
    2. Doctest
    3. Pdb (breakpoints and post-mortem debugging)
  4. Basic scientific computing packages
    1. NumPy (numerical computing - array-based - vectorization)
    2. SciPy (various tools for integration, optimization, etc.)
    3. Matplotlib (data visualization) + other visualisation packages
    4. H5py (data storage/access via HDF)
    5. Documentation using Sphinx

Scientific computing

  1. Basic issues related to computational sciences such as
    1. Floating-point representation
    2. Numerical accuracy and condition number
    3. Cancellation
    4. Algorithmic complexity
  2. Scientific software development
    1. Version control (via git)
    2. Code documentation
    3. Test procedures (what to test - and how)
    4. Code refactoring

 

Audience: The targeted audience is mainly engineers or similar with an interest in developing robust, portable and high-quality code for various scientific computing purposes. By this we mean code to solve actual problems where [lots of] floating-point computations are needed. It is not a course in object-oriented programming and we apply a procedural approach to programming in the course.

Prerequisites: Participants must have some basic experience in code development in e.g. MATLAB, C or FORTRAN. Further, some basic skills in general use of a computer are expected. The tools applied work best using Linux or Mac OSX - Microsoft Windows may experience challenges if using parallel computing (but this should not be a problem for this part of the course).

Learning objectives: After completing the course the participants will:

  1. have fundamental knowledge of important aspects of scientific computing
  2. be able to map a mathematically formulated algorithm to Python code
  3. know how to document, debug and test the developed code.
  4. know when and how to optimize Python code

Teaching methods: A combination of lectures, demonstrating examples using Jupyter notebooks, smaller exercises, and a mini-project is used to facilitate learning. The course is rich in examples and active user participation is expected - the topics covered demand a “learning by doing” approach.

Criteria for assessment: A standard mini-project must be delivered (4-8 pages) in addition to the developed code. The code must include testing/validation, and performance evaluation. An acceptable mini-project and at-least 75% participation is required to pass the course.

Key literature: We expect to use a combination of the following:

  1. Selection of a few chapters in Python books (specified at a later stage)
  2. References to Python and all relevant packages (freely available via http://python.org)
  3. A number of scientific papers relevant for specific parts of the course.

Organizer: Thomas Arildsen - tari@its.aau.dk

Lecturers:

ECTS: 2,5

Time: 14-16 June 2021

Place: Aalborg University

Zip code: 9220

City: Aalborg

Number of seats: 46 (This course is fully booked! Please contact vkns@adm.aau.dk to be placed on the waiting list)

Deadline: 5 May 2021

Important information concerning PhD courses: We have over some time experienced problems with no-show for both project and general courses. It has now reached a point where we are forced to take action. Therefore, the Doctoral School has decided to introduce a no-show fee of DKK 3.000 for each course where the student does not show up. Cancellations are accepted no later than 2 weeks before start of the course. Registered illness is of course an acceptable reason for not showing up on those days. Furthermore, all courses open for registration approximately four months before start. This can hopefully also provide new students a chance to register for courses during the year. We look forward to your registrations.