- Introducing R as a statistical programming environment for data analysis.
- Efficient data management and high-level graphics using R
- Statistical modelling in theory and practice using R.
- Learning from large datasets using R.
- Aspects of scientific computing in theory and practice.
- Reproducible research in practice.
- Programming in R.
be able to solve programming tasks with R.
know how to manage data, visualize data and fit models to data using R.
have learned enough about R to continue learning on their own.
- Q: I am from outside AAU and wish to sign up for the course. What do I do?
- A: You click on \"signup\" and fill out the form.
- Q: I can not get the steps above to work. Who can help?
- A: You will have to ask the doctoral school: aauphd@adm.aau.dk
- Q: If I participate in the course, can you then help me analyze a dataset that I work with as part of my ph.d. project.
- A: No, I am afraid that this is not possible
- Q: I am not a ph.d. student, but I would like to participate in the course anyway. Is that possible?
- A: You will have to ask the doctoral school: aauphd@adm.aau.dk
- Q: I realize that I am late for enrollment, but I would really like to participate. Is it possible.
- A: You will have to ask the doctoral school: aauphd@adm.aau.dk
Welcome to Data Science using R (2022)
Description:
Most quantitative research projects involve case-specific programming tasks as well as data analysis of a more standard nature. When working with quantitative data it is essential to be able to do so in a systematic and reproducible manner with trustworthy software. This course, Data Science using R, aims at introducing the free program R as a computational environment for integrating these tasks. R has for two decades been a leading tool for computing with data and it is the preferred language for implementation of new statistical methodology.
The course will cover the practice of scientific computing, programming, and quantitative analyses as well as the essential theoretical underpinnings. Topics of the course will include:
Prerequisites (important):
Participants must have a working knowledge of elementary statistical methods such as regression models and analysis of variance at the level taught e.g. in the "applied statistics" course described at http://asta.math.aau.dk/.
Learning objectives:
After completing the course the participants will:
Teaching methods:
A combination of instructive videos, computer practicals and lectures.
Course organization:
The course gives 4 ECTS credits which means that the workload of the course is about 120 hours. The course is organized in three blocks each consisting of two days. The blocks are two weeks apart. Prior to the first block and between the blocks there will be tasks for participants to work on, either independently or in smaller groups. After the last block, there will be time to work on the major exercise to be handed in.
Criteria for assessment:
Active participation in the practicals + approval of major exercise (to be handed in after the course).
Frequently asked questions:
Organizers: Søren Højsgaard and Ege Rubak
ECTS:
4
Time:
02, 03, 16, 17, 30, 31 March 2023
Place:
Aalborg University: Fredrik Bajers Vej 7C, room 3-204
Zip code:
9220
City:
Aalborg
Number of seats:
40
Deadline:
09 February 2023
Important information concerning PhD courses: We have over some time experienced problems with no-show for both project and general courses. It has now reached a point where we are forced to take action. Therefore, the Doctoral School has decided to introduce a no-show fee of DKK 3.000 for each course where the student does not show up. Cancellations are accepted no later than 2 weeks before start of the course. Registered illness is of course an acceptable reason for not showing up on those days. Furthermore, all courses open for registration approximately three months before start. This can hopefully also provide new students a chance to register for courses during the year. We look forward to your registrations.
- Teacher: Søren Højsgaard
- Teacher: Ege Rubak