This course gives an overview of statistical methods that are used for analyzing high- dimensional data sets in which many variables (often thousands) have been measured for a limited number of subjects. This type of data arises in genomics, where genetic information is measured for many thousands of genes simultaneously, but also in functional MRI imaging of the brain. The course covers the most important statistical issues in this field, which include: a) initial processing of the data; b) model- based differential expression analysis for Gaussian and count data (classical and Bayesian methods); c) multiple testing (family-wise error rate and false discovery rate control); d) penalized regression (lasso and ridge); and e) shrinkage. Several specific types of high-dimensional data will be discussed and used during the course. Philosophy: Teaching students the adjustments to classical statistical methodology, necessary to tackle high-dimensional data.
Students should be able to perform and understand the most common analysis types on several types of high-dimensional data, and be familiar with the specific issues in important types of high dimensional data sets.
Mode of Instruction
The course consists of a series of lectures and practicals (partly computer practicals, partly exercises).
See the Leiden University students' website for the Statistical Science programme -> Schedules 2018-2019
Grading will be as follows:
80% exam grade
20% homework assignments
For both parts you will need to have at least a 5.5.
Literature will be specified during course, no books are required.
Enroll in Blackboard for the course materials and course updates.
To be able to obtain a grade and the EC for the course, sign up for the (re-)exam in uSis ten calendar days before the actual (re-)exam will take place. Note, the student is expected to participate actively in all activities of the program and therefore uses and registers for the first exam opportunity.
Exchange and Study Abroad students, please see the Prospective students website for information on how to apply.
mark[dot]vdwiel[at]vumc[dot]nl and w[dot]n[dot]van[dot]wieringen[at]vu[dot]nl
- This is an compulsory course of the Master Statistical Science for the Life and Behavioural sciences / Data Science.