What genes cause cancer ? Have we inherited genes from Neanderthals ? How does a single genome code for the diverse function that we see?
We can now begin to answer these fascinating questions in biology because the cost of genome sequencing has fallen faster than Moore's law. The bottleneck in answering these questions has shifted from data generation to powerful statistical models and inference algorithms that can make sense of this data. Statistical machine learning provides an important toolkit in this endeavor. Further, biological datasets offer new challenges to the field of machine learning.
We will learn about probabilistic models, inference and learning in these models, model assessment, and interpreting the inferences to address the biological questions of interest. The course aims to introduce CS/Statistics students to an important set of problems and Bioinformatics/Human Genetics students to a rich set of tools.
Familiarity with probability, statistics, linear algebra and algorithms is expected. No familiarity with biology is needed.
Instructor: Sriram Sankararaman
Office Hours: 296B Engineering VI, Thursday 11:00 am  noon
Email: sriram at cs dot ucla dot edu
Teaching assistant: Tevfik Dincer
Office hours: BSRB 301 conference room Thursday 2:00  4:00 pm
Email: dincer at g dot ucla dot edu
You are free to discuss the homework problems. However, you must write up your own solutions. You must also acknowledge all collaborators.
The course website is based on material developed by Ameet Talwalkar and Fei Sha. Some of the administrative content on the course website is adapted from material from Jenn Wortman Vaughan, Rich Korf, and Alexander Sherstov.
Date  Topics  Reading  Problem Sets 

10/01  Introduction to genomics  Problem Set 0 released. Due on Oct 8, 2018 at 11:59pm 

10/03  
10/08  
10/10  
10/15  
10/17  
10/22  
10/24  
10/29  
10/31  
11/05  
11/07  
11/12  Veteran's day  
11/14  
11/19  
11/21  
11/26  
11/28  
12/03  
12/05 