CSCI2950-C: Algorithms for Cancer Genomics
Spring 2015

Professor: Ben Raphael

Time: Tuesday/Thursday 10:30-11:50am [CIT 245]

Overview

This course seminar will explore algorithmic challenges that emerge in the analysis and interpretation of cancer genome sequencing data, with a focus on two major themes.

The mutational process of cancer evolution. The underlying algorithmic problem is to construct trees that represent the relationships between cells from mutational data. We will explore tree reconstruction algorithms using phylogenetic techniques (perfect phylogeny and Dollo parsimony) and population genetic techniques (branching processes and the coalescent).

The identification of combinations of cancer causing mutations. Such combinations typically result from biological interactions between genes, which are represented via graphs, or networks. We will examine algorithms to analyze data on graphs including random walks (e.g. PageRank), diffusion processes, community detection, and spectral methods for graph partitioning.

Course Organization

The course will be organized in seminar style where students will read and present articles and recent research papers on the topics listed above. These topics will be introduced with introductory lectures. Students will undertake a project to further study one of the topics. To the extent possible, projects will be adjusted to the background/interest of the student and could range from theoretical (e.g. designing a new algorithm and proving its correctness), to the practical (a software implementation). The project will include a written proposal, midterm report, and final presentation.

Prerequisites

Undergraduate-level knowledge of probability: random variables, distributions, etc.
Undergraduate-level knowledge of algorithms and/or statistics

No biology background is assumed. Necessary background will be introduced in lectures and reading.

Syllabus

Outline of Topics

Schedule

Assignments

Proposal:Due TBD (Specific Aims and Significance) and TBD (All).

Paper Reviews

review form

Course Credits (for Computer Science students)

PhD: Area T (Theory) [Pre-2012: Area B (Algorithms)]
ScM: " Theory " or "Practice" course. (Depending on final project chosen.)
Significant Programming can be arranged with an appropriate class project

Resources

Previous offerings of this course are available here:

CSCI2950-C: Algorithms for Cancer Genomics Spring 2015