- Ellie Pavlick
- Lorenzo De Stefani (spring '22)
- Course Home Page:
|Meeting Time:||I hr: T,Th 10:30-11:50|
|Offered this year?||Yes|
|When Offered?||Most years|
Data is (soon) at the core of essentially all domains from material science to health care. Mastering big data requires a set of skills spanning a variety of disciplines from distributed systems over statistics to machine learning as well as a deep understanding of a complex ecosystem of tools and platforms. Data Science refers to the intersection of these skills and is concerned with the whole processing pipeline to transform data into actionable knowledge. This course provides an overview of the various techniques and tools involved in Data Science and how they work together rather than focussing on a specific aspect. Among other things, we will cover SQL and NoSQL solutions for massive data management, basic algorithms for data mining and machine learning, information retrieval techniques, and visualization methods.
Prerequisites: CSCI 160, CSCI 180, or CSCI 190. One of CSCI 330 or CSCI 320 is strongly recommended.