Instructor: Yizhou Sun
Lecture times: Tue 6 - 9 PM
Lecture location: Behrakis Health Sciences Center 310
This course introduces concepts, algorithms, and techniques of data mining on different types of datasets, including (1) matrix data, (2) set data, (3) sequence data, (4) time series, and (5) graph and network. The class project involves hands-on practice of mining useful knowledge from large data sets. The course is a graduate-level computer science course, which is also a good option for senior-level computer science undergraduate students interested in the field. Also, the course may attract students from other disciplines who need to understand, develop, and use data mining systems to analyze large amounts of data.
*Note: all the deadlines are 11:59PM (midnight) of the due dates.
Jiawei Han, Micheline Kamber, and Jian Pei. Data Mining: Concepts and Techniques, 3rd edition, Morgan Kaufmann, 2011
Recommended books for further reading:
This term we will be using Piazza for class discussion. The system is highly catered to getting you help fast and efficiently from classmates, the TA, and myself. Rather than emailing questions to the teaching staff, I encourage you to post your questions on Piazza.
Tips: Answering other students' questions will increase your participation score.
Find our class page at: https://piazza.com/northeastern/fall2013/cs6220/home
A commitment to the principles of academic integrity is essential to the mission of Northeastern University. The promotion of independent and original scholarship ensures that students derive the most from their educational experience and their pursuit of knowledge. Academic dishonesty violates the most fundamental values of an intellectual community and undermines the achievements of the entire University.
For more information, please refer to the Academic Integrity Web page.