Instructor: Yizhou Sun
TA:
Lecture times: Mondays 6 - 9 PM
Lecture location: Forsyth Building 236
This course introduces concepts, algorithms, and techniques of data mining on different types of datasets, including (1) matrix data, (2) text data, (3) set data, (4) sequence data, (5) time series, (6) graph and network, and (7) image data. The class project involves hands-on practice of mining useful knowledge from large data sets. The course is a graduate-level computer science course, which is also a good option for senior-level computer science undergraduate students interested in the field. Also, the course may attract students from other disciplines who need to understand, develop, and use data mining systems to analyze large amounts of data.
*Note: all the deadlines are 11:59PM (midnight) of the due dates; No late submissions accepted!
Regrading Policy:
Jiawei Han, Micheline Kamber, and Jian Pei. Data Mining: Concepts and Techniques, 3rd edition, Morgan Kaufmann, 2011
Recommended books for further reading:
You are encouraged to come to the office hours of TAs and the instructor.
Peer-based Q&A via Piazza: piazza.com/northeastern/fall2015/cs622001/home
A commitment to the principles of academic integrity is essential to the mission of Northeastern University. The promotion of independent and original scholarship ensures that students derive the most from their educational experience and their pursuit of knowledge. Academic dishonesty violates the most fundamental values of an intellectual community and undermines the achievements of the entire University.
For more information, please refer to the Academic Integrity Web page.