CS 6220: Data Mining Techniques

News

[9/29/2014] Kaggle link for course project is released: http://inclass.kaggle.com/c/nu-cs6220-14f/

[09/8/2014] First day of classes


Class Schedule

(Future lectures and events are tentative.)

Week# Date Topic Slides Assignment Project Reading (Textbook or Other Materials)
2 Sep. 8 Introduction and Know Your Data 01Introduction
02Data
    Chapter 1, 2, 3
Math overview:
3 Sep.15 Course Project Introduction
Matrix Data: Prediction (linear regression); Classification (decision tree, evaluation)
Course Project Overview
03Matrix_Prediction
04Matrix_Classification_1
    Notes by Andrew Ng (Sec. 1-3 in Part 1): http://cs229.stanford.edu/notes/cs229-notes1.pdf

Chapter 8.1, 8.2, 8.5
4 Sep. 22 Matrix Data: Classification (Naive Bayes, logistic regression) 04Matrix_Classification_2 Assign#1 out Team formation due (Sep. 21) Chapter 8.3, 9.1
Notes by Tom Mitchell: http://www.cs.cmu.edu/~tom/mlbook/NBayesLogReg.pdf
Notes on derivation of P(C_j) in Naive Bayes
5 Sep. 29 Matrix Data: Classification (SVM, kNN, and other issues) 04Matrix_Classification_3     Chapter 9.3, 9.5, 8.6, 9.7
Notes on SVM by Andrew Ng: http://cs229.stanford.edu/notes/cs229-notes3.pdf
6 Oct.6 Matrix Data: Clustering (k-means, hierarchical clustering, DBSCAN, evaluation) 04Matrix_Clustering_1 Assign#1 due (Oct. 5)/ Assign#2 out   Chapter 10.1, 10.2, 10.3, 10.4, 10.6
7 Oct. 13 Columbus Day (No Class)        
8 Oct. 20 Matrix Data: Clustering (mixture model and EM algorithm, kernel k-means ) 04Matrix_Clustering_2
Midterm Report due (Oct. 19) Chapter 11.1, 11.3
Notes on mixture models and EM algorithm: http://www.stat.cmu.edu/~cshalizi/350/lectures/29/lecture-29.pdf
9 Oct. 27 Set Data: Frequent Pattern Mining (Apriori, FP-growth) 05Set_FP Assign#2 due (Oct. 26)Assign#3 out   Chapter 6
10 Nov.3 Midterm Exam        
11 Nov. 10 Sequence Data (Sequential pattern mining (GSP, PrefixSpan), HMM) 06Sequence   Reference: Chapter 8.3 in Han's Data Mining Book, Edition 2
Papers: GSP, PrefixSpan
12 Nov. 17 Time Series 07Time_Series Assign#3 due (Nov. 16) / Assign#4 out   References: DTW
13 Nov. 24 Graph / Network 08Graph     Read: Graph Mining
14 Dec. 1 No Class   Assign#4 due (Dec. 1)    
15 Dec. 8 Course Project Due Date (No class)     Final Report & Code (Dec. 8)