CS 6220: Data Mining Techniques

News

[09/24/2013]04Matrix_Classification_1 has been updated.

[09/10/2013] First day of classes


Class Schedule

(Future lectures and events are tentative.)

Week# Date Topic Slides Assignment Project Reading (Textbook or Other Materials)
2 Sep. 10 Introduction and Know Your Data 01intro
02data
    Chapter 1, 2, 3
3 Sep. 17 Course Project Introduction
Matrix Data: Prediction (linear regression); Classification (decision tree, evaluation,)
course_project_introduction
course_project_example
course_system_introduction
03Matrix_Prediction
04Matrix_Classification_1
    Notes by Andrew Ng (Sec. 1-3 in Part 1): http://cs229.stanford.edu/notes/cs229-notes1.pdf

Chapter 8.1, 8.2, 8.5
4 Sep. 24 Matrix Data: Classification (Naive Bayes, logistic regression) 04Matrix_Classification_2   Team formation due (Sep. 23) Chapter 8.3, 9.1
Notes by Tom Mitchell: http://www.cs.cmu.edu/~tom/mlbook/NBayesLogReg.pdf
5 Oct. 1 Matrix Data: Classification (SVM, kNN, and other issues) 04Matrix_Classification_3 Assign#1 out   Chapter 9.3, 9.5, 8.6, 9.7
Notes on SVM by Andrew Ng: http://cs229.stanford.edu/notes/cs229-notes3.pdf
6 Oct. 8 Matrix Data: Clustering (k-means, hierarchical clustering, DBSCAN, evaluation) 04Matrix_Clustering_1     Chapter 10.1, 10.2, 10.3, 10.4, 10.6
7 Oct. 15 Matrix Data: Clustering (mixture model and EM algorithm, kernel k-means ) 04Matrix_Clustering_2 Assign#1 due (Oct. 14)/ Assign#2 out Proposal due (Oct. 16) Chapter 11.1, 11.3
Notes on mixture models and EM algorithm: http://www.stat.cmu.edu/~cshalizi/350/lectures/29/lecture-29.pdf
8 Oct. 22 Set Data: Frequent Pattern Mining (Apriori, FP-growth) 05Set_Data_FP
  Chapter 6
9 Oct. 29 Sequence Data (Sequential pattern mining (GSP, PrefixSpan), HMM) 06Sequence_Data Assign#2 due (Oct. 28)   Reference: Chapter 8.3 in Han's Data Mining Book, Edition 2
Papers: GSP, PrefixSpan
10 Nov. 5 Midterm Exam   Assign#3 out    
11 Nov. 12 Time Series 07Time_Series Midterm check point (Nov. 11) References: DTW
12 Nov. 19 Graph / Network 08Graph_1 Assign#3 due (Nov. 18) / Assign#4 out   Read: Graph Mining
13 Nov. 26 Graph / Network (Reading by yourself, no class) 08Graph_2      
14 Dec. 3 Course Project Final Presentation (Session 1)   Assign#4 due (Dec. 2) Final Report / Code / Data (early submission, Dec. 2)  
15 Dec. 10 Course Project Final Presentation
(Session 2)
    Final Report / Code / Data (Dec. 9)