Data Mining
CSE-6412
Fall 2011
York University


Semester: Fall 2011
Course/Sect#: CSE-6412
Time: Tue 10:00am-11:30am
Thu 10:00am-11:30am
Location: BC 225
Instructor: Aijun An
Office: CSB 2048
Office Hours: Tue and Thur: 12:00pm - 1:00pm
Phone #: 416-736-2100 x44298
e-mail: aan@cse.yorku.ca


Welcome to the Data Mining course, CSE-6412, for Fall 2011. Materials, instructions, and notices for the course will accumulate here over the semester.


Message Board

December 8, 2011
Please be reminded that the final exam is scheduled for Monday December 12 at 1:00-3:30pm in CSE 3033. You can find some sample questions here.
November 19, 2011
Paper presentation schedule is posted.
November 13, 2011
The reading list for student paper presentations is posted. See the links below in the "Paper Review and Presentation" section for the reading list and requirements for the presentation.
November 10, 2011
An FAQ page for A2 is set up. Please see A2 Frequently Asked Questions.
November 3, 2011
Assignment 2 is posted. See the link below in the "Assignments" section.
September 27, 2011
Assignment 1 is posted. See the link below under "Assignments".
September 12, 2011
Please note that our classroom has been moved to BC 225, effective immediately.
September 6, 2011
This web site is set up. Welcome to the course! The first lecturer will be at 10:00 - 11:00am on Thursday September 8.


Description

Data mining or knowledge discovery from databases (KDD) is one of the most active areas of research in databases. It is at the intersection of database systems, statistics, AI/machine learning, and data visualization. In this course, we will introduce the concepts of data mining and present data mining algorithms and applications. Topics include association rule mining, sequential pattern mining, classification models, and clustering.


Prerequisites

  • Required: an introductory course on database systems and an introductory course on probability.
  • Preferred: basic knowledge on statistics.


Reference Books and Materials

  • Jiawei Han and Micheline Kamber, Data Mining -- Concepts and Techniques, Morgan Kaufmann, Second Edition, 2006.
  • Pang-Ning Tan, Michael Steinbach, Vipin Kumar, Introduction to Data Mining, Addison Wesley, 2006.
  • Ian H. Witten and Eibe Frank, Data Mining -- Practical Machine Learning Tools and Techniques (Second Edition), Morgan Kaufmann, 2005.
  • Margaret H. Dunham, Data Mining -- Introductory and Advanced Topics, Prentice Hall, 2003.
  • Some conference/journal papers (More will be posted over the semester).


Grading Scheme

  • Assignments (25%)
  • Final exam (Monday December 12 at 1:00 - 3:30pm in room CSE 3033) (30%)
  • Paper review and presentation (10%)
  • Course project (25%)
  • Participation (10%)


Lecture Notes


Assignments

  • Assignment 1 (12%) (Due Tuesday October 18 in class) Please note that you need a user name and a password to access the assignment. Please check your email for the user name and password.
  • Assignment 2 (13%) (Due Monday November 21 by 5pm)


Paper Review and Presentation


Project


Useful On-line Information