Data Mining
CSE-6412
Fall 2015
York University


Semester: Fall 2015
Course/Sect#: CSE-6412
Time: Tue 2:30pm-4:00pm
Thu 2:30pm-4:00pm
Location: TEL 0005
Instructor: Aijun An
Office: CSB 2048
Office Hours: Tue: 12:00-1:00pm and Thu: 4:00-5:00pm
Phone #: 416-736-2100 x44298
e-mail: aan@cse.yorku.ca


Welcome to the Data Mining course, CSE-6412, for Fall 2015. Materials, instructions, and notices for the course will accumulate here over the semester.


Message Board

January 14, 2016
Grades are posted. You can check yours via ePost".
December 14, 2010
Please be reminded that project presentations will take place on tomorrow December 15 at 2:00-4:30pm in room LAS 3033. Each projet has about 10 minutes for the presentation.
December 10, 2015
Please be reminded that the final exam is scheduled for tomorrow December 11 at 9:30am-11:30pm in CB 129. You can find some sample questions here.
December 3, 2015
Assignment 2 marks are posted. You can check yours using ePost. You need to log in with your eecs account to see the marks. For the feedbacks from the TA on Q1-Q5, please log into Web submit system and select 6412 for Course and A2-remark for Assignment. The TA's comments are in the pdf file that you submitted for Q1-Q5. I will bring the feedback on Q6 on a hard copy for you to the class today.
November 12, 2015
A list of potential course projects is posted. Please see the link to it in the Project section below. Also, in that section, you should see project requirements and some sample course projects from previous years.
November 5, 2015
Paper presentation schedule is posted.
October 29, 2015
The reading list for student paper presentations is posted. See the links below in the "Paper Review and Presentation" section for the reading list and requirements for the presentation.
October 28, 2015
An FAQ page for A2 is set up. Please see A2 Frequently Asked Questions.
October 21, 2015
Assignment 2 is posted. See the link below in the "Assignments" section.
September 25, 2014
Assignment 1 is posted. See the link below under "Assignments".
September 15, 2015
Lecture notes have been put under password protection. Credentials for accessing the lecture notes have been emailed to your cse or yorku account. Please check your email.
September 9, 2015
This web site is set up. Welcome to the course! The first lecturer will be at 2:30 - 4:00pm on Thursday September 10.


Description

Data mining or knowledge discovery from databases (KDD) is one of the most active areas of research in databases. It is at the intersection of database systems, statistics, AI/machine learning, and data visualization. In this course, we will introduce the concepts of data mining and present data mining algorithms and applications. Topics include association rule mining, sequential pattern mining, classification models, and clustering.


Prerequisites

  • Required: an introductory course on database systems and an introductory course on probability.
  • Preferred: basic knowledge on statistics.


Reference Books and Materials

  • Jiawei Han, Micheline Kamber and Jian Pei, Data Mining -- Concepts and Techniques, Morgan Kaufmann, Third Edition, 2011.
  • Pang-Ning Tan, Michael Steinbach, Vipin Kumar, Introduction to Data Mining, Addison Wesley, 2006.
  • Ian H. Witten and Eibe Frank, Data Mining -- Practical Machine Learning Tools and Techniques (Second Edition), Morgan Kaufmann, 2005.
  • S.M. Weiss and N. Indurkhya, Predictive Data Mining, Morgan Kaufmann, 1998.
  • Margaret H. Dunham, Data Mining -- Introductory and Advanced Topics, Prentice Hall, 2003.
  • Some conference/journal papers
  • More books can be found here


Grading Scheme

  • Assignments (25%)
  • Final exam (30%) (Time: 9:30am-11:30am on Friday December 11. Location: CB129)
  • Paper review and presentation (10%)
  • Course project (25%) (Due: Tuesday December 22 by 4:00pm)
  • Participation (10%)


Lecture Notes


Assignments

  • Assignment 1 (12%) (Due Tuesday October 13 in class) Please note that you need a user name and a password (that you use to download the lecture notes) to access the assignment.
  • Assignment 2 (13%) (Due Wednesday November 4 by 10pm)


Paper Review and Presentation


Project


Useful On-line Information