CSG230: Data Mining Techniques (Spring 2008)


NEWS

Instructor

Donghui Zhang
Office Hours: Tu W 4-5:30pm
Office: 478 WVH
Phone: x2177
Email: donghui AT ccs.neu.edu
Lecture: W 6:00-9:00pm. 102WVG.

TA

Jian Wen <jarodwen AT ccs.neu.edu>, office hour Tu 4-7pm, 472WVH.
Tianhua Zheng <tianhua AT ccs.neu.edu> office hour F 1:30-3, 5:30-7pm, 472WVH.

Overview

This course introduces basic data mining techniques. We will cover four aspects of data mining including association rules, cluster Analysis, classification, and data warehousing. The class projects involve solving a real industrial problem, and implementing some fundamental data mining algorithm.

Prerequisite

Knowledge of Java and algorithms.

Grading

Textbook

Data Mining -- Concepts and Techniques, Jiawei Han and Micheline Kamber, 2nd edition, published by Morgan Kaufmann Publishers, ISBN 1-55860-901-6.

Projects

Class Schedule

 

Date

Lecture

Quiz & Milestone

Week 1

Jan 9

Introduction, project.

 

Week 2

Jan 16

Chp 5: Association.

 

Week 3

Jan 23

Chp 5.

 

Week 4

Jan 30

Chp 5.

 

Week 5

Feb 6

Chp 6: Classification.

Quiz 1

Week 6

Feb 13

Chp 6.

 

Week 7

Feb 20

Chp 6

 

Week 8

Feb 27

Chp 2: Data Preprocessing.

Quiz 2

Week 9

Mar 5

No class (Spring break)

 

Week 10

Mar 12

Chp 3. Data Warehousing.

 

Week 11

Mar 19

Chp 4. Data Cube.

 

Week 12

Mar 26

Chp 7: Clustering.

Quiz 3

Week 13

Apr 2

Chp 7

 

Week 14

Apr 9

No class (instructor travels)

 

Week 15

Apr 16

Chp 8 & 9 Advanced topics.

 

Final Exam

Apr 23

 

 

Academic Honesty

Northeastern University is committed to the principles of intellectual honesty and integrity. All members of the Northeastern Community are expected to maintain complete honesty in all academic work, presenting only that which is their own work in tests and assignments. If you have any questions regarding the proper attribution of the work of others, contact your professor prior to submitting the work for evaluation.