DATA MINING

[4740-6740]

Fall 2017

DM_ClassGuidelines.pdf

Student Evaluation of this Class >

Course Details




TEXTBOOK








DATA MINING: Concepts and Techniques

3rd Edition

Jiawei Han | Micheline Kamber | Jian Pei

> Table of Content | > Why this book?


Supplementary

TEXTBOOK



Introduction to DATA MINING

1st Edition

Michael Steinbach | Pang-Ning Tan | Vipin Kumar

> Table of Content


Supplementary

TEXTBOOK


The Elements of Statistical Learning

2nd Edition

Trevor Hastie | Robert Tibshirani | Jerome Friedman

> The full PDF of this book is generously put online for free, by its great authors.

  • For password-protected material: "dm46"




SLIDES







MY SLIDES

In line with the subjects covered by the book "Data Mining concepts and techniques" and the proposed order, with a great extent of changes based on a variety of sources.


  • For password-protected material: "dm46"

  • For all quizzes and projects, make sure you enter your 4-digit ID on top of your papers or scripts.



PROJECTS

&

QUIZZES






FINAL PROJECT DESCRIPTION


This is a guide to your final project. Here I explain the major steps you should consider in your project, and how you should present it.

There are some useful links in this file to help you find some data, learn about what should be included in a poster, and its format.

*Poster: (PDF, 24 X 36, in inches)




DM_FinalProject_Description.pdf

Oct. 26, 2017

A pdf file containing the direct link to the data you have chosen to work on, for your final project, and a short paragraph about the data.

Nov. 2, 2017

A pdf file with 1-3 paragraphs, explaining your data and what you want to do with it. What is the problem you want to solve and what knowledge you are hoping to extract from it.

Nov. 9, 2017

A pdf file with 1-3 paragraphs explaining the problems you are dealing with. Please, avoid listing some general problems. I would like to know the exact problems you have to tackle, and your raw ideas of attacking them.

Nov. 16, 2017

A pdf file with 1-3 paragraphs (once again) explaining the problems you are dealing with now. Please, avoid listing general problems. I would like to know the exact problems you have to tackle this time, and your raw ideas of attacking them.

Nov. 19, 2017

Your final project submission deadline. You need to submit:

  • Your data (Raw data)

  • Your data (Processed)

  • Your code (R script)

  • Poster (PDF, 24 x 36 inches)

Dec. 1, 2017

This is your presentation day. You MUST be present at this day without any exception. I want you guys to give a 5- min presentation on what you have done, on the poster you have submitted.

Students' Posters




OTHER SLIDES








Slides prepared for the textbook

"DATA MINING: Concepts and Techniques"

by:


Jiawei Han | Micheline Kamber | Jian Pei




R







R Cheat-sheet

from KDnuggets


> For more helpful links about Data Mining in R and Python visit this link.



TALKs




Your Talks

> Saurabh Gupta

You grades are here:

  • For confidentiality reasons, your name and pantherID are not included. Use your 4-digit ID to keep track of your grades.

  • The comments in the sheet below are the great work of the TA, Max Hostetter, who patiently looked into all students' code/quizzes and provided them with useful feedback.

DM_Grades_Shared

Contact me:

Contact TA: