DATA MINING
[4740-6740]
Fall 2017
TEXTBOOK
DATA MINING: Concepts and Techniques
3rd Edition
Jiawei Han | Micheline Kamber | Jian Pei
Supplementary
TEXTBOOK
Introduction to DATA MINING
1st Edition
Michael Steinbach | Pang-Ning Tan | Vipin Kumar
Supplementary
TEXTBOOK
The Elements of Statistical Learning
2nd Edition
Trevor Hastie | Robert Tibshirani | Jerome Friedman
> The full PDF of this book is generously put online for free, by its great authors.
For password-protected material: "dm46"
For password-protected material: "dm46"
For all quizzes and projects, make sure you enter your 4-digit ID on top of your papers or scripts.
FINAL PROJECT DESCRIPTION
This is a guide to your final project. Here I explain the major steps you should consider in your project, and how you should present it.
There are some useful links in this file to help you find some data, learn about what should be included in a poster, and its format.
*Poster: (PDF, 24 X 36, in inches)
Oct. 26, 2017
A pdf file containing the direct link to the data you have chosen to work on, for your final project, and a short paragraph about the data.
Nov. 2, 2017
A pdf file with 1-3 paragraphs, explaining your data and what you want to do with it. What is the problem you want to solve and what knowledge you are hoping to extract from it.
Nov. 9, 2017
A pdf file with 1-3 paragraphs explaining the problems you are dealing with. Please, avoid listing some general problems. I would like to know the exact problems you have to tackle, and your raw ideas of attacking them.
Nov. 16, 2017
A pdf file with 1-3 paragraphs (once again) explaining the problems you are dealing with now. Please, avoid listing general problems. I would like to know the exact problems you have to tackle this time, and your raw ideas of attacking them.
Nov. 19, 2017
Your final project submission deadline. You need to submit:
Your data (Raw data)
Your data (Processed)
Your code (R script)
Poster (PDF, 24 x 36 inches)
Dec. 1, 2017
This is your presentation day. You MUST be present at this day without any exception. I want you guys to give a 5- min presentation on what you have done, on the poster you have submitted.
Students' Posters
R
R Cheat-sheet
from KDnuggets
> For more helpful links about Data Mining in R and Python visit this link.
You grades are here:
For confidentiality reasons, your name and pantherID are not included. Use your 4-digit ID to keep track of your grades.
The comments in the sheet below are the great work of the TA, Max Hostetter, who patiently looked into all students' code/quizzes and provided them with useful feedback.
What is Data Mining? [◦Computerphile◦]
What is Data Mining? [◦Wikipedia◦]
A documentary about Big Data, with some great examples of Data Mining applications. [◦Video◦]
Data Mining | Big Data | Data Science | Ware Housing? [◦investopedia◦]
Some polls about this subject: [◦KDnuggets◦]
Another book + slides? [Introduction to Data Mining, Tan, Steinbach, Kumar◦]
A rather short intro to Data Mining [◦Thearling◦]
Data Mining lectures by Ram Akella and Ray Larson [Videos] @ University Of California Santa Cruz
Wow! Today (2017/09/06) a massive solar flare occurred. [Top-50-List][Video]
Academic Calendar 2017 - 2018 [Here]
Contact me:
Contact TA:
aahmadzadeh1@cs.gsu.edu | 645, 25 Park Place