INFO 523: Data Mining & Discovery

Dr. Greg Chism

This page contains an outline of the topics, content, and assignments for the semester. Note that this schedule will be updated as the semester progresses and the timeline of topics and assignments might be updated throughout the semester.

Week Date Topic Video Prepare Due
1 Mon, Jul 7

Lec 0

Welcome + Intro to Python

Panopto 🎥

📚 Python Data Analysis - Chp 2
📚 Python Data Analysis - Chp 3
🎥 Getting Started with Quarto
🎥 Quarto + VSCode
🎥 Git + VSCode
🎥 Git + GitHub in RStudio - just watch conceptual stuff


Fri, Jul 11

Lec 1

Introduction to Data Mining
Intro to Numpy


📚 Data Mining Concept - Chp 1
📚 Python Data Analysis - Chp 4

RQ 1

2 Mon, Jul 14

Lec 2

Sampling + conclusions
Intro to Pandas

Panopto 🎥

📚 Prac Stats for DS - Chp 2
📚 Python Data Analysis - Chp 5
📚 Python Data Analysis - Chp 6


Fri, Jul 18

Lec 3

Exploratory Data Analysis + Data Viz


📚 Prac Stats for DS - Chp 1
📚 Python Data Analysis - Chp 9

📝 HW 1
RQ 2

3 Mon, Jul 21

Lec 4

Data Preprocessing

Panopto 🎥

📚 Python Data Analysis - Chp 7
📚 Python Data Analysis - Chp 8


Fri, Jul 25

Lec 5

Classification I


📃 Data Preprocessing in Python

📝 HW 2

4 Mon, Jul 28

Lec 5 Cont.

Classification I Cont.

Panopto 🎥

📚 ISL - Chp 4



Lec 6

Classification II
Model Evaluation


📚 ISL - Chp 8.1
📚 ISL - Chp 5


Fri, Aug 1

Lec 7
Peer Review

Regression I
Final Project Peer Review


📚 ISL - Chp 3
📚 ISL - Chp 6, up to regularization

📑 Project proposals for peer review
📝 HW 3
RQ 3

5 Mon, Aug 4

Lec 8

Regression II

Panopto 🎥 - IP

📚 ISL - Chp 6, second half

📑 Project proposals for instructor review


Lec 9

Regression III


📚 ISL - Chp 7
📚 ISL - Chp 8.2


Fri, Aug 8

Lec 10

Support Vector Machines


📚 ISL - Chp 9

📝 HW 4
RQ 4

6 Mon, Aug 11

Lec 11

Unsupervised Learning Methods I

Panopto 🎥 - IP

📚 Prac Stats for DS - Chp 7 (pgs 294-302)


Fri, Aug 15

Lec 12

Unsupervised Learning Methods II


📚 Prac Stats for DS - Chp 7 (pgs 302- 325)

📝 HW 5
RQ 5

7 Mon, Aug 18

Lec 13

Association Rules

Panopto 🎥 - IP

📚 Data Mining Concept - Chp 4
📃 Data Mining in Python


Finals Wed, Aug 20

Final Project Presentations



📑 Final Project