COMP 135
Intro to Machine Learning and Data Mining

Department of Computer Science

Tufts University

Fall 2009

 

TR 12:00-1:15

Halligan Hall 106

 

Instructor: Carla Brodley

Office hours: Tu/Th 1:15-1:45 and by appt.

 

TA: Andrew Fox

Office hours: Mo/Tu 3-4pm H111b

 

Assignments

Assignment 1

Here is the specification for Assignment 1
Here are the training and test datasets.

Assignment 2

Here is the specification for Assignment 2
Here are the training, balanced test and skewed test datasets.

Assignment 3

Here is the specification for Assignment 3
Here are the iris-prime.arff, veh-prime.arff datasets.
Here is a small sample of only 200 examples from the veh-prime dataset: veh-prime-sample.arff for use with testing and debugging your code.
Link to Pearson Correlation Coefficient pseudocode.

Assignment 4

Here is the specification for Assignment 4
Here is the segment.arff dataset.

Assignment 5

Here is the specification for Assignment 5
Here is the bupa.arff dataset.

Assignment 6

Here is the specification for Assignment 6
Here is the KS.arff dataset.


Class Material

Course Introduction

Class Slides

Class 1 Slides


Useful Links

Weka
UCI data archive

FAQs

I am NOT a computer scientist. Should I take this course?

You will learn to use and extend publically-available tools for performing clustering, classification, and anomaly detection.