Homework 3
CS522, Winter 2011


Due: Wednesday, February 23

Please upload your files to CSNS. The files should include all the source code, documentation (optional), and a text file hw3.txt, which contains detailed instructions on how to compile and run your program on the CS3 server. Note that file uploading will be disabled automatically after 11:59PM of the due date, so please turn in your work on time.


For this assignment, you are going to implement a Decision Tree classifier and a Naive Bayesian classifier, and compare their accuracy on the Car Evaluation Dataset using 10-fold cross validation and t-test. You may use any programming language of your choice, as long as your program can be compiled and run on CS3.

The input to your program is simply the data file name. For example:

java Classify car.data

Your program should output to the console the following (and nothing else):

Note that use of existing classification code found online or from other sources will be considered cheating, and your program may not rely on any external tools or databases.

To receive full credit, your Decision Tree implementation must achieve at least 60% accuracy. The top three most accurate Decision Tree classifiers will receive up to 15% extra credit, and the bottom three will receive up to 20% extra penalty.