CSC7442 Class website 

  Data Mining and Knowledge Discovery

     Class Schedule:  Monday and Wednesday 3:30pm to 4:50pm, 1110 Patrick Taylor Hall  

   

   Instructor: Dr. Jianhua Chen

         Contact info:   E-mail: jianhua@csc.lsu.edu
                                  Tel: 225-578-4340
                                  Office:  3122-C Patrick Taylor  Hall
        
         Office Hours:  M, T, TH 2:00 pm to 3:00 pm
 
         Other times:  By appointment
         
         

      Grader:  Mr. Sudip Biswas

                       Office:   Room 3116, Patrick Taylor Hall
                       Office Hours:  Tuesday 11am to 1pm
                       Tel:   
                       Email:  sbiswa7@tigers.lsu.edu

         
         
          More info. about the course:  class announcement    and the syllabus   guidelines for group projects

 

      Text Book: Introduction to Data Mining by Tan, Steinbach and Kumar

             web site of the text book

   

       Homework Assignments  

                  

           Homework1         Please note that the due date for homework 1 is postponed to Wednesday 9-19-2012

             Homework2            Please note that the training data set for Q.1 in homework 2 has been revised slightly

           Homework3

       Reminder

            Final  Exam:   Thursday December 6, 2012 from 12:30 pm to 2:30 pm

            Quiz Dates: September 26, 2012 and October 31, 2012
           
         
         

        Other links

             some interesting papers on association rule mining:  survey paper on subtree mining      sequential rule mining                                         original paper on association rule mining     paper on tree mining    paper on dbscan - density based clustering        link to lecture notes on SVM by Dr. Andrew Ng at Stanford

link to modified slides for Chapter 5            link to the chameleon clustering paper

          link to the adaboost paper        link to the paper on madaboost    link to the paper on bagging   link to the random forest paper      fp-tree           tree projection paper
    
             fuzzy clustering             link to presentation on text-mining
                              
              link to ICML10 proceedings

              link to ICML11 proceedings

              link to ICML12 proceedings

              link to ICDM conference website

         Group Presentations:

               Group 1      Members             Topic:   Predicting Automobile Gas Consumption (MPG) with DT and SVM

               Group 2      Members             Topic: Learning to Predict Game Winners for the Game of Arimaa      slides

               Group 3      Members             Topic:  Mining disease symptoms from  medicine articles
 
               Group 4     Members             Topic:   Rule Learning for Sleep-stage classification using 2-channel EEG and EOG Data   slides

               Group 5     Members              Topic:  A PCA-based Large-scale Face Recognition Using the MapReduce Framework

               Group 6     Members              Topic:  Text Mining

               Group 7     Members              Topic:  Mining Wine Quality with Physico-chemical Variables