Departamento de Ciência de Computadores

Data Mining 1


Important Dates


List of Exercises #1

  • Portuguese version (pdf, html)
  • English version (pdf, html)
  • List of Exercises #2 (pdf, html)

    List of Exercises R #1 - Basic

    List of Exercises R #2 - Data Analysis

    How to use WEKA and interpret results (Section 2.4, in Portuguese)


    Theoretical classes

  • Slides with hands-on in R by Prof. Luis Torgo (highly recommended)


  • Class 1 (25/09) Introduction to Data Mining
  • Class 2 (02/10) Data
  • Class 3 (09/10) Data
  • Class 4 (16/10) Data Exploration and Visualization
  • Class 5 (23/10) (no slides, used the board)
  • Class 6 (30/10)
  • Class 7 (06/11)
  • Class 8 (13/11)

  • Recommended Reading

  • What statistical analysis should I use?
  • The statistics blue book (in particular the 10 Worst Statistical Mistakes and Pitfalls)
  • Data Quality and Integration Issues in Electronic Health Records
  • Top 10 Algorithms in Data Mining

  • Links of interest

  • KDD Nuggets
  • UCI Machine Learning Repository
  • Data Mining with R
  • Introduction to Inductive Logic Programming (book)
  • Weka
  • RapidMiner
  • Yap Prolog
  • Aleph
  •