Departamento de Ciência de Computadores

Data Mining 1


Tests



Assignments


Assignment #1: Predicting Cardiac Pathology (pdf, html), deadline: November, 23rd

Assignment #2: Kaggle Contest, deadline: January 6th, 2019


Mini Assignments



Lists of Exercises



Theoretical classes


  • Slides with hands-on in R by Prof. Luis Torgo (highly recommended)


  • Class 1 (17/09) Introduction to Data Mining
  • Class 2 (19/09) Intro and Data
  • Class 3 (24/09) Data
  • Class 4 (26/09) Data: Types, formats etc (cont.)
  • Class 5 (1/10) Data: Types, formats etc (cont.)
  • Class 6 (3/10) Data Exploration and Visualization
  • Class 7 (8/10) Data Exploration and Visualization (cont.)
  • Class 8 (10/10) Notes about Data Exploration for dataset used in practical #2
  • Class 9 (15/10) Classification: decision trees plus discussion about p-values
  • Class 10 (17/10) To read and study: (I will be away)
  • Class 11 (22/10) Questions about papers of Class 10 (in Moodle)
  • Class 12 (24/10) Relational Learning (Inductive Logic Programming)
  • Class 13 (29/10) FIRST TEST
  • Class 14 (31/10)

  • Class 15 (5/11)

  • Class 16 (7/11)

  • Class 17 (12/11)

  • Class 18 (14/11)

  • Class 19 (19/11)

  • Class 20 (21/11)

  • Class 21 (26/11)
  • Class 22 (28/11)
  • Class 23 (03/12)
  • Class 24 (05/12)
  • Class 25 (10/12): Complementary material
  • Class 26 (12/12)

    Recommended Reading



    Links of interest


  • Google Dataset Search
  • KD Nuggets
  • UCI Machine Learning Repository
  • Data Mining with R
  • Kaggle: Your home for data science
  • Weka
  • RapidMiner
  • Yap Prolog
  • Aleph
  •