
Departamento de
Ciência de Computadores
Disciplina: Sistemas de Apoio a Decisão Clínica
Mestrado em Informática Médica
23/24

EXERCISE: RULE INDUCTION FOR BREAST CANCER
What to do?
- Open the zip file with the CSV tables. Inspect the tables using
your favorite tool (it can be excel, a library, a programming
language etc).
- Describe the data contained in the tables:
- For the categorical variables, compute frequencies
- For the numerical variables, compute min, max, mean, median, standard deviation etc
- Repeat the description after separating upgrades from non-upgrades
- Plot visualization graphs for variables. For example, boxplots
for numerical variables and histograms for categorical variables
- Rule induction (modeling):
- Open cplint and select
the
bc_train
file
- Inspect the file
- Run
induce(X).
on the query panel
- Add expert rules to
bc_train
, including the modeb
and determination predicates, and learn some new rules