Departamento de Ciência de Computadores

Disciplina: Big Data and Cloud Computing

All material previously found in this page is lost due to a server crash. I am trying to replace it slowly. For the time being, you will find here only links to the pdf files used for the theoretical classes and some practical assignments.

Very brief review of DM and ML, problems with scalability

Google Cloud Dataflow

Multi-relational data

Big Data in GPGPUs

Practical class #1: Molecules (batch)

Practical class #2: Stream data (stream)

Practical class #3: Multiprocessing in Python

Distributed Databases x Hadoop MapReduce

Recommended reading #1: Big Data computing and clouds: Trends and future directions

Recommended reading #2: Critical analysis of Big Data challenges and analytical methods