Big Data & Cloud Computing (Part 1)
Professors
- Eduardo Marques (Part 1, this page), gab. 1.72, edrdo _at_ dcc.fc.up.pt
- Inês Dutra (Part 2), gab. 1.31, ines _at_ dcc.fc.up.pt
Forums
- Announcements + Student questions → Moodle.
Aim of this course
[Detailed description at Sigarra]
This course provides an introduction the to the use of cloud computing for processing big data, concerning:
- Deployment of cloud-based infrastructures for big data applications.
- Programming big data applications using the cloud.
- Data mining fundaments for big data applications.
- Hands-on experience with state-of-the-art tools and real-world data sets.
Bibliography
- Cloud Computing for Science and Engineering (available free online), I. Foster and D. Gannon, MIT Press, 2017
- Python Data Science Handbook (available free online), Jake VanderPlas, O'Reilly, 2016
- Spark: The Definitive Guide - Big Data Processing Made Simple , M. Zaharia and B. Chambers, O'Reilly Media, 2018.
- Mining of Massive Data Sets, 2nd edition (available free online), J. Leskovec et al., Cambridge University Press, 2014.
- Hadoop: The Definitive Guide, 4th edition, T. White, O'Reilly Media, 2015.
Student evaluation
- Project assignments: 40 %
- Final exam: 60 %