Big data: architectures and data analytics (2016/2017)


This page has hierarchy - Parent page: Teaching

Table of content

General information

Exam rules

  • Exam rules Academic Year 2016-2017 (pdf)

Announcements

  • (23/03/2017) The schedule of the lab activities has been published in the “practices” section
  • (15/03/2017) The email with the credential for the BigData@Polito cluster has been sent (one personalized email for each student).
    • Please contact Paolo Garza if you are already enrolled in the course but you did not receive the email.
    • The information about how to use your account to connect to the BigData@Polito cluster will be provided during the first practice at LABINF.
  • (09/03/2017) The first lab practice will be held on Tuesday, March 21, 2017 from 17:30 to 19:00 at LABINF (map)
    • Please make sure you have an account on the LABINF PCs before the first lab practice. You can register an account at LABINF (map) every day from 2pm to 3pm. Student card and Certification of enrollment are needed.
    • Next week, I will also send you an email with your credential for the BigData@Polito cluster (it is not the same account of the LABINF laboratory).
    • LAB SCHEDULE. To fit LABINF capacity students must attend all labs according to the following schedule:
      • TEAM A: Students from AA to LE – Tuesday from 5:30pm to 7pm
      • TEAM B: Students from LI to ZZ – Thursday from 5:30pm to 7pm
  • (28/02/2017) First lecture: March 9, 2017 at 13:00
  • (28/02/2017) No lab the first two weeks of the course

Materials

Exercises

Practices

  • TEAM A: Students from AA to LE – Tuesday from 5.30pm to 7pm
  • TEAM B: Students from LI to ZZ – Thursday from 5.30pm to 7pm
  • Schedule of the lab activities
  • Team A Team B
    Lab #1 Tuesday, March 21 – from 5.30pm to 7pm Thursday, March 23 – from 5.30pm to 7pm
    Lab #2 Tuesday, March 28 – from 5.30pm to 7pm Thursday, March 30 – from 5.30pm to 7pm
    Lab #3 Tuesday, April 4 – from 5.30pm to 7pm Thursday, April 6 – from 5.30pm to 7pm
    Lab #4 Tuesday, April 11 – from 5.30pm to 7pm Thursday, April 20 – from 5.30pm to 7pm
    Lab #5 Tuesday, May 2 – from 5.30pm to 7pm Thursday, May 4 – from 5.30pm to 7pm
    Lab #6 Tuesday, May 9 – from 5.30pm to 7pm Thursday, May 11 – from 5.30pm to 7pm
    Lab #7 Tuesday, May 16 – from 5.30pm to 7pm Thursday, May 18 – from 5.30pm to 7pm
    Lab #8 Tuesday, May 23 – from 5.30pm to 7pm Thursday, May 25 – from 5.30pm to 7pm
    Lab #9 Tuesday, May 30 – from 5.30pm to 7pm Thursday, June 1 – from 5.30pm to 7pm

Additional materials

  • Slides and screencasts about Java (kindly provided by prof. Torchiano) (link)
    • Suggested slides/lectures for those students who do not know Java
      • OO Paradigm and UML (The UML part in not mandatory)
      • The Java Environment
      • Java Basic Features
      • Java Inheritance
  • Docker Image with Hadoop and Spark
  • Virtual Machine with Hadoop and Spark
    • How to import and use the Virtual Machine on your laptop or personal workstation (howto.pdf)
    • Virtual machine image – BigData_localHadoop.ova (download from Dropbox) (download from Google drive) – Size of the file: ~7 GB
      • The virtual machine does not include HUE. Hence, use hdfs (from the command line) to put and get files from the HDFS file system.