Indice

LABORATORY OF DATA SCIENCE (2018/2019)

Teacher:

Teaching assistant:

News

Hours and Rooms

Classes

Lessons will be held at: Polo Didattico “L. Fibonacci”, Via F. Buonarroti 4, Pisa.

Day of Week Hour Room
Monday 09:00 - 11:00 LAB M
Tuesday 11:00 - 13:00 LAB M

Office hours by appointment, Room 374/DO, Dept. of Computer Science.

Learning Material

Slides & Registration of the classes

Past Exams

Software

F.A.Q.

Class calendar - (2018-2019)

Day Topic Slides Registration Data/Software References
17.09 09:00-11:00 Canceled - The lesson will be recovered on October 19, 2018 Room I h:11-13
18.09 11:00-13:00 Canceled - The lesson will be recovered on October 25, 2018 Room M h:9-11
1. 24.09 09:00-11:00 Introduction. File data access. Representation formats: CSV, FLV, ARFF, XML lds.01.introduction.pdf lds.02.bi_architectures.pdf lds.03.file_data_access.pdf Video 24/09/2018 - BI technology: An Overview of Business Intelligence Technology - File access: File System Interface - File Formats: Introduction to data technologies(Chps. 5, 6), Weka ARFF Format, XRFF Format
2. 25.09 11:00-13:00 Python Recap Python Recap Video 25/09/2018
3. 01.10 09:00-11:00 File data access in Python. Lab practice on file access. lds.05.fileaccess-python.pdf Video 01/10/2018 Sample data code-2018-09-25.zip
4. 02.10 11:00-13:00 Lab practice on file access and transformation from CSV2ARFF file format. Video 02/10/2018 xmlelements2csv.zip csv2arff.zip code-2018-10-01.zip
5. 08.10 09:00-11:00 Lab practice on file access. Video 08/10/2018 ex-customers.pdf data-customers.zip
6. 09.10 11:00-13:00 RDBMS access protocols: ODBC, OLE DB, JDBC. ODBC Programming. lbi.06.relationaldataaccess-1.pdf Video 09/10/2018
7. 15.10 09:00-11:00 Lab practice: stratified sampling in ODBC. lbi.06.relational_data_access-complete.pdfVideo 15/10/2018 code-2018-10-15.zip
8. 16.10 11:00-13:00 Introduction to SQL Server. ETL tools: SQL Server Integration Services (SSIS). lds.07.sqlserver.pdf lds.08.etlandssis.pdfVideo 16/10/2018 stratifiedsampling.zip
9. 19.10 11:00-13:00 SSIS samples and lab practice: update and pipeline. Video 19/10/2018 lds-ssis-samples.zip ex-midterm.pdf
10. 22.10 09:00-11:00 SSIS samples and lab practice: sampling, update, surrogate keys. Video 22/10/2018
11. 23.10 11:00-13:00 SSIS samples and lab practice: surrogate keys, slowly changing dimensions, Mid-term practice Video 23/10/2018 2016ssis.zip
12. 25.10 09:00-11:00 SSIS samples and lab practice: surrogate keys, slowly changing dimensions, Mid-term practice Video 25/10/2018 Dissimilarity.py MDP.py exam 14/4/2015 siss-mdp.zip ssis-dissimilarityindex.zip
13. 05.11 09:00-11:00 Datawarehousing and OLAP recap. Data cubes, analytic SQL, and materialized views in SQL Server. lds.09.dwandolap.pdf Video 05/11/2018 First Part Video 05/11/2018 Second Partlbi.08.afdemo.sql.zip For DW and OLAP: Decision support databases course lecture notes.
14. 07.11 11:00-13:00 OLAP with SQL Server Analysis Services (SSAS): data source views, dimensions, hierarchies. Data cubes. lds.10.ssas.pdf Video 06/11/2018 monreale_foodmart.zip Notice: Please read the instructions in the Section NEWS! 1) SSAS (olap): documentation; 2) S. Harinath et al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chps. 4-6.
15.12.11 09:00-11:00 Parent-child hierarchies. OLAP explorative data analysis with Pivot Tables in Excel. Video 12/11/2018 First Part Video 12-13/11/2018 Pivot Tables in Excel: G. Harvey. Excel 2013 All-in-One For Dummies, 2013. Chp. VII-2.
16.13.11 11:00-13:00 Calculated metrics. ROLAP and MOLAP in SSAS. The Video of the previous lecture includes also the topic of this lecture. foodmartexplorative.xlsx MDX: 1) documentation and a useful guide on ordering; 2) S. Harinath ed al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chp. 3.
17.19.11 09:00-11:00 Practice with MDX. Video 19/11/2018 lbi.09.mdxsample.mdx.zip
18.20.11 11:00-13:00 Practice with MDX. Video 20/11/2018 lbi.09.mdxpractice.mdx.zip
19.26.11 09:00-11:00 Practice with MDX. Video 26/11/2018 20170208.pdf
20.27.11 11:00-13:00 Reporting with Power BI Desktop. Data Mining pre-processing in WEKA. lds.12.powerbi.pdf lds.13.weka.pdf Video 27/11/2018 weka.3.7.9.light.zipwekapatch.zip
21.03.12 09:00-11:00 WEKA Classification. meta-cost-classification.pdf Video 03/12/2018 lsd.practice.ee.pdf Data-ee
22.04.12 11:00-13:00 WEKA Classification (practice) & AR lds.14.associationrules.pdf Video 1 04/12/2018 Video 2 04/12/2018 Video 3 04/12/2018 lds.practicesolution.ee.pdf
23.10.12 09:00-11:00 WEKA AR & Practice. Weka API. lds.15.wekaapi.pdf Video 1 10/12/2018 Video 2 10/12/2018 Python example for WEKA API
22.11.12 11:00-13:00 Practice for the second midterm Queries sec. Midterm Weka practice Exercise on MDX

Exams

Mid-term exams

Rule: Students may do the second mid-term even if they did have the first mid-term.

Date Hour Room Notes Marks
29.10.2018 09:00 - 12:00 Room M
17.12.2018 09:00 - 12:00 Room M

Exam sessions

Rule: Students having at least one mid-term exam may do only one part of the written exam in the exam sessions.

Session Date Time Room Notes Marks
1.16.01.2019 09:00 - 12:00 Room M
2.06.02.2019 09:00 - 12:00 Room M
3.18.06.2019 09:00 - 13:00 Room H Oral Exam on DM1 within 15 July. If you cannot do within that date you can do the oral exam on September.
4.09.07.2019 09:00 - 13:00 Room H Oral Exam on DM1 within 15 July. If you cannot do within that date you can do the oral exam on September.

Extra sessions A.A. 2017/18

Date Time Room Notes Results
29.10.2018 09:00 - 12:00 Room M

Past Editions