Strumenti Utente

Strumenti Sito


mds:lbi:start

LABORATORY OF DATA SCIENCE (2018/2019)

Teacher:

Teaching assistant:

News

  • [13-11-2018]:Results of the first midterm Results
  • [07-11-2018]:Instructions for the SSAS project in the Lecture of today: to avoid conflicts in deployment/process follow this steps once the solution is opened: (1) rename the project as <your account>_foodmart; (2) from project properties select 'Deployment', then rename the database as <your account>_foodmart; (3) click on the button “show all files” just above “Solution explorer” right click on “view code” on the .database file that is visualized, and then change the ID from ruggieri_foodmart into <your account>_foodmart, and finally save the file; (4) change the credentials of connection to database on SQL Server. As an alternative solution you mayimport the project from the SSAS server and rename it as <your account>_foodmart (step 4 is still necessary).
  • [20-10-2018]: Here you can find exercises simular to those you can find in the first mid-term. Please try to address them and on October 25, 2018 during the lesson we will discuss the solutions.
  • [09-10-2018]: The lesson of Sept, 17 will be recovered on October 25, 2018 Room M
  • [09-09-2018]: Lessons will start on Monday, 24th. Please, see details below.

Hours and Rooms

Classes

Lessons will be held at: Polo Didattico “L. Fibonacci”, Via F. Buonarroti 4, Pisa.

Day of Week Hour Room
Monday 09:00 - 11:00 LAB M
Tuesday 11:00 - 13:00 LAB M

Office hours

Prof. Anna Monreale: Wed 9:00 - 12:00, Dept. of Computer Science.

Learning Material

Slides & Registration of the classes

  • The slides used in the course will be inserted in the calendar after each class.
  • Registration of each lecture will be published in the calendar after each class

Past Exams

Software

F.A.Q.

Class calendar - (2018-2019)

Day Topic Slides Registration Data/Software References
17.09 09:00-11:00 Canceled - The lesson will be recovered on October 19, 2018 Room I h:11-13
18.09 11:00-13:00 Canceled - The lesson will be recovered on October 25, 2018 Room M h:9-11
1. 24.09 09:00-11:00 Introduction. File data access. Representation formats: CSV, FLV, ARFF, XML lds.01.introduction.pdf lds.02.bi_architectures.pdf lds.03.file_data_access.pdf Video 24/09/2018 - BI technology: An Overview of Business Intelligence Technology - File access: File System Interface - File Formats: Introduction to data technologies(Chps. 5, 6), Weka ARFF Format, XRFF Format
2. 25.09 11:00-13:00 Python Recap Python Recap Video 25/09/2018
3. 01.10 09:00-11:00 File data access in Python. Lab practice on file access. lds.05.fileaccess-python.pdf Video 01/10/2018 Sample data code-2018-09-25.zip
4. 02.10 11:00-13:00 Lab practice on file access and transformation from CSV2ARFF file format. Video 02/10/2018 xmlelements2csv.zip csv2arff.zip code-2018-10-01.zip
5. 08.10 09:00-11:00 Lab practice on file access. Video 08/10/2018 ex-customers.pdf data-customers.zip
6. 09.10 11:00-13:00 RDBMS access protocols: ODBC, OLE DB, JDBC. ODBC Programming. lbi.06.relationaldataaccess-1.pdf Video 09/10/2018 SolutionEx: 2018-10-09
7. 15.10 09:00-11:00 Lab practice: stratified sampling in ODBC. lbi.06.relational_data_access-complete.pdfVideo 15/10/2018 code-2018-10-15.zip
8. 16.10 11:00-13:00 Introduction to SQL Server. ETL tools: SQL Server Integration Services (SSIS). lds.07.sqlserver.pdf lds.08.etlandssis.pdfVideo 16/10/2018 stratifiedsampling.zip
9. 19.10 11:00-13:00 SSIS samples and lab practice: update and pipeline. Video 19/10/2018 lds-ssis-samples.zip ex-midterm.pdf
10. 22.10 09:00-11:00 SSIS samples and lab practice: sampling, update, surrogate keys. Video 22/10/2018
11. 23.10 11:00-13:00 SSIS samples and lab practice: surrogate keys, slowly changing dimensions, Mid-term practice Video 23/10/2018 2016ssis.zip
12. 25.10 09:00-11:00 SSIS samples and lab practice: surrogate keys, slowly changing dimensions, Mid-term practice Video 25/10/2018 Dissimilarity.py MDP.py exam 14/4/2015 siss-mdp.zip ssis-dissimilarityindex.zip
13. 05.11 09:00-11:00 Datawarehousing and OLAP recap. Data cubes, analytic SQL, and materialized views in SQL Server. lds.09.dwandolap.pdf Video 05/11/2018 First Part Video 05/11/2018 Second Partlbi.08.afdemo.sql.zip For DW and OLAP: Decision support databases course lecture notes.
14. 07.11 11:00-13:00 OLAP with SQL Server Analysis Services (SSAS): data source views, dimensions, hierarchies. Data cubes. lds.10.ssas.pdf Video 06/11/2018 monreale_foodmart.zip Notice: Please read the instructions in the Section NEWS! 1) SSAS (olap): documentation; 2) S. Harinath et al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chps. 4-6.
15.12.11 09:00-11:00 Parent-child hierarchies. OLAP explorative data analysis with Pivot Tables in Excel. Video 12/11/2018 First Part Video 12-13/11/2018 Pivot Tables in Excel: G. Harvey. Excel 2013 All-in-One For Dummies, 2013. Chp. VII-2.
16.20.11 11:00-13:00 Calculated metrics. ROLAP and MOLAP in SSAS. The Video of the previous lecture includes also the topic of this lecture. foodmartexplorative.xlsx MDX: 1) documentation and a useful guide on ordering; 2) S. Harinath ed al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chp. 3.

Exams

Mid-term exams

Rule: Students may do the second mid-term even if they did have the first mid-term.

Date Hour Room Notes Marks
29.10.2018 09:00 - 12:00 Room M
17.12.2018 09:00 - 12:00 Room M

Exam sessions

Rule: Students having at least one mid-term exam may do only one part of the written exam in the exam sessions.

Session Date Time Room Notes Marks

Extra sessions A.A. 2017/18

Date Time Room Notes Results
29.10.2018 09:00 - 12:00 Room M

Past Editions

mds/lbi/start.txt · Ultima modifica: 14/11/2018 alle 18:17 (3 giorni fa) da Anna Monreale