Strumenti Utente

Strumenti Sito


mds:smd:start

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisione Revisione precedente
Prossima revisione
Revisione precedente
mds:smd:start [01/05/2019 alle 13:26 (5 anni fa)]
Salvatore Ruggieri [Class calendar]
mds:smd:start [04/11/2022 alle 12:18 (18 mesi fa)] (versione attuale)
Salvatore Ruggieri
Linea 1: Linea 1:
-<html> +====== Statistical Methods for Data Science A.Y2020/21 =====
-<!-- Google Analytics --> + 
-<script type="text/javascript" charset="utf-8"> +**This course is discontinuedStarting from A.Y. 2021/22, it has been replaced by 9 ECTS version:** 
-(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ + 
-(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), +  * **[[mds:sds: |Statistics for Data Science (628PP)]]**
-m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) +
-})(window,document,'script','//www.google-analytics.com/analytics.js','ga');+
  
-ga('create', 'UA-34685760-1', 'auto', 'personalTracker', {'allowLinker': true}); 
-ga('personalTracker.require', 'linker'); 
-ga('personalTracker.linker:autoLink', ['pages.di.unipi.it', 'enforce.di.unipi.it', 'didawiki.di.unipi.it'] ); 
-   
-ga('personalTracker.require', 'displayfeatures'); 
-ga('personalTracker.send', 'pageview', 'ruggieri/teaching/smd/'); 
-setTimeout("ga('send','event','adjusted bounce rate','30 seconds')",30000);  
-</script> 
-<!-- End Google Analytics --> 
-<!-- Capture clicks --> 
-<script> 
-jQuery(document).ready(function(){ 
-  jQuery('a[href$=".pdf"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'SMD', 'PDFs', fname); 
-  }); 
-  jQuery('a[href$=".r"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'SMD', 'Rs', fname); 
-  }); 
-  jQuery('a[href$=".zip"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'SMD', 'ZIPs', fname); 
-  }); 
-}); 
-</script> 
-</html> 
-====== Statistical Methods for Data Science A.Y. 2018/19 ====== 
  
 =====Instructor===== =====Instructor=====
Linea 41: Linea 11:
     * Università di Pisa     * Università di Pisa
     * [[http://pages.di.unipi.it/ruggieri/]]     * [[http://pages.di.unipi.it/ruggieri/]]
-    * [[ruggieri@di.unipi.it]]   +    * [[salvatore.ruggieri@unipi.it]]   
   * **Office hours**   * **Office hours**
-    * Tuesday h 14:00 - 17:00, Department of Computer Science, room 321/DO. +    * <del>Tuesday h 14:00 - 17:00, Department of Computer Science, room 321/DO.</del> 
 +    * **Office hours only on appointment via Teams/Skype. Skype contact: salvatore.ruggieri**
  
  
Linea 50: Linea 21:
  
 ^  Day of Week  ^  Hour  ^  Room  ^  ^  Day of Week  ^  Hour  ^  Room  ^ 
-|  Monday |  14:00 - 16:00  |  Fib-N1  |  +|  Tuesday |  16:00 - 18:00  |  [[https://teams.microsoft.com/l/channel/19%3afdc694d4d7044c5eb7c81396dea7e64b%40thread.tacv2/General?groupId=a083eaab-5584-4177-b197-e4fb9637642f&tenantId=c7456b31-a220-47f5-be52-473828670aa1|Teams Virtual Room]]  |  
-|  Tuesday |  9:00 - 11:00  |  Fib-A1  |+|  Wednesday|  9:00 - 11:00  |  [[https://teams.microsoft.com/l/channel/19%3afdc694d4d7044c5eb7c81396dea7e64b%40thread.tacv2/General?groupId=a083eaab-5584-4177-b197-e4fb9637642f&tenantId=c7456b31-a220-47f5-be52-473828670aa1|Teams Virtual Room]]  
  
  
Linea 63: Linea 34:
  
  
-=====Text Books=====+=====Mandatory Teaching Material=====
  
 The following are //mandatory text books//: The following are //mandatory text books//:
Linea 77: Linea 48:
 =====Preliminary program and calendar===== =====Preliminary program and calendar=====
  
-  * [[https://esami.unipi.it/esami2/programma.php?c=38251&aa=2018|Preliminary program]]. +  * [[https://esami.unipi.it/programma.php?c=48036&aa=2020|Preliminary program]]. 
-  * [[https://www.di.unipi.it/en/education/mds/academic-calendar-2018-2019-wds|Calendar of lessons]].+  * [[https://didattica.di.unipi.it/en/master-programme-in-data-science-and-business-informatics/academic-calendar-2020-2021/|Calendar of lessons]].
  
  
-=====Project=====+=====Student project=====
  
-  * Project can be done in groups of at most students. +  * The project can be done in groups of at most students. 
-  * Project must be completed by end of July, including oral discussion (on project and all topics of the course)+  * The project must be delivered (report + code) by end of July
-  * Project replace the written exam but **students have to [[https://esami.unipi.it/esami2/|register for the written dates]] in order to fill the student's questionnaire**. +  * The oral discussion must be done by the September session, and it will cover both the project and all topics of the course. 
-  * {{ :mds:smd:smd.project.2019.pdf | Project presentation slides}}+  * The project replaces the written exam but **students have to [[https://esami.unipi.it/esami2/|register for the written dates]] in order to fill the student's questionnaire**
-  * [[https://drive.google.com/drive/folders/1rmgn0uM-YxXPdcU3DhYCL7C0rnKYW-Xc?usp=sharing|Google Drive project directory]] (accessible only to authorized students)+  * Groups ready to discuss send the project to the teacher plus availability time slots for oral discussion
 +  * {{ :mds:smd:smd.project.2021.pdf | Project presentation slides}} and [[http://patterns.di.unipi.it/sds/video/smd_project_2021.mp4|project info audio-video (.mp4)]].
 =====Written exam===== =====Written exam=====
  
-__//There are no mid-terms//.__ The exam consists of a written part and an oral part. The written part consists of exercises on the topics of the course. Each question is assigned a grade, summing up to 30 points. Students are admitted to the oral part if they receive a grade of at least 18 points. Oral consists of critical discussion of the written part and of open questions and problem solving on the topics of the course.  +__//There are no mid-terms//.__ The exam consists of a written part and an oral part. The written part consists of exercises on the topics of the course. Each question is assigned a grade, summing up to 30 points. Students are admitted to the oral part if they receive a grade of at least 18 points. Written exam consists of open questions and exercises. Example written texts: **{{ :mds:smd:smdsample.pdf | sample1}}**, **{{ :mds:smd:smdsample2.pdf | sample2}}**. Oral consists of critical discussion of the written part and of open questions and problem solving on the topics of the course.\\ 
-Registration to exams is mandatory: [[https://esami.unipi.it/esami2/|register here]]+**Online exams:** during the COVID-19 restrictions, the written part and the oral part will be online. For the written part, students will connect to a reserved Teams virtual room and will activate both microphone and web-cam. The text will be shared in the virtual room chat. Solutions will be written on sheet of papers. Each sheet will include name, surname, student id, and it will be signed. A photo of the sheets will be delivered to [[ruggieri@di.unipi.it]] at the end of the written part.  
 + 
 +Registration to exams is mandatory (**beware of the registration deadline!**): [[https://esami.unipi.it/esami2/|register here]]\\ 
  
-^  Date  ^  Hour  ^  Room  ^   
-|  18/6/2019  |  9:00 - 11:00  |  Fib-N1  | 
-|  2/7/2019  |  16:00 - 18:00  |  Fib-L1  | 
-|  24/7/2019  |  16:00 - 18:00  |  Fib-L1  | 
 =====Class calendar===== =====Class calendar=====
  
 ^ ^ Date ^ Room ^ Topic ^ Learning material ^  ^ ^ Date ^ Room ^ Topic ^ Learning material ^ 
-|1| 18.02 14:00-16:00 | N1 | Introduction. Probability and independence. | **[T]** Chpts. 1-3 |  +|01| 16.02 16:00-18:00 | Teams | Introduction. Probability and independence. [[http://patterns.di.unipi.it/sds/video/smd01_20210216.mp4|rec01 audio-video (.mp4)]]| **[T]** Chpts. 1-3 {{:mds:smd:smd01.pdf|slides01 (.pdf)}}|  
-|219.02 9:00-11:00 | A1 | R basics.    | **[R]** Chpts. 1,2.1,2.2 {{ :mds:smd:r_intro.pdf | slides}} {{ :mds:smd:2018smdr1.r | script1.R}} | +|0223.02 16:00-18:00 | Teams | R basics. [[http://patterns.di.unipi.it/sds/video/smd02_20210223.mp4|rec02 audio-video (.mp4)]]| **[R]** Chpts. 1,2.1,2.2 {{:mds:smd:smd02.pdf|slides02 (.pdf)}},  {{:mds:smd:smd02.r|script02 (.R)}}|  
-|326.02 9:00-11:00 | A1 | Discrete random variables.    | **[T]** Chpt. 4 **[R]** Chpt. 3 {{ :mds:smd:2018smdr2.r | script2.R}} | +|0324.02 9:00-11:00 | Teams | Discrete random variables. [[http://patterns.di.unipi.it/sds/video/smd03_20210224.mp4|rec03 audio-video (.mp4)]]| **[T]** Chpt. 4 **[R]** Chpt. 3 {{:mds:smd:smd03.pdf|slides03 (.pdf)}},  {{:mds:smd:smd03.r|script03 (.R)}}|  
-|44.03 14:00-16:00 | N1 | Continuous random variables. Simulation.  | **[T]** Chpts. 5, 6.1-6.2  **[R]** Chpt. 3 {{ :mds:smd:2018smdr3.r | script3.R}} | +|0402.03 16:00-18:00 | Teams | Recalls: derivatives and integrals. [[http://patterns.di.unipi.it/sds/video/smd04_20210302.mp4|rec04 audio-video (.mp4)]]| **[P]** Chpt. 1-8 {{:mds:smd:smd04.pdf|slides04 (.pdf)}},  {{:mds:smd:smd04.r|script04 (.R)}}| 
-|55.03 9:00-11:00 | A1 | Expectation and variance. R data access  | **[T]** Chpt. 7 **[R]** Chpt2. {{ :mds:smd:2018smdr4.r | script4.R}} | +|05| 03.03 9:00-11:00 | Teams | Continuous random variables. Simulation. [[http://patterns.di.unipi.it/sds/video/smd05_20210303.mp4|rec05 audio-video (.mp4)]]| **[T]** Chpts. 5, 6.1-6.2  **[R]** Chpt. 3 {{:mds:smd:smd05.pdf|slides05 (.pdf)}},  {{:mds:smd:smd05.r|script05 (.R)}}| 
-|612.03 9:00-11:00 | A1 Recalls: derivatives and integrals   | **[P]** Chpt. 1-8 {{ :mds:smd:2018smdrmath.scriptMath.R}} | +|0609.03 16:00-18:00 | Teams | Expectation and variance. Computations with random variables[[http://patterns.di.unipi.it/sds/video/smd06_20210309.mp4|rec06 audio-video (.mp4)]]| **[T]** Chpts. 7,8 {{:mds:smd:smd06.pdf|slides06 (.pdf)}},  {{:mds:smd:smd06.r|script06 (.R)}}| 
-|713.03 11:00-13:00 | I-Lab | Power laws and Zipf laws.  | [[https://arxiv.org/pdf/cond-mat/0412004.pdf | Newman's paper]] Sects. I,II,IIIA,IIIB,IIIE,IIIF\\ {{ :mds:smd:2018smdr5.script5.R}} +|0710.03 9:00-11:00 | Teams R data access and programming[[http://patterns.di.unipi.it/sds/video/smd07_20210310.mp4|rec07 audio-video (.mp4)]]| **[R]** Chpt. 2.3,2.4 {{:mds:smd:smd07.zip|script07 (.zip)}} | 
-|8| 18.03 14:00-16:00 N1 | Zipf lawsProject presentation.    |  +|0816.03 16:00-18:00 | Teams | Power laws and Zipf laws. [[http://patterns.di.unipi.it/sds/video/smd08_20210316.mp4|rec08 audio-video (.mp4)]]| [[https://arxiv.org/pdf/cond-mat/0412004.pdf | Newman's paper]] Sect I, II, III(A,B,E,F) {{:mds:smd:smd08.pdf|slides08 (.pdf)}}, {{:mds:smd:smd08.zip|script08 (.zip)}} 
-|919.03 9:00-11:00 | A1 R programming   | **[R]** Chpt2.3 {{ :mds:smd:r_intro_exercise.exercise.R}} {{ :mds:smd:2018smdr6.script6.R}} | +|0917.03 9:00-11:00 | Teams Moments, joint distributions, sum of random variables. [[http://patterns.di.unipi.it/sds/video/smd09_20210317.mp4|rec09 audio-video (.mp4)]]| **[T]** Chpts9-11 {{:mds:smd:smd09.pdf|slides09 (.pdf)}}{{:mds:smd:smd09.zip|script09 (.zip)}} | 
-|10| 20.03 11:00-13:00 | I-Lab Computations with random variablesJoint distributions | **[T]** Chpts. 8-{{ :mds:smd:2018smdr7.r | script7.R}} | +|10| 23.03 16:00-18:00 | Teams Law of large numbersThe central limit theorem[[http://patterns.di.unipi.it/sds/video/smd10_20210323.mp4|rec10 audio-video (.mp4)]]| **[T]** Chpts. 13-14 {{:mds:smd:smd10.pdf|slides10 (.pdf)}}, {{:mds:smd:smd10.r|script10 (.R)}}| 
-|11| 25.03 14:00-16:00 | N1 CovarianceSum of random variables   | **[T]** Chpts10-11 | +|11| 24.03 9:00-11:00 | Teams Project presentationGraphical summaries. [[http://patterns.di.unipi.it/sds/video/smd11_20210324.mp4|rec11 audio-video (.mp4)]]| **[T]** Chpt15 {{:mds:smd:smd11.pdf|slides11 (.pdf)}}{{:mds:smd:smd11.r|script11 (.R)}}| 
-|12| 26.03 9:00-11:00 | A1 | Law of large numbersThe central limit theorem.    | **[T]** Chpts13-14 {{ :mds:smd:2018smdr8.script8.R}} +|1230.03 16:00-18:00 | Teams | Numerical summaries. Data preprocessing in R. [[http://patterns.di.unipi.it/sds/video/smd12_20210330.mp4|rec12 audio-video (.mp4)]]| **[T]** Chpt. 16, **[R]** Chpts. 4,10 {{:mds:smd:smd12.pdf|slides12 (.pdf)}}, {{:mds:smd:smd12.r|script12 (.R)}}, {{ :mds:smd:dataprep.r | dataprep.R}} | 
-|13| 8.04 14:00-16:00 | N1 | Graphical summaries.    | **[T]** Chpt. 15 {{ :mds:smd:2018smdr9.r | script9.R}} | +|137.04 9:00-11:00 | Teams | Unbiased estimators. Efficiency and MSE. [[http://patterns.di.unipi.it/sds/video/smd13_20210407.mp4|rec13 audio-video (.mp4)]]| **[T]** Chpts. 17.1-17.3, 19, 20  {{:mds:smd:smd13.pdf|slides13 (.pdf)}}, {{:mds:smd:smd13.r|script13 (.R)}} | 
-|149.04 9:00-11:00 | A1 | Numerical summaries. Data preprocessing in R. Q&A on the project  | **[T]** Chpt. 16, **[R]** Chpts. 4,10 {{ :mds:smd:2018smdr10.r | script10.R}}, {{ :mds:smd:dataprep.r | dataprep.R}} | +|1413.04 16:00-18:00 | Teams | Maximum likelihood estimation[[http://patterns.di.unipi.it/sds/video/smd14_20210413.mp4|rec14 audio-video (.mp4)]]| **[T]** Chpt. 21 {{ :mds:smd:notes1.pdf |}}  {{:mds:smd:smd14.pdf|slides14 (.pdf)}}, {{:mds:smd:smd14.r|script14 (.R)}} | 
-|1515.04 14:00-16:00 | N1 | Unbiased estimators. Efficiency and MSE    | **[T]** Chpts. 17.1-17.3, 19, 20 {{ :mds:smd:2018smdr11.r | script11.R}} | +|1514.04 9:00-11:00 | Teams | Linear regression. Least squares estimation[[http://patterns.di.unipi.it/sds/video/smd15_20210414.mp4|rec15 audio-video (.mp4)]]| **[T]** Chpts. 17.4,22 **[R]** Chpts. 6 {{ :mds:smd:notes2.pdf |}}  {{:mds:smd:smd15.pdf|slides15 (.pdf)}}{{:mds:smd:smd15.r|script15 (.R)}} | 
-|1616.04 9:00-11:00 | A1 | Maximum likelihood. Fisher information | **[T]** Chpt. 21 {{ :mds:smd:notes1.pdf |}}| +|16| 20.04 16:00-18:00 | Teams | Multiple, non-linear, and logistic regression. [[http://patterns.di.unipi.it/sds/video/smd16_20210420.mp4|rec16 audio-video (.mp4)]]| **[R]** Chpt. 12.1,13,16.1-16.2 {{ :mds:smd:notes2.pdf |}} {{:mds:smd:smd16.pdf|slides16 (.pdf)}}, {{:mds:smd:smd16.zip|script16 (.zip)}} | 
-|1729.04 14:00-16:00 | N1 | Linear, polynomial, and non-linear regressions and least squares.  | **[T]** Chpts. 17.4,22 **[R]** Chpts. 6,12.1,16.1-16.2 {{ :mds:smd:2018smdr12.script12.R}} | +|1721.04 9:00-11:00 | Teams Logistic regression (ctd). Introduction to confidence intervals. [[http://patterns.di.unipi.it/sds/video/smd17_20210421.mp4|rec17 audio-video (.mp4)]]| **[T]** Chpts. 23.1 {{:mds:smd:smd17.pdf|slides17 (.pdf)}}, {{:mds:smd:smd17.r|script17 (.R)}} | 
-|1830.04 9:00-11:00 | A1 Confidence IntervalsGaussian, T-student, large sample method | **[T]** Chpts. 23.1,23.2,23.4,24.3,24.4 {{ :mds:smd:2018smdr13.r | script13.R}} | +|1827.04 16:00-18:00 | Teams | Confidence intervals: Gaussian, T-student, large sample method. Confidence intervals in linear regression. [[http://patterns.di.unipi.it/sds/video/smd18_20210427.mp4|rec18 audio-video (.mp4)]]| **[T]** Chpts. 23.2,23.4, 4.3,24.4 {{ :mds:smd:notes2.pdf |}} 
-| | <del>6.05 14:00-16:00</del> <del>N1</del> | **NO LESSON ON THIS DATE** | | +|19| 28.04 9:00-11:00 | Teams | Empirical bootstrap. Application to confidence intervals. [[http://patterns.di.unipi.it/sds/video/smd19_20210428.mp4|rec19 audio-video (.mp4)]]| **[T]** Chpts. 18.1,18.2,23.3 {{:mds:smd:smd19.pdf|slides19 (.pdf)}}, {{:mds:smd:smd19.r|script19 (.R)}} | 
-|19| 7.05 9:00-11:00 | A1 | Empirical and parametric bootstrap. Application to confidence intervals.   | **[T]** Chpts. 18,23.3 {{ :mds:smd:2018smdr14.r | script14.R}} | +|20| 04.05 16:00-18:00 | Teams Parametric bootstrap. Hypotheses testing. [[http://patterns.di.unipi.it/sds/video/smd20_20210504.mp4|rec20 audio-video (.mp4)]]| **[T]** Chpts. 18.3,25 {{:mds:smd:smd20.pdf|slides20 (.pdf)}}, {{:mds:smd:smd20.r|script20 (.R)}} | 
-|20| **8.05 11:00-13:00** **I-Lab** | Hypotheses testing. t-test and application to linear regressions | **[T]** Chpts. 25-27**[R]** Chpt5.{{ :mds:smd:2018smdr15.r | script15.R}} | +|2105.05 9:00-11:00 | Teams | One-sample tests of the mean and application to linear regression.[[http://patterns.di.unipi.it/sds/video/smd21_20210505.mp4|rec21 audio-video (.mp4)]]| **[T]** Chpts26-27, **[R]** Chpts5.1,5.2 {{:mds:smd:smd21.pdf|slides21 (.pdf)}}, {{ :mds:smd:notes2.pdf |}}, {{:mds:smd:smd21.r|script21 (.R)}} 
-| | <del>13.05 14:00-16:00</del> <del>N1</del> | **NO LESSON ON THIS DATE** | | +|22| 11.05 16:00-18:00 | Teams Multiple comparisonsFitting distributions.[[http://patterns.di.unipi.it/sds/video/smd22_20210511.mp4|rec22 audio-video (.mp4)]]{{ :mds:smd:ks.pdf K-S}}, {{:mds:smd:smd22.pdf|slides22 (.pdf)}}, {{:mds:smd:smd22.r|script22 (.R)}} 
-| | <del>14.05 9:00-11:00</del> | <del>A1</del>**NO LESSON ON THIS DATE** | | +|23| 12.05 9:00-11:00 | Teams Two-sample tests of the mean, and F-test.[[http://patterns.di.unipi.it/sds/video/smd23_20210512.mp4|rec23 audio-video (.mp4)]]| **[T]** Chpts28, **[R]** Chpts5.3-5.7 {{:mds:smd:smd23.pdf|slides23 (.pdf)}}, {{:mds:smd:smd23.r|script23 (.R)}} 
-|21| 20.05 14:00-16:00 | N1 | ...    | ... +|24| 18.05 16:00-18:00 | Teams | Testing correlation/independence. Multiple-sample tests of the mean.[[http://patterns.di.unipi.it/sds/video/smd24_20210518.mp4|rec24 audio-video (.mp4)]]| **[R]** Chpts7, 8 {{:mds:smd:smd24.pdf|slides24 (.pdf)}}, {{:mds:smd:smd24.r|script24 (.R)}} 
-|22| 21.05 9:00-11:00 | A1 | ...    ... | +|--| 19.05 9:00-11:00 | Teams | Office hours and project tutoring. |  |
-| | <del>27.05 14:00-16:00</del> | <del>N1</del> **NO LESSON ON THIS DATE (EU ELECTIONS)** | +
-|23| 28.05 9:00-11:00 | A1 | ...    | ... | +
-|24| **29.05 11:00-13:00** | **I-Lab** ...    | ... | +
  
  
  
-=====Previous years===== 
  
-  * [[mds:smd:2018|Statistical Methods for Data Science A.Y. 2017/18]] 
-  * [[mds:smd:2017|Statistical Methods for Data Science A.Y. 2016/17]] 
  
  
mds/smd/start.1556717177.txt.gz · Ultima modifica: 01/05/2019 alle 13:26 (5 anni fa) da Salvatore Ruggieri