Strumenti Utente

Strumenti Sito


dm:start:guidelines

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisione Revisione precedente
Prossima revisione
Revisione precedente
dm:start:guidelines [03/10/2018 alle 23:29 (6 anni fa)]
Anna Monreale [Guidelines for the task on Data Understanding]
dm:start:guidelines [23/11/2020 alle 10:34 (3 anni fa)]
Riccardo Guidotti [Guidelines for the task on Classification]
Linea 1: Linea 1:
 ====== Guidelines for the task on Data Understanding ====== ====== Guidelines for the task on Data Understanding ======
    * Data understanding (30 points)    * Data understanding (30 points)
-     Data semantics (3 points) +     Data semantics (3 points) 
-   *   Distribution of the variables and statistics (7 points) +     - Distribution of the variables and statistics (7 points) 
-   *   Assessing data quality (missing values, outliers) (7 points) +     - Assessing data quality (missing values, outliers) (7 points) 
-   *   Variables transformations (6 points) +     - Variables transformations (6 points) 
-   *   Pairwise correlations and eventual elimination of redundant variables (7 points)+     - Pairwise correlations and eventual elimination of redundant variables (7 points)
  
    
Linea 36: Linea 36:
 ====== Guidelines for the task on Classification ====== ====== Guidelines for the task on Classification ======
    * Learning of different decision trees/classification algorithms with different parameters and gain formulas with the object of maximizing the performances (12 points)    * Learning of different decision trees/classification algorithms with different parameters and gain formulas with the object of maximizing the performances (12 points)
-   * Decision trees interpretation (6 points) +   * Decision trees interpretation, validation with test and training set (6 points)  
-   Decision trees validation with test and training set (6 points)+   Training of different KNN classifiers with different parameters with the object of maximizing the performances (6 points)
    * Discussion of the best prediction model (6 points)    * Discussion of the best prediction model (6 points)
  
dm/start/guidelines.txt · Ultima modifica: 23/11/2020 alle 10:34 (3 anni fa) da Riccardo Guidotti