magistraleinformatica:ir:ir22:start
Differenze
Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.
Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente | ||
magistraleinformatica:ir:ir22:start [18/01/2023 alle 15:01 (2 anni fa)] – [Exams] Paolo Ferragina | magistraleinformatica:ir:ir22:start [07/09/2023 alle 14:00 (22 mesi fa)] (versione attuale) – [Exams] Paolo Ferragina | ||
---|---|---|---|
Linea 42: | Linea 42: | ||
^ Date ^ Room ^ Text ^ Notes | | ^ Date ^ Room ^ Text ^ Notes | | ||
- | | 17/01/23, start at 09:00 | room E | {{ : | + | | 17/01/23, start at 09:00 | room E | {{ : |
- | | 08/02/23, start at 11:00 | room E | text, results, solution | | | + | | 08/02/23, start at 11:00 | room A1 | {{ : |
+ | | 05/06/2023, start at 16:00 | room C | {{ : | ||
+ | | 05/07/2023, start at 11:00 | room A1 | {{ : | ||
+ | | 24/07/2023, start at 14:00 | room C | {{ : | ||
+ | | 07/09/2023, start at 14:00 | room A1 | {{ : | ||
====== Materials for study ====== | ====== Materials for study ====== | ||
Linea 72: | Linea 77: | ||
| 07.11.2022 | Query processing: soft-AND. Phrase queries, biword index and positional index. Exact search: hashing. Prefix search: compacted trie, front coding, 2-level indexing. Edit distance with e-errors via brute-force approach, or Dynamic Programming (possibly weighted). Overlap measure with k-gram index. An index for e-error matches based on k-gram index (with false positives, no false negatives). | Sect. 2.3 and 2.4 of [MRS].\\ [[https:// | | 07.11.2022 | Query processing: soft-AND. Phrase queries, biword index and positional index. Exact search: hashing. Prefix search: compacted trie, front coding, 2-level indexing. Edit distance with e-errors via brute-force approach, or Dynamic Programming (possibly weighted). Overlap measure with k-gram index. An index for e-error matches based on k-gram index (with false positives, no false negatives). | Sect. 2.3 and 2.4 of [MRS].\\ [[https:// | ||
| 08.11.2022 | Caching and Tiered index. An efficient filter for one-error match (with false positives, no false negatives). | | 08.11.2022 | Caching and Tiered index. An efficient filter for one-error match (with false positives, no false negatives). | ||
- | | 14.11.2022 | Text-based ranking: dice, jaccard, tf-idf. Vector space model and cosine similarity doc-doc and query-doc. Storage of tf-idf and use for computing document-query similarity. Fast top-k retrieval: high idf, champion lists, many query terms, clustering.| Sect 6.2 and 6.3 and 7 from [MRS], [[https:// | + | | 14.11.2022 | Text-based ranking: dice, jaccard, tf-idf. Vector space model and cosine similarity doc-doc and query-doc. Storage of tf-idf and use for computing document-query similarity. Fast top-k retrieval: high idf, champion lists, many query terms, clustering.| Sect 6.2 and 6.3 and 7 from [MRS], [[https:// |
| 15.11.2022 | Fast top-k retrieval: fancy hits. Exact Top-K: WAND and blocked-WAND. | | | | 15.11.2022 | Fast top-k retrieval: fancy hits. Exact Top-K: WAND and blocked-WAND. | | | ||
| 21.11.2022 | Relevance feedback, Rocchio, pseudo-relevance feedback, query expansion. Performance measures: precision, recall, F1, DCG and NDCG. | Sect 8.1-8.3 and 9 [MRS]. | | | 21.11.2022 | Relevance feedback, Rocchio, pseudo-relevance feedback, query expansion. Performance measures: precision, recall, F1, DCG and NDCG. | Sect 8.1-8.3 and 9 [MRS]. | |
magistraleinformatica/ir/ir22/start.1674054114.txt.gz · Ultima modifica: 18/01/2023 alle 15:01 (2 anni fa) da Paolo Ferragina