====== Web Mining and Social Network Analysis 2014 ====== * **Dino Pedreschi** Università di Pisa, Knowledge Discovery and Data Mining Lab [[pedre@di.unipi.it]] * Teaching assistants: **Luca Pappalardo** [[lpappalardo@di.unipi.it]] and **Giulio Rossetti** [[giulio.rossetti@isti.cnr.it]], Knowledge Discovery and Data Mining Lab ===== News ===== * **Next exam session: Friday, September 12, 2014 -- h 09:00 -- Prof. Pedreschi's office** * Friday, June 6, 2014 - 14:00-16:00 Sala Seminari Est: PhD Workshop ===== 2014 Schedule ===== * **Monday, h 16:00 - 18:00, Aula C** * **Thursday, h 11:00 - 13:00, Aula N1** ====== Goals ====== Over the past decade there has been a growing public fascination with the complex "connectedness" of modern society. This connectedness is found in many contexts: in the rapid growth of the Internet and the Web, in the ease with which global communication now takes place, and in the ability of news and information as well as epidemics and financial crises to spread around the world with surprising speed and intensity. These are phenomena that involve networks and the aggregate behavior of groups of people; they are based on the links that connect us and the ways in which each of our decisions can have subtle consequences for the outcomes of everyone else. This short course is an introduction to the analysis of complex networks, with a special focus on social networks and the Web - its structure and function, and how it can be exploited to search for information. Drawing on ideas from computing and information science, applied mathematics, economics and sociology, the course describes the emerging field of study that is growing at the interface of all these areas, addressing fundamental questions about how the social, economic, and technological worlds are connected. ====== Syllabus ====== 1) Graph theory and social networks * Graphs * Social, information, biological and technological networks * Strong and weak ties * Networks in their surrounding context 2) The World Wide Web * The structure of the Web * Link analysis and Web search * Web mining e sponsored search markets 3) Network dynamics * Information cascades * Power laws and rich-get-richer phenomena * The small-world phenomenon * Epidemics ====== Textbooks and materials ====== * Slides (see Calendar). * **David Easley, Jon Kleinberg: Networks, Crowds, and Markets. [[http://www.cs.cornell.edu/home/kleinber/networks-book/]]** * **Albert-Laszlo Barabasi. Network Science Book Project (2013, ongoing) [[http://barabasilab.neu.edu/networksciencebook/]]** Reading: * **M. E. J. Newman: The structure and function of complex networks**, SIAM Review, Vol. 45, p. 167-256, 2003. ({{:wma:newman_2003.pdf|download pdf}}) * **A.-L. Barabasi. Linked. PLUME, Penguin Group, 2002.** * Duncan J. Watts. //Six Degrees: The Science of a Connected Age.// Norton, New York, 2003. * Anand Rajaraman, Jeffrey D. Ullman, Mining of Massive Datasets. [[http://infolab.stanford.edu/~ullman/pub/book.pdf]] Course on **Network Science** held by **Albert-Laszlo Barabasi** at Northeastern University, Boston, MA: [[http://barabasilab.neu.edu/courses/phys5116/|link]] ====== Midterm Project ====== * Translation of Barabasi's network science book. ====== Project ====== * {{:wma:wma_20final_20term_202014.pdf|Project Assignment & Instructions}} ====== Calendar ====== ^ ^ Date ^ Topic ^ Learning material ^ Homework ^ |1. | Monday, 17.02.2014 | Introduction to Complex Network Analysis. | {{:wma:pedreschi.wmr.2012.01.pdf|slides}} | **Reading:** Chapter 1, 2 of Kleinberg's book and Chapter 1, 2 of Barabasi's book. | |2. | Thursday, 20.02.2014 | Basic network measures: degree, distance, clustering | | | |3. | Monday, 24.02.2014 | Basic network measures: degree, distance, clustering | {{:wma:pedreschi.wmr.2012.02.pdf|slides}} | | |4. | Thursday, 27.02.2014 | Random graphs and real networks | {{:wma:pedreschi.wmr.2012.03.pdf|slides}} | **Reading:** Chapter 3 of Barabasi's book | | | Monday, 03.03.2014 | Lecture canceled | | | |5. | Thursday, 06.03.2014 | Network analytics with Cytoscape | [[http://www.cytoscape.org/|Cytoscape website]] | | |6. | Monday, 10.03.2014 | Small world, Strength of weak ties | {{:wma:pedreschi.wmr.2012.04.pdf|slides}} | **Reading**: Chapter 3 of Kleinberg's book, {{:wma:travers69smallworld.pdf|Milgram's small world experiment}}, {{:wma:watts-smallworld2003.pdf|Watts' email experiment}}, {{:wma:leskovec-im.pdf|Leskovec's IM experiment}}, {{:wma:granstrengthweakties.pdf|Granovetter's Strength of Weak Ties theory}}, {{:wma:pnas-2007-onnela-7332-6.pdf|Onnela et al.'s Strength of Weak Ties experiment}} | |7. | Thursday, 13.03.2014 | Organization of Translation Project | [[http://barabasilab.neu.edu/networksciencebook/|NetSci Book]] | |8. | Monday, 17.03.2014 | Centrality measures | {{:wma:pedreschi.wmr.2012.05.centrality.pdf|slides}} | | | | |9. | Monday, 24.03.2014 | Scale free networks. Generative models: Small World model and Barabasi-Albert model (Preferential attachment) | [[http://barabasilab.neu.edu/courses/phys5116/content/Class4_NetSci_2012/04_CLASS_2012_Scale-Free_Property.pdf|slides Scale Free Property]] [[http://barabasilab.neu.edu/courses/phys5116/content/Class5_NetSci_2012/05_CLASS_2012_The_Small_World.pdf|slides Small World Model]] [[http://barabasilab.neu.edu/courses/phys5116/content/Class7_NetSci_2012/07_CLASS_2012_BAmodel.pdf|slides Barabasi Albert Model]] | Read Chapters 4 and 5 of Barabasi's book. Read original papers of {{:wma:wsmodel.pdf| Watts-Strogatz model}} and {{:wma:bamodel.pdf| Barabasi-Albert model}} | | |10. | Thursday, 27.03.2014 | Preferential attachment (continued) | | |11. | Monday, 07.04.2014 | Link prediction | {{:wma:lezionerossettilinkprediction09.05.2012.pdf|slides}} | Guest lecturer: Giulio Rossetti (Dottorato di Ricerca in Informatica, Università di Pisa) | |12. | Thursday, 10.04.2014 | Network simulations with NetLogo. Network robustness to failures and attacks. | [[http://ccl.northwestern.edu/netlogo/|NetLogo website]] [[http://barabasilab.neu.edu/courses/phys5116/|Slides Barabasi Class 9]] | Read Chapter 8 of Barabasi's book. | |13. | Monday, 14.04.2014 | Il contributo della Social Network Analysis alla “peace research” | [[Abstract]] | Guest lecturer: Prof. Andrea Salvini, Dip. Scienze Sociali, Univ. Pisa | |14. | Monday, 28.04.2014 | Diffusion, spreading, contagion, epidemics | {{:wma:07-cascading_leskovec_.pdf|slides Leskovec}}, {{:wma:08-cascades_leskovec_.pdf|slides Leskovec}} | Read Chapter 16 and 17 of Kleinberg's book. | |15. | Monday, 05.05.2014 | Diffusion, spreading, contagion, epidemics (continued) | | | |16. | Thursday, 08.05.2014 | Diffusion, spreading, contagion, epidemics (continued) | {{:wma:09-contagion.pdf|slides Leskovec}} [[http://barabasilab.neu.edu/courses/phys5116/content/Class17_NetSci_2012/17_CLASS_2012_Spreading.ppt|slides Barabasi]] | Read Chapter 19 and 21 of Kleinberg's book. | |17. | Monday, 12.05.2014 | Diffusion, spreading, contagion, epidemics (continued) | | | |18. | Thursday, 17.05.2014 | Network effects: Schelling's segregation model | | | |19. | Monday, 19.05.2014 | Project assignment and organisation | | | |20. | Friday, 06.06.2014 | **PhD workshop** | Tiziano De Matteis: //[[Models for evolving graphs]]//. Andrea De Salve: //[[Spreading in Distributed Online Social Networks]]//. Francesco Piccinno: //[[Large Graph Processing]]// | Hours: 14:00-16:00 Room: Sala Seminari Est - Dip. Informatica | ====== Link alle edizioni precedenti ====== * Edizione 2012-2013 [[WMA20122013]] * Edizione 2011-2012 [[WMA20112012]] * Edizione 2010-2011 [[WMA20102011]] * Edizione 2008-2009 [[WMA20082009]]