|
|
|
|
Stanford University Stat 252 and MS&E 238 Spring 2008 class time: Monday 3:15 - 6:05 pm Spring 2008 class location: Gates B01 This file (weigend.com/teaching/stanford) is the official course page. Other resources are probably more important: Spring 2008 list of students Spring 2008 course wiki Spring 2007 course wiki Data Mining and Electronic BusinessThis course is about people and data: Collecting data about behavior on the web, in social networks, in communication, on dating sites, etc. Mining the data, building predictive models, creating (and rejecting) hypotheses, designing cool experiments, and learning from them quickly. And figuring out what is truly new, what is similar to the past, and what the underlying drivers are. We will discuss the impact of the communication and data revolution on individuals, business, and society, i.e., to many aspects of the world we live in.The 90’s, the decade of algorithms (data mining), focused on the question: "Given these data, what insights can you get?". Great algorithms were invented, refined, and their strengths and weaknesses understood. The current decade is the decade of data (data mining), and the question has shifted to: "Given these problems, what data can you get?". Furthermore, economic aspects of data are becoming increasingly important, with the question becoming "Who pays whom?". The first half of the course focuses on data: Click data (what all can be collected and what it is useful for), intention data (such the queries from the searches you do, we will also discuss social search), attention data (such as tags on social bookmarking sites with its important application for discovery), and interaction data (of email headers and social networking sites). The second half of the quarter focuses on models and on creating appropriate structures and incentives. We will discuss models for products (recommender systems), people (reputation systems), situation and location. The second half discusses applications. They range from personalization, recommendations and online marketing (behavioral and situational targeting), to the principles behind collective intelligence, reputation systems and peer-production, as well as prediction markets as yet another way of gleaning data from people and fostering interactions between them. Students are expected to actively engage in class discussions, to have their assumptions challenged, and to bring their various backgrounds to bear to make it a great experience for themselves and everybody else. We will also have some great guest speakers come to class. After each class, a detailed write-up is created by the students as the course wiki (see 2007). To help prospective students with the decision of whether to take this course, previous syllabi (2004, 2005) might also be useful. Schedule: We meet once a week (Monday afternoons) for 3 hours. The dates in Spring 2008 are:
Course wiki: All students have full read/write access to the course wiki at stanford2008.wiskispaces.com. I encourage you to actively contribute -- the class and you will benefit. Grading: The main goal is that you get insights and that you transfer them to your area, coming up with some interesting ideas and applications. To support this objective, your grade will be determined by the following:
Readings Some of the material is very recent and originates from several academic disciplines. Besides statistics and computer science, it discusses modern marketing techniques, behavioral economics, social network analysis ideas and other concepts. Depending on your specific background and interests, the following might be useful:
Teaching Assistants, office hours and other information is on the course wiki. |
|
I am at Burning Man. I will not be able to read email or get text messages until I am back in San Francisco on Tuesday, September 2.
My email is
Click on an airport to see when I'll be there : AEP : BKK : BRC : EZE : FRA : GRU : HAM : ICN : LAX : LBC : LHR : MPX : MUC : MVD : OAK : OSL : PDP : PEK : SAN : SEL : SFO : SHA : SIN : SJC : USH : all |
|