Department of Computer Science
Pomona College
CS 160 - Introduction to Information Retrieval
Fall 2009

Instructor: Dave Kauchak
e-mail: [first_initial][last_name]@cs.pomona.edu
office hours: T/Th 10:30am-12 and 2-3pm

time: MW 11-12:15
location: Edmunds 217
web page: http://www.cs.pomona.edu/classes/cs160/
discussion board: TBA

textbook:

assignment handin: How to submit assignments


Announcements

(you are responsible for information posted here)

Ethics articles and assignments posted
homework 6 is available
Cumulative grades so far can be found here
Final project specification is available
assignment 4 is available
Sample output for assignment 3 can be found here
homework 5 is available
homework 3 solution available
assignment 3 is available
homework 4 is available
homework 2 solution available
homework 3 is available
assignment 2 is available
Remember to read the popular media article (scheduled for 9/21)
homework 1 solution available
homework 2 is available
The page numbers for the homework are different for the online version, but the problem numbers are the same. Don't worry :)
homework 1 is available
assignment 1 is available

Schedule

Note: This is a tentative schedule and is subject to change
DateTopicReadingSlides/Handouts
9/2Admin. material, IntroductionCh. 1 except 1.2admin, slides, pdf
9/7Text pre-processingCh. 2, 5.1slides, pdf
9/9Index constructionCh 1.2, Ch. 4slides, pdf
9/14Index compressionCh. 5slides, pdf
9/16TF-IDFCh. 6 except 6.4.4slides, pdf
9/21Faster TF-IDFCh. 7, articleslides, pdf
9/23EvaluationCh. 8slides, pdf
9/28Spelling correctionCh. 3.3, 3.4, articleslides, pdf
9/30Relevance feedback/
query expansion
Ch. 9slides, pdf
10/5Web search basicsCh. 19 (except 19.3), articleslides, pdf
10/7CrawlingCh. 20slides, pdf
10/12Link AnalysisCh. 21slides, pdf
10/14Midterm  
10/19fall recess  
10/21Text segmentationpaper, articleslides, pdf
10/26Audio processing basicspaperslides, pdf
10/28Audio searchpaperslides, pdf
11/2Image processing basicspaper, articleslides, pdf
11/4Project proposal discussion  
11/9Document Image searchpaperslides, pdf
11/11Information Extractionpaper, articleslides, pdf
11/12 4:15
Rose Hills
Document modeling
(substitute lecture)
paper 
11/16Text classificationCh. 13 (except 13.5), 14.intro,
14.1, 14.3-6, 15-15.3
slides, pdf
11/18Text classification2articleslides, pdf
11/23Text clusteringCh. 16slides, pdf
11/25No class, substituted on 11/12 
11/30Hierarchical clusteringCh. 17, paper, articleslides, pdf
12/2Online AdvertisingCh. 3.9.2, 19.3slides, pdf
12/7Ethics in IR  
12/9Review?(cross-lingual IR?)  
12/14Final time 9am - project presentations