Instructor: Dave Kauchak
e-mail: [first_initial][last_name]@cs.pomona.edu
office hours: T/Th 10:30am-12 and 2-3pm
time: MW 11-12:15
location: Edmunds 217
web page: http://www.cs.pomona.edu/classes/cs160/
discussion board: TBA
textbook:
assignment handin: How to submit assignments
Date | Topic | Reading | Slides/Handouts |
---|---|---|---|
9/2 | Admin. material, Introduction | Ch. 1 except 1.2 | admin, slides, pdf |
9/7 | Text pre-processing | Ch. 2, 5.1 | slides, pdf |
9/9 | Index construction | Ch 1.2, Ch. 4 | slides, pdf |
9/14 | Index compression | Ch. 5 | slides, pdf |
9/16 | TF-IDF | Ch. 6 except 6.4.4 | slides, pdf |
9/21 | Faster TF-IDF | Ch. 7, article | slides, pdf |
9/23 | Evaluation | Ch. 8 | slides, pdf |
9/28 | Spelling correction | Ch. 3.3, 3.4, article | slides, pdf |
9/30 | Relevance feedback/ query expansion | Ch. 9 | slides, pdf |
10/5 | Web search basics | Ch. 19 (except 19.3), article | slides, pdf |
10/7 | Crawling | Ch. 20 | slides, pdf |
10/12 | Link Analysis | Ch. 21 | slides, pdf |
10/14 | Midterm | ||
10/19 | fall recess | ||
10/21 | Text segmentation | paper, article | slides, pdf |
10/26 | Audio processing basics | paper | slides, pdf |
10/28 | Audio search | paper | slides, pdf |
11/2 | Image processing basics | paper, article | slides, pdf |
11/4 | Project proposal discussion | ||
11/9 | Document Image search | paper | slides, pdf |
11/11 | Information Extraction | paper, article | slides, pdf |
11/12 4:15 Rose Hills | Document modeling (substitute lecture) | paper | |
11/16 | Text classification | Ch. 13 (except 13.5), 14.intro, 14.1, 14.3-6, 15-15.3 | slides, pdf |
11/18 | Text classification2 | article | slides, pdf |
11/23 | Text clustering | Ch. 16 | slides, pdf |
11/25 | No class, substituted on 11/12 | ||
11/30 | Hierarchical clustering | Ch. 17, paper, article | slides, pdf |
12/2 | Online Advertising | Ch. 3.9.2, 19.3 | slides, pdf |
12/7 | Ethics in IR | ||
12/9 | Review?(cross-lingual IR?) | ||
12/14 | Final time 9am - project presentations |