Week 1

N-grams.

Read Chapter 6 of the textbook.
Do it soon, and bring your questions to class. You will have lots of them.
Counting n-grams
Due by 11:59 PM Monday, April 2. Please submit via HSP.
Read Section 2.1 of the textbook.
By Wednesday, April 4.

Week 2

Parsing.

Write a parser based on Earley's algorithm.
Due 11:59PM Monday, April 16.

Week 4

Syntax-driven semantics

Read the handout from Jurafsky & Martin.

Week 5

Hidden Markov Models

Do this semantic analysis assignment.
Hand in on paper by noon Friday, April 27.
Read Chapter 9 of the textbook.
By Wednesday, May 2.

Week 6

Hidden Markov Models and Speech Recognition

Do this Hidden Markov Model assignment.
Hand in on paper by 11:10AM Monday, May 7.

Week 7

Playing with NLP tools

Play with the CMU-Cambridge Statistical Language Modeling Toolkit v2.
These tools are installed on the CS Linux machines. Try to figure out what each of the tools does, and what interesting info you can extract from a large document of your choice. Bring observations to class on Wednesday. You'll need to put /usr/local/CMU-Cam_Toolkit_v2/bin in your Linux $PATH environment variable. In class, Monday, May 7.

Week 8

More playing with NLP tools

Try this WordNet lab.
In class, 16 May 2007.