Week 1
N-grams.
- Read Chapter 6 of the textbook.
- Do it soon, and bring your questions to class. You will have
lots of them.
- Counting n-grams
- Due by 11:59 PM Monday, April 2.
Please submit via HSP.
- Read Section 2.1 of the textbook.
- By Wednesday, April 4.
Week 2
Parsing.
- Write a parser based on Earley's algorithm.
- Due 11:59PM Monday, April 16.
Week 4
Syntax-driven semantics
- Read the handout from Jurafsky & Martin.
Week 5
Hidden Markov Models
- Do this semantic analysis assignment.
- Hand in on paper by noon Friday, April 27.
- Read Chapter 9 of the textbook.
- By Wednesday, May 2.
Week 6
Hidden Markov Models and Speech Recognition
- Do this Hidden Markov Model assignment.
- Hand in on paper by 11:10AM Monday, May 7.
Week 7
Playing with NLP tools
- Play with the CMU-Cambridge
Statistical Language Modeling Toolkit v2.
- These tools are installed on the CS Linux machines.
Try to figure out what each of the tools does, and what interesting info you can extract from
a large document of your choice. Bring observations to class on Wednesday.
You'll need to put /usr/local/CMU-Cam_Toolkit_v2/bin in your Linux $PATH environment variable.
In class, Monday, May 7.
Week 8
More playing with NLP tools
- Try this WordNet lab.
- In class, 16 May 2007.