Computational linguistics · Schedule of topics
Fall, 2010 · Department of Computer Science · Grinnell College

August 27. How computers process language.

August 30. Regular expressions.
In preparation for this session, please study chapter 1 and chapter 2, from the beginning through section 2.1, in Jurafsky and Martin (pages 1-26).

September 1. Finite-state automata.
In preparation for this session, please study section 2.2 (pages 26-38) in Jurafsky and Martin.

September 3. Compiling automata.
In preparation for this session, please study from section 2.3 to the end of chapter 2 (pages 38-44) in Jurafsky and Martin.

September 6. Libraries in R6RS Scheme.
In preparation for this session, you might want to read chapter 7 of the Revised6 report on the algorithmic language Scheme.

September 8. Characters and encodings; Unicode.

September 10. Representing a morpheme lexicon.
In preparation for this session, please study from the beginning of chapter 3 through section 3.3 (pages 45-56) in Jurafsky and Martin.

September 13. Finite-state transducers.
In preparation for this session, please study sections 3.4 through 3.7 (pages 57-68) in Jurafsky and Martin.

September 15. Stemming.
In preparation for this session, please study section 3.8 (page 68) in Jurafsky and Martin.

September 17. Tokenization.
In preparation for this session, please study section 3.9 (pages 68-71) in Jurafsky and Martin.

September 20. Minimizing edit distance.
In preparation for this session, please study from section 3.10 to the end of chapter 3 (pages 72-82) in Jurafsky and Martin.

September 22. N-gram counting.
In preparation for this session, please study from the beginning of chapter 4 through section 4.4 (pages 83-97) in Jurafsky and Martin.

September 24. Smoothing.
In preparation for this session, please study sections 4.5 through 4.8 (pages 97-109) in Jurafsky and Martin.

September 27. Entropy and information.
In preparation for this session, please study from section 4.10 to the end of chapter 4 (pages 114-122) in Jurafsky and Martin. (Section 4.9 is optional reading.)

September 29. (pause for breath)

October 1. Part-of-speech tagging.
In preparation for this session, please study from the beginning of chapter 5 through section 5.4 (pages 123-139) in Jurafsky and Martin.

October 4. Tagging with a hidden Markov model.
In preparation for this session, please study section 5.5 (pages 139-151) in Jurafsky and Martin.

October 6. Brill tagging.
In preparation for this session, please study sections 5.6 through 5.8 (pages 151-163) in Jurafsky and Martin.

October 8. (pause for breath)

October 11. Context-free grammars.
In preparation for this session, please study from the beginning of chapter 12 through section 12.4 (pages 385-411) in Jurafsky and Martin.

October 13. Chomsky normal form.
In preparation for this session, please study from section 12.5 to the end of chapter 12 (pages 412-425) in Jurafsky and Martin.

October 15. Cocke-Younger-Kasami parsing.
In preparation for this session, please study from the beginning of chapter 13 through section 13.4.1 (pages 427-443) in Jurafsky and Martin.

October 25. Earley parsing.
In preparation for this session, please study section 13.4.2 (pages 443-448) in Jurafsky and Martin.

October 27. Chart parsing.
In preparation for this session, please study from section 13.4.3 to the end of chapter 13 (pages 448-458) in Jurafsky and Martin.

October 29. Feature structures.
In preparation for this session, please study from the beginning of chapter 15 through section 15.1 (pages 489-492) in Jurafsky and Martin.

November 1. Unification.
In preparation for this session, please study sections 15.2 and 15.3 (pages 492-507) and figure 15.8 (page 511) in Jurafsky and Martin.

November 3 and 5. Parsing with constraints.
In preparation for this session, please study from section 15.5 to the end of chapter 15 (pages 513-528) in Jurafsky and Martin.

November 8. Representations of meaning.
In preparation for this session, please study chapter 17 (pages 545-582) in Jurafsky and Martin.

November 10. Lexical semantics.
In preparation for this session, please study chapter 19 (pages 611-635) in Jurafsky and Martin.

November 12. Computational semantics.
In preparation for this session, please study chapter 18 (pages 583-610) in Jurafsky and Martin.

November 15. Named entity recognition.
In preparation for this session, please study from the beginning of chapter 22 through section 22.1 (pages 725-734) in Jurafsky and Martin.

November 17. Relation detection and classification.
In preparation for this session, please study sections 22.2 and 22.3 (pages 734-752) in Jurafsky and Martin.

November 19. Template filling.
In preparation for this session, please study from section 22.4 to the end of chapter 22 (pages 752-764) in Jurafsky and Martin.

November 22. The Semantic Web; XML.

November 24. Information extraction from XML documents.

November 29. Transformations.

December 1 and 3. Ontologies; the Web Ontology Language (OWL).

December 6. Classical approaches to machine translation.
In preparation for this session, please study from the beginning of chapter 25 through section 25.2 (pages 859-874) in Jurafsky and Martin.

December 8. Statistical and corpus-based approaches to machine translation.
In preparation for this session, please study from section 25.3 to the end of chapter 25 (pages 874-908) in Jurafsky and Martin.

December 10. Review and retrospect; student evaluations.

· ·
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 United States License.

This text is available on the World Wide Web as

http://www.cs.grinnell.edu/~stone/courses/computational-linguistics/syllabus.html


John David Stone · stone@cs.grinnell.edu