Skip to the content of the web site.

CS499 | Natural Language Computing (IN DEVELOPMENT)

SESSIONS 1 and 2: Psycholinguistics

Session 1

Wednesday 18 May 2:00-4:00 DC2306C

Readings:

Steven Pinker,
The Language Instinct: How the Mind Creates Language,
Perennial Classics, 2000.
Copies will be on reserve in the Davis Centre Library.

Chapter 3 Mentalese
Chapter 4 How Language Works


Session 2

Thursday 19 May 2:00-4:00 DC2306C

Readings:

Pinker continued:
Chapter 10 Language Organs and Grammar Genes
Chapter 11 The Big Bang
Chapter 13 Mind Design


SESSIONS 3 and 4: Ontologies for Natural Language Computing

Session 3: Basic Ontologies

Wednesday 1 June 2:00-4:00 DC2306C

Readings:

WordNet:
"Introduction to WordNet: An On-line Lexical Database",
George A. Miller, Richard Beckwith, Christiane Fellbaum, Derek
Gross, and Katherine Miller
(Revised August 1993)

"Nouns in WordNet: A Lexical Inheritance System",
George A. Miller
(Revised August 1993)

PDF

SENSUS:
"Building a Large-Scale Knowledge Base for Machine Translation",
Kevin Knight and S. Luk,
Proceedings of the National Conference on Artificial Intelligence (AAAI),
1994.

PDF


Session 4: Specialized Ontologies

Thursday 2 June 2:00-4:00 DC2306C

Readings:

Mikrokosmos:
"Ontology development for machine translation: Ideology and methodology",
Kavi Mahesh,
Technical report MCCS-96-292,
Computing Research Laboratory,
New Mexico State University,
Las Cruces, New Mexico, 1996.

PDF

Unified Medical Language System:
"The Unified Medical Language System: What is it and how to use it?",
Olivier Bodenreider; Jan Willis; and William Hole,
Presentation at: MEDINFO, September 8, 2004; San Francisco, CA.

PDF

"The Unified Medical Language System and the Gene Ontology:
Some critical reflections",
Anand Kumar and Barry Smith,
Published in A. Günter, R. Kruse and B. Neumann (eds.),
KI2003: Advances in Artificial Intelligence
(Lecture Notes in Artificial Intelligence 2821),
Berlin: Springer, 2003, 135-148.

PDF


SESSIONS 5 and 6:
Statistical Natural Language Processing Basics

Christopher Manning and Hinrich Schütze,
Foundations of Statistical Natural Language Processing,
The MIT Press, 2000.

Session 5

Wednesday 15 June 2:00-4:00 DC2306C

Readings:

Chapter 6 (6.1 6.2) N-gram models
Chapter 10 (10.1 to p.353, 10.3) Part-of-speech tagging
Rule-based tagging:
"A simple rule-based part of speech tagger",
Eric Brill, DARPA Workshop, 1996.

PDF


Session 6

Thursday 16 June 2:00-4:00 DC2306C

Readings:

Chapter 12 (12.1, 12.2.3) Probabilistic parsing


SESSIONS 7 and 8: Applications of Statistical NLP

Session 7

Wednesday 6 July 2:00-4:00 DC2306C

Readings:

Manning and Schütze continued
Chapter 15 (15.1, 15.2, 15.4, 15.5) Information retrieval


Session 8

Thursday 7 July 2:00-4:00 DC2306C

Readings:

Chapter 16 (16.1, 16.4) Text categorization

Text summarization:
Inderjeet Mani and Mark T. Maybury (eds.)
Advances in automatic text summarization,
The MIT Press, 1999.
Will be on reserve in DC Library.


Session 9: Natural Language Generation

Wednesday 13 July 2:00-4:00 DC2306C

Readings:

Ehud Reiter and Robert Dale (editors),
Building Natural Generation Systems,
Cambridge University Press, 2000.
Will be on reserve in DC Library.

Selections:

Chapter 3


Session 10: Machine Translation

Thursday 14 July 2:00-4:00

Readings:

Sergei Nirenburg (editor),
Readings in Machine Translation,
The MIT Press, 2002.
Will be on reserve in DC Library.

Selections:

TBA


Session 11: Biolinguistics

Wednesday 27 July 2:00-4:00 DC2306C

Readings:

Miguel A. Andrade and Alfonso Valencia,
"Automatic extraction of keywords from scientific text:
application to the knowledge domain of protein families",
Bioinformatics, 14(7), 600-607, 1998.

PDF

Christian Blaschke, Miguel A. Andrade, Christos Ouzounis,
and Alfonso Valencia.
"Automatic extraction of biological information from scientific text:
protein-protein interactions",
ISMB Tutorial 1999.

PDF

J. Pustejovsky, J. Castaño, R. Saurí, A. Rumshumsky,
J. Zhang, and W. Luo,
"Medstract: Creating large-scale information servers
for biomedical libraries",
Workshop on Natural Language Processing in the Biomedical Domain,
Conference of the Association for Computational Linguistics, 2002.

PDF

Session 12: Attitude and Affect in Natural Language Systems

Thursday 28 July 2:00-4:00 DC2306C

Readings:

To be selected by the class.