LING 5200 Computational Corpus Linguistics, Fall 2004
Course description
This course is an introduction to the use of corpora in the investigation of linguistic questions. A major focus of the course is the development of computational skills, preparing the student to take CSCI 5832 (Natural Language Processing) and CSCI 6302 (Speech Synthesis and Recognition). Topics covered include:
- an introduction to the Unix operating system
- the Perl programming language
- basic software engineering
- publicly available corpora of written and spoken language
- basic issues in corpus design and construction
- tools for working with corpora
- graduate standing in the linguistics department
- an idea for a research project
- no graduate credits in CSCI
Required texts:
As of August 22nd, both of these books are available at Amazon.com at significant discounts--30% and 32%, respectively. You may also be able to find them in the computer store at UMC.
Optional texts:
- Time: Mondays, 6-8:30 p.m.
- Place: ECME 269
- In Boulder: Hellems 294, Thursdays 4:50-5:30, and by appointment
- In Denver: RC-1 6400A, by appointment
- 10 lab/homework assignments: 80% of final grade
- one substantial research project: 20% of final grade