Section for Computational Linguistics

  1. You are here:
  2. Start
  3. Department of Linguistics
  4. Section for Computational Linguistics
DALI code 4

Section for Computational Linguistics

Computational linguistics develops formalised models of natural language, realised as systems for processing text or speech. Over the past two decades, this interdisciplinary field has developed strongly as a result of a) the enormous increase of the amount of electronically available data; b) ever increasing computing power; c) methods that are data-driven, using machine learning and statistics to automatically induce models from data, usually in combination with traditional linguistic rules and representations. Computational linguistics offers new possibilities to confront descriptive and theoretical linguistics with empirical evidence, increased leverage in fields where natural language constitutes the primary data (such as the humanities, social sciences, medicine and cultural heritage), and language technology applications ranging from machine translation to dialogue systems.

Study with us!

The Department of Linguistics offers two full study programmes: a Bachelor’s Programme in Linguistics with a specialisation in Computational Linguistics, held in Swedish, and a two-year Master's Programme in Language Sciences, conducted in English. Most courses within these programmes can also be taken as free-standing courses. If you are an exchange student interested in studying at the Department of Linguistics, you can find more information on our education pages. The department also offers PhD studies with a specialisation in Computational Linguistics.

Corpora, Resources, Tools

Corpora, resources and tools

Welcome to our site containing a collection of corpora, resources and tools from the Section for Computational Linguistics

Research in Computational Linguistics

Research in the Section for Computational Linguistics is oriented towards first-language acquisition, massively parallel corpora as a vehicle for typological study, and analysis of user-generated content with particular application to medical health records. Also, the section provides several freely available language technology resources and tools, and is engaged in a project for spreading the use of language technology to other disciplines where natural language constitutes the primary data.