Semantic universals of classifier systems

The aim of the project is to examine classifiers, a type of noun categorization that is widespread in the world’s languages. Classifiers exhibit a remarkable diversity in the meanings they convey and the morphological and syntactic contexts in which they occur.

Several types of classifiers are distinguished, including the most well-known numeral classifiers which occur with numerals, as in Mandarin Chinese in the phrase yí liàng chē (one vehicle car) ‘one car’, where liàng is the classifier for vehicles. Other classifier types include noun, possessive, verbal, deictic, and locative classifiers. Classifiers are found in typologically diverse languages, ranging from the analytic languages of Southeast Asia to the polysynthetic languages of North America.

A wide range of semantic categories serve as the basis for categorization in classifier systems, including animate vs. inanimate, male vs. female, and social status, together with categories found among inanimates, including physical properties such as shape and size as well as function and value. These properties tend to correlate with the type of classifier system. For example, possessive classifiers in Oceanic languages distinguish such categories as valuable, edible, and drinkable, while verbal classifiers in the Athabaskan languages of North America are used to classify objects as, e.g., long, round, and flat/flexible.

Goal of the project

This project aims to fill in gap in research by constructing a database of classifier types in the world’s languages and determining the distribution of semantic values in classifier languages. We will create a database of 3000+ languages that will offer:

Project duration

September 2023 – September 2026

Aims

As part of the project, we will create a database of over 3000 languages based on data available from over 7000 grammars and other descriptions, and from open-access databases under a Creative Commons Licence. Automatic detection followed by manual checking will be used to identify the classifier types and semantic categories found in classifier languages. We will thus try to answer the following questions:

Based on the compiled data, we will examine fundamental aspects of classifier systems:

For example, with regard to the universality of semantic categories, our aim is to determine which categories are more common and which ones tend to occur in particular language families or areas. As regards correlations between classifier types and their semantics, we will establish whether such correlations are indeed found, or whether our knowledge of the meanings expressed by classifier systems is influenced by available descriptions of the most well-known languages representing particular types, e.g., of Mandarin Chinese or Japanese in the case of numeral classifiers.

Database

The database is being built on existing sources such as the World Atlas of Classifier Languages (WACL) (Her et al. 2022) to identify classifier languages worldwide. We use the Gramfinder software as a tool to access the published materials for each language and assess which semantic values and which types of classifiers are found in each language. Quantitative analyses controlling for geographic area and language family are conducted at the global scale to identify the universality and specificity of each semantic feature and classifier type. The interaction between classifier types and semantic values is also assessed independently. Finally, the interaction among semantic values, classifier types, area, family, and cultural traits is assessed by means of quantitative methods such as mixed models and decision trees.


At the moment, the database is under development. Progress on the database will be presented at the conferences listed in the ‘Conferences’ section.

Conferences

Attended

1) 98th Annual Meeting of the Linguistic Society of America, January 2024;

2) 21st International Congress of Linguists, Poznań, September 2024;

Upcoming

1) 26th Annual Meeting of the North American Association for the History of the Language Sciences, Philadelphia, January 2025;

2) 58th Annual Meeting of the Societas Linguistica Europaea, University of Bordeaux, August/September 2025;

3) 54th Poznań Linguistic Meeting, Poznań, September 2025;

4) 100th Annual Meeting of the Linguistic Society of America, January 2026;

5) 52nd Annual Meeting of the Berkeley Linguistic Society, Berkeley, CA, February 2026.


The project is funded by the National Science Centre (OPUS programme, 2022/47/B/HS2/02999).