ThEA - Theoretical and experimental approaches to dialectal variation and contact-induced change: a case study of Tundra Nenets

Project overview

The study of syntactic change in Siberian indigenous languages remains a largely underexplored area, with considerable potential to shed light on the shift from Subject-Object-Verb (SOV) to Subject-Verb-Object (SVO) word order. Investigating these changes may yield valuable insights into the mechanisms of syntactic restructuring, particularly the influence of language contact in driving such transformations.

Tundra Nenets — an endangered Indigenous language of northwestern Siberia, classified within the Samoyedic branch of the Uralic language family — offers a particularly compelling case study in this context. While it is fundamentally a verb-final language, instances of non-verb-final clauses have been attested, often shaped by discourse-pragmatic factors. Such structures appear across the three main dialect groups of Tundra Nenets — Western, Central, and Eastern — with varying degrees of frequency and distribution. However, the extent to which these dialects exhibit parallel patterns of syntactic change remains unclear. Given their differing levels and forms of exposure to Russian, the dominant majority language, it can be hypothesised that they may also diverge in terms of structural developments.

This project originally set out to investigate Russian-induced syntactic changes in Tundra Nenets by comparing two dialects: the Yamal dialect, spoken in the Yamalo-Nenets Okrug, and the Taymyr dialect spoken in the Taymyr Autonomous Okrug (see the map below). The study planned to employ a combined approach, integrating both theoretical and experimental syntax — along with fieldwork and experiments — a relatively novel method for investigating syntax and syntactic changes in languages of this region.

Map 1. Distribution of Tundra Nenets dialect groups examined in this project (created by Zoltán Gulyás).

The project sought to address three central research questions:

  1. Are there contact-induced syntactic changes across the dialects?
  2. Do different sociolinguistic environments give rise to distinct patterns of syntactic change?
  3. Can the Tundra Nenets language offer insights into broader typological patterns of contact-induced change?

To test our general hypothesis, we focused on a specific clause type: interrogatives. Such sentences provide a valuable basis for examining contact-induced language change, as Greenberg (1966) observed that certain interrogative strategies correlate — at least to some extent — with the basic word order of a language. By examining their structure, we can thus infer the basic syntactic configuration of the dialect under study. In addition to potential syntactic variation, the prosody of interrogative sentences may also offer insights into possible linguistic change.

Revised Objectives

The ThEA project commenced in 2018. However, due to unforeseen circumstances, including the COVID-19 pandemic and the Russian-Ukrainian war, fieldwork and direct data collection soon became unfeasible. As a result, we redirected our research focus. By that time, we had gathered pilot data, initially recorded in person in Moscow and later conducted online during the COVID-19 outbreak with a native speaker of the Yamal dialect of Tundra Nenets. Based on these pilot results, we identified three new directions for the project.

We abandoned the attempt to make systematic comparisons between the two aforementioned dialects and instead focused on conducting descriptive syntactic studies using the available pilot data of the Yamal dialect of Tundra Nenets. This shift was deemed necessary due to the lack of precise syntactic (and prosodic) descriptions and analyses of interrogatives — not only in Tundra Nenets but also in other Indigenous languages of the region. As a result, we have laid the groundwork for future comparisons and the identification of potential areal features.

We then initiated a comparative investigation, incorporating additional Indigenous languages spoken in regions neighboring Tundra Nenets. This analysis has allowed us to identify syntactic phenomena shared across these languages, including Tundra Nenets, and to provide coherent explanations for these syntactic patterns.

Finally, given the relatively large number of written, published texts available to us, we have placed a strong emphasis on the computational processing of Tundra Nenets data. This process involves data organisation, digital processing, and comprehensive linguistic annotation at the morphological, syntactic, pragmatic, and prosodic levels, followed by systematic analysis. This aspect of the project is being carried out in collaboration with the Modyco Lab at Nanterre University, the INRIA Lab at Nancy University, the Department of Computer Engineering at Boğaziçi University, and the Institute of Formal and Applied Linguistics at Charles University in Prague. By improving the accessibility and structural organisation of the data, we aim to support more in-depth investigations into the Nenets languages and their linguistic characteristics. For a more detailed description, visit the Tundra Nenets Linguistic Data Sets section of this homepage.

The project was funded by the National Research, Development and Innovation Office (NKFIH) of Hungary under Grant ID FK 129235 from 2018 to 2025.

Participants

Principal investigator

Research Team

  • Katalin Mády – Researcher
  • Réka Metzger: Computer Engineering, NLP (Natural Language Processing) Tasks (2018–2023)
  • Cecília Molnár – Researcher (Joined in 2023)
  • Péter Rebrus – Researcher
  • Uwe Reichel – Researcher

Technical Support

  • Ferenc Gulyás: Web Design
  • Zoltán Gulyás: Map Design, Cartography

Knowledge Sharing

In line with the project's commitment to disseminating research findings and promoting collaborative efforts, this section highlights relevant talks, and publications related to the project's work. The scientific works summarising the results are grouped into the three central topics of our project: descriptive studies, comparative works, and corpus/data processing.

Descriptive Studies on Tundra Nenets

Publications

  1. Mus, Nikolett & Mády, Katalin. 2023. The role and functional motivation of High target tones in Tundra Nenets. In: Gráczi, Tekla Etelka, Horváth, Viktória, Juhász, Kornélia, Kohári, Anna, Krepsz, Valéria, Mády, Katalin (Eds.) Speech Research conference: Hungarian Research Centre for Linguistics. Budapest, 23–24. February 2023 Budapest, Hungary. Hungarian Research Centre for Linguistics. 84–86.
  2. Mus, Nikolett. 2023. Tundra Nenets. In: Behnke, Anja & Wagner-Nagy, Beáta (Eds.) Clause Linkage in the Languages of the Ob-Yenisei Area. Leiden: Brill, 2023. 133–174.
  3. Mus, Nikolett. 2023. Nenets. In: Abondolo, Daniel, Valijärvi, Riitta-Liisa (Eds.) The Uralic Languages. London: Routledge. 853–896.
  4. Mus, Nikolett. 2022. The syntax of multiple wh-questions in Tundra Nenets. In Valentin Gusev, Anna Urmanchieva, Aleksandr Anikin (Eds.) Siberica et Uralica: In memoriam Eugen Helimski. Studia uralo-altaica 56:127–138.
  5. Mus, Nikolett. 2022. On the semantic distribution of copular verbs in Tundra Nenets. In Erkkilä, Riku, Toivo EH Qiu, and Timo Leisiö. Hämeenmaalta Jamalille. Kirja Tapani Salmiselle 07.04. 2022 Тапани’ĕ’’эмня падвы падар’’. 243–253.
  6. Mus, Nikolett 2020. Szórendi megszorítások a tundrai nyenyec kiegészítendő kérdésben In Kenesei, István (főszerk.); Dékány, Éva; Halm, Tamás; Surányi, Balázs (szerk.) Általános nyelvészeti tanulmányok XXXII. : Újabb eredmények a grammatikaelmélet, nyelvtörténet és uralisztika köréből, Budapest: Akadémiai Kiadó, 405–418.
  7. Mus, Nikolett 2018. The interrogative verb in Tundra Nenets. Finnisch-Ugrische Mitteilungen: 137–150.
  8. Mus, Nikolett 2018. A tundrai nyenyec 'mit mond' jelentésű kérdő ige. In Kenesei, István; Bakró-Nagy, Marianne (szerk.) Általános Nyelvészeti Tanulmányok XXX.: Uralisztikai tanulmányok, Budapest: Akadémiai Kiadó, 191–205.

Talks

  1. Katalin Mády & Nikolett Mus. The grammatic function of word-final High tones in Tundra Nenets. Speech Units Workshop, University of Zurich, 17–19 April 2023
  2. Mus, Nikolett & Mády, Katalin. The role and functional motivation of High target tones in Tundra Nenets. Speech Research Conference, Hungarian Research Centre for Linguistics, Budapest, 23–24th February 2023 (poster)
  3. Mus Nikolett. Lexikai szófajváltás tundrai nyenyec nyelvjárásokban: kérdő határozószó kialakulása kérdő ige(név)ből. Nyelvelmélet és diakrónia 5., Pázmány Péter Katolikus Egyetem, 2022. november 16–17.
  4. Mus, Nikolett. A syntactic change in Tundra Nenets: from a wh-verb to a wh-adverb. Workshop on Samoyedic Languages and Culture, Hamburg, 8–9 September 2022
  5. Mády, Katalin & Mus, Nikolett. On the function of the high tone in Tundra Nenets. Workshop on Samoyedic Languages and Culture, Hamburg, 8–9 September 2022
  6. Mus, Nikolett. A postverbal non-finite complement clause in Tundra Nenets. Syntax of Uralic languages 4 (SOUL 4), Hungarian Research Centre for Linguistics (online), 14–16 June 2022
  7. Mády, Katalin & Mus, Nikolett. The syntax and prosody of content questions in Tundra Nenets. VIII International Conference on Samoyedic Studies, Tomsk State Pedagogical University (online), 17–18 September 2021
  8. Mus, Nikolett & Surányi, Balázs. The syntax of postverbal phrases in Tundra Nenets. VIII International Conference on Samoyedic Studies, Tomsk State Pedagogical University (online), 17–18 September 2021
  9. Mus, Nikolett & Surányi, Balázs. Post-verbal phrases and their correlates in Tundra Nenets. Olomouc Linguistics Colloquium (Olinco 5), online, 10–12 June 2021
  10. Hegedűs, Veronika & Mus, Nikolett & Surányi, Balázs. Tense, agreement and copula drop in Tundra Nenets copular clauses. Olomouc Linguistics Colloquium (Olinco 5), online, 10–12 June 2021
  11. Mus, Nikolett. On the syntax of Tundra Nenets content questions, New results in the syntax of Uralic languages, Budapest, 22 October 2019
  12. Mus, Nikolett. The role of information structure in Tundra Nenets content questions, The Uralic and Altaic Languages, Tomsk, 12–13 October 2019
  13. Mus, Nikolett. The correlation of syntactic position and semantic function of wh-phrases in Tundra Nenets multiple questions. 3rd Budapest Linguistics Conference, Budapest, 6–8 June 2019
  14. Mus, Nikolett. Szórendi megszorítások a tundrai nyenyec kiegészítendő kérdésben [Word order constraints in the Tundra Nenets content question]. Diskurzus ünnepi konfigurációban. Műhelykonferencia É. Kiss Katalin köszöntésére, Budapest, 28 May 2019
  15. Mády, Katalin & Metzger, Réka & Mus, Nikolett & Rebrus, Péter & Reichel, Uwe. The syntax and prosody of Tundra Nenets interrogatives. Workshop on Uralic Prosody, Helsinki, 21 March 2019
  16. Mus, Nikolett. Theoretical and experimental approaches to dialectal variation and contact-induced change: a case study of Tundra Nenets. Linguistic diversity, minority languages and digital research infrastructures, Hamburg, 20–21 September 2018

Comparative Works on Interrogative Constructions in Various Siberian Indigenous Languages

Publications

  1. É. Kiss, Katalin & Mus, Nikolett. 2022. The reflexive cycle. From reflexive to personal pronoun in Uralic. Journal of Uralic Linguistics 1:1: 43–66.
  2. Mus, Nikolett 2020. The North Samoyedic interrogative verb meaning 'say what'. ESUKA – JEFUL 2020, 11–2: 119–136.

Talks

  1. Mus, Nikolett. On the position of wh-phrases in languages of Northern Siberia: an explanation of *V-WhP order. 21st International Congress of Linguists (ICL), Poznań, Poland, 8–14 September 2024
  2. Mus, Nikolett. A syntactic change in languages of Northern Siberia: the position of wh-phrases, and the structure of wh-questions. Arctic Congress Bodø 2024, Norway, 29 May–3 June 2024
  3. Wagner-Nagy, Beáta & Mus, Nikolett. Preliminaries on the syntax of Samoyedic languages. Summer School in Uralic Formal Linguistics. Hungarian Research Centre for Linguistics, Budapest, August 21–26 2023 (invited)
  4. É.Kiss, Katalin & Mus, Nikolett. The Reflexive Cycle. Syntax, Phonology and Language Analysis 14 (SinFonIJA 14), Novi Sad, Serbia (online), 22–24 September 2021
  5. Mus, Nikolett. Word order constraints in the Tundra Nenets wh-question. Siberian Life 2021, Hamburg (online), 4–5 February 2021
  6. Mus, Nikolett. Elements at the right periphery in an OV language: a syntactic orphan in Tundra Nenets, Potsdam Linguistics Syntax and Semantics Colloquium (online), 14 July 2020
  7. Mus, Nikolett, & Däbritz, Chris Lasse. Posztverbális összetevők észak-szibériai SOV/SVX nyelvek-ben [Postverbal constituents in SOV/SVX languages of North Siberia], Nyelvelmélet és diakrónia 4. PPKE, Budapest, 21–22 November 2019
  8. Däbritz, Chris Lasse, & Mus, Nikolett. The discourse function of the postverbal constituentsin SOV/SVO languages of Northern Siberia, 13th Conference of the Association for Linguistic Typology, Pavia, 4–6 September 2019
  9. Mus, Nikolett. Kérdő ige az északi szamojéd nyelvekben [The interrogative verb in the Northern Samoyedic languages]. Magyar Nyelvtudományi Társaság, Budapest, 13 November 2018
  10. Mus, Nikolett. The Northern Samoyedic interrogative verbs. 7th International Conference on Samoyed Studies, Tartu, 26–27 October 2018

Works on Corpus and Data Processing

Publications

  1. Mus, Nikolett & Metzger, Réka 2021. Toward a Corpus of Tundra Nenets: Stages and Challenges in Building a Corpus. In Proceedings of the 4th Workshop on Computational Methods for Endangered Languages Vol.2 (Resource Papers and Extended Abstracts), 2021.
  2. Mus, Nikolett & Metzger, Réka. 2021. A tundrai nyenyec (egynyelvű) korpusz munkálatai: kihívások, módszerek, eredmények. Nyelvtudományi Közlemények 117: 123–144.

Talks

  1. Akkurt, Furkan & Mus, Nikolett. Development of a Helsinki Finite-State Transducer for Tundra Nenets. UniDive 3rd general meeting, HUN-REN Hungarian Research Centre for Linguistics, Budapest, Hungary, 29–30 January 2025 (poster)
  2. Mus, Nikolett. Starting a UD Treebank of Tundra Nenets. 1st UniDive training school, Technical University of Moldova, Chișinău (Moldova), 8–12 July 2024 (poster)
  3. Horváth, Csilla & Mus, Nikolett. Representative corpora for endangered indigenous languages of Western Siberia. Arctic Congress Bodø 2024, Norway, 29 May–3 June 2024
  4. Mus, Nikolett. A digital text collection of Tundra Nenets. UniDive 1st general meeting, Paris-Saclay University, LISN lab, France, 16–17 March 2023 (poster)
  5. Mus, Nikolett & Metzger, Réka. Archiving Tundra Nenets materials: towards designing a balanced Tundra Nenets corpus. 7th International Conference on Language Documentation & Conservation, University of Hawai‘i at Mānoa (online), 4–7 March 2021
  6. Mus, Nikolett – Metzger, Réka. Toward a corpus of Tundra Nenets: stages and challenges in building a corpus. 4th Workshop on Computational Methods for Endangered Languages, online, 2–3 March 2021

Events

Contact

E-mail

mus.nikolett[at]gmail.com

Main

HUN-REN Hungarian Research Centre for Linguistics
Benczúr utca 33.
1068 Budapest