Resurrection: The Khazar Language Reconstruction Using Computer Science Technologies

Authors

  • Elina Makipova KIMEP University
  • Iskander Akhmetov Insitute of Information and Computational Technologies
  • Alexander Gelbukh Instituto Politécnico Nacional

DOI:

https://doi.org/10.13053/cys-28-1-4902

Keywords:

Khazar, language reconstruction, extinct languages, historical linguistics

Abstract

Decrypting or reconstructing extinctlanguages is challenging, especially when the objectiveis to reconstruct a language with no or very few textsleft, such as the Khazar language or early Slavic andUgric languages. In this paper, we lay out the historicalperspective of the Khazar people, their language, andcontemporary descendant ethnic groups, namely theChuvash and Tatar people. Then we discuss waysComputer Science can help researchers in languagereconstruction and decryption. Finally, we pilot anapproach to find Khazar/Bulgar word candidates inChuvash and Tatar languages by (1) normalizing thewords of two languages and (2) comparing them,accounting for the semantic concepts to solve thehomonymy problem, and (3) excluding common Turkicwords and borrowings from the Russian language.

Downloads

Published

2024-03-20

Issue

Section

Articles