The Forest Lion and the Bull: Morphosyntactic Annotation of the Panchatantra

Autores/as

  • Puneet Dwivedi Faculty of Mathematics and Physics, Charles University, Praha
  • Daniel Zeman Faculty of Mathematics and Physics, Charles University, Praha

DOI:

https://doi.org/10.13053/cys-22-4-3076

Palabras clave:

Dependency syntax, morphology, word segmentation, tokenization, treebank, Sanskrit

Resumen

We present the first freely available dependency treebank of Sanskrit. It is based on text from Panchatantra, an ancient Indian collection of fables. The annotation scheme we chose is that of Universal Dependencies, a current de-facto standard for cross-linguistically comparable morphological and syntactic annotation. In the present paper, we discuss word segmentation issues, morphological inventory and certain interesting syntactic constructions in the light of the Universal Dependencies guidelines. We also present an initial parsing experiment.

Descargas

Publicado

2018-12-30