Publications
- Chahan Vidal-Gorène, Nadi Tomeh, and Victoria Khurshudyan. 2024. Cross-Dialectal Transfer and Zero-Shot Learning for Armenian Varieties: A Comparative Analysis of RNNs, Transformers and LLMs. In Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, pages 438–449, Miami, USA. Association for Computational Linguistics.
- Vidal-Gorène, C., Tomeh, N., & Khurshudyan, V. (2024). Pie Model for Lemmatization, POS Tagging, and Morphological Analysis of Classical Armenian (1.0.0). The 4th International Conference on Natural Language Processing for Digital Humanities (EMNLP 2024), Miami.
- Vidal-Gorène, C., Tomeh, N., & Khurshudyan, V. (2024). Pie Model for Lemmatization, POS Tagging, and Morphological Analysis of Western Armenian (1.0.0). The 4th International Conference on Natural Language Processing for Digital Humanities (EMNLP 2024), Miami.
- Vidal-Gorène, C., Tomeh, N., & Khurshudyan, V. (2024). Pie Model for Lemmatization, POS Tagging, and Morphological Analysis of Eastern Armenian (1.0.0). The 4th International Conference on Natural Language Processing for Digital Humanities (EMNLP 2024), Miami.
- Khurshudyan, Victoria; Tomeh, Nadi; Nouvel, Damien; Donabedian, Anaid and Vidal-Gorene, Chahan (eds.). 2022. Proceedings of the International Workshop Digital Armenian: Processing Language Variation, 2022 International Conference on Language Resources and Evaluation (LREC 2022), Marseille, June 20, 2022.
- Chakmakjian, Samuel and Wang, Ilaine. 2022. Towards a Unified ASR System for the Armenian Standards. 2022. In Proceedings of The Workshop on Processing Language Variation: Digital Armenian (DigitAm) within the 13th Language Resources and Evaluation Conference (LREC2022). Khurshudyan, Victoria; Tomeh, Nadi; Nouvel, Damien; Donabedian, Anaid and Vidal-Gorene, Chahan (eds.). Marseille, France. 38-42.
- Khurshudyan, Victoria, Arkhangelskiy, Timofey, Daniel, Michael, Levonian, Dmitri, Plungian, Vladimir, Polyakov, Alexey, Rubakov, Sergey. 2022. Eastern Armenian National Corpus: State of the Art and Perspectives. In Proceedings of The Workshop on Processing Language Variation: Digital Armenian (DigitAm) within the 13th Language Resources and Evaluation Conference (LREC2022). Khurshudyan, Victoria; Tomeh, Nadi; Nouvel, Damien; Donabedian, Anaid and Vidal-Gorene, Chahan (eds.). Marseille, France. 28-37.
- Vidal-Gorène, Chahan, Khurshudyan, Victoria & Donabédian, Anaïd. 2020. Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing. In Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial). International Conference on Computational Linguistics, 2020, Barcelona, Spain (online). 90-101.
- Vidal-Gorène, Chahan, Khurshudyan, Victoria & Donabédian, Anaïd. 2020. Modèles d’annotation morphologique pour le traitement de données multivariées de l’arménien. 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), Dec 2020, Montrouge (virtuel), France. 72-82.
Presentations
- Khurshudyan, Victoria, Kocharov, Petr & Agnes Ouzounian. Grammatical Annotation of Classical Armenian: Nominal Inflection in the Corpus-Based Approach. Twelfth International Conference on Armenian Linguistics (ICAL XII). National Association for Armenian Studies and Research (NAASR), Boston, May 31-June 2, 2023.
- Wang, Ilaine. 2023. Digitizing Armenian Linguistic Heritage (DALiH): Armenian Multivariational Corpus and Data Processing/ Numérisation du patrimoine linguistique arménien : Corpus multivariationnel d’arménien et traitement des données. Rencontre BnF DataLab : Approches computationnelles et linguistiques pour l’arménien, 12 juin 2023.
- Khurshudyan, Victoria. 2023. Presentation on Armenian Corpus Linguistics and Digitizing Armenian Linguistic Heritage. Yerevan State Linguistic University. February 2023.
- Khurshudyan, Victoria. 2023. Digitizing Armenian Linguistic Heritage (Հայերենի լեզվական ժառանգության թվայնացման խնդիրներ). Methodological seminars at Institute of Archeology and Ethnography (Մեթոդաբանության սեմինարներ ՀՀ ԳԱԱ ՀԱԻ). Yerevan. Armenia. February 2023.
- Khurshudyan, Victoria & Donabedian Anaid. Presentation of the ANR project Digitizing Armenian Linguistic Heritage (DALiH): Armenian Multivariational Corpus and Data Processing. The Lexicon-Grammar Interface in the Synchrony and Diachrony of Armenian Julius-Maximilians-Universität Würzburg, (virtual). April 4‒5, 2022.
- Khurshudyan, Victoria, Arkhangelskiy, Timofey, Daniel, Michael, Levonian, Dmitri, Plungian, Vladimir, Polyakov, Alexey, Rubakov, Sergey. Eastern Armenian National Corpus: State of the Art and Perspectives. In Proceedings of The Workshop on Processing Language Variation: Digital Armenian (DigitAm) within the 13th Language Resources and Evaluation Conference (LREC2022). Marseille. June 2022.
- Khurshudyan, Victoria. Le projet ANR DALiH – Armenian Multivariational Corpus and Data Processing”. Présentation de travaux de l’Axe 3 lors du Symposium 2022 du Labex EFL. Inalco. Paris. Juin 2022.
- Khurshudyan, Victoria & Yavrumyan, Marat. Grammatical annotation harmonisation attempt: case study on Classical Armenian and Modern Eastern Armenian on the existing annotated data. 15th General Conference of the Association Internationale des Études Arméniennes Martin Luther University Halle-Wittenberg, 2-4 September 2021.
- Khurshudyan, Victoria, Vidal-Gorène, Chahan & Donabédian, Anaïd. Modèles d’annotations morphologiques pour le traitement de données multivariées de l’arménien. 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), Décembre 2020, Montrouge (virtuel).
- Vidal-Gorène, Chahan, Khurshudyan, Victoria & Donabédian, Anaïd. Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing. In Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial). International Conference on Computational Linguistics. Barcelona, Spain (online), August 2020.
Bibliography
Linguistics
Martirosyan, H. (2018). The Armenian dialects. In The languages and linguistics of Western Asia: An areal perspective (Hans Henrich Hock (Vol. 6, p. 46 105). De Gruyter Mouton.
Donabédian-Demopoulos, A. (2018). Middle East and Beyond - Western Armenian at the crossroads: A sociolinguistic and typological sketch. In C. Bulut (Ed.), Bulut, Christiane, Linguistic minorities in Turkey and Turkic-speaking minorities of the periphery, 111/2018, Harrazowitz Verlag (Vol. 111, p. 89 148). Harrazowitz Verlag.
Donabédian, A., & Sitaridou, I. (2021). Anatolia. In E. Adamou & Y. Matras (Eds.), The Routledge Handbook of Language Contact (p. 404 433). Routledge.
NLP for Armenian
Vidal-Gorène, C., & Kindt, B. (2020). Lemmatization and POS-tagging process by using joint learning approach. Experimental results on Classical Armenian, Old Georgian, and Syriac. Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages, 22 27.
Avetisyan, K., & Ghukasyan, T. (2019). Word Embeddings for the Armenian Language: Intrinsic and Extrinsic Evaluation.