A collection of spoken non-standard Russian corpora

Anastasia Panova and Waleria Morozova

In this talk we present several corpora of varieties of Russian spoken in different regions. One part of the collection consists of dialectal corpora, which make it possible to study the phonetic and morphosyntactic characteristics of Russian dialects and the dynamics of dialect loss. Another part of the collection represents L2 varieties of Russian that are spoken as lingua franca. This type of corpora provides rich material for investigating interference from languages that are typologically distant from and genetically unrelated to Russian. Corpus data were recorded during field trips of affiliated groups in different regions of Russia in 2007-2018. The corpora were created in 2013-2019 and use a similar methodology for annotation and presentation.