09.11.2018

The project “Electronic corpus of Polish texts from the 17th and 18th centuries (until 1772)” was prepared and implemented by the Polish Language History Laboratory of the 17th and 18th centuries at the Institute of the Polish Language of the Polish Academy of Sciences in cooperation with the Linguistic Engineering Team at the Institute of Computer Science of the Polish Academy of Sciences. The project was financed by the National Program for the Development of Humanities for the years 2013-2017.

The most important result of the project is the Electronic Corpus of Polish Texts from the 17th and 18th centuries (up to 1772), briefly referred to as the Baroque Corps (KorBa, the acronym was used in the name of the Internet domain: http://korba.edu.pl). KorBa has almost 13.5 segments (as understood by the founders of the National Corpus of Polish Language).

The project was heterogeneous. On the one hand, it relied on the selection of representative texts from the era, their transfer to an electronic medium, linguistic and editorial work, and on the other — on the creation of IT tools for collecting, processing, searching and presenting fragments of texts contained in the corpus or modifying existing ones, created for the needs of the bodies of contemporary texts. Thanks to the project, undoubtedly, the methods of historical-linguistic research have been modernized and incorporated into the stream of corpus linguistics.

//korba.edu.pl