The method of forming a vocabulary minimum by applying corpus analysis toolkit for concordancing

UT Research Journal. Humanities Research. Humanities


2016, Vol. 2. №1

The method of forming a vocabulary minimum by applying corpus analysis toolkit for concordancing

Author: Olga Yu. Savina

About the author:

Olga Yu. Savina, Senior Lecturer, German Language Department, Institute for Philology and Journalism, Tyumen State University;


The article deals with the didactic potential of the modern corpus linguistics, in particular, the use of the corpus of texts as a source of data for didactic materials. The author formulates the steps for creating of a mini-corpus with German publicist texts applying a specialized search engine Yomunda. The method employs the work with corpus analysis toolkit for concordancing, including preprocessing of texts, their conversion, compiling a list of vocabulary frequency, and the lemmatization of types that are included in the frequency list. The article describes the experience of the compilation of thematic vocabulary minimum on the basis of the corpus of authentic texts and its verification using database Profile Deutsch. The analysis shows that the proposed algorithm for creating vocabulary minimum can effectively select the most common tokens of topical authentic texts. The advantage of this created vocabulary minimum is its derivational wealth. The algorithm allows forming vocabulary minimum for any target group.


  1. Aleksandrov, K. V. 2010. “Kompyuternaya programma dlya formirovaniya leksicheskoy kompetencii pri izuchenii inostrannogo yazyka” [A Computer Program for the Formation of Lexical Competence when Learning a Foreign Language]. Inostrannye yazyki v shkole [Foreign Languages at School], no. 6, pp. 67-73.
  2. Maltseva, M. S. 2011. “Korpusnye tehnologii v metodike prepodavaniya inostrannyh yazykov” [Corpus Technology in Methodology of Teaching Foreign Languages]. Zhurnal Socialno-ekonomicheskie yavleniya i processy [Social-Economic Phenomena and Processes], no. 8, pp. 214-218. 
  3. Raitskaya, L. K. 2009. “Didakticheskie vozmozhnosti korpusnyh Internet-tehnologiy v prepodavanii inostrannogo yazyka v vysshey shkole” [Didactic Possibilities of Corpus Internet Technologies in Teaching a Foreign Language in High School]. Vestnik Moskovskogo gosudarstvennogo oblastnogo universiteta [Moscow Regional University Herald], no. 4, pp. 123-127.
  4. Sosnina, Ye. P. Korpusnaya lingvistika i korpusnyy podhod v obuchenii inostrannomu yazyku [Corpus Linguistics and Corpus Approach in Foreign Language Teaching]. 
  5. Sysoev, P. V. 2010. “Lingvisticheskiy korpus v metodike obucheniya inostrannym yazykam” [Linguistic Corpus in the Methodology of Teaching Foreign Languages]. Yazyk i kultura [Language and Culture], no. 1(9), pp. 99-111.
  6. Kiyan, O. N. Autentichnost kak metodicheskaya kategoriya v obuchenii inostrannomu yazyku [Authenticity as a Methodical Category in Foreign Language Teaching]. 
  7. Shchukin, A. N. 2008. Lingvodidakticheskiy enciklopedicheskiy slovar [Lingvodidactic Encyclopedic Dictionary]. Moscow: Astrel; AST; Khranitel.
  8. Zaharov, V. P. 2005. Korpusnaya lingvistika: uchebno-metod. Posobie [Linguistic Corpus: a Teaching Method: a Manual]. St. Petersburg.
  9. Yomunda. “Poiskovaya mashina dlya nemeckoyazychnyh tekstov” [Search Engine for German-Speaking Texts]. 
  10. Shhipicina, L. Yu. 2013. Informacionnye tehnologii v lingvistike. Uchebnoe posobie [Information Technology in Linguistics. A Manual]. Moscow: Flinta.
  11. Anthony, L. A Freeware Corpus Analysis Toolkit for Concordancing and Text Analysis. 
  12. Lezius, W. MORPHY für Windows. Ein Tool zur Deutschen Morphologie. 
  13. Gemeinsamer Europäischer Referenzrahmen für Sprachen: Lernen, Lehren, Beurteilen. 
  14. Glaboniat, M., M. Müller, H. Schmitz, P. Rusch and L. Wertenschlag. 2005. Profile Deutsch. Version 2.0.