Accès libre

Research on English Corpus Construction and Optimisation of Language Teaching Strategies Supported by Data Mining Algorithms

  
17 mars 2025
À propos de cet article

Citez
Télécharger la couverture

In this paper, the word2vec text mining method is applied to extract the target English teaching resources, such as English teaching materials, videos, newspapers and magazines. The collected resources are preprocessed and subjected to clustering, morpheme and association analysis to calculate a series and some keywords. Using these keywords to collect additional corpus, iterating repeatedly until a certain size of corpus is constructed. Applying the corpus to English teaching, it can be analyzed that a total of 184 [APPOINT] related corpora are obtained with the corpus as the object of study. There are 11 collocations related to “be addicted to”, including 8 negative words and 3 positive words. Eleven common errors in college students’ writing were also identified. The performance of English majors in S colleges and universities was significantly improved after applying English corpus teaching (P=0.003). Therefore, the English corpus has a positive effect on English teaching.