Research on multi-label short text categorization method for online education under deep learning

The rapid development of the modern Internet has not only changed our way of life, but also changed the previous mode of education and learning, and the online education mode has been greatly developed and improved accordingly. In this paper, BERT model is used to extract word vectors of multilabel short texts for online education, and then BiLSTM-CNN model is used to extract features of short texts, and a classifier is constructed by Sigmoid activation function to realize the output of classification results of multilabel short texts. The validation analysis of the model’s effectiveness was conducted using the public dataset THCNEWS and the self-collected EduData as examples. The loss and Marco-P of the model after 5*10⁵ steps of training converged stably around 0.085 vs. 96.05%. The Marco-F1 values of the multi-label short text classification model on the THCNEWS and EduData datasets reach 0.915 and 0.962, which are significantly higher than the individual comparison models. Combining deep learning technology with multi-label short text classification for online education can achieve accurate classification of text data and provide new exploration ideas for improving the quality of online education.

Lingua:: Inglese

Frequenza di pubblicazione:: 1 volte all'anno
Argomenti della rivista:: Scienze biologiche, Scienze della vita, altro, Matematica, Matematica applicata, Matematica generale, Fisica, Fisica, altro

Feed RSS della rivista

Research on multi-label short text categorization method for online education under deep learning

Yinuo Guo

Pubblicato online: 19 mar 2025

Ricevuto: 11 nov 2024

Accettato: 15 feb 2025

DOI: https://doi.org/10.2478/amns-2025-0391

Parole chiaveBERT model, BiLSTM-CNN model, Text categorization, Multi-label short text, Online education

© 2025 Yinuo Guo, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Parole chiave
BERT model, BiLSTM-CNN model, Text categorization, Multi-label short text, Online education