Parsing Korean Classical Literature by Integrating Text Mining and Semantic Analysis 
 and   
Mar 19, 2025
About this article
Published Online: Mar 19, 2025
Received: Oct 23, 2024
Accepted: Jan 29, 2025
DOI: https://doi.org/10.2478/amns-2025-0517
Keywords
© 2025 Lai Wei, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Keywords TF-IDF weights
| Number | Keywords | TF-IDF | Number | Keywords | TF-IDF | 
|---|---|---|---|---|---|
| 1 | Legend | 0.0359 | 11 | Earth | 0.0200 | 
| 2 | Folklore | 0.0347 | 12 | Buddhism | 0.0196 | 
| 3 | Dynasty | 0.0345 | 13 | Morality | 0.0182 | 
| 4 | Goli | 0.0341 | 14 | Maidservant | 0.016 | 
| 5 | Aristocracy | 0.0310 | 15 | Pariah | 0.015 | 
| 6 | Confucian | 0.0305 | 16 | Taoism | 0.0144 | 
| 7 | Heaven | 0.0300 | 17 | Court | 0.0132 | 
| 8 | Imperial envoy | 0.0280 | 18 | People | 0.0126 | 
| 9 | China | 0.0246 | 19 | Benevolence | 0.0109 | 
| 10 | Identity | 0.0211 | 20 | Analects | 0.0088 | 
Attribute keyword lift value analysis results
| No. | Work | Cons | Lift | Work | Cons | Lift | Work | Cons | Lift | 
|---|---|---|---|---|---|---|---|---|---|
| 1 | A | Legend | 0.4093 | B | Legend | 0.3246 | C | Heaven | 0.3458 | 
| 2 | A | Morality | 0.3637 | B | Identity | 0.3176 | C | Earth | 0.3152 | 
| 3 | A | Maidservant | 0.3052 | B | Folklore | 0.3102 | C | Morality | 0.2891 | 
| 4 | A | Benevolence | 0.2519 | B | Morality | 0.2551 | C | Goli | 0.2675 | 
| 5 | A | Folklore | 0.2003 | B | Aristocracy | 0.2173 | C | Identity | 0.2326 | 
| 6 | A | Court | 0.1833 | B | Confucian | 0.1878 | C | People | 0.2188 | 
| 7 | A | People | 0.1284 | B | Pariah | 0.1803 | C | Dynasty | 0.2019 | 
| 8 | A | Confucian | 0.0721 | B | People | 0.1093 | C | Pariah | 0.1476 | 
| 9 | A | Goli | 0.0504 | B | Earth | 0.0491 | C | Court | 0.1463 | 
| 10 | A | Pariah | 0.0333 | B | Dynasty | 0.0383 | C | Legend | 0.0963 | 
High frequency keywords in Korean classical literature works
| Keywords | Frequency | Keywords | Frequency | 
|---|---|---|---|
| Heaven | 2909 | Mythology | 857 | 
| Earth | 1965 | Official | 790 | 
| Folklore | 1594 | Poetry | 759 | 
| Confucian | 1403 | Goli | 748 | 
| Buddhism | 1392 | Three Kingdoms | 743 | 
| Taoism | 1287 | Chinese | 732 | 
| Morality | 1266 | Benevolence | 726 | 
| Dynasty | 1149 | Politeness | 716 | 
| Legend | 1101 | Filial piety | 703 | 
| Aristocracy | 986 | Loyalty | 684 | 
| Drama | 956 | Ethics | 675 | 
| Art | 947 | North | 621 | 
| Qu Yuan | 943 | Religious belief | 579 | 
| Identity | 942 | Analects | 569 | 
| Maidservant | 931 | Compassion | 568 | 
| Imperial envoy | 916 | Tao Yuanming | 557 | 
| People | 912 | Happiness | 551 | 
| Pariah | 903 | Disaster | 529 | 
| Court | 901 | Elegance | 516 | 
| China | 881 | Root | 500 | 
Keywords common matrix
| K1 | K2 | K3 | K4 | K5 | K6 | K7 | K8 | K9 | |
|---|---|---|---|---|---|---|---|---|---|
| K1 | 0 | 685 | 823 | 542 | 293 | 77 | 112 | 302 | 20 | 
| K2 | 685 | 0 | 326 | 187 | 83 | 227 | 101 | 128 | 27 | 
| K3 | 823 | 326 | 0 | 152 | 54 | 89 | 115 | 378 | 23 | 
| K4 | 542 | 187 | 152 | 0 | 85 | 26 | 31 | 11 | 43 | 
| K5 | 293 | 83 | 54 | 85 | 0 | 38 | 62 | 48 | 29 | 
| K6 | 77 | 227 | 89 | 26 | 38 | 0 | 53 | 92 | 4 | 
| K7 | 112 | 101 | 115 | 31 | 62 | 53 | 0 | 221 | 9 | 
| K8 | 302 | 128 | 378 | 11 | 48 | 92 | 221 | 0 | 3 | 
| K9 | 20 | 27 | 23 | 43 | 29 | 4 | 9 | 3 | 0 | 
Korean classical literature works TF-IDF weights
| Weight rank | Literature works | TF-IDF weight | 
|---|---|---|
| 1 | A | 0.0342 | 
| 7 | C | 0.0179 | 
| 28 | B | 0.0113 | 
| 103 | E | 0.0098 | 
| 112 | D | 0.0082 | 
