Accès libre

Folk Tales from Diverse Cultures: Digital Analysis of Content using Natural Language Processing

  
19 mars 2025
À propos de cet article

Citez
Télécharger la couverture

Figure 1.

Data flow architecture diagram
Data flow architecture diagram

Figure 2.

Architecture diagram of distributed crawler
Architecture diagram of distributed crawler

Figure 3.

Natural language processing flow chart
Natural language processing flow chart

Figure 4.

The distribution of the average number of phrases per chapter
The distribution of the average number of phrases per chapter

Figure 5.

The distribution of the average length of each chapter paragraph
The distribution of the average length of each chapter paragraph

Figure 6.

The distribution of the average sentence length per chapter
The distribution of the average sentence length per chapter

Analysis results of complex network characteristics of writers’ works

Writer Aggregation coefficient Mean distance Aggregation coefficient * Average distance
Wu Cheng’en 0.464 2.146 0.995744
Pu Songling 0.526 2.087 1.097762
Luo Guanzhong 0.531 2.138 1.135278
Zhen Guangzu 0.525 1.955 1.026375
Sima Qian 0.489 2.036 0.995604
Zuo Qiuming 0.632 1.863 1.177416

Analysis results of complex network characteristics

Title of work Aggregation coefficient Mean distance Aggregation coefficient * Average distance
“The Classic of Mountains and Seas” 0.738 1.654 1.220652
“Journey to the West” 0.582 2.189 1.273998
“Romance of the Gods” 0.622 1.746 1.086012
“Daming Palace Ci” 0.492 2.297 1.130124
“Wild History of Ming and Qing Dynasties” 0.286 3.147 0.900042
“The Legacy of Supreme Harmony” 0.393 2.532 0.995074
“Records of the Grand Historian” 0.793 1.893 1.501149
“The Biography of Zuo” 0.863 2.109 1.820067
“Ci Hai” 0.973 1.743 1.695939
This text 0.167 3.862 0.644954
Collection of scientific articles 0.274 2.753 0.754322