A study of constructed paths based on big data analysis facing the framework of modern and contemporary literature education
Online veröffentlicht: 24. März 2025
Eingereicht: 14. Okt. 2024
Akzeptiert: 07. Feb. 2025
DOI: https://doi.org/10.2478/amns-2025-0749
Schlüsselwörter
© 2025 Shaojuan Liu, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Currently in the new stage of education, the national education department responds to the wave of education reform and gives a new look to literature education. With the deepening of the education reform situation, the teaching of modern and contemporary literature should be carried out by making full use of the teaching resources of our schools, combining with the professional teaching situation, adjusting and optimizing the methods and modes of literature teaching, enhancing the quality of teaching, improving the relationship between teachers and students, and paying attention to the cultivation of the students’ comprehensive quality and enhancement, help students understand the Chinese literary system, analyze the part of the traditional teaching mode that is not compatible with the modern requirements, solve the problem of students’ disinterest in literature teaching, improve the status quo of education, and aim to inspire the teaching of Chinese modern and contemporary literature to a higher level [1-4].
As we all know, Chinese modern and contemporary literature is a compulsory course for Chinese majors in Chinese colleges and universities, and it is also a basic and core course in the undergraduate level. Usually the scope of Chinese modern and contemporary literature covers Chinese literary works from 1917 to the present day, and from the May Fourth Movement to modern literature, all of them are the scope of what should be studied in the course of Chinese modern and contemporary literature [5-6]. Chinese modern and contemporary literature connects the basic courses and elective courses of Chinese language and literature majors, explores the development of China’s century-long literary history, studies the literary works of great writers within a century, and analyzes literary phenomena, literary trends, and literary movements, and teachers should show the law of the development of the history of literature in the process of lecturing, and analyze the social background and political roots behind the literature. However, from the perspective of literature [7-8]. The curriculum of modern and contemporary literature is a teaching material formed during the long period of change in the last century, in which various factors outside the body of literature have had a significant impact on the style of literary works, and these factors come from all directions, including the world, the domestic, the history and the reality, which are always influencing the literary debates and literary creations, and even dominating the literary movements [9-10].
As an important basic course in the language discipline, current and contemporary literature has a crucial position and role. It carries the development and inheritance of literary education, and also shoulders the important mission of cultivating students’ literary literacy and humanistic feelings. In the context of modern economy and society, with the change of people’s life put way, people’s spiritual activities also change [11-12]. The general lack of humanistic values is the status quo of contemporary education. How to make the present and contemporary literature develop better in such a big environment and better reflect its humanistic value and social value is what we need to pay attention to [13].
With the development of science and technology and the progress of the times, people’s lifestyles have changed dramatically. Once literature was the spiritual food on which people depended, and good literature promoted the progress of people’s thinking and brought spiritual enlightenment to people [14]. But in the context of the era of fast-paced life and pan-entertainment, there are more and more ways of entertainment for people to choose. The rise of the Internet has even brought people diversified ways of entertainment. Fast-food consumption occupies most people’s lives, people no longer choose to read books in peace and quiet, and the proliferation of network novels has made more valuable works of modern and contemporary literature obliterated [15-16].
The same is true on campus, where students are no longer willing to think and their interest in the classics of current and contemporary literature is declining. Few students read valuable classic works of modern and contemporary literature, but only stereotypically as the content of textbooks to learn. Education is the hope for the future of a country and society, so it is especially important to use modern and contemporary literature in education and teaching. How to reflect the connotation of modern and contemporary literature in education and teaching and lead students’ literary aesthetics and literary quality is a problem that education and teaching workers need to think about [17-19].
This paper builds a knowledge point network based on mining and analyzing the knowledge point association rules of modern and contemporary literature. It clarifies the advantages of mining knowledge point association rules in students’ literature learning process, which can help teachers understand their students’ learning progress and needs. The basic concepts of knowledge point model, knowledge point judgment, and knowledge point association rule mining method are elaborated respectively, and the overall process of knowledge point association mining in modern and contemporary literature is sorted out. By extracting and establishing the association of knowledge points in the textbook, along with parameter analysis, a knowledge point network is constructed. Based on the frequent pattern, the test questions in the question bank of modern and contemporary literature are mined and analyzed in terms of wrong and right answers to verify that the knowledge points are related to each other.
Mining knowledge points based on the data generated in the process of students’ literature learning is a key link in building a personalized teaching model. Through real-time mining and analysis of students’ behavioral data of searching and collecting knowledge points in the process of literature learning, teachers are able to have a comprehensive understanding of students’ disciplinary performance, learning progress and needs, so as to provide accurate guidance and support for personalized teaching.
Mining of knowledge points based on data generated in the process of literature learning allows for real-time tracking of students’ performance in the discipline. By analyzing students’ knowledge point data in literature reading and literature comprehension tests, teachers can discover students’ subject strengths and weaknesses in a timely manner. This provides the basis for personalized instruction, which allows teachers to target and strengthen students’ knowledge weaknesses to improve overall subject matter proficiency. Mining knowledge point data can help understand students’ learning progress, which in turn can help to adjust the teaching program. Through big data analysis of students’ knowledge point learning time, progress, participation and other data, teachers can understand when, where and to what extent students are learning, and thus flexibly adjust the teaching plan, to ensure that students’ individual differences on the basis of their own learning rhythm to promote teaching. Knowledge point data mining analysis can help teachers better understand the subject needs of their students. By analyzing the data on the frequency of searching for knowledge points and the length of stay of students in the process of literature learning, teachers can find out that students are confused about certain knowledge points or topics, and provide personalized answers and guidance in a timely manner, so as to satisfy the students’ needs and improve their sense of identity in the subject. Mining knowledge points based on the data generated in the process of literary learning provides timely and comprehensive information on students’ disciplines for the personalized teaching mode. Through real-time understanding of students’ subject performance, learning progress, and needs, teachers are able to formulate personalized literature teaching strategies to improve teaching effectiveness and students’ satisfaction with the subject.
The overall flow of the course knowledge point association mining model is shown in Figure 1. The model analyzes the learner’s learning behavior records, i.e., the time spent on each page of the courseware, the bookmarking of pages, and the action of flipping back and forth to determine which pages may contain knowledge points, and mines the associations between knowledge points. These possible associations mined will be stored in a knowledge point association database, where each association

Knowledge point association mining model
In actual teaching, the knowledge points have various forms and connotations, and the knowledge points have characteristics. However, the knowledge points in the courseware still have commonality. Different knowledge points are in different branches of concepts, which in turn have definitions, attributes and behaviors, and their polymorphism. It is easy to see that concepts have a strong object-oriented idea. For this reason, this paper proposes to use ABCDR model, i.e., the concept node model diagram in Fig. 2, to represent the concept node approach.

Conceptual node model diagram
The nodes of a concept consist of five parts: attributes, behaviors, concept words, descriptions, and associations. We believe that concepts are the unity of the pentad.
The five-tuple node model of concepts is a rational and superior way to represent and encapsulate knowledge points, as demonstrated by the following aspects.
Conform to people’s cognitive structure. The adopted conceptual network organization of knowledge is in line with the cognitive structure of people. The cognitive structure of human beings is net-like rather than linear.
Favorable to knowledge sharing and reuse. With the help of the concepts of base class and derived class, and the concept of inheritance in object-oriented programming, lower knowledge points can inherit certain attributes from the upper knowledge points. Knowledge points can also be flexibly combined to form new knowledge points.
Conform to the trend of databaseization of courseware. We have encapsulated a variety of knowledge points into a five-element model, so that it can be easily stored in a database.
Generalizability. The description of knowledge uses a consistent conceptual node approach that is both highly general and facilitates the sharing and retrieval of courseware resources, which leads to an increase in reuse rate.
Expandability. In the conceptual model of knowledge points, conceptual attributes can be easily added to guide students into different learning paths based on their individual situation, including the difficulty of courseware resources and other attributes.
A concept word is a terminological lexicon used primarily as a label for concepts. It consists of a concept name and an alias or synonym that describes the concept at the word level.
The concept name is the identification of the concept, and what should be paid attention to in its naming is accuracy and uniqueness.
Accuracy should be able to exactly express the substantive meaning of the concept, avoid ambiguity, and adopt formal names and scientific names as far as possible. Generally speaking, the subject of the knowledge is a high concept of knowledge, often a very concise few words, such as “modern Chinese language”, “ancient literature”, etc., as far as possible, the subject of the knowledge as the name of the concept. Uniqueness Each concept should have a unique name to distinguish it from other concepts. However, in a large network of concepts, it is often difficult to ensure unique naming. Typical examples are: graphical “trees” and plant “trees”, medical “viruses” and computer “viruses”. To ensure the requirement of uniqueness, all concepts are assigned a unique identification number ID, which is used as an index value for concept retrieval in the concept network. Other word combinations are as follows:
Synonyms, such as “fighter” and “fighter plane”; Common name and scientific name, common name is a kind of people usually called, not as standardized as the scientific name, often and the location of the user, the industry to which it belongs to have a close relationship. For example, “computer” is the common name for “computer”, and it can be assumed that the user may be an amateur or the magazine is more lifestyle-oriented. And then there is “calcium carbonate” and “limestone”, “QQ” and “Internet pager”; Different languages, such as “Windows 2000” and “Windows 2000”; Abbreviations and full names, e.g., “NASA” and “National Aeronautics and Space Administration”; Between different translations, e.g., “lyser” and “laser”; Different spelling forms, such as “chrome” “Duluo” and “Duge”, “WIN2K” and “WINDOWS2000”.
Attributes are static characteristics of the thing represented by the concept. Attributes are data items that reflect the essential features of the concept. Attributes are represented as sets, with
The discovery of attributes lies in the care and observation of things. In general, what one cares about may be used as an attribute; what one observes from a perspective may be used as an attribute from that perspective. In an instructional system, attributes can be selected from a perspective that is applicable to instruction. The selection of attributes can be based on the following principles.
Distinguish the feature saliency queue, keep the most significant if the thousands, ignore the others, for example, in “C Language Programming”, “parameter” is a significant attribute of “function”. Easy to reuse, an existing knowledge point can be used as an attribute of another knowledge point, e.g., “parameter” can be an independent knowledge point, and at the same time, it can also be an attribute of “function”. There are many points of knowledge in teaching that cannot be subdivided and become meta-knowledge points. The more meta-knowledge points, the better for reuse. The hierarchy of inherited knowledge points is obvious, and the catalog tree of each chapter of the traditional textbook can be simply understood as a knowledge hierarchy tree. In the conceptual network, the attributes of the upper nodes are an abstraction and generalization of the attributes of all lower nodes subordinate to it, called general attributes, and the general attributes can be determined to be invariant within a certain period of time and under certain occasions. The subordinate lower node can be inherited from the upper concept, then the lower node sets a new attribute with its own unique characteristics. For example, “function pointer” as a sub-concept under “pointer” has its own unique properties. Applicable to a specific teaching environment, the situation and connotation of knowledge points are very different, we abstract them into a five-element concept node model, in order to reflect its diversity, almost all the characteristics can be put into the attribute set of the concept node. Such as the difficulty of this knowledge point, the degree of importance, the applicable object, and so on. Attribute value is a measure on a certain characteristic, depending on the nature of the attribute characteristic, usually has the following values:
Numerical values such as height, specific gravity, complexity, some of which are with a scale. Numerical values are further divided into continuous values, discrete values, which are taken exactly. Such as 4 cylinders, 10 meters, k (1024) Logical values things are true or false, judgment is established or not, connected and disconnected. Enumerated values, value domain is a set, can take one of the values. Such as gender values for {male, female}; traffic light values for {red, yellow, green}, etc. Fuzzy values Some attributes take values that give a range of values, and the boundaries of different values are not obvious. For example, when age is divided into old, middle, young, and young, there is a large degree of fuzziness. For the fuzzy values are expressed using the degree of affiliation.
A learner can favorite a page of the courseware when using the mobile client for online learning. Therefore, it is possible to determine which pages contain key points or general knowledge points of interest to that learner by the status of the learner’s favorites of the pages, and record the set
If there is only one learner, then when his dwell time on consecutive pages exceeds the threshold or collects consecutive pages, then it will be determined that these consecutive pages contain the same knowledge point. If there are only two learners and their collections of consecutive pages are
The key points are the important knowledge in the courseware that is present throughout the chapter or even the entire course, and most learners will bookmark the page containing the key points; the general knowledge points are less important than the key points. Most learners do not add it to their favorites, but it is still connected to subsequent knowledge.
The reason for distinguishing between key and general knowledge points in this paper is that it is difficult and unnecessary to find connections between key knowledge and other knowledge points, so only connections between general knowledge points will be explored. Because the focus is usually related to most of the content of the entire course or the current chapter, it is difficult to identify all of these connections. And because it is the focus, being added to the collection by the learner, it can be found easily, so it is not necessary to find out the association between the real focus of the course and the other one or two pages to prompt the learner. Let the collection of focused knowledge be
Where
Knowledge point is a localized collection of relatively complete knowledge of logical significance, which is the basic unit for transmitting teaching information in the process of teaching and learning, and it includes theories, principles, concepts, definitions, examples and conclusions.
In Internet technology, information may be presented in the form of graphics, images, documents, audio, video, etc., but each form of presentation has certain limitations. Therefore, most of the online course knowledge points combine these presentation forms to achieve optimal teaching and learning results.
In the learning process, it is difficult to achieve the ideal learning effect by learning a certain knowledge point alone without combining the relationship between that knowledge point and other knowledge points. Therefore, the relationship between knowledge points is a research hotspot in both traditional and online education. Learners can form an in-depth understanding of the systematic structure of knowledge points by mastering the relationship between knowledge points. The relationship between knowledge points is generally classified into the following types.
Parent-child relationship: A composite knowledge point is formed by combining several knowledge points, and the relationship between the composite knowledge point and these knowledge points is a parent-child relationship. And the father can participate in the composition of other composite knowledge points, which forms a hierarchical structure between the knowledge points, a course in the knowledge points usually have this hierarchical structure. leading and following relationship: in the study of a knowledge point before, sometimes need to master other knowledge points, in other words, for the study of the two knowledge points there is an inevitable sequence, then the two knowledge points that is the leading and following relationship. Reference relationship: learning a knowledge point can refer to the content of another knowledge point, but between the two knowledge points and there is no inevitable sequence of learning, then the two knowledge points that is the reference relationship. Parallelism: no clear parent-child relationship between the two knowledge points, leading and following the relationship, and the two knowledge points at the same level, then the two knowledge points is parallelism.
If there is a pattern between two or more variables, then the pattern between them can try to be explained by association. Association rules are often used to mine the intrinsic correlation between different items in an event, to extract the hidden knowledge structure, and will not change the original distribution of the sample, to avoid the “dimensionality catastrophe” caused by too many labeled items. Association rule mining usually refers to extracting the common probability of occurrence between items or entities, in order to extract potential behavioral patterns of users. Therefore, association rules are commonly employed to analyze the shopping or browsing behaviors of users. For example, in the famous story of “Beer and Diapers”, there is a 75% probability that a customer who buys diapers at the supermarket will buy beer at the same time. The results of this association rule extraction prompted the supermarket to place diapers and beer in close proximity to each other, which proved to be a significant increase in the sales of both. The effectiveness of association rules in uncovering implicit connections between events is self-evident.
From the principle of association rule, suppose
The association rule contains three metrics: support, confidence and enhancement. The support of the association rule is expressed as the ratio of the number of documents containing both itemset
Mining of association rules
The FP-growth algorithm is realized based on a special prefix tree FP-tree, which needs to be established before executing the FP-growth algorithm and is defined as follows.
Define FP-tree: FP-tree consists of an item prefix subtree and a frequent item header table;
Item prefix subtree, the nodes in the tree include: item name, item count value and node chain. The item name indicates the name of the item represented by the node, the item count value indicates the number of transactions in the path to reach the node, and the node chain connects the next node in the tree with the same item name, and the value is null if the next node does not exist. Frequent item header table, whose entries consist of two parts: the item name and a pointer to the first node with the item name.
In FP-tree, for a certain node
FP-growth algorithm is an association rule mining algorithm proposed by Chia-Wei Tsao et al. The algorithm compresses the frequent itemsets in the transaction database into a frequent pattern tree FP-tree by one scan, while retaining the association information in it, and then divides the FP-tree into a number of conditional libraries corresponding to the length of the frequent itemsets of length
Based on the above general process of course knowledge point correlation mining, the following section combines specific textbooks and test questions to construct a network of literary knowledge points.
The content of the next book of History of Modern Chinese Literature 1915-2022 (Fourth Edition) by Peking University Press is analyzed, in which the introduction, exercises and chapter guides in the textbook are not counted in the scope of knowledge point extraction. According to the definition and extraction method of knowledge points in the concept definition, the specific content of the textbook is deconstructed, and a total of 208 knowledge points are extracted, each of which has a specific corresponding paragraph in the textbook, and if a certain paragraph belongs to a certain topic, then the corresponding knowledge point belongs to a certain topic, and it is difficult to show the number of knowledge points completely due to the large number of knowledge points, so the number of statistical data of various types of knowledge points and the topics they belong to are as follows Table 1.
Network knowledge points scale of teaching materials
| Special subject | One | Two | Three | Four | Five | Six | total |
|---|---|---|---|---|---|---|---|
| Number of knowledge points | 25 | 32 | 30 | 48 | 32 | 41 | 208 |
| scale | 12.0% | 15.4% | 14.4% | 23.1% | 15.4% | 19.7% | 100% |
According to the rules of confirmation of knowledge point connection, 208 knowledge points in the research content of the next book of “History of Modern Chinese Literature 1915-2022 (Fourth Edition)” of Peking University Press were analyzed, and the neighbor matrix (208 horizontal rows and 208 vertical rows, both indicating 208 knowledge points) was constructed by using Excel software for statistics, and the analysis process was as follows: starting from the first Knowledge points to start, if the first knowledge point and the nth knowledge point related to the first line of the nth column space marked as 1, analyze each knowledge point one by one for statistics. At the same time, note that due to the sequence of knowledge points, if the nth knowledge point and the n+1th knowledge point is related, then only in the nth row of the n+1st column marking, no longer in the n+1st row of the nth column marking, so as to derive a network adjacency matrix that can be constructed without weights. (Note: using this method to record is actually recorded a directed network, but the subsequent analysis using simple undirected network analysis, so the subsequent adjustment in the Pajek software can be).
The above knowledge point extraction and judgment of knowledge point association were carried out by the two researchers based on the conceptual definition respectively, and the results of the extraction and judgment reached 90.5% agreement, and the subsequent network mapping and analysis was carried out after reaching a unification of the different judgments.
Node point degree centrality The distribution of node point degree centrality in the knowledge network of the statistical textbook is shown in Figure 3 below. Combined with Figure 3, it can be seen that the degree of 208 knowledge points ranges from 1-18, in which the number of knowledge points increases and then decreases as the degree increases, the number of knowledge points with a degree of 3 and 4 is the largest number of knowledge points, the overall majority of the knowledge point degree centered on 2-4 or so, accounting for about 59% of the total number of knowledge points when the degree value reaches 10 and after the number of knowledge points was once reduced to a total of 17, accounting for 8% of the total number of knowledge points or so. As a whole, through the software calculations, the average point of knowledge points of the center of the degree of 4.83, which means that a knowledge point is associated with about 5 knowledge points on average, from a local point of view, the number of knowledge points with a degree equal to and more than 10 accounted for only 8%, and less than 2% of the knowledge points accounted for 2%. The above data show that most of the knowledge points in the knowledge network of the textbook can be associated with two or more knowledge points, and the association between the knowledge points is close, while the knowledge points with a degree value of more than 10 can be associated with a very large number of knowledge points, which is the skeleton knowledge of the textbook knowledge network, and the percentage of this part of the knowledge is relatively small. Proximity to the center The degree of proximity to the center reflects the degree of the node in the center of the network, the greater the degree of proximity to the center, indicating that this node is more directly associated with other nodes. The use of Pajek software on the knowledge network statistics, calculated the degree of proximity to the center of each node, due to the normalization of the value is less than 1, the distribution of each value of the statistics plotted in Figure 4. As in Figure 4, the horizontal coordinate is the serial number of 208 nodes, and the vertical coordinate is the degree of proximity to the center of each node, it can be seen that the degree of each node is located in the range of 0.25-0.45, and most of them are concentrated in about 0.35 (the light purple part of Figure 4), and there are points with a larger degree of more than 0.4. This shows that some nodes have a direct correlation with many nodes (knowledge points span and are directly related to the knowledge points of different units), and most of the nodes are more evenly distributed near the nodes with a larger degree, indicating that this part of the knowledge points is directly related to one of the nodes and less with the other nodes (more associated with the knowledge points in the unit), and there are also some nodes with a small degree. It is suggested that only one or a few knowledge nodes are associated (only a small number of knowledge points in the unit are associated). Mediator Center Degree As the name suggests, the mediator centrality degree implies the ability of a node to act as an intermediary bridge to associate two other nodes, using Pajek software to statistically analyze the knowledge network to obtain the mediator centrality degree of each node, plotted as in Fig. 5. As can be seen in Fig. 5, the horizontal coordinate is the serial number of the 208 nodes, and the vertical coordinate is the number of mediator centrality degrees of individual nodes, and the mediator centrality degree of the majority of the nodes is less than 0.05,. Eight nodes (about 3% of the nodes) have larger degrees than 0.10, and a small number of nodes (15% of the nodes) have degrees located between 0.035-0.095. This shows that the textbook network plays an intermediary role in the control of knowledge is not the majority, on the one hand, in order to effectively and quickly learn knowledge, the need to focus on mastering this kind of intermediary knowledge, and on the other hand, also shows that the textbook knowledge connectivity has more ways, and there is not only one aspect of the knowledge can be associated.

Figure of number of nodes in point degree center degree

Distribution of knowledge point proximity to center

Distribution of center degree of knowledge center
Frequent pattern mining of the data in the examination system of the history of modern Chinese literature can be studied from the perspective of test questions, knowledge points, etc. In this section, we take the “test questions” as the basic research object, conduct frequent pattern mining and association rule acquisition, and then analyze the corresponding knowledge points of the test questions with the results.
The goal of this section is to mine the association between simultaneous wrong answers to test questions, the question bank of about 6000 questions, 150 questions are extracted from each paper, the probability of each question being extracted is about 0.025, the probability of answering the right answer to the wrong test question is obtained according to the teacher’s law of the questions of the course are about 0.5, according to empirical calculation of the average probability of the appearance of each item in the dataset is 0.015. In the case of determining the degree of support, according to the confidence level. The change curve of the number of correlation rules in the interval [0.25,0.65] found that the number of correlation rules began to decrease in the confidence level of 0.4, and then the decrease tends to level off, combined with the specific number of correlation rules in the dataset, the confidence level of 0.45 was selected.
After you have determined a method to use frequent pattern mining and support-confidence levels, you need to implement correlation rule mining on your computer. The FP-growth algorithm is a typical algorithm based on the support-confidence framework, which achieves good results by limiting the generation of candidates to find frequent itemsets, but cannot avoid the “negative correlation defect” of the support-confidence framework, for example, an association rule is obtained by using the FP-growth algorithm, as shown in equation (8).
The implication of this rule is that if the student answers question 5610 correctly, then the probability of his correct answer to question 5782 is 0.6, but in fact, the support degree of question 5782 is 0.65, that is, the probability of answering question 5782 correctly in the total data set is 0.65, which means that the former question has a negative impact on the latter question, and these “negative correlations” need to be screened out in the mining results, so it is necessary to introduce a boost degree to solve the “negative correlation” defect in the support-confidence framework.
Setting the support s=0.015 confidence c=0.45 for the association between wrongly answered test questions, the data of wrongly answered test questions got 978 association rules, which were sorted according to the confidence of the rules, and the top 10 are shown in Table 2.
Association rules between incorrect answers at the same time
| Pre-test question | Corresponding knowledge point | Post test question | Corresponding knowledge point | Support degree | Confidence degree | Lift degree | |
|---|---|---|---|---|---|---|---|
| 1 | 5936 | teahouse | 5889 | 1950s - 1970s plays | 0.012 | 0.72 | 2.475 |
| 2 | 6725 | Liu Qing | 5889 | 1950s - 1970s novelist | 0.010 | 0.621 | 2.142 |
| 3 | 6257 | Ten-year literary trend | 5889 | Literary trends from 1950s to 1970s | 0.013 | 0.630 | 2.067 |
| 4 | 5961 | Taiwan novel | 5889 | 1950s - 1970s Literature in Hong Kong and Taiwan | 0.012 | 0.625 | 2.036 |
| 5 | 5583 | Yu Guangzhong | 5889 | Taiwan new poetry | 0.011 | 0.617 | 2.050 |
| 6 | 8675 | Wang Zengqi | 5889 | 1980s prose | 0.014 | 0.523 | 2.023 |
| 7 | 5547 | Lu Yao | 5832 | 1980s novels | 0.013 | 0.561 | 2.571 |
| 8 | 6481 | Mo Yan | 5526 | 1980s novels | 0.012 | 0.575 | 2.8124 |
| 9 | 6321 | On the hoof | 5872 | 1990s novels | 0.015 | 0.53 | 1.987 |
| 10 | 5648 | Wang Anyi | 5834 | 1990s novels | 0.013 | 0.567 | 2.536 |
According to the association rules in Table 2, we found that if students answered the pre-test questions incorrectly, the rate of answering the post-test questions incorrectly also increased relatively, which shows that there is a knowledge association between the pre-test questions and the post-test questions. Further analysis revealed that students were prone to give incorrect answers to the pre-test questions that examined the works of specific literary authors, and correspondingly, to give incorrect answers to the post-test questions on the superordinate concepts such as the “literary genres” to which these specific authors belonged. From this, we can find that students are easy to answer the knowledge points of the first few units of the textbook incorrectly when they are studying History of Modern Chinese Literature 1915-2022. It is speculated that the reason for this may be that students studied these units at the beginning of the semester, and the time interval between them and the final exam is long, which leads to the situation that the relevant knowledge points are forgotten more. Teachers can improve their teaching planning accordingly to address this situation.
The association rule mining between simultaneous right-answer questions has the same method as the association rule mining between simultaneous wrong-answer questions in 4.2.1, based on which, the association between simultaneous right-answer questions is directly analyzed in this section.
The association rules between simultaneous right answer questions are counted, and the content of the questions is viewed to investigate the students’ perception of the difficulty of the questions, and the questions that appear as the top 5 frequencies of the antecedent and the consequent items are obtained as shown in Table 3.
The most frequent correct answers to the questions
| Preceding paragraph | frequency | content | Difficulty | consequent | frequency | content | Difficulty |
|---|---|---|---|---|---|---|---|
| 6341 | 21 | Taiwan and Hong Kong novels | harder | 5505 | 721 | 1980s - 1990s genre of fiction | easiness |
| 6515 | 16 | Lai Shengchuan | easiness | 5803 | 613 | 1980s - 1990s poetry | easiness |
| 5874 | 16 | Misty poetry | easiness | 5503 | 571 | 1980s - 1990s poetry | easiness |
| 5873 | 16 | Modern and contemporary minority novels | easiness | 5862 | 520 | Modern and contemporary minority literature | easiness |
| 6861 | 15 | Popular culture and network literature | normal | 5498 | 453 | Literary trends since 2000 | easiness |
The analysis of the topics in Table 3 reveals that the topics frequently appearing as the antecedent are basically examining the basic knowledge points despite their varying difficulty, and that answering the topics examining the basics indicates that the students’ mastery of the basic knowledge points is firm, and naturally, there is a high probability of answering the other topics correctly; and the questions with high frequency of the latter items are generally simple, which indicates that answering the antecedent problems correctly has a high probability of answering the simpler questions correctly after the latter items as well.
To analyze the knowledge points of specific test questions, the knowledge points in the middle or back unit of the textbook are relatively easy for students, and the reason for this may be that most teachers will focus on the middle or back unit, and the time interval between the study of the knowledge points in the middle or back unit and the final exam is relatively short, and students’ memory and review of the relevant knowledge points are more in place. Teachers have the ability to modify their teaching plans accordingly.
This paper focuses on how to construct a network of associated knowledge points in modern and contemporary literature using big data analysis technology. According to the clear knowledge point association rule mining process, 208 knowledge points of the textbook History of Modern Chinese Literature 1915-2022 (4th edition) are extracted and established in association, and the relevant parameters are analyzed, and it is found that the vast majority of the knowledge points in the knowledge network of the textbook are able to produce direct associations with 2 and more knowledge points, and the knowledge points are closely associated with each other. Most of the nodes intermediary center degree is less than 0.05, accounting for about 3% of the nodes with a degree of more than 0.10, accounting for 15% of the nodes with a degree of between 0.035-0.095, playing the role of intermediary, and not the majority of the knowledge points with strong control ability. Further to the 6000 test questions in the modern literary history examination system for frequent pattern mining, analyze the correlation between simultaneous wrong answers to test questions and the correlation between simultaneous right answers to test questions, to verify that the students’ mastery of a certain knowledge point affects the understanding of other knowledge points, and that the teacher needs to adjust the relevant knowledge point lecture time and other teaching plans according to the students’ knowledge point mastery.
Using big data analysis technology, combined with modern and contemporary literature textbooks and test questions, to explore the correlation between different knowledge points and students’ mastery of different knowledge points, can scientifically and clearly find out the problems that need to be improved in the process of teachers’ teaching and students’ learning, and help the education of modern and contemporary literature to realize systematization and systematization.
