An Exploration of Artificial Intelligence Assisted Strategies in English Reading Teaching

With the deepening of the new curriculum reform and the rapid development of artificial intelligence, the application scenarios of artificial intelligence technology in English teaching are increasingly rich and diversified, and its role has become more and more prominent, and it has begun to be used to explore new teaching modes to adapt to the learning needs and characteristics of different students, to promote the personalized development of students, and to provide strong support for the construction of a highly effective and student-centered English language classroom [1-4]. The new curriculum standard clearly puts forward that teachers should actively utilize network resources to enrich the content and form of teaching with innovative teaching methods, enhance students’ interest and initiative in learning, and thus significantly improve the effect of classroom teaching [5]. The use of artificial intelligence technology to assist education can enhance the teaching effect and meet students’ learning needs [6]. Artificial intelligence assisted teaching needs to follow three principles, namely, the principle of student nature, the principle of development, the principle of moderation [7]. Assisted teaching should be based on students’ learning attitudes, learning methods, learning status, learning habits and other specific circumstances, to decide whether to use artificial intelligence, when and where to use, and how to use, in order to maximize the students’ learning needs, to ensure that students achieve real learning [8-10]. Artificial intelligence-assisted independent learning by students changes students’ attitudes and learning styles, realizes learning anytime and anywhere, and makes progress in learning ability and information literacy [11-12]. It aims to improve the traditional teaching methods so that students can get good development. It does not mean that it can be applied to every aspect of teaching, and it should follow the principle of moderation and reasonably use artificial intelligence for teaching.

At present, most students inevitably encounter many difficulties in English learning, such as insufficient vocabulary, unclear grammar rules, difficulties in reading comprehension and other problems, which lead them to be easily intimidated in learning, and then affect the enthusiasm and motivation of English learning. In order to effectively solve this problem, schools need to give full play to the leading role, scientific and rational use of artificial intelligence to assist teaching, as an opportunity to create a more vivid, intuitive, inspiring English learning environment for students [13-14].

In the development of the times, artificial intelligence has been used quite a lot in English teaching. Literature [15] utilizes AI image recognition technology and AI-supported optical character recognition to place students in a real-world environment to learn English vocabulary while instructing them with self-regulated strategies, showing that image recognition is superior. Literature [16] concludes that AI programs can assist non-native English speakers in scientific writing through Elicit and ResearchRabbit. Scispace Copilot summarizes the content of References, Grammarly and Paperpal corrects grammar, and ChatGPT rewrites the text and reduces checking. Similarly, literature [17] shows that ChatGPT can also perform English composition writing, with a predominantly active-passive use of grammar. In addition, literature [18] used artificial intelligence to build an online English teaching system that can be personalized for students with different achievement levels to improve teaching efficiency. Similarly, literature [19] designed an online intelligent English learning system that combines the ideas of artificial intelligence and expert systems, which can develop appropriate teaching strategies for students and conduct scientific training tests on the knowledge that has been learned, so as to accurately improve the teaching effect. Literature [20] implemented an immersive teaching experience for students with virtual reality technology supported by artificial intelligence and machine learning, and the experimental teaching effect was remarkable. Literature [21] develops chatbots with AI technology to assist English learning, and intelligent chatting assists schools in cultivating students’ interest in learning. Further, literature [22] set up an AI-supported English online teaching model, combined with deep belief network localization, which realistically captures students’ real-time location. Synthesizing the above related literature, we find that the application of artificial intelligence in English teaching mainly focuses on several aspects such as teaching level, grammar, writing, learning interest, learning efficiency, and mastering students’ dynamics, and lacks the related research on AI-assisted English reading teaching.

The article first proposes strategies for teaching English reading in four dimensions: a mixed cognitive diagnostic model, scaffolding teaching model, inquiry teaching model, and cognitive diagnostic assessment model. Secondly, the joint research design, based on the Q-matrix theory, completed the preparation of cognitive diagnostic test papers for “English reading”, and formed a set of final test papers to be distributed and recycled. After that, we conducted further research on the technology related to AI-assisted English reading teaching, and first proposed a knowledge tracking model based on deep forgetting modeling. Taking the knowledge tracking model based on key-value pair memory network as a framework, using LSTM to improve the long-term dependence of the model, and considering three kinds of forgetting factors, the residual network is used to realize the deep forgetting modeling for the forgetting behavior. Meanwhile, a CF-DKD method combining learners’ cognitive and external memory matrices is proposed to predict learners’ performance in future test questions and diagnose their knowledge mastery status. Finally, the relationship between the attributes in reading ability is demonstrated through data-based analysis, and the applicability of the DINA model in the cognitive diagnosis study of language proficiency is verified; meanwhile, two classes with comparable reading levels are selected for empirical analysis of the effect of English reading teaching.

2

Strategies for Building an English Reading Teaching Model Based on Diagnostic Assessment

2.1

Strengthening the mechanism of second language reading and processing

English reading teaching involves bilingual conversion between English and Chinese, which falls under the category of bilingual reading teaching. Based on the current lower-order teaching model, reading instruction should first explore the contributions of various factors by strengthening relevant research in second language reading. The cognitive diagnostic model, on the other hand, precisely subdivided the macro-competence in the field of English reading into different cognitive attributes such as knowledge, skills, strategies, etc., and utilized the compensatory model, non-compensatory model, and saturation model to study a certain attribute such as vocabulary or syntax, the attribute measured by the reading task, etc., to explore the intrinsic relationship between the reading skills, and to reveal the complexity of the characteristics of second-language reading. Obviously, the mixed cognitive diagnostic model covers the macro and micro levels of theory, which needs to be used by English reading teachers as a guide to explore the relationship between reading vocabulary, syntax, details, reasoning, and main idea, to build a theoretical framework between language knowledge and strategic skills, and to incorporate the reading performances of students at different levels into the framework for in-depth exploration, to explain the mechanism of bilingual reading processing, and finally to provide practical references for the mixed cognitive diagnostic model. Provide practical reference.

2.2

Improve the English Reading Teaching System

English reading teaching based on diagnostic assessment needs to focus on constructing systems such as teaching context and design on the basis of the processing mechanism, from redefining the position of the teaching subject in reading teaching to updating the teaching method in order to realize the goal of improving the English reading level and classroom teaching efficiency [23]. Based on this, the scaffolding teaching mode, as a new trend of reading teaching at this stage and even for a longer period of time in the future, is a new type of teaching method based on constructivism and other theories that focuses on the learner and cultivates students’ problem-solving and autonomy. The model respects the students’ protagonist status as the premise, puts the students in the center, objectively requires teachers to practice the teaching method of scaffolding, set up and throw questions to guide students step by step, so that students really understand the main idea of the reading materials, thus speculate the potential meaning of the article, and further explore the meaning of the corresponding vocabulary, phrases, and so on.

2.3

Strengthening autonomous cognitive abilities

Diagnostic assessment is a product of the information age, a mode of teaching introduced by modern educational concepts and technical equipment, etc., with innate openness. Based on this, the mode of English reading teaching should be expanded from classroom teaching to extracurricular teaching. Classroom reading teaching should be transformed from static to dynamic, with teachers guiding, assisting and supervising students to read independently, and completing inquiry and discussion through teacher-student interaction or student-student interaction, so as to lay a foundation for students to improve their comprehensive English ability. Extracurricular reading teaching creates an open environment through the medium of informatized education platforms and digital resources in libraries, so that students can go out of the classroom and carry out various forms of cooperative learning activities.

2.4

Strengthening teaching diagnostic studies

English reading, as an interactive activity between readers and texts, involves a multifaceted and complex cognitive processing process. In English reading teaching, teachers and students are able to act as readers together, and need to utilize different cognitive skills, language skills, etc., to integrate and process the information of the reading text, so as to achieve the goal of comprehending the whole chapter. Cognitive diagnostic assessment is based on the cognitive psychology model, incorporating the cognitive model of the relevant reading tasks into the model, and providing qualitative or quantitative scientific diagnostic results for different students by measuring the learners’ reading knowledge, skills, and strategic processing processes, especially by scientifically analyzing the secret behind their reading results. Under this premise, the definition of cognitive diagnostic attributes of English reading, model construction and result feedback are the key links, which need to be combined with linguistics and other perspectives to classify the reading attributes into language knowledge, strategy skills, cognition and other dimensions, and then use cognitive psychology knowledge to carry out coding analysis of different dimensions, laying a foundation for the construction of the model. In the construction of diagnostic models, students of different groups at the same level or students of different levels in the same group should be taken as samples, focusing on analyzing the differences between individuals, so as to generate objective feedback results [24].

3

Study design

3.1

Purpose of the study

In this study, the G-DINA cognitive diagnostic model is used to conduct cognitive diagnosis of English reading. The results of cognitive diagnostic assessment can accurately reveal the cognitive attributes that students have mastered and have not mastered, so that teachers can figure out the students’ mastery of each cognitive attribute of English reading, and then more reasonably carry out the design of teaching and arrange the teaching content. The students themselves can understand where they are “sick” and what knowledge and skills they have not mastered, and review and consolidate them in time, and the parents can have a more in-depth understanding of their children’s learning situation.

3.2

Content of the study

Based on the G-DINA model, this study selects English “English reading” as the research topic, analyzes the internal logic of “English reading” knowledge, and diagnoses students’ “English reading” cognitive situation. The diagnosis is determined by the results of the diagnosis, and the students’ knowledge of English reading is evaluated. Based on the results of the diagnosis, we will provide guidance for students’ remedial teaching and teachers’ remedial teaching. The details are as follows: 1)

Identify the cognitive attributes associated with the topic of “reading in English” in the context of the content of the textbook and the test questions, and establish a hierarchical structure among these attributes.

2)

According to the Q matrix theory, the “English Reading” cognitive diagnostic test paper was compiled, and the reliability of the test, the difficulty and discrimination of the questions were tested by the pre-test, and the applicability of the G-DINA model in the diagnosis of “English Reading” was verified, and the test questions were perfected to obtain the final test paper of “English Reading” cognitive diagnosis.

3)

Collecting students’ responses to the test, using Excel and the cognitive diagnostic analysis platform flexCDMs, etc., and choosing the G-DINA model to analyze in detail the cognitive structure of the participating students in “English reading,” and providing teachers with targeted remedial teaching suggestions.

3.3

Subjects of the study

According to Piaget’s four-stage theory of cognitive development, for students in the first year of high school, after experiencing the systematic learning in the middle school stage, they have established a solid foundation of English knowledge and have a grasp of English reading methods. Before coming into contact with English reading, students familiarize themselves with the related knowledge, which lays the foundation for their understanding of English reading.

In order to gain a deeper understanding of students’ perception of English reading, this study selected 2668 students from 10 classes in a high school as the sample.

3.4

Research ideas

The research idea is shown in Figure 1. Based on literature analysis and in-depth discussions with front-line teachers and masters of education in the same major, the relevant cognitive attributes of “English Reading” were preliminarily determined, and the hierarchical relationship between these attributes was established. In order to further confirm the accuracy and rationality of these cognitive attributes and their hierarchical structure, a questionnaire on the recognition of the cognitive attributes and hierarchical relationships of “English Reading” was distributed to front-line teachers. By collecting and analyzing the feedback information of the questionnaire, the cognitive attributes and hierarchical relationships of “English Reading” were finally determined.

Secondly, the test paper of “English Reading” was prepared based on the Q matrix theory. In order to ensure the quality of the test paper, resources such as textbooks, tutorials and previous college entrance examination questions were comprehensively utilized to design a test paper that could cover all cognitive attributes and examine each attribute several times, and it was repeatedly revised with the opinions of front-line teachers. After the initial test paper is completed, it is necessary to carry out pre-testing, analyze the subjects’ response data, test the quality of the initial test paper, and determine whether or not to make adjustments to the test questions based on the results of the test, so as to finally form a standardized cognitive diagnostic test paper for “English Reading.”

After completing the preparatory work, the students were given the “English Reading” cognitive diagnostic test papers and the test was scheduled to be conducted at a uniform time. After the test was completed, the papers were collected and carefully corrected. For the purpose of subsequent analysis, students’ scores on each item, gender, class, and other information were coded. Next, the coded data were comprehensively analyzed using Excel, the cognitive diagnostic software flexCDMs, and SPSS27.0 software. The G-DINA model was chosen for cognitive diagnosis based on full consideration of the theoretical basis and model data fit. We analyzed the attribute mastery patterns and the probability of mastery of each attribute, and compared the differences in the mastery of “English reading” between different classes and different genders.

Finally, based on the results of the cognitive diagnosis, possible pathways for students to learn “Reading in English” are proposed, and some remedial teaching suggestions are made based on them.

4

Cognitive Diagnosis of Artificial Intelligence-Assisted English Reading Teaching

4.1

Knowledge Tracking Modeling

4.1.1

Principles of IRT

IRT, also known as Latent Trait Theory or Item Curve Theory, is a modern testing theory proposed to overcome the shortcomings of classical testing theory.

In the mathematical model of IRT, it is assumed that the learning ability of learner i can be measured, denoted by θ_i. Each item, i.e., learner exercise test, will have a difficulty coefficient of Exercise j denoted as β_j. The probability of a learner answering a question correctly by answering an exercise numbered j on a test is expressed using probability P(a), P(a) determined by the item response function, which is characterized by the following: if a learner has a higher learning ability, this learner has a higher probability of answering a question correctly: if the question answered is more difficult, the probability of the learner answering the question correctly is lower. In item response theory models, logistic regression models are commonly used as item response functions: (1) $P (a) = s i g m o i d (θ - β_{j}) = \frac{1}{1 + \exp (- (θ - β_{j}))}$

The item response functions for the two-parameter logistic regression IRT model are as follows: (2) $P_{i} (θ_{i}) = \frac{1}{1 + \exp (- 1.7 a_{j} (θ_{i} - b_{j}))}$

In the two-parameter logistic regression model, a_j denotes the discriminant parameter of item j, which represents the learner’s discrimination ability, and b_j denotes the discriminant parameter of item j, which represents the difficulty coefficient of the exercise.

IRT is designed for educational testing environments, and the model assumes that learners’ abilities do not change in the testing environment; however, in knowledge tracking, students’ knowledge status changes over time, and therefore, the initial IRT cannot be used directly for knowledge tracking tasks.

4.1.2

Knowledge Tracking Model Considering Forgetting Behavior

Knowledge tracking models considering forgetting behavior can be divided into two categories, knowledge tracking models based on the extension of the BKT model and deep knowledge tracking models [25].

In the proposed BKT model considering forgetting behavior, the default is that the learner forgets the previously learned knowledge after a few days of learning, and a new parameter “forgetn” is introduced to indicate the forgetting rate on a certain day after the learning interaction, but this model cannot simulate the forgetting behavior on a smaller scale.

On the basis of the BKT model considering forgetting behavior, a knowledge tracking model is proposed to estimate the probability of forgetting through the exercise sequence, for example, there is an example of exercises with exercise sequence A₁–A₂–B₁–A₃–B₂–B₃–A₄ denoting and knowledge point A,B, assuming that the probability of forgetting between A₁,A₂ is F, then the probability of forgetting between A₂ and A₃ is 1–(1–F)², and that the probability of forgetting between A₃ and A₄ is 1–(1–F)³. However, the model does not take into account the interval between the before and after answering sequences.

Considering the three forgetting behaviors, the DKT model is extended to propose a DKT model with forgetting ability, and the model’s structure for modeling forgetting behavior is shown in Figure 2.

First, the model expresses the exercise interaction sequence x_t = (q_t,r_t) embedding as the exercise embedding vector v_t, and the three forgetting factors embedding as the vector c_t, and operates the two to obtain the embedding vector $v_{t}^{c}$ after the forgetting process: (3) $v_{t}^{c} = f (v_{t}, c_{t})$

Here f denotes one of the tandem, product or tandem product operations.

Then, the knowledge mastery state value h_t and the knowledge mastery state value after forgetting treatment are calculated $h_{t}^{c}$ . Finally, the probability of answering the question correctly next time is predicted based on $h_{t}^{c}$ : (4) $h_{t} = R N N (v_{t}^{c}, h_{t - 1})$ (5) $h_{t}^{c} = f (h_{t}, c_{t + 1})$ (6) $y_{t} = s i g m o i d (W^{o u t} h_{t}^{c} + b^{o u t})$

4.1.3

IRT-based knowledge tracking models

Deep-IRT’s model combines IRT and DKVMN model, which retains the performance of DKVMN model, provides mental explanations for both learners and exercises, and improves the interpretability of deep knowledge tracking model. In the DKVMN model, for an exercise q_t received at timestamp t, a feature vector f_t is generated after model training, which contains the learner’s knowledge mastery state and exercise embedding information for exercise q_t. The Deep-IRT model uses a neural network to build a student ability network layer and an exercise difficulty network layer, uses f_t vectors to infer the learner’s learning ability on exercise q_t, and uses the Exercise Embedding Vector q_t to compute the difficulty of the exercises: (7) $θ_{t j} = t a n h (W_{θ} f_{t} + b_{θ})$ (8) $β_{j} = t a n h (W_{β} q_{t} + b_{β})$

Here θ_ti and β_j denote the learning ability and the difficulty of the exercise for knowledge point j at time stamp t, respectively, and then the probability that the learner will answer the question correctly is calculated p_t based on the learning ability and the difficulty of the exercise: (9) $p_{t} = s i g m o i d (3.0 * θ_{t j} - β_{j})$

The output of the learning ability network layer was scaled up by multiplying it by a factor of 3.0 for practical reasons, and the item response theory that was refined in this model can be applied to other models.

4.2

Dynamic cognitive diagnostics based on learning and forgetting factors

4.2.1

Problem definition

CF-DKD is a supervised learning method that tracks the learner’s knowledge proficiency through the learner’s sequence of test responses on each assessment.

The task description of CF-DKD is as follows: learner history interaction data is a sequence of learner responses to test questions at different moments, denoted by X = {x₁,x₂,…,x_t–1}. where the answer-response record, cell x_t = (e_t,r_t,f_t,l_t), is a quaternion indicating that a particular learner executes trial e_t,(e_t ∈ {e1,…,e|E|}) once per trial at a certain moment t. The incorrect and correct responses are r_t, which is a binary variable r_t = {0,1}. If the learner answers correctly, r_t = 1. Otherwise, r_t = 0. In addition, we use f_i,l_i to denote the cognitive factors forgetting and learning, respectively.

4.2.2

Solutions and frameworks

The solution is shown in Figure 3. We propose a dynamic knowledge diagnostic method (CF-DKD) incorporating cognitive features to track changes in learners’ potential knowledge mastery status.

The complete one-step CF-DKD framework is shown in Fig. 4. CF-DKD is a time-series model that utilizes a memory network to dynamically store potential knowledge capabilities, and it provides a cognitive rule-based interface for reading and writing hidden information.

4.2.3

Cognitive Feature Extraction

Based on the coding basis of extrinsic features, CF-DKD achieves the modeling of cognitive laws of learning and forgetting by classifying extrinsic knowledge features, interaction features, behavioral features, and temporal features according to cognitive laws, i.e., the modeling of cognitive laws of learning and forgetting. By introducing forgetting and learning factors to improve learners’ knowledge mastery [26]. The forgetting and learning features are shown in Figure 5.

4.2.4

Memory Layer Reading

We use the key-value matrix pairs M_cw and M_sw to store knowledge mastery states in CF-DKD, instead of using a single hidden layer to store knowledge mastery states in the more traditional DKT model. M_ek is an invariant key matrix that stores potential knowledge, and M_sw is a dynamic value matrix that stores each learner’s knowledge proficiency. These two memory matrices have the same slots, each representing a potential knowledge. The memory layer consists of two steps:

1)

Key reading

For a given test question input e_i(e, ∈ E) and Step 1, we first encode the test questions with a one-hot vector, where E is the set of test questions and |E| denotes the number of exercises. Due to the sparsity of the one-hot vector, we map it to a dense space, e_i multiplied by an embedding matrix A ∈ R^|E|×d_k to obtain a continuous vector $k_{i} \in ℝ_{i}^{d}$ .

(10)

k_{i} = e_{i} * A

In order to obtain the relevance of the current test question to the potential knowledge, we use the attention mechanism by calculating the inner product vector k_i of the embedding vector of the current test question with the key matrix M_ece as follows: (11) $w_{t} (i) = s o f t \max (k_{i}^{T} M_{e x \int}^{'} (i))$

where $s o f t \max (z_{i}) = e_{z} i \sum_{i = 1}^{n} e^{z,}$ and dw_i(i) ∈ [0,1].

2)

Value reading

Given the relevant weights, we retrieve the potential knowledge states at the time of the trial question from the learner’s value matrix. Thus, the trial mastery state is computed by multiplying the weighted sum of all relevant potential knowledge states by the corresponding correlation weights.

4.2.5

Knowledge status updates

First, given test questions and corresponding response tuples as inputs, the external memory approach uses wipe gates and add gates to update the hidden layer, ignoring the effect of long-term cognitive dependence on learners’ knowledge acquisition. Although various forgetting variables are added to the DKT+F model to affect the updating process, only one hidden layer cannot accurately represent the real hidden learning process of RNNs. Finally, we use an additional key-value mnemonic to store potential states. Thus, to balance the learning and forgetting factors, we propose two gates inspired by the thresholding mechanism to adaptively fuse the two features. Examples are the forgetting gate from LSTM, the updating gate from GRU, and the adding and deleting gate from GKT.

In our CF-DKD update method, the forgetting gate F_t controls the information to be deleted from the post-new-trial value matrix $M_{s t u}^{t - 1}$ , which contains the embedded current responses 3 v_t and the long-term forgetting factor f_t. The temporary forgetting vector ${\tilde{f}}_{t}$ is generated by combining the learner’s responses v_t(v_t ∈ ℝ^|d_v|) and forgetting features f_t(f_t ∈ ℝ^|d_f|). The forgetting information F_t can be computed by the fully connected layer with activation as: (12) ${\tilde{f}}_{t} = φ (f_{t}, ν_{t})$ (13) $F_{ι} = s i g m o i d (F^{T} \cdot {\tilde{f}}_{t} + b_{f})$

f_t = φ(Δs_t,Δr_t) and φ(·) are the integration functions that will be introduced in section “(VI) Prediction”. F^τ is a weight matrix F^T ∈ ℝ(d_r + d_f)p(d_r+d_f), where each element is a scalar between 0 and 1.

Similar to the forgetting gate, the learning gate L_t controls which knowledge needs to be enhanced in the current knowledge state $M_{s t u}^{t - 1}$ through the current response v_t and the long-term memory of the learning factor t_t. Therefore, we use the same integration method to integrate these two factors into a temporary learning vector ${\tilde{l}}_{t}$ . We can obtain the learning information through a fully connected layer with tanh activation L_t. The vector L_t is computed as (14) ${\tilde{l}}_{t} = φ (Δ C_{t}, ν_{t})$ (15) $L_{t} = \tanh (L^{T} \cdot {\tilde{l}}_{t} + b_{1})$

where is the L^T-weight matrix.

(16)

L^{T} \in ℝ^{(d_{v} + d_{i})} \times (d_{v} + d_{i})

However, not all information has the same effect of weakening or strengthening prior information for the gate mechanism. We therefore multiply $M_{s t u}^{t - 1} (i)$ with the correlation weight w_t to solve the problem of correlated trials during reading. Thus, the value memory matrix $M_{s t u}^{t - 1} (i)$ is updated in the following way: (17) $M_{s t u}^{t} = M_{s t u}^{t - 1} (i) [1 - w_{t} (i) F_{t}] + w_{t} (i) L_{t}$

4.2.6

Prediction of learners’ future performance

How can we integrate various factors to predict learners’ performance on the next trial?

1)

Multiple integration functions

Given the various features of a new trial < k₁,f_l,l_l >, the first important task is to integrate them into a unified tensor. First, the most popular integration method is splicing, which stacks all the feature vectors together without changing the original vectors. Second, multiplication modifies the original vectors by multiplying the contextual information. Therefore, we use matrix multiplication to perform this method. Third, splicing and multiplication combine the first two methods to further enhance cognitively relevant information.

2)

Learner performance prediction

After combining all features with a uniform representation vector φ_in, we connect it to the read content vector read_t.

A two-layer feed-forward neural network is used to obtain the likelihood of answering a question correctly, as shown in the following equation: (18) $f o r w a r d_{t} = \tanh (W_{1}^{T} [r e a d_{t}, φ_{i n}]) + b_{1}$ (19) $p_{t} = s i g m o i d (W_{2}^{T} \cdot f o r w a r d_{t} + b_{2})$

Where the first layer employs a tanh activation function, $\tanh (z_{i}) = (e^{z_{i}} - e^{- z_{i}^{'}}) / (e^{z_{i}} + e^{- z_{i}})$ . The second layer employs a sigmoid activation function to obtain the final prediction, which is a scalar indicating the probability of answering the test question correctly e_t. In addition, sigmoid(z_i) = 1/(1+e^−z).

5

Experimentation and analysis

5.1

Data analysis and discussion

5.1.1

Describing statistics

Based on the students’ scores in the reading section, the candidates’ reading scores were converted into levels according to the score-level conversion criteria. The CEDT test reading scores were converted into levels according to the score-level conversion criteria and the ratio of the number of students as shown in Table 1. Reading paper score (0,30) can be converted into 11 levels, 010-025 level students into the C level class (beginner), 030-040 level into the B level class (intermediate), 045-060 level into the A level class (advanced) for categorized teaching. The majority of candidates’ total scores are concentrated in the (13,15) and (16,18) bands, corresponding to the 035 level (22.00%) and the 040 level (17.8%).

Table 1.

CEDT tests reading scores and grade conversion standards and proportion

Score	Grade	Number	Percentage	Class	Number	Percentage
0	010	174	6.52%	C	419	15.70%
1-3	015	50	1.87%
4-6	020	85	3.19%
7-9	025	110	4.12%
10-12	030	308	11.54%	B	1370	51.35%
13-15	035	587	22.00%
16-18	040	475	17.80%
19-21	045	439	16.45%	A	879	32.95%
22-24	050	347	13.01%
25-27	055	90	3.37%
28-30	060	3	0.11%

The total reading scores of the overall students were analyzed with descriptive statistics using the SPSS statistical package and the descriptive statistics of the overall reading scores of the students are shown in Table 2. The sample size of the cohort was large (N=2668), the full distance of the sample distribution was wide (Range=29), and the inter-individual variation was large (Std=4.75), which is encompassing and representative, and the reading ability of the subject sample as a whole was good (Mean=15.82, Mode=16).

Table 2.

The overall reading score is descriptive

Sample size	In effect	2668
Sample size	Deletion	0
Mean	15.82
Median	17
Mode Number	16
Standard Deviation	4.75
Partial State	-0.178
Peak	-0.221
Full Distance	29
Minimum Value	0.00
Maximum Value	29

A histogram of the distribution of total student reading scores is shown in Figure 6. The normal Q-Q probability plot of total reading scores is shown in Figure 7. Observing the graph, it can be seen that the overall distribution of students’ total reading scores slightly deviates from the normal distribution line, with a skewness index of -0.175 (absolute value less than 1), which is slightly negatively skewed.

5.1.2

Cognitive diagnostic analysis

In this section, a series of cognitive diagnostic analyses of the test’s empirical data are conducted using the G-DINA model. Multilevel diagnostics are conducted for students in general, for groups of different levels and for individual students, while the relationships between reading attributes are discussed to make a case for the applicability of the G-DINA model in language testing.

1)

Probability of mastery of overall student attributes

In this paper, the most common MLE estimation method will be used for diagnostic analysis. First, the G-DINA model is used to estimate the average probability of mastery of five reading attributes among 2668 subjects. The overall attribute mastery probabilities of the students are shown in Table 3. According to the magnitude of the mastery probabilities, the difficulty of attribute mastery was ranked from difficult to easy:

A4 (Articulation and Generalization) > A5 (Reasoning) > A2 (Complex Sentences and Structures) > A3 (Explicit Detailed Information) > A1 (Words and Phrases)

Overall, students’ mastery of attribute A4 (articulation and generalization) is weak, with a probability of mastery of 0.473, indicating that most students have not yet mastered it. Comparatively speaking, for the other four attributes A1, A2, A3 and A4, the probability of mastery is in the range of (0.5,0.8), which means that most of the students have mastered these attributes well.

Table 3.

Student: the overall property of the student is the probability

Code	Attribute	Master probability
A1	Words and phrases	0.7116
A2	Complex sentences and structures	0.548
A3	Explicit details	0.653
A4	Cohesion and generalization	0.473
A5	reasoning	0.5146

2)

Overall mastery patterns and distribution of students

Through cognitive diagnostic analysis, we can identify the main patterns of students’ overall mastery of attributes and their distribution probabilities, which reflect the main types of cognitive errors made by students. In this paper, there are 32 cognitive modes for the five attributes in theory, and the main modes and distribution probabilities of students’ reading attributes are shown in Table 4 (“0” means that they have not mastered the attribute, and “1” means that they have mastered it). The largest proportion of patterns is all mastery (11111), with 18.22% of students mastering all the attributes. On the one hand, it shows that this group of students has better reading skills, and on the other hand, it also shows that all the five reading attributes are related to each other, which reflects the difficult differentiation of language skills. This pattern, followed by (00000), i.e., not mastering all the attributes, also has a higher distribution probability (11.99%) since it does not involve mastering any attribute and is not constrained by the relationship between the attributes.

Table 4.

The students read the main mode and distribution probability of the properties

	Master Mode	Percentage	Number		Master Mode	Percentage	Number
1	11111	18.22%	486	17	10011	43	1.61%
2	00000	11.99%	320	18	10001	41	1.54%
3	10100	9.00%	240	19	01001	39	1.46%
4	10000	8.17%	218	20	10101	27	1.01%
5	01000	8.32%	222	21	00111	12	0.45%
6	11100	8.13%	217	22	10110	14	0.52%
7	11010	4.09%	109	23	11000	10	0.37%
8	11011	4.05%	108	24	11001	11	0.41%
9	10111	3.19%	85	25	01010	6	0.22%
10	00100	3.04%	81	26	01011	9	0.34%
11	01100	2.92%	78	27	00010	5	0.19%
12	00101	2.29%	61	28	00011	2	0.07%
13	01101	2.32%	62	29	11101	2	0.07%
14	11110	2.21%	59	30	00110	0	0.00%
15	01110	2.02%	54	31	00001	0	0.00%
16	10010	1.76%	47	32	01111	0	0.00%

The overall attribute patterns and distribution ratio of students are shown in Figure 8. The results show that the two patterns, (11111) and (00000), are the most prevalent patterns of attribute mastery. This finding is consistent with other cognitive diagnostic studies. The phenomenon is determined by the high positive correlation between the attributes, which means that the mastery of one of the attributes tends to predict the mastery of the other, and vice versa.

3)

Attribute mastery characteristics of different groups

The reading attribute mastery probabilities and difference tests of the three level groups are shown in Table 5. The chi-square test found that the probability of mastery of the five attributes of the three level groups was compared, and the analysis found that the five P-values were lower than the critical value (α=0.005, p<0.001), which shows that the probability of mastery of the five attributes of the three level groups are significantly different. In other words, for the five reading attributes, the mean value of mastery probability of level A group is significantly higher than that of level B group, and the mean value of mastery probability of level B group is significantly higher than that of level C group. In contrast, the variance in the probability of mastery of the attributes was greater for the B-level group than for the A- and C-level groups. The A-level group is more proficient in each attribute, with mastery probabilities above 0.8. The mastery probabilities of the five attributes are more balanced, with the smallest variation.

Table 5.

Reading properties master probability and difference tests

	Attribute
	Sample size	A1	A2	A3	A4	A5
Class A	419	0.9911	0.7296	1.3245	1.0915	0.6082
Class B	1370	0.5898	0.2854	0.66	0.463	0.5372
Class C	879	0.5434	0.4217	0.4552	0.7224	1.0231
Calorie value		425.656	238.068	411.1	462.402	32.867
Freedom		3	3	3	3	3
Significance		0.000	0.000	0.000	0.000	0.000

The differences in the probability of mastery of the reading attributes for the three level groups are shown in Table 6. Compared with the level A group, the level B and C groups have some differences in the probability of mastery of each attribute, with the most significant differences in attributes A4 and A5. For both the B and C level groups, the attribute with the worst probability of mastery is A5, which is significantly lower than the average probability of mastery of the other attributes, indicating that remedial learning in this area should be strengthened in the primary and intermediate student groups.

Table 6.

The reading properties of the three horizontal groups master the difference

	A1	A2	A3	A4	A5
A-B	0.1126	0.278	0.2183	0.3466	0.3915
A-C	0.3662	0.3566	0.3977	0.4171	0.5274

5.1.3

Individual Student Attribute Mastery Characteristics

In this section, 10 students were randomly selected from the cohort to observe their scoring patterns and attribute mastery. The attribute mastery of the 10 individual students is shown in Table 7 (“0” means that the question was answered incorrectly and “1” means that the question was answered correctly). As can be seen from the table, although the total scores of this group of students were the same, and their reading comprehensiveness was the same as grade B, their scoring patterns on reading varied greatly, and none of them was the same. At the same time, the mastery of the five attributes involved in reading was completely different. Clearly, there are qualitative differences in the knowledge and cognitive structures of students with the same total and composite scores. In traditional tests, these qualitative differences among individual students are often obscured by a total score.

Table 7.

The individual properties of the student are in control

Student code	Answer pattern	Total	Read the properties
Student code	Answer pattern	Total	A1	A2	A3	A4	A5
1	1111111 0 1 0 0 0 0 1111 0 0 0 111 0 0 0 0 0 0 0	20	1	0	1	0	1
2	1 0 11 0 0 0 11111 0 1111 0 1 0 11 0 0 0 0 0 0 11	20	1	0	0	1	1
3	111111 0 111 0 1 0 1 0 0 1 0 0 111 0 0 0 00000	20	0	1	1	0	1
4	111 0 0 111 0 11 0 11 0 0 1 0 1111 0 0 0 0 0 0 11	20	1	1	1	0	1
5	0 0 0 0 11111111 0 0 111111 0 1 0 0 0 0 0 0 0 0	20	1	0	1	0	0
6	1 0 1 0 11 0 0 1111 0 11 0 1 0 1 0 11 0 0 1 00000	20	0	1	1	0	0
7	111 0 0 11111 0 0 0 111 0 11 0 11 0 0 0 0 0 1 0 1	20	0	1	1	0	1
8	1111 0 1 0 1 0 1 0 1 0 1 0 11 0 1111 0 0 0 0 0 0 11	20	1	0	0	1	1
9	11111111110101010001100000 0 0 0 0	20	1	0	1	0	1
10	0 11 0 1111 0 111 0 11 0 0 0 1111 0 0 0 0 0 0 1 0	20	1	1	1	0	0

5.1.4

Relationships between reading attributes

1)

Correlation between attributes

The overall students’ mastery probabilities of the five attributes were correlated to explore whether there was a significant correlation between the attributes. The correlation analysis between the mastery probabilities of the five attributes is shown in Table 8 (** indicates that the correlation is significant at a confidence level (two-sided) of 0.01). As seen from the Spearman correlation test in the table, the correlation coefficient r between the five reading attributes ranges from (0.415, 0.852) (α=0.01), and there is a significant and highly positive correlation between the attributes, which further validates the correlation between the reading attributes. Among them, attribute A1 was most highly correlated with attribute A3 (r = 0.852), followed by attribute A1 and attribute A3 (r = 0.849). The above analysis shows that the reading attributes are not independent of each other, but rather interconnected and dependent upon each other. One of the attributes may be more closely linked to another attribute, so that the mastery of one attribute may also predict the mastery of another attribute and vice versa.

Table 8.

Correlation analysis of the probability between the five species

			A1	A2	A3	A4	A5
Spearman ‘s rho	A1	Pearson correlation	1	0.475**	0.852**	0.471**	0.812**
		Sig.(2-tailed)		0.000	0.000	0.000	0.000
		N	2668	2668	2668	2668	2668
	A2	Pearson correlation	0.492**	1	0.415**	0.753**	0.772**
		Sig.(2-tailed)	0.000		0.000	0.000	0.000
		N	2668	2668	2668	2668	2668
	A3	Pearson correlation	0.849**	0.418**	1	0.512**	0.675**
		Sig.(2-tailed)	0.000	0.000		0.000	0.000
		N	2668	2668	2668	2668	2668
	A4	Pearson correlation	0.472**	0.755**	0.522**	1	0.845**
		Sig.(2-tailed)	0.000	0.000	0.000		0.000
		2668	2668	2668	2668	2668	2668
	A5	Pearson correlation	0.821**	0.775**	0.674**	0.845**	1
		Sig.(2-tailed)	0.000	0.000	0.000	0.000
		N	2668	2668	2668	2668	2668

2)

Joint effects of attributes

The probability of answering the test questions correctly in the joint mode of each attribute is shown in Table 9. In the table, in test question 19, the probability of answering the question correctly for the interaction attribute of mastering attribute A2 × attribute A3 is 0.9024, which is much higher than the probability of answering the question correctly for mastering only a single attribute (0.5624 and 0.7809), which indicates that there exists a high degree of dependence and complementarity between the two attributes of the question. This shows that there is no exclusionary relationship between the individual reading attributes, but rather they are interrelated and interdependent. At the same time, the probability of answering the question correctly is not a simple addition of the two probabilities when mastering attributes A2 and A3 respectively, but also includes the interaction between the two.

Table 9.

The questions in the combination mode of each attribute are the probability

Issue number	Measured property	Attribute union pattern	Answer probability
1	A2-A3	A00	0.4161
2	A2-A3	A10	0.5624
3	A2-A3	A01	0.7809
4	A2-A3	A11	0.9024
5	A1-A2-A3	A000	0.5602
6	A1-A2-A3	A100	0.6143
7	A1-A2-A3	A001	0.4364
8	A1-A2-A3	A010	0.5479
9	A1-A2-A3	A101	0.598
10	A1-A2-A3	A110	0.7444

5.2

Analysis of the Effectiveness of English Reading Teaching

5.2.1

Experimental background

1)

Subjects’ Reading Achievement

In this section, we continue to use students in a high school as a sample for the study, in order to gain a deeper understanding of the students’ cognitive level of “reading in English”, and to analyze in detail the cognitive structure of the students who participated in the test in terms of “reading in English”. Before the beginning of the experiment, the study chooses the English reading scores of Class A (experimental class) and Class B (control class) in the first semester of the school in 2022-2023 (total score of 30 points) and the usual scores (total score of 70 points) as the midterm reading scores, with a total score of 100 points, and the descriptive tables of midterm reading scores of Classes A and B are as shown in Table 10 and Table 11. The table shows that the average English midterm score of class A is 68.7. The standard deviation of the scores is 18.53, and the average score of English midterm examination of class B is 67.81, and the standard deviation of the scores is 19.43. From the observation of the visual data, it can be concluded that the average scores of the two parallel classes are similar, and the standard deviation of the scores is similar, which is comparable. Moreover, the results of the midterm examination of the whole grade show that the level of both classes is in the middle to lower level.

Table 10.

Reading performance description table in class A

Grade 1
N	In effect	48
	omission	1
Average		68.695654173813952
Median		73.600000000000000
Mode Number		85.000000000000000
Standard Deviation		18.527010258112589

Table 11.

Reading performance description table in class B

Grade 2
N	In effect	42
	omission	0
Average		67.81
Median		75.00
Mode Number		82
Standard Deviation		19.428

Since Class A and Class B are two independent individuals when comparing their reading performance horizontally, the independent samples t-test should be used to determine whether there is a significant difference between the performance of the two classes in terms of statistical significance, and the results of the independent samples t-test for Class A and Class B are shown in Table 12. From the test results of the table, the observed value of 0.382 by F-test and the probability value of 0.529 (>0.05) indicate that there is no significant difference between Class A and Class B in terms of their performance in English reading achievement.

Table 12.

The independent sample t test of class A and class B

		Levene’s variance test		For the average t test for the average
		F	Significance	T	df	Significance (double tail)
Grade	The number of equal variations is adopted	0.382	0.529	0.201	85	0.833
Grade	No equal variation			0.201	83.261	0.833
		For the average t test for the average
					Confidence interval of 95% difference
		Mean difference		Standard error	Lower limit	Upper limit
Grade	The number of equal variations is adopted	0.8315		3.9952	-7.1132	8.7654
Grade	No equal variation	0.8315		3.9949	-7.1152	8.7674

2)

Subjects’ reading strategy mastery

The next item is to test the results of the questionnaire on reading strategy mastery in the two classes, which contains 20 questions, of which each cognitive strategy corresponds to 1 question. The questionnaire was distributed through the questionnaire star, 50 copies were distributed in class A, 50 copies were recovered, 50 points were valid, the recovery rate of the questionnaire was 100%, and the validity rate of the questionnaire was 100%. Based on the results of the survey, the mean value of the score for each question in each class was calculated, and the results of the questionnaire on the mastery of reading strategies in class A at midterm are shown in Figure 9. From the figure, it can be seen that the three best-performing strategies in Class A are cultural knowledge background, topic sentence, and pre-reading metacognitive strategies, and the three worst-performing strategies are mind mapping, categorization, and selective labeling.

The results of the questionnaire on the mastery of reading strategies in Class B at midterm are shown in Figure 10. From the figure, it can be seen that the three best-performing strategies of class B students are cultural knowledge background, topic sentence, and pre-reading metacognitive strategy, and the worst-performing ones are mind mapping, classification, selective marking, and visualization. It can be concluded that the students in Class A and Class B are similar in the variability of reading strategy mastery, and the worst and best reading strategies of the two classes are basically the same.

5.2.2

Analysis of experimental effects

In order to verify that the method of this paper can effectively improve students’ reading levels, the researcher conducted a questionnaire survey to assess students’ mastery of reading strategies and two dimensions of English reading scores.

The results of the questionnaire survey on the mastery of reading strategies using the method of this paper and the traditional method are shown in Figures 11 and 12. Comparison with the pre-experimental results can be visualized that the overall level of both groups improved after the experiment, but the experimental group improved more obviously, and most of the scores of the traditional group were below 4.0. The before and after of the experimental group showed significant differences, while the before and after of the traditional group yielded very similar results. This indicates that from the visual descriptive statistics there is a significant difference between the experimental group and the control group after the experiment, and that the experimental group’s level of mastery of reading strategies increased more than that of the control group.

5.2.3

Cross-comparison analysis of reading scores

In order to more scientifically verify the effect of this paper’s method on reading achievement, this study used cross-comparison analysis to compare the experimental group’s midterm and final reading scores, as well as the experimental group’s final reading scores with those of the control group. The comparison of the experimental group’s midterm and final reading scores can prove whether the experimental group learning with this paper’s method effectively improves the reading scores, while the comparison of the experimental group’s final reading scores with the control group can filter out the influence of other variables besides this paper’s method, thus proving that this paper’s method can effectively improve students’ reading scores. The descriptive statistical trilinear table of the final reading achievement of the experiment using the method of this paper and the traditional teaching method is shown in Table 13 and Table 14. The table shows that the mean scores of the final reading scores of the experimental and control classes were 74.88 and 69.25 with standard deviations of 12.322 and 19.527 respectively. A comparison of the size of the numbers leads to the conclusion that the experimental class taught using the methodology of this paper had higher final reading scores than the control class that was taught according to the traditional model, and that there was even less disparity within the class.

Table 13.

Final reading performance description statistics

Experimental group
N	In effect	47
	Omission	1
Average		74.88
Median		79.05
Mode number		82
Standard deviation		12.322

Table 14.

Final reading performance description statistics

Control group
N	In effect	42
	Omission	0
Average		69.25
Median		73.00
Mode Number		70
Standard Deviation		19.527

6

Conclusion

With the development of education informatization, more and more science and technology are used in teaching, and this paper discusses the auxiliary role of artificial intelligence technology in English reading teaching.

By analyzing the students’ overall attribute mastery probability, it is found that the students’ overall attribute mastery difficulty is in the order from difficult to easy as A4 (articulation and generalization) > A5 (reasoning) > A2 (complex sentences and structures) > A3 (explicit detail information) > A1 (words and phrases), and most of the students have mastered the attributes of A1, A2, A3 and A4 better, and the probability of mastery is in the range between (0.5,0.8) .

In addition, the experiment used cross-comparison analysis to exclude the influence of other variables on this experiment except for the ‘method of this method’, and the average final reading score of the experimental group and the control group was 74, respectively. With a score of 88 and 69.25, it can be concluded that the final reading score of the experimental group taught by the method was higher than that of the control group taught according to the traditional model, and the gap within the class was smaller. It has been accurately proved that the difference between the experimental and control classes after the experiment is caused by differences in teaching methods.

Funding:

Project 1

Year: 2025

Granting Organization: Science Research Fund Project of Yunnan Provincial Department of Education

Title: Research on the Path of Local Colleges and Universities’ Language and Culture Service System Construction to Boost Rural Revitalization

Number: 2025J1013

Project 2

Year: 2024

Granting Organization: Industry-University Collaborative Education Program of the Ministry of Education

Title: Research on the Path of Integrating Ideological and Political Education into the Teaching of English Reading Courses in Colleges and Universities

Number: 2412053427

Project 3

Year: 2024

Granting Organization: Supply-Demand Docking Employment and Education Program of the Ministry of Education

Title: Research on Innovative Strategies for the Cultivation Model of Targeted English Talents in Universities under the Background of Digital Transformation

Number: 2024122570446

Langue:: Anglais

Périodicité:: 1 fois par an
Sujets de la revue:: Sciences de la vie, Sciences de la vie, autres, Mathématiques, Mathématiques appliquées, Mathématiques générales, Physique, Physique, autres

RSS Feed de la revue

An Exploration of Artificial Intelligence Assisted Strategies in English Reading Teaching

Qijun Zhao

Publié en ligne: 21 mars 2025

Reçu: 10 oct. 2024

Accepté: 05 févr. 2025

DOI: https://doi.org/10.2478/amns-2025-0632

Mots clésIRT, Knowledge Tracking Model, Dynamic Cognitive Diagnostic Model, English Reading Teaching

© 2025 Qijun Zhao, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Mots clés
IRT, Knowledge Tracking Model, Dynamic Cognitive Diagnostic Model, English Reading Teaching