The Role of Data-Driven Instructional Models in the Development of Students’ Critical Thinking Skills in Core Literacy Education

Core literacy refers to a set of comprehensive abilities and literacies that individuals need to possess when facing complex problems and situations, and it is a basic ability and quality that can support individuals’ learning, work and life in different fields [1-2]. Core literacies include, but are not limited to, critical thinking, innovative thinking, cooperation, communication skills, information literacy, cultural literacy, emotional literacy, etc. These literacies not only help individuals solve problems and cope with challenges but also promote their personal growth and social development [3-5]. The appropriate teaching mode is the key to developing students’ core literacy. However, the current traditional teaching mode is not effective in developing students’ core literacy due to insufficient pre-course guidance and evaluation feedback from teachers, resulting in poor pre-course preview quality, in-class interaction level and higher-order thinking development [6-9]. In addition, the existing empirical studies related to the development of students’ core literacy are mostly result-oriented, lacking the support of data-driven system design and process dynamic data, and relying only on traditional teaching analytics tools. They cannot effectively monitor and reproduce the offline learning process of students, making it difficult to comprehensively and realistically measure the effectiveness of students’ learning [10-14]. Therefore, using learning analytics technology as an entry point to establish a data-driven teaching model oriented to core literacy can exercise students’ higher-order thinking skills and develop their core literacy [15-18]. For critical thinking, the data-driven teaching model promotes communication and interaction in the learning process through purposeful and organized judgment and dialogue, in which interpretation, analysis, evaluation and reasoning are generated, which provides an exercise field for developing critical thinking [19-21].

Based on the socio-spatial theory and the use and satisfaction theory, this study establishes a theoretical framework for the environment of the technical data-driven teaching model consisting of physical space, activity space and psychological space and analyzes its impact on the development of students’ critical thinking ability in core literacy education. The elements of physical space, activity space and psychological space in the teaching model were taken as explanatory variables, and the development of critical thinking ability as explanatory variables, respectively, and relevant questionnaires were designed and distributed in a school for data collection. Finally, descriptive statistical analysis, correlation analysis and regression analysis were performed on the data, respectively, to explore the practical significance of the data-driven teaching mode in promoting the development of students’ critical thinking ability.

2

Research design

2.1

Modeling of theoretical assumptions

2.1.1

Three-dimensional elements of a data-driven instructional model

Through the qualitative analysis of classroom observation data and student interview data, the key elements and their characteristics that support the development of higher-order thinking in high school language are analyzed from the real classroom context, and the theoretical model of the data-driven teaching mode to support the development of student’s critical thinking ability is constructed as shown in Fig. 1, combining with the previous understanding of the structure and function of the technology-enriched classroom environment.

The data-driven teaching model acts on the process of students’ critical thinking ability development with three dimensions of rich physical space, rich activity space and rich psychological space, and supports students’ critical thinking ability development. The theoretical model is explained below in terms of the interrelationships between three-dimensional spatial elements and their role in the development of higher-order thinking in high school language.

2.1.2

Research hypotheses

Hypothesis 1: The data-driven instructional model activity space element significantly promotes the development of student’s critical thinking skills in core literacy education.

Hypothesis 2: The physical space element of the data-driven instructional model significantly promotes the development of students’ critical thinking skills in core literacy education.

Hypothesis 3: The data-driven instructional model mental space element significantly promotes the development of student’s critical thinking skills in core literacy education.

Hypothesis 4: The data-driven instructional model shows a correlation with the development of students’ critical thinking skills.

2.2

Data collection

2.2.1

Objects of study

In the pre-survey phase of this study, one prestigious high school, one high-quality high school, and one average high school were selected, and there was a large gap between the student population of the three high schools. The pre-survey included three grades of students, and one class was randomly selected for each grade. The “pre-survey” used on-site organization of the questionnaire mode. The two surveys issued 320 questionnaires and recovered 300 valid questionnaires, with a recovery rate of 93.8%.Among them, 135 were girls and 165 were boys.

2.2.2

Scale design

1)

Measurement Scale of Three Elements of Data-Driven Instruction

The connotation and characteristics of the key elements of the three-dimensional degree space of the data-driven teaching model have been portrayed in depth and detail in the previous section, according to which the observational indexes of the key elements of the three-dimensional degree space can be determined, as shown in Table 1. The internal consistency coefficient of the whole scale is 0.791.

2)

Critical Thinking Skills Scale for Students

The California Critical Thinking Disposition Scale, or the Scale for short, has questionnaire questions derived from the personality traits proposed by the U.S. Delphi Item Group regarding critical thinkers. Corresponding attitudes are categorized into six levels, from strongly agree to strongly disagree, and subjects choose the option that best meets their attitudes among the six levels based on sentence descriptions. The scale has a total of questions, including seven dimensions: curiosity, openness, systematicity, truth-seeking, self-confidence, and maturity and the number of questions for each dimension is maintained at the question, with an overall reliability of 0.93 [22]. The Cologne Bach coefficients of the dimensions in the sample test results ranged from about 0.51 to 0.83, the Cologne Bach reliability coefficient of the whole scale was 0.88, and the internal consistency coefficient of the total scale in this study was 0.812.

Table 1.

Data drive teaching mode key elements indicator

Observation dimension	Observation index
Physical space Active space	Multimedia resources
	Cognitive construction tool
	Ac evaluation tool
Observation dimension Physical space Active space	Learning task
	Learning support
	Learning community
	Learning evaluation
Observation dimension	Incentive will
Observation dimension	Atmosphere experience

2.3

Data processing methods

2.3.1

Multiple regression models

1)

Computational modeling

The multiple linear regression model is an extension of the univariate linear regression model, and its basic principles are similar to those of the univariate linear regression model, except that it is computationally more complex. A multiple linear regression model is an equation that describes how the dependent variable y depends on the independent variable x₁,x₂,…….x_k and the error term ε, and its general form is: (1) $y = b_{0} + b_{1} x_{1} + b_{2} x_{2} + ... + b_{k} x_{k} + ε$ \[y={{b}_{0}}+{{b}_{1}}{{x}_{1}}+{{b}_{2}}{{x}_{2}}+...+{{b}_{k}}{{x}_{k}}+\varepsilon \]

Where: b₀ is a constant term that represents the estimated value of y when all the respective variables are zero; b₁,b₂,⋯,b_k is the sample partial regression coefficient, or known as the partial regression coefficient of y corresponding to x₁,x₂,……x_k, which represents the average change in y for each unit change in x₁,x₂,……x_k when the other independent variables are unchanged; ε is known as the random variable of the error term, which is the variability that is contained inside the y but cannot be explained by the linear relationship of the k independent variables [23].

When the original information establishes the multivariate regression model, in order to ensure that the regression model has excellent explanatory ability and predictive effect, the theoretical conditions that should be satisfied are: (1)

Linearity: the relationship between the dependent variable and the independent variable is linear. That is to say, the independent variable has a significant effect on the dependent variable and has a close linear correlation.

(2)

Independence, the random error term is independent across sample points with no autocorrelation. We have a random sample {(X_i1,X_i2,⋯,X_ik,Y_i):i=1,2,⋯,n} containing n observations. This ensures that the error ε is itself random, i.e., free of autocorrelation, Cov(ε_i–E(ε_i))(ε_j–E(ε_j))=0.

(3)

Mutual exclusivity: the independent variables should be mutually exclusive. That is, the degree of correlation between the independent variables should not be higher than the degree of correlation between the independent variables and the cause of the dependent variable.

(4)

Completeness: the independent variables should be complete statistics, and their predictive values should be easy to determine.

(5)

Normality, the random error term obeys a normal distribution with zero mean and variance.

(6)

Variance uniformity: the random error term has equal variance at different sample points.

Estimation of biased regression coefficients:

One of the purposes of regression analysis is to develop regression equations that enable researchers to predict the value of the dependent variable based on the known independent variables. The estimation of the partial regression coefficients for a multiple regression model, like the same linear regression equation, is also done by solving the parameters by the least squares method, provided that the sum of squared errors (∑e²) is required to be minimized. Taking the bilinear regression model as an example, the standard set of equations for solving the regression parameters is: (2) $y = b_{0} + b_{1} x_{1} + b_{2} x_{2}, (k = 2)$ \[y={{b}_{0}}+{{b}_{1}}{{x}_{1}}+{{b}_{2}}{{x}_{2}},(k=2)\] (3) $B = [\begin{matrix} b_{0} \\ b_{1} \\ b_{2} \end{matrix}], X = (\begin{matrix} 1 & x_{11} & x_{12} \\ ... & ... & ... \\ 1 & x_{n 1} & x_{n 2} \end{matrix})$ \[B=\left[ \begin{matrix} {{b}_{0}} \\ {{b}_{1}} \\ {{b}_{2}} \\ \end{matrix} \right],X=\left( \begin{matrix} 1 & {{x}_{11}} & {{x}_{12}} \\ ... & ... & ... \\ 1 & {{x}_{n1}} & {{x}_{n2}} \\ \end{matrix} \right)\]

The formula to derive b₀,b₁b₂ is as follows: (4) ${\begin{array}{l} \sum y = n b_{0} + b_{1} \sum x_{1} + b_{2} \sum x_{2} \\ \sum x_{1} y = b_{0} \sum x_{1} + b_{1} \sum x_{1}^{2} + b_{2} \sum x_{1} x_{2} \\ \sum x_{2} y = b_{0} \sum x_{2} + b_{1} \sum x_{1} x_{2} + b_{2} \sum x_{2}^{2} \end{array}$ \[\left\{ \begin{array}{*{35}{l}} \sum{y}=n{{b}_{0}}+{{b}_{1}}\sum{{{x}_{1}}}+{{b}_{2}}\sum{{{x}_{2}}} \\ \sum{{{x}_{1}}}y={{b}_{0}}\sum{{{x}_{1}}}+{{b}_{1}}\sum{x_{1}^{2}}+{{b}_{2}}\sum{{{x}_{1}}}{{x}_{2}} \\ \sum{{{x}_{2}}}y={{b}_{0}}\sum{{{x}_{2}}}+{{b}_{1}}\sum{{{x}_{1}}}{{x}_{2}}+{{b}_{2}}\sum{x_{2}^{2}} \\ \end{array} \right.\]

Solve this equation to find the value of b₀,b₁,b₂. The following matrix method can also be used to find b=(x′x)^–1·x′y i.e: (5) $[\begin{matrix} b_{0} \\ b_{1} \\ b_{2} \end{matrix}] = {[\begin{matrix} n & \sum x_{1} & \sum x_{2} \\ \sum x_{1} & \sum x_{1}^{2} & \sum x_{1} x_{2} \\ \sum x_{2} & \sum x_{1} x_{2} & \sum x_{2}^{2} \end{matrix}]}^{- 1} [\begin{matrix} \sum y \\ \sum x_{1} y \\ \sum x_{2} y \end{matrix}]$ \[\left[ \begin{matrix} {{b}_{0}} \\ {{b}_{1}} \\ {{b}_{2}} \\ \end{matrix} \right]={{\left[ \begin{matrix} n & \sum{{{x}_{1}}} & \sum{{{x}_{2}}} \\ \sum{{{x}_{1}}} & \sum{x_{1}^{2}} & \sum{{{x}_{1}}}{{x}_{2}} \\ \sum{{{x}_{2}}} & \sum{{{x}_{1}}}{{x}_{2}} & \sum{x_{2}^{2}} \\ \end{matrix} \right]}^{-1}}\left[ \begin{matrix} \sum{y} \\ \sum{{{x}_{1}}}y \\ \sum{{{x}_{2}}}y \\ \end{matrix} \right]\]

2) Tests of the model

Hypothesis testing of multiple regression model includes goodness-of-fit test, significance test of multiple regression equation as a whole with hypothesis test of partial regression coefficients.

A goodness-of-fit test is a test of the goodness of fit of a model to a sample of observations, which can be accomplished by constructing a statistic that expresses the degree of fit. The overall sum of squares TSS=∑(Y–Ȳ)², the regression sum of squares ESS=∑(Ŷ–Ȳ)², the residual sum of squares RSS=∑(Y–Ŷ)², Ȳ is the sample value and Ŷ is the estimated value. The relationship between them is TSS = ESS + RSS.

Goodness-of-fit test statistic decidable coefficient r²: (6) $r^{2} = \frac{E S S}{T S S} = 1 - \frac{R S S}{T S S}$ \[{{r}^{2}}=\frac{ESS}{TSS}=1-\frac{RSS}{TSS}\]

The closer r² is to 1, the better the model fits.

The test of significance of the equation aims to make an inference about whether the linear relationship between the explanatory variables and the explanatory variables in the model holds significantly in the aggregate.

The original hypothesis b₁=0,b₂,⋯,b_k = 0 and the alternative hypothesis b_i are not all 0. According to the definition of mathematical statistics, under the condition that the original hypothesis holds, the statistics are as follows: (7) $F = \frac{E S S / k}{R S S / (n - k - 1)}$ \[F=\frac{ESS\;/\;k}{RSS\;/\;(n-k-1)}\]

Obey the F distribution with degree of freedom (k,n–k–1). Given the significance level α, the critical value Fα(k,n–k–1), the value of the statistic F is derived from the sample, and the original hypothesis can be rejected or accepted by either F>Fα(k,n–k–1) or F≤Fα(k,n–k–1) to determine whether the overall linear relationship of the original equation holds significantly.

Hypothesis test for biased regression coefficients: the t test. In such a test, we need to make a statistically significant (i.e., with a certain level of confidence) test of whether a given (overall) parameter in the model satisfies the dummy original hypothesis b_i = a_i, where a_i is some given known number. In particular, when a_i=0, it is called a significance test for the parameter. If the original hypothesis is rejected, it means that the explanatory variable x_i has a significant linear effect on the explanatory variable Y, and the estimate ${\hat{b}}_{i}$ \[{{\hat{b}}_{i}}\] will dare to be used. Conversely, it means that the explanatory variable x_i does not have a significant linear effect on the explanatory variable Y and the estimate ${\hat{b}}_{i}$ \[{{\hat{b}}_{i}}\] would not be meaningful to us.

Since the model parameter b_i obeys the following normal distribution: ${\hat{b}}_{i} ~ N (b_{i}, σ^{2} c_{i i})$ ${{\hat{b}}_{i}}\tilde{\ }N({{b}_{i}},{{\sigma }^{2}}{{c}_{ii}})$ , where c_i denotes the ith element on the main diagonal of the matrix (X′X)−1 and σ² is the variance of the random error term, it is replaced by its estimator in the actual calculation: (8) ${\hat{σ}}^{2} = \frac{\sum ε_{i}^{2}}{n - k - 1} = \frac{ε^{'} ε}{n - k - 1}$ \[{{\hat{\sigma }}^{2}}=\frac{\sum{\varepsilon _{i}^{2}}}{n-k-1}=\frac{{\varepsilon }'\varepsilon }{n-k-1}\]

A statistic can be constructed as follows: (9) $t = \frac{{\hat{b}}_{i} - b_{i}}{S_{i}} = \frac{{\hat{b}}_{i} - b_{i}}{\sqrt{c_{i i} \frac{ε^{'} ε}{n - k - 1}}}$ \[t=\frac{{{{\hat{b}}}_{i}}-{{b}_{i}}}{{{S}_{i}}}=\frac{{{{\hat{b}}}_{i}}-{{b}_{i}}}{\sqrt{{{c}_{ii}}\frac{{{\varepsilon }^{\prime }}\varepsilon }{n-k-1}}}\]

Given the significance level α, the critical value t_a/2(n–k–1) is obtained, whereupon the original hypothesis can be rejected or accepted based on: |t|>t_a/2(n–k–1) or |t|≤t_a/2(n–k–1), thus determining whether the corresponding explanatory variables should be included in the model.

2.3.2

Correlation analysis

The principle of correlation analysis is the process of analyzing the degree of correlation between two or more variables and expressing the results of the calculations through appropriate indicators. Variable selection generally consists of two parts: rules and query mechanisms. By rules, we mean the rules for evaluating the joint correlation between variables in a set of variables [24]. The main purpose of the query mechanism is to delete or add variables to the set of variables. Evaluation rules, also known as evaluation criteria, play a decisive role in variable selection. Both parts and feature construction are inseparable from correlation. Correlation is a general concept used to describe the degree of data closeness between variables and the amount of mutual information contained, including asymmetric causal and driving relationships. The basic idea of feature selection based on correlation analysis is to select effective variables with higher correlation with the prediction target, i.e., feature variables contributing to the performance of the prediction model, by analyzing the correlation strength relationship between variables.

Commonly used correlation coefficients: 1)

Pearson’s correlation coefficient

Define the aggregate of the two-dimensional variable X,Y to be (X,Y)^T,(x₁,y₁)^T,(x₂,y₂)^T,⋯,(x_n,y_n)^T is an experimental sample of the variable X,Y, which is also an observation, to obtain the observation matrix M: (10) $M = {[\begin{matrix} x_{1} & x_{2} & \dots & x_{n} \\ y_{1} & y_{2} & \dots & y_{n} \end{matrix}]}^{T}$ \[M={{\left[ \begin{matrix} {{x}_{1}} & {{x}_{2}} & \cdots & {{x}_{n}} \\ {{y}_{1}} & {{y}_{2}} & \cdots & {{y}_{n}} \\ \end{matrix} \right]}^{T}}\]

The means of variable X and variable Y were calculated separately: (11) $\bar{x} = \frac{1}{n} \sum_{i = 1}^{n} x_{i}$ \[\bar{x}=\frac{1}{n}\sum\limits_{i=1}^{n}{{{x}_{i}}}\] (12) $\bar{y} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}$ \[\bar{y}=\frac{1}{n}\sum\limits_{i=1}^{n}{{{y}_{i}}}\]

Equation (11) represents the mean of observed data for variable X and equation (12) is the mean of observed data for variable Y. The data variance is further calculated: (13) $S_{x x} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}$ \[{{S}_{xx}}=\frac{1}{n-1}\sum\limits_{i=1}^{n}{{{({{x}_{i}}-\bar{x})}^{2}}}\] (14) $S_{y y} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}$ \[{{S}_{yy}}=\frac{1}{n-1}\sum\limits_{i=1}^{n}{{{({{y}_{i}}-\bar{y})}^{2}}}\] (15) $S_{x y} = \frac{1}{n - 1} \sum_{i = 1}^{n} (x_{i} - \bar{x}) (y_{i} - \bar{y})$ \[{{S}_{xy}}=\frac{1}{n-1}\sum\limits_{i=1}^{n}{({{x}_{i}}-\bar{x})}({{y}_{i}}-\bar{y})\]

Equations (13), (14), and (15) compute the covariance of the observations for variable X, variable Y, and two-dimensional variable X,Y, respectively, defining S as the covariance matrix of the observations, then: (16) $S = [\begin{matrix} S_{x x} & S_{x y} \\ S_{y x} & S_{y y} \end{matrix}]$ \[S=\left[ \begin{matrix} {{S}_{xx}} & {{S}_{xy}} \\ {{S}_{yx}} & {{S}_{yy}} \\ \end{matrix} \right]\]

In matrix S, there is S_xy=S_x, so S The diagonal elements are equal and it is a symmetric matrix. Moreover, according to Schwarz inequality, there is: (17) $S_{x y}^{2} \leq S_{x x} S_{y y}$ \[S_{xy}^{2}\le {{S}_{xx}}{{S}_{yy}}\]

Therefore the matrix S is a positive definite matrix. Define the correlation coefficient of the observations of X,Y is given by: (18) $r_{x y} = \frac{S_{x y}}{\sqrt{S_{x x}} \sqrt{S_{y y}}}$ \[{{r}_{xy}}=\frac{{{S}_{xy}}}{\sqrt{{{S}_{xx}}}\sqrt{{{S}_{yy}}}}\]

|r_xy|≤1 holds according to Schwarz’s inequality. r_xy is used to measure the degree of linear correlation of the variable X,Y. Let the overall two-dimensional variable X,Y be (X,Y)^T, then the correlation coefficient of (X,Y)^T is calculated as: (19) $r_{X Y} = \frac{C o v (X, Y)}{\sqrt{V a r (X)} \sqrt{V a r (Y)}}$ \[{{r}_{XY}}=\frac{Cov(X,Y)}{\sqrt{Var(X)}\sqrt{Var(Y)}}\]

Where Var(X),Var(Y) is the variance of variable X and variable Y, respectively, and Cov(X,Y) is the covariance of the two-dimensional aggregate (X,Y)^T. The range of earson’s correlation coefficient is rϵ[–1,1], and the absolute value of Pearson’s coefficient is usually used to determine the strength of correlation between two variables, the rules are as follows: r>0 means positive correlation, r<0 means negative correlation; |r|=0 means there is no linear relationship; and |r|=1 means complete linear correlation.

In short, the absolute value of the Pearson coefficient is close to 1, which proves that the correlation between the two variables is stronger; on the contrary, the smaller the absolute value is, the weaker the correlation between the variables is, and the stronger the independence is. Meanwhile, it should be noted that the use of Pearson’s correlation coefficient needs to follow the following principles.

(1)

Pearson’s correlation coefficient is only applicable to linear correlation data analysis.

(2)

The calculation of the Pearson coefficient will have a greater impact when the observed data of the variables have extreme values.

(3)

Using the Pearson correlation coefficient, it is required that the two-dimensional overall (X,Y)^T being tested obeys a bivariate normal distribution.

2) Distance correlation coefficient

The formula for the distance correlation coefficient for the two variables X and Y is as follows: (20) $d C o r (X, Y) = {(\frac{d C o v (X, Y)}{\sqrt{d C o v (X, X) d C o v (Y, Y)}})}^{\frac{1}{2}}$ \[dCor\left( X,Y \right)={{\left( \frac{dCov(X,Y)}{\sqrt{dCov(X,X)dCov(Y,Y)}} \right)}^{\frac{1}{2}}}\]

Where the covariance dCor(X,Y) is defined as: (21) $d C o v (X, Y) = \frac{1}{n^{2}} \sum_{k, l = 1}^{n} X_{k l} Y_{k l}$ \[dCov(X,Y)=\frac{1}{{{n}^{2}}}\sum\limits_{k,l=1}^{n}{{{X}_{kl}}}{{Y}_{kl}}\]

The relevant term in the above equation is defined as: (22) $X_{k l} =_{k l} - {\bar{x}}_{k \cdot} - {\bar{x}}_{\cdot l} + \bar{x}$ \[{{X}_{kl}}{{=}_{kl}}-{{\bar{x}}_{k\cdot }}-{{\bar{x}}_{\cdot l}}+\bar{x}\] (23) ${\bar{x}}_{k •} = \frac{1}{n} \sum_{i = 1}^{n} x_{k l} {\bar{x}}_{\cdot l} = \frac{1}{n} \sum_{k = 1}^{n} x_{k l} \bar{x} = \frac{1}{n^{2}} \sum_{k, l = 1}^{n} x_{k l}$ \[{{\bar{x}}_{k\bullet }}=\frac{1}{n}\sum\limits_{i=1}^{n}{{{x}_{kl}}}\quad {{\bar{x}}_{\cdot l}}=\frac{1}{n}\sum\limits_{k=1}^{n}{{{x}_{kl}}}\quad \bar{x}=\frac{1}{{{n}^{2}}}\sum\limits_{k,l=1}^{n}{{{x}_{kl}}}\]

The term of interest for the random variable Y is defined above. The distance correlation coefficient is an improved method of Pearson’s correlation coefficient. When Pearson’s correlation coefficient is 0, we can not conclude that the two variables are independent because Pearson can not measure the nonlinear relationship, but if the distance correlation coefficient is 0, it can be confirmed that the two variables are independent of each other.

3) Information Entropy and Mutual Information

The concept of information entropy refers to the average amount of information in a message after eliminating redundancy, which is used to express the quantitative performance of the message. For a set of random variables X containing discrete values, the information entropy represents the uncertainty associated with the information source X, and represents the overall characteristics of the information in terms of its average significance. Assuming X={x₁,x₂,⋯,x_n} is the set of all events and p(x) is the probability of a given x, the information entropy of X is defined as: (24) $H (X) = E_{x} (I (x)) = - \sum_{i = 1}^{n} p (x_{i}) \log p (x_{i})$ \[H(X)={{E}_{x}}(I(x))=-\sum\limits_{i=1}^{n}{p}({{x}_{i}})\log p({{x}_{i}})\]

Where H(X) is the information entropy of X, I(x) stands for self-information or informativeness and denotes the entropy contributed by a single piece of information, and E is the expectation function.

For random variables x and y containing finite values, if x and y are signals from two sources that are not independent of each other, and their joint distribution is set to be p(x,y) and their marginal distributions to be p(x) and p(y), then the joint information entropy of x and y is: (25) $H (X, Y) = - \sum_{x, y} p (x, y) \log p (x, y)$ \[H(X,Y)=-\sum\limits_{x,y}{p}(x,y)\log p(x,y)\]

Among them: (26) $p (x, y) = p (X = x, Y = y) p (x) = p (X = x) p (y) = p (Y = y)$ \[p(x,y)=p(X=x,Y=y)p(x)=p(X=x)p(y)=p(Y=y)\]

The conditional entropy of X in the Y condition is then expressed as: (27) $\begin{array}{l} H (X | Y) = - \sum_{y} p (y) H (X | Y = y) \\ = - \sum_{x} \sum_{y} p (y) p (x | y) \log p (x | y) \\ = - \sum_{x, y} p (x, y) \log p (x | y) \end{array}$ \[\begin{align} & H(X|Y)=-\sum\limits_{y}{p}\left( y \right)H(X|Y=y) \\ & =-\sum\limits_{x}{\sum\limits_{y}{p}}(y)p(x|y)\log p(x|y) \\ & =-\sum\limits_{x,y}{p}(x,y)\log p(x|y) \\ \end{align}\]

Mutual information in information theory denotes the interdependent inclusion of two variables X and Y, i.e., the amount of information that Y contains about. For two discrete random variables X and Y, the mutual information I(X;Y) between them is defined as: (28) $I (X; Y) = E_{x, y} (S I (x, y)) = \sum_{x, y} p (x, y) \log \frac{p (x, y)}{p (x) p (y)}$ \[I(X;Y)={{E}_{x,y}}(SI(x,y))=\sum\limits_{x,y}{p}(x,y)\log \frac{p(x,y)}{p(x)p(y)}\]

Where p(x,y) denotes the joint probability distribution of X and Y, and SI(x,y) is the mutual information between the sample points emanating from X and Y. For two continuous random variables X and Y, I(X;Y) is defined as: (29) $I (X; Y) = \iint p (x, y) \log \frac{p (x, y)}{p (x) p (y)} d x d y$ \[I(X;Y)=\iint{p}(x,y)\log \frac{p(x,y)}{p(x)p(y)}dxdy\]

Mutual information can also be represented by information entropy with the following formula: (30) $\begin{matrix} I (X; Y) = H (X) - H (Y | X) \\ = H (Y) - H (Y | X) \\ = H (X) + H (Y) - H (X, Y) \end{matrix}$ \[\begin{align} & I(X;Y)=H(X)-H(Y|X) \\ & =H(Y)-H(Y|X) \\ & =H(X)+H(Y)-H(X,Y) \end{align}\]

3

Empirical results and analysis

3.1

Results of descriptive statistical analysis

3.1.1

Sample descriptive statistics

The overall profile of the study population is shown in Table 2. Among them, 131 students belonging to quality schools (43.67%), 55% of the total number of females, 51.67% of the total number of students from urban areas, and 198 students from arts and sciences (66%), which is in line with the thinking characteristics of the development of critical thinking skills.

Table 2.

Overall student statistics

Categories	Project	N	Percentage(%)
School	Elite school	78	26
	Quality	131	43.67
	General	91	30.33
Gender	Male	135	45
Gender	Female	165	55
Grade	A high	178	59.33
Grade	High two	122	40.67
Home	Countryside	145	48.33
Home	City	155	51.67
Disciplines	Liberal arts	198	66
Disciplines	Science	102	34

3.1.2

Proportion of critical thinking tendencies

The students’ CTDI-CV scores for each dimension and the percentage of critical thinking tendencies are shown in Table 3. The total score of students’ CTDI-CV is (293.14±30.6), which is positive critical thinking. The highest score in each dimension is curiosity, the lowest is self-confidence, and there are a few people who have positive and strong critical thinking in each dimension, in which the proportion of the number of people in the curiosity dimension (22.4%) is higher than the other dimensions.

Table 3.

The dimensions of each dimension and the percentage of critical thinking

Project	Score(score, $\bar{x} \pm ss$ \[\bar{x}\pm \text{s}\])	Each dimensional score distribution [n(%)]
Project	Score(score, $\bar{x} \pm ss$ \[\bar{x}\pm \text{s}\])	Negative thinking(≤30 score)	Ambiguity(31~39 score)	Positive thinking(≥40 score)	Positive thinking(≥50 score)
Seek the truth	43.25±6.09	32(18.2)	48(31.2)	78(49.3)	11(6.2)
Liberation thought	51.45±2.99	4(2.1)	37(24.2)	115(71.3)	11(6.2)
Analytical ability	51.24±1.13	4(2.1)	48(31.2)	97(62.1)	16(9.6)
systematization	48.26±2.56	11(6.3)	67(42.7)	78(49.3)	11(6.2)
self-confidence	43.52±5.72	22(12.2)	79(48.51)	61(38.3)	9(5.8)
Thirst for knowledge	52.23±5.28	6(2.3)	46(38.5)	82(51.3)	35(21.4)
Cognitive maturity	48.94±3.26	17(10.2)	51(30.2)	89(56.4)	13(7.8)
Total score	293.14±30.6	0(0.0)	67(42.8)	96(52.9)	4(2.1)

3.2

One-way analysis of variance

3.2.1

Differences in students’ critical thinking in terms of gender

In order to examine whether there is a significant difference between the student’s critical thinking in terms of gender, an independent samples t-test was conducted on the students’ total critical thinking scores and their sub-dimension scores, and the results of the t-test are shown in Table 4. The table shows that although the mean of the total critical thinking scores of male students (266.19) is higher than that of female students (264.28), it does not reach a significant difference (P=0.598>0.05), so there is no significant difference between the total scores of the student’s critical thinking in terms of gender but there is a significant difference between the sub-dimensions of truth-seeking, self-confidence, and cognitive maturity, where the scores of male students are significantly higher than those of female students in the sub-dimensions of truth-seeking and self-confidence. Aspect boys scored significantly higher than girls, while in the cognitive maturity sub-dimension, girls scored significantly higher than boys.

Table 4.

University critical thinking is different in gender

	Male(N=35)		Fmale(N=165)		T value	Sig
	M	SD	M	SD	T value	Sig
Seek the truth	37.81	6.836	36.13	4.751	2.084^*	0.041
Liberation thought	38.99	5.41	37.21	4.677	1.137	0.277
Analytical ability	38.93	6.011	39.18	5.777	0.522^*	0.639
systematization	35.75	4.863	36.02	4.712	.344	0.769
self-confidence	38.91	6.398	36.11	5.493	2.566^*	0.011
Thirst for knowledge	39.85	5.675	41.42	6.576	-1.349	0.16
Cognitive maturity	35.95	7.004	38.21	5.863	-2.441^*	0.011
Total score	266.19	35.21	264.28	27.212	-.431	.598

^*P≤0.05;^**P≤0.01;^***P≤0.001

3.2.2

Differences in students’ critical thinking in terms of specialization

Similarly, to examine whether there is a significant difference between students in terms of arts and sciences, we conducted an independent samples t-test. The differences in college students’ critical thinking in terms of majors are shown in Table 5, which shows that there is a significant difference between the students’ total scores of critical thinking in terms of majors (P=0.018<0.05). Also, there is a significant difference between the two subdimensions of truth-seeking and analytical ability, where the science students’ Critical Thinking is significantly higher than that of Arts students, and Science students’ scores in the two sub-dimensions of truth-seeking and emancipation are also significantly higher than that of Arts students.

Table 5.

Students’ critical thinking majors

	Science(N=102)		Liberal arts(N=198)		T value	Sig
	M	SD	M	SD	T value	Sig
Seek the truth	37.12	6.845	35.84	5.648	2.145^*	0.041
Liberation thought	38.36	5.879	37.64	5.267	1.137	0.077
Analytical ability	39.69	6.711	38.44	6.171	2.084^*	0.039
systematization	36.08	5.715	35.84	5.071	1.123	0.169
self-confidence	36.56	7.313	37.84	5.857	-.566	0.011
Thirst for knowledge	41.16	7.56	40.26	6.253	1.441	0.061
Cognitive maturity	38.12	7.878	37.27	6.185	1.349	0.076
Total score	267.09	25.421	263.13	32.534	1.898^*	0.018

^*P≤0.05;^**P≤0.01;^***P≤0.001

3.3

Correlation analysis

Pearson correlation analysis was carried out to correlate the total score of students’ critical thinking ability with the total score and each factor of the data-driven teaching mode, and the results of the analysis are shown in Table 6. From the table, it can be seen that there is a significant correlation between the dimensions of students’ data-driven teaching mode and their critical thinking ability on the total score and some factors. On each factor, there is a significant correlation of 0.01 between the desire for knowledge, systematization ability and all factors in the data-driven teaching model. Truth-seeking was significantly correlated with physical and mental space. However, open-mindedness, analytical ability, and self-confidence in critical thinking were not correlated with the data-driven instructional model. There is a significant positive correlation between the total score and all factors of critical thinking skills of high school students, which confirms hypothesis 4.

Table 6.

The analysis of critical thinking and data driving mode of students

Project	Open mind	Thirst for knowledge	Cognitive maturity	Systematization	Look for the truth	Analytical ability	Self-confidence	Total score
Active space	0.046	0.157^***	0.033	0.122^**	0.068	0.042	0.052	0.113^**
Physical space	0.017	0.176^***	-0.008	0.173^**	0.092^*	0.081	0.078	0.132^**
Psychological space	0.041	0.174^***	0.043	0.166^**	0.121^**	0.054	0.058	0.132^**
Total score	0.278	0.192^**	0.036	0.174^**	0.097^*	0.063	0.072	0.151^**

^*P≤0.05;^**P≤0.01;^***P≤0.001

3.4

Multiple regression analysis

In carrying out the linear regression analysis, it is necessary to check whether there is multivariate covariance between the variables, which is judged by observing the values of eigenvalues, conditional indices, and checking the values of tolerance and variance inflation factor, and the diagnostic results are shown in Table 7. According to the data in Table 7, it can be learned that the value of the variance inflation factor of all dimensions is less than 10, the franchised value TOL is far away from 1, the eigenvalues are greater than 0.01, and the condition index is less than 30. Therefore, there is no multivariate covariance between the predictor variables, and it can be analyzed by using multiple regression.

Table 7.

Colinear diagnosis

Dimension	Eigenvalue	Conditional index	Common linear statistics
Dimension	Eigenvalue	Conditional index	Authorized value	Coefficient of variance expansion
1	0.012	19.231	0.523	1.231
2	0.031	17.452	0.532	1.892
3	0.022	19.224	0.611	1.593

The normal distribution plot is shown in Figure 2, in which the residual data from the regression analysis are evenly distributed on or near the diagonal and do not deviate very significantly, thus proving that the regression standardized residuals satisfy the normal distribution.

The multiple regression analysis will be carried out by taking the activity space, physical space, and mental space elements as independent variables and the total score of students’ critical thinking skills in nuclear literacy education as dependent variables, and the results of the analysis are shown in Table 8. From the results of the regression analysis in the table, it can be seen that the fit of this linear regression is acceptable, R-squared = 0.296, and these four influences can together explain 29.6% of the variance in the total score of students’ development of critical thinking skills, and the R-squared is referred to as the coefficient of determination, which is used to measure the degree of fit between the regression straight line and the data. The F value is 53.052, p<0.001, which reaches the highly significant level, indicating that the model regression effect is better, and the B value of the three factors, activity space, physical space and psychological space factors, is greater than 0, and the significance is less than 0.05, which indicates that all three factors have a significant positive impact on the development of student’s critical thinking skills, which verifies the hypothesis 1, hypothesis 2, as well as the hypothesis 3 observation. Beta coefficients: The maximum value is in the activity space, followed by physical and mental spaces. Therefore, the magnitude of the influence of the three factors on the development of student’s critical thinking skills is activity space > physical space > psychological space. Entering into this regression model, the regression equation about the development of student’s critical thinking skills is students’ critical thinking skills = 1.088 + 0.182^*activity space + 0.145^*physical space + 0.121^*mental space.

Table 8.

Regression analysis

Model	Non-standard error factor		Standard coefficientBeta	T	significance
Model	B	Standard error	Standard coefficientBeta	T	significance
Constants	1.088	0.142		8.314	<0.001
Active space	0.182	0.041	0.232	5.232	<0.001
Physical space	0.145	0.038	0.193	5.088	<0.001
Psychological space	0.121	0.036	0.095	3.145	0.032
R²				0.296
F				53.052
P				<0.001
Dependent variables: the total score of critical thinking in students

4

Conclusion

In this paper, 300 students from three universities were selected as subjects to study the relationship between the influence of a data-driven teaching model on the development of students’s critical thinking skills, and the results found that the students’ CTDI-CV total score was (293.14±30.6), positive critical thinking, and the highest score of all dimensions was inquisitiveness, and the lowest score was self-confidence, with the proportion of the number of people in the inquisitiveness dimension (22.4%) being higher than the other dimensions. Secondly, the level of critical thinking ability of male and female students is comparable, and there is no significant difference. However, science students possess better critical thinking abilities than those of arts students.

According to the results of multiple regression analysis, the F-value is 53.052, p<0.001, which reaches a highly significant level. The model demonstrates a superior regression effect. At the same time, the B-value of the activity space, physical space, and mental space factors is greater than 0, and the significance is less than 0.05. This indicates that all three factors have a significant positive impact on the development of the student’s critical thinking ability. This suggests that the data-driven model can significantly promote the development of the student’s critical thinking skills.

Język:: Angielski

Częstotliwość wydawania:: 1 razy w roku
Dziedziny czasopisma:: Nauki biologiczne, Nauki biologiczne, inne, Matematyka, Matematyka stosowana, Matematyka ogólna, Fizyka, Fizyka, inne

Kanał RSS czasopisma

The Role of Data-Driven Instructional Models in the Development of Students’ Critical Thinking Skills in Core Literacy Education

Min Zhang

Data publikacji: 17 mar 2025

Otrzymano: 17 paź 2024

Przyjęty: 28 sty 2025

DOI: https://doi.org/10.2478/amns-2025-0177

Słowa kluczoweData-driven instruction, Critical thinking skills, Multiple regression, Correlation analysis

© 2025 Min Zhang, published by Sciendo

This work is licensed under the Creative Commons Attribution 4.0 International License.

Słowa kluczowe
Data-driven instruction, Critical thinking skills, Multiple regression, Correlation analysis