Quantitative assessment of brand promotion effect of agricultural products based on multiple regression analysis

As an important economic pillar, the agricultural products industry plays a key role in market competition. The importance of branding for agricultural products companies is becoming increasingly important as consumers are becoming more concerned about food safety and quality. Agricultural products companies need to focus on and improve their branding strategies to increase their market share, enhance brand recognition, and build strong relationships with consumers [1-3]. The agricultural products industry is characterized by increasing competitive intensity, including competitors within the same industry as well as substitutes from other food categories. In this competitive environment, branding has become an important tool for agricultural product companies to gain market share, attract new customers and retain existing customers [4-6]. And branding is a key strategy for agricultural product companies to build corporate image, increase product awareness and increase consumer loyalty. Through branding, agricultural companies can effectively communicate the unique value of their products, quality assurance and fit with consumer needs [7-9].

In the context of the development of rural revitalization strategy, with the application of digital technology expanding, the branding of agricultural products has higher requirements. At this stage, the branding of agricultural products has gradually become a prerequisite for promoting the development of modern agriculture, and has become an indispensable and important part of the construction process of modern agriculture [10-12]. Thus, it must be based on the existing brand construction, combined with the background and connotation of the development of rural revitalization strategy, the basic requirements, etc., to enhance the awareness of brand promotion of farmers, create a brand promotion system for agricultural products that meets the market demand, improve the satisfaction of the target customer groups, and realize the creation and promotion of agricultural product brands [13-15]. With the improvement of people’s living standards, ecological agricultural products are gradually favored by consumers. In the absence of obvious safety identification standards, ecological agricultural products enterprises in order to win the attention of more consumers, and gradually embark on the road of branding, to attract consumers with a big brand, therefore, the brand promotion of agricultural products has become an important topic for many ecological agricultural products production enterprises to study [16-18].

The important task of brand promotion of agricultural products is to increase brand awareness and reputation, and ultimately sell the products with corresponding brand names [19]. Therefore, consumers’ purchasing behavior of branded agricultural products has become an important factor in measuring the effectiveness of agricultural product brand promotion. As people’s attention to food quality and safety is increasing, the recognition and demand for agricultural product brands are also increasing [20-21]. The optimization of brand promotion strategy can improve the visibility and reputation of agricultural products brand, which helps to improve the market competitiveness and profitability of enterprises. The research on the optimization of brand promotion strategy for agricultural products is of great significance to promote the development of agricultural products market and the operation of modern supply chain of Chinese agricultural products, as well as to improve the international competitiveness of Chinese agricultural products [22-23].

In the field of research on branding of agricultural products , Butova, T. G and others, in order to improve the competitiveness and quality of Russian agricultural products in foreign markets, proposed to consider the branding technology as an indicator of the focus and development of agricultural products, thus focusing on the analysis of constraints to the development of local brands of agricultural products, in particular the problem of insufficient definition of the brand structure with the aim of optimizing the design of territorial brands of agricultural products [24]. Zheng, X built a theoretical model of brand-driven mode, and combined with the fuzzy set qualitative comparative analysis method, analyzed the brand-driven mode and applicable conditions, pointed out that the main paths of regional branding of Chinese agricultural products in the present time are mainly four modes: resource-dependent, technology-led, culture-led and industry-led, and pointed out that resource-dependent type has high requirements on policies, infrastructure improvement degree. While technology-induced type has high requirements on the education level of regional farmers [25]. Orria, B discusses the “new countryside” branding and promotion of quality local food being promoted by Campanian farmers, who are proposing the application of an integrated approach to local development in order to guarantee the quality of local food, culture and environmental resources, while promoting a sustainable view of development and consumption, and rejecting consumerism [26]. Sun, Y introduced the current policy documents and exploration hotspots around China’s agricultural product branding, tried to analyze the macro-micro factors affecting China’s agricultural product branding, and set up an analytical model of agricultural product branding based on the Chekran method [27]. From the agricultural product brand policy support, local development integrated approach, common agricultural product brand building path as well as the international market competition of agricultural products and other multi-dimensional, the current agricultural product brand construction analysis, but also some scholars focus on the agricultural product brand publicity and promotion strategy and impact. Suman, S elaborated that branding and promotion of agricultural products have significant impact on farmers’ choice of agricultural products and conducted descriptive study and conducted a survey on the same and found that farmers prioritize branding when purchasing agricultural inputs especially in purchasing seeds and fertilizers [28]. Reyes, S. R. C et al. revealed the problems of single channel and poor effect of traditional agricultural products promotion mode, and proposed an agricultural products brand promotion mode mediated by short videos, aiming to enhance the brand awareness and audience of agricultural products and provide strong support for online and offline marketing of agricultural products [29].

The article first analyzes the influencing factors of the brand promotion effect of agricultural products. Then the principal component analysis method and the establishment and testing method of multiple linear regression model are outlined. Subsequently, the principal component analysis method is used to determine the characteristic roots and principal component coefficients, and the number of principal component quantities that affect the effect of brand promotion of agricultural products is determined according to the calculation results. After determining the influencing factors, the multiple linear regression model was applied to establish the model and further estimate and test the model of the influence of the brand promotion effect of agricultural products, so as to carry out multiple regression analysis model fitting.

2

Quantitative Assessment Methods of Agricultural Product Branding Effectiveness

2.1

Factors affecting the effectiveness of brand promotion for agricultural products

1)

Program Innovation

Agricultural product proponents usually attach considerable importance to project innovativeness, hoping to obtain unique and personalized products/services in return. The need for novelty stems from people’s pursuit of individuality and freshness, and the innovativeness of produce projects can serve as an important quality signal. It is argued that produce supporters are more inclined to fund and participate in projects with a higher degree of novelty. Projects with higher innovativeness are more likely to gain sustained attention and recommendation from supporters, which can enhance the brand promotion effect of the project.

2)

Cycle Continuity

Initiators tend to shorten the project cycle to avoid exposing potential problems. However, the initiators of high-quality projects do not have this concern, and they tend to choose a longer cycle in order to gain more attention and support. This paper argues that when the cycle has a certain degree of continuity, it can expose the project’s deficiencies and defects to a greater extent, so cycle continuity can be used as one of the quality signals. When the project cycle has a certain degree of continuity, it can reduce the uncertainty of supporters due to information asymmetry, and the possibility of supporters’ continued attention and recommendation is greater, which improves the brand promotion effect of the project.

3)

Product Reliability

The success of agricultural products depends on the nature of the project itself, and high-quality projects can be identified through product quality and other aspects. Supporters learn about the program through the online produce platform and receive innovative products in return for their support. Whether the product information is falsely exaggerated and truthfully and reliably represents the true quality of the produce program, so product reliability can be used as a type of quality signal. Product reliability will affect the supporters’ experience of the agricultural products and their evaluation of the initiator. When the products are more reliable, the perceived value of the supporters is higher, and there is a greater likelihood of sustained attention and recommendation, which will enhance the branding effect of the project.

4)

Detailed information

The agricultural platform requires the initiator to disclose project information in a timely manner. The more detailed the information is, the more in-depth the supporters’ understanding of the project and the product is, and the less uncertainty they feel, so this paper takes the detailedness of information as one of the quality signals. When project information is disclosed in a more timely and detailed manner, the smaller the perceived risk of the supporters, the greater the possibility of their continued attention and recommendation, thus enhancing the brand promotion effect of the project.

5)

Word-of-mouth (IWOM)

The first IWOM information that supporters pay attention to is the word-of-mouth of the project itself. At the early stage of project promotion and launch, leading supporters or experienced supporters will form judgments, and they will post evaluations on the agricultural platform through reviews and scores, which make up the IWOM of the project. The better the project’s IWOM is, the more it can gain the attention and recommendation of subsequent supporters, thus enhancing the project’s brand promotion effect.

6)

Initiator IWOM

As the executor of the project, the IWOM of the initiator is also the information that supporters focus on. The evaluations harvested by the initiator in his previous business and agricultural product experiences are the initiator’s IWOM. The better the originator’s IWOM is, the more attention and recommendation it can get from subsequent supporters, thus enhancing the branding effect of the project.

7)

Financing effect

This paper argues that the financing effect of agricultural products will have an impact on the brand promotion effect, mainly due to the following reasons: ① Time sequence. The motivation of agricultural product supporters is different from that of ordinary consumers, and their participation in agricultural products is mainly to obtain some more niche products and to satisfy curiosity, etc. They pay attention to the agricultural product project and the brand promotion effect of the project first. They first focus on agricultural projects and products themselves, and then form a support decision to influence the financing effect, after the formation of a satisfactory evaluation of the project and product and then continue to pay attention to the initiator and influence the brand promotion effect. ② Ease of realization. To provide promoters with convenient financing is the original purpose of the emergence of agricultural products, the success rate of domestic agricultural projects is generally higher, the project basically get better financing results. The realization of brand promotion is the incidental value of agricultural products, and it is more difficult to obtain better brand promotion results.

8)

Initial number of fans

Supporters who are interested in the initiator of agricultural products and are willing to pay continuous attention usually collect the initiating store to become fans [30].

2.2

Principal component analysis

The basic idea of principal component analysis is to represent the measured multiple indicators by a linear combination of a few potentially independent principal component indicators, which constitutes a linear combination that can reflect the main information of the original multiple measured indicators.

There is n sample, in which m characteristic variables constitute the matrix as follows: 1 $X = [\begin{array}{rcl} x_{11} & x_{21} & Λ & x_{n 1} \\ x_{12} & x_{22} & Λ & x_{n 2} \\ M & M & M & M \\ x_{1 m} & x_{2 m} & Λ & x_{n m} \end{array}]$ $$X = \left[ {\begin{array}{*{20}{r}} {{x_{11}}}&{ {x_{21}}}&\Lambda &{ {x_{n1}}} \\ {{x_{12}}}&{ {x_{22}}}&\Lambda &{ {x_{n2}}} \\ M&M&M&M \\ {{x_{1m}}}&{ {x_{2m}}}&\Lambda &{ {x_{nm}}} \end{array}} \right]$$

Normalizing the matrix yields: 2 $Z_{j} = \frac{x_{i j} - {\bar{x}}_{j}}{s_{j}}$ $${Z_j} = \frac{{{x_{ij}} - {{\bar x}_j}}}{{{s_j}}}$$

where ${\bar{x}}_{j} = \frac{1}{n} \sum_{i = 1}^{n} x_{i j}$ $${\bar x_j} = \frac{1}{n}\sum\limits_{i = 1}^n {{x_{ij}}}$$, $s_{j}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} (x_{i j} - {\bar{x}}_{j})$ $${s_j}^2 = \frac{1}{{n - 1}}\sum\limits_{i = 1}^n {\left( {{x_{ij}} - {{\bar x}_j}} \right)}$$, i = 1, 2, Λ, n, j = 1, 2, Λ, m. The normalized feature variables are converted into principal components: 3 $F_{j} = Z L_{j} = L_{j 1} Z_{1} + L_{j 2} Z_{2} + Λ + L_{j m} Z_{m}$ $${F_j} = Z{L_j} = {L_{j1}}{Z_1} + {L_{j2}}{Z_2} + \Lambda + {L_{jm}}{Z_m}$$

where j = 1, 2, 3Λm.

If the cumulative contribution of principal components: 4 $\sum_{j = 1}^{k} (λ_{j} / \sum_{j = 1}^{m} λ_{j}) \geq 85 %$ $$\sum\limits_{j = 1}^k {\left( {{{{\lambda_j}} \mathord{\left/ {\vphantom {{{\lambda_j}} {\sum\limits_{j = 1}^m {{\lambda_j}} }}} \right. } {\sum\limits_{j = 1}^m {{\lambda_j}} }}} \right)} \geq 85\%$$

Then the first k principal components are selected as principal component variables, or the first few principal components with eigenvalues greater than 1 (or some specified eigenvalue) are selected as principal component components [31]. For example, if the sales price P is used as the dependent variable and the principal component variables are used as the independent variables, a linear function is used to model the price: 5 $P = α_{0} + α_{1} F_{1} + α_{2} F_{2} + Λ + α_{k} F_{k} + \in$ $$P = {\alpha_0} + {\alpha_1}{F_1} + {\alpha_2}{F_2} + \Lambda + {\alpha_k}{F_k} + \in$$

To summarize, this paper considers fitting all three function cases, comparing the fitting effect with the prediction effect through statistical analysis and testing, and selecting the function form that can achieve the optimal evaluation effect of agricultural brand promotion.

2.3

Multiple linear regression models

The number of explanatory variables (independent variables) of the univariate linear regression model is one. In the practical application of the problem, the number of independent variables will be more than one, such as in this paper, the impact of the roadway speed is not only affected by the speed of the same roadway adjacent moments, but also by the roadway around the neighboring roadway speed constraints. That is, when the number of independent variables reaches more than two, it is called a multiple linear regression model. Among them, the literature is a multiple linear regression estimation model of road speed based on the spatial relationship of road segments, which obtains relatively high probability and accuracy of speed error, but does not incorporate temporal correlation, in order to ensure the fitting accuracy and data coverage, this paper applies spatial and temporal correlation together in the model to fill in the missing data of floating vehicles.

The following is a brief description of the multiple linear regression model as the theoretical basis of this thesis.

2.3.1

Multiple linear regression modeling

General form of multiple linear regression model: 6 $y_{i} = α_{0} + α_{1} x_{1 i} + α_{2} x_{2 i} + \dots + α_{k} x_{k i} + ε_{i} i = 1, 2, \dots, n$ $${y_i} = {\alpha_0} + {\alpha_1}{x_{1i}} + {\alpha_2}{x_{2i}} + \cdots + {\alpha_k}{x_{ki}} + {\varepsilon_i}i = 1,2, \cdots ,n$$

In the model y_i is referred to as the dependent variable while x_1i, x_2i, ⋯, x_ki is referred to as the independent variable. At k = 1, equation (6) is known as a univariate linear regression model and at k > 2, equation (6) is known as a multiple linear regression model [32]. The dependent variable y_i consists of an error term random variable ε_i and a linear function α₀ + α₁x_1i + α₂x_2i + ⋯ + α_kx_ki of the k independent variables.

The parameter estimation of the multiple linear regression model is much more complicated than the univariate linear regression model, and the tool of matrix is introduced to simplify the computation and analysis in order to facilitate the computation and analysis, and to facilitate the generalization of the results from univariate totals to general multivariate totals.

A set of random samples in the overall set of $(y_{i}, x_{1 i}, x_{2 i}, \dots, x_{k i})$ $$\left( {{y_i},{x_{1i}},{x_{2i}}, \cdots ,{x_{ki}}} \right)$$, i = 1, 2, ⋯, n. In then (6) can be written as the following matrix representation: 7 $\begin{matrix} y_{1} = α_{0} + α_{1} x_{11} + α_{2} x_{21} + \dots + α_{k} x_{k 1} + ε_{1} \\ y_{2} = α_{0} + α_{1} x_{12} + α_{2} x_{22} + \dots + α_{k} x_{k 2} + ε_{2} \\ ⋮ \\ y_{n} = α_{0} + α_{1} x_{1 n} + α_{2} x_{2 n} + \dots + α_{k} x_{k n} + ε_{n} \end{matrix}$ $$\begin{array}{*{20}{c}} {{y_1} = {\alpha_0} + {\alpha_1}{x_{11}} + {\alpha_2}{x_{21}} + \cdots + {\alpha_k}{x_{k1}} + {\varepsilon_1}} \\ {{y_2} = {\alpha_0} + {\alpha_1}{x_{12}} + {\alpha_2}{x_{22}} + \cdots + {\alpha_k}{x_{k2}} + {\varepsilon_2}} \\ \vdots \\ {{y_n} = {\alpha_0} + {\alpha_1}{x_{1n}} + {\alpha_2}{x_{2n}} + \cdots + {\alpha_k}{x_{kn}} + {\varepsilon_n}} \end{array}$$

Where k - the number of independent variables.

α₀, α₁, ⋯, α_k - parameters to be determined.

ε_i - random variable.

Expressed using matrix operations as: 8 $[\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{n} \end{matrix}] = [\begin{matrix} 1 & x_{11} & \dots & x_{k 1} \\ 1 & x_{12} & \dots & x_{k 2} \\ ⋮ & ⋮ & ⋮ \\ 1 & x_{1 n} & \dots & x_{k n} \end{matrix}] [\begin{matrix} α_{0} \\ α_{1} \\ ⋮ \\ α_{k} \end{matrix}] + [\begin{matrix} ε_{1} \\ ε_{2} \\ ⋮ \\ ε_{n} \end{matrix}]$ $$\left[ {\begin{array}{*{20}{c}} {{y_1}} \\ {{y_2}} \\ \vdots \\ {{y_n}} \end{array}} \right] = \left[ {\begin{array}{*{20}{c}} 1&{ {x_{11}}}& \cdots &{ {x_{k1}}} \\ 1&{ {x_{12}}}& \cdots &{ {x_{k2}}} \\ \vdots & \vdots &{ }& \vdots \\ 1&{ {x_{1n}}}& \cdots &{ {x_{kn}}} \end{array}} \right]\left[ {\begin{array}{*{20}{c}} {{\alpha_0}} \\ {{\alpha_1}} \\ \vdots \\ {{\alpha_k}} \end{array}} \right] + \left[ {\begin{array}{*{20}{c}} {{\varepsilon_1}} \\ {{\varepsilon_2}} \\ \vdots \\ {{\varepsilon_n}} \end{array}} \right]$$

Denote by $y = [\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{n} \end{matrix}]$ $$y = \left[ {\begin{array}{*{20}{c}} {{y_1}} \\ {{y_2}} \\ \vdots \\ {{y_n}} \end{array}} \right]$$, $x = [\begin{matrix} 1 & x_{11} & \dots & x_{k 1} \\ 1 & x_{12} & \dots & x_{k 2} \\ ⋮ & ⋮ & ⋮ \\ 1 & x_{1 n} & \dots & x_{k n} \end{matrix}]$ $$x = \left[ {\begin{array}{*{20}{c}} 1&{ {x_{11}}}& \cdots &{ {x_{k1}}} \\ 1&{ {x_{12}}}& \cdots &{ {x_{k2}}} \\ \vdots & \vdots &{ }& \vdots \\ 1&{ {x_{1n}}}& \cdots &{ {x_{kn}}} \end{array}} \right]$$, $α = [\begin{matrix} α_{0} \\ α_{1} \\ ⋮ \\ α_{k} \end{matrix}]$ $$\alpha = \left[ {\begin{array}{*{20}{c}} {{\alpha_0}} \\ {{\alpha_1}} \\ \vdots \\ {{\alpha_k}} \end{array}} \right]$$, $ε = [\begin{matrix} ε_{1} \\ ε_{2} \\ ⋮ \\ ε_{n} \end{matrix}]$ $$\varepsilon = \left[ {\begin{array}{*{20}{c}} {{\varepsilon_1}} \\ {{\varepsilon_2}} \\ \vdots \\ {{\varepsilon_n}} \end{array}} \right]$$. This can be expressed as a matrix: 9 $y = X α + ε$ $$y = X\alpha + \varepsilon$$

Noting $\hat{α} = [\begin{matrix} {\hat{α}}_{0} \\ {\hat{α}}_{1} \\ ⋮ \\ {\hat{α}}_{k} \end{matrix}]$ $$\widehat \alpha = \left[ {\begin{array}{*{20}{c}} {{{\hat \alpha }_0}} \\ {{{\hat \alpha }_1}} \\ \vdots \\ {{{\hat \alpha }_k}} \end{array}} \right]$$, $e = [\begin{matrix} e_{1} \\ e_{2} \\ ⋮ \\ e_{n} \end{matrix}]$ $$e = \left[ {\begin{array}{*{20}{c}} {{e_1}} \\ {{e_2}} \\ \vdots \\ {{e_n}} \end{array}} \right]$$, the matrix representation of the sample is: 10 $y = X \hat{α} + e$ $$y = X\widehat \alpha + e$$

where ${\hat{α}}_{0}, {\hat{α}}_{1}, \dots, {\hat{α}}_{k}$ $${\hat \alpha_0},{\hat \alpha_1}, \cdots ,{\hat \alpha_k}$$ is the fitted value of α₀, α₁, ⋯, α_k respectively, the regression equation is: 11 ${\hat{y}}_{i} = {\hat{α}}_{0} + {\hat{α}}_{1} x_{1 i} + {\hat{α}}_{2} x_{2 i} + \dots + {\hat{α}}_{k} x_{k i}$ $${\hat y_i} = {\hat \alpha_0} + {\hat \alpha_1}{x_{1i}} + {\hat \alpha_2}{x_{2i}} + \cdots + {\hat \alpha_k}{x_{ki}}$$

Where ${\hat{α}}_{0}$ $${\hat \alpha_0}$$ - constant.

${\hat{α}}_{1}, \dots, {\hat{α}}_{k}$ $${\hat \alpha_1}, \cdots ,{\hat \alpha_k}$$ - partial regression coefficient.

2.3.2

Basic assumptions of the model

The purpose of the underlying assumptions is to facilitate the estimation of the parameters of the model, and the following assumptions are made about the equation:

Assumption 1 The model is assumed to be parametrically linear.

Assumption 2 The independent variables are independent of the random error term, i.e.: 12 $cov (x_{j i}, ε_{i}) = 0, j = 1, 2, \dots, k$ $$\operatorname{cov} \left( {{x_{ji}},{\varepsilon_i}} \right) = 0,j = 1,2, \cdots ,k$$

Assumption 3 Zero mean assumption, i.e.: 13 $E (ε_{i}) = 0, i = 1, 2, \dots, n$ $$E\left( {{\varepsilon_i}} \right) = 0,i = 1,2, \cdots ,n$$

Assumption 4 Homoskedasticity assumption, i.e.: 14 $var (ε_{i}) = σ^{2}, i = 1, 2, \dots, n$ $$\operatorname{var} \left( {{\varepsilon_i}} \right) = {\sigma^2},i = 1,2, \cdots ,n$$

Assumption 5 No autocorrelation assumption: 15 $cov (ε_{i}, ε_{j}) = 0, i \neq j, i = 1, 2, \dots, n, j = 1, 2, \dots, n$ $$\operatorname{cov} \left( {{\varepsilon_i},{\varepsilon_j}} \right) = 0,i \ne j,i = 1,2, \cdots ,n,j = 1,2, \cdots ,n$$

Assumption 6 Normality is assumed, viz: 16 $ε_{i} ~ N (0, σ^{2}), i = 1, 2, \dots, n$ $${\varepsilon_i}\sim N\left( {0,{\sigma^2}} \right),i = 1,2, \cdots ,n$$

2.3.3

Estimation of model parameters

After specifying the regression theoretical model, the unknown parameters of the model are estimated. Ordinary least squares is the most commonly used method for estimating the unknown parameters and is also the classical estimation method. When there are regression problems that do not meet the model assumptions, some new methods can be used, which are also based on the ordinary least squares method, such as partial least squares estimation, principal component regression, ridge regression and so on. Due to the fact that the computation is very large when there are many parameter variables, it is necessary to use computer software for the operation.

Based on the principle of least squares, the estimate ${\hat{α}}_{i}$ $${\hat \alpha_i}$$ of α_i is to be satisfied: 17 $Q = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} \to \min$ $$Q = \sum\limits_{i = 1}^n {{{\left( {{y_i} - {{\hat y}_i}} \right)}^2}} \to \min$$

The derivation of Eq: 18 ${\begin{matrix} n {\hat{α}}_{0} + (\sum_{i = 1}^{n} x_{1 i}) {\hat{α}}_{1} + \dots + (\sum_{i = 1}^{n} x_{k i}) {\hat{α}}_{k} = \sum_{i = 1}^{n} y_{i} \\ (\sum_{i = 1}^{n} x_{1 i}) {\hat{α}}_{0} + (\sum_{i = 1}^{n} x_{1 i}^{2}) {\hat{α}}_{1} + (\sum_{i = 1}^{n} x_{1 i} x_{2 i}) {\hat{α}}_{2} + \dots + (\sum_{i = 1}^{n} x_{1 k} x_{k i}) {\hat{α}}_{k} = \sum_{i = 1}^{n} x_{1 i} y_{i} \\ \dots \dots \\ (\sum_{i = 1}^{n} x_{k i}) {\hat{α}}_{0} + (\sum_{i = 1}^{n} x_{1 i} x_{k i}) {\hat{α}}_{1} + (\sum_{i = 1}^{n} x_{2 i} x_{k i}) {\hat{α}}_{2} + \dots + (\sum_{i = 1}^{n} x_{k i}^{2}) {\hat{α}}_{k} = \sum_{i = 1}^{n} x_{k i} y_{i} \end{matrix}$ $$\left\{ {\begin{array}{*{20}{c}} {n{{\hat \alpha }_0} + \left( {\sum\limits_{i = 1}^n {{x_{1i}}} } \right){{\hat \alpha }_1} + \cdots + \left( {\sum\limits_{i = 1}^n {{x_{ki}}} } \right){{\hat \alpha }_k} = \sum\limits_{i = 1}^n {{y_i}} } \\ {\left( {\sum\limits_{i = 1}^n {{x_{1i}}} } \right){{\hat \alpha }_0} + \left( {\sum\limits_{i = 1}^n {x_{1i}^2} } \right){{\hat \alpha }_1} + \left( {\sum\limits_{i = 1}^n {{x_{1i}}} {x_{2i}}} \right){{\hat \alpha }_2} + \cdots + \left( {\sum\limits_{i = 1}^n {{x_{1k}}} {x_{ki}}} \right){{\hat \alpha }_k} = \sum\limits_{i = 1}^n {{x_{1i}}} {y_i}} \\ { \ldots \cdots } \\ {\left( {\sum\limits_{i = 1}^n {{x_{ki}}} } \right){{\hat \alpha }_0} + \left( {\sum\limits_{i = 1}^n {{x_{1i}}} {x_{ki}}} \right){{\hat \alpha }_1} + \left( {\sum\limits_{i = 1}^n {{x_{2i}}} {x_{ki}}} \right){{\hat \alpha }_2} + \cdots + \left( {\sum\limits_{i = 1}^n {x_{ki}^2} } \right){{\hat \alpha }_k} = \sum\limits_{i = 1}^n {{x_{ki}}} {y_i}} \end{array}} \right.$$

is further written in matrix form: 19 $A a = B$ $$Aa = B$$

Assume that the coefficient matrix A is full rank, where: 20 $\begin{array}{rcl} A & = & X^{T} X, B = X^{T} Y, X = [\begin{matrix} 1 & x_{11} & \dots & x_{k 1} \\ 1 & x_{12} & \dots & x_{k 2} \\ ⋮ & ⋮ & ⋮ \\ 1 & x_{1 n} & \dots & x_{k n} \end{matrix}] \\ Y & = & [\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{n} \end{matrix}], a = [\begin{matrix} {\hat{α}}_{0} \\ {\hat{α}}_{1} \\ ⋮ \\ {\hat{α}}_{k} \end{matrix}] \end{array}$ $$\begin{array}{rcl} A &=& {X^T}X,B = {X^T}Y,X = \left[ {\begin{array}{*{20}{c}} 1&{ {x_{11}}}& \cdots &{ {x_{k1}}} \\ 1&{ {x_{12}}}& \cdots &{ {x_{k2}}} \\ \vdots & \vdots &{ }& \vdots \\ 1&{ {x_{1n}}}& \cdots &{ {x_{kn}}} \end{array}} \right] \\ Y &=& \left[ {\begin{array}{*{20}{c}} {{y_1}} \\ {{y_2}} \\ \vdots \\ {{y_n}} \end{array}} \right],a = \left[ {\begin{array}{*{20}{c}} {{{\hat \alpha }_0}} \\ {{{\hat \alpha }_1}} \\ \vdots \\ {{{\hat \alpha }_k}} \end{array}} \right] \\ \end{array}$$

The measured least squares method is estimated as: 21 $a = A^{- 1} B = {(X^{T} X)}^{- 1} X^{T} Y$ $$a = {A^{ - 1}}B = {\left( {{X^T}X} \right)^{ - 1}}{X^T}Y$$

2.3.4

Model testing

After the unknown parameters of the model are estimated and the multiple linear regression equation is established, the multiple linear regression equation needs to be tested for significance. 1)

Goodness-of-fit test of regression equation the magnitude of the coefficient of determination R² can indicate how well the estimated equation fits the sample observations, i.e. SST = SSR + SSE.

Where $S S T = \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}$ $$SST = \sum\limits_{i = 1}^n {{{\left( {{y_i} - \bar y} \right)}^2}}$$ is the total sum of squared deviations, $S S R = \sum_{i = 1}^{n} {({\hat{y}}_{i} - \bar{y})}^{2}$ $$SSR = \sum\limits_{i = 1}^n {{{\left( {{{\hat y}_i} - \bar y} \right)}^2}}$$ is the regression sum of squares, which is a parameter that reflects the effect of regression, and $S S E = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}$ $$SSE = \sum\limits_{i = 1}^n {{{\left( {{y_i} - {{\hat y}_i}} \right)}^2}}$$ is the residual sum of squares. y_i is the regression value on the ith sample point $(x_{1 i}, x_{2 i}, \dots, x_{k i})$ $$\left( {{x_{1i}},{x_{2i}}, \cdots ,{x_{ki}}} \right)$$ and $\bar{y}$ $$\bar y$$ is the sample mean of y.

The correlation coefficient is: 22 $r = \frac{cov (X, Y)}{\sqrt{D X \cdot D Y}}, \hat{r} = \frac{\sum_{i = 1}^{n} (x_{i} - \bar{x}) (y_{i} - \bar{y})}{\sqrt{\sum_{i = 1}^{n} {(x_{i} - \hat{x})}^{2}} \cdot \sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}}$ $$r = \frac{{\operatorname{cov} (X,Y)}}{{\sqrt {DX \cdot DY} }},\hat r = \frac{{\sum\limits_{i = 1}^n {\left( {{x_i} - \bar x} \right)} \left( {{y_i} - \bar y} \right)}}{{\sqrt {\sum\limits_{i = 1}^n {{{\left( {{x_i} - \hat x} \right)}^2}} } \cdot \sqrt {\sum\limits_{i = 1}^n {{{\left( {{y_i} - \bar y} \right)}^2}} } }}$$

R² is the square of the correlation coefficient: 23 $R^{2} = 1 - \frac{S S E}{S S T}$ $${R^2} = 1 - \frac{{SSE}}{{SST}}$$

In one-way linear regression, if the value of R² is close to 1, then it indicates that the regression equation is a good fit to the actual observations, and vice versa, the worse the fit.

In practical problems, if R² increases by adding an independent variable, it is assumed that the model can be made to fit better by adding an independent variable [33]. However, the reality is that an increase in R² due to an increase in the number of independent variables does not correlate well with a good fit, and then R² needs to be adjusted. This brings us to the adjusted goodness of fit:

In a real problem, if R² increases by adding an independent variable, this would lead one to believe that the model would be fitted better by adding an independent variable. However, the reality is that an increase in R² due to an increase in the number of independent variables does not correlate well with goodness of fit, and then R² needs to be adjusted. This brings us to the adjusted goodness of fit: 24 $R_{1}^{2} = 1 - \frac{S S E}{n - k - 1} / \frac{S S T}{n - 1}$ $$R_1^2 = 1 - {{\frac{{SSE}}{{n - k - 1}}} \mathord{\left/ {\vphantom {{\frac{{SSE}}{{n - k - 1}}} {\frac{{SST}}{{n - 1}}}}} \right. } {\frac{{SST}}{{n - 1}}}}$$

Where n − k − 1 - Degrees of freedom (residual sum of squares).

n − 1 - degrees of freedom (overall sum of squares).

In multiple linear regression, the closer $R_{1}^{2}$ $$R_1^2$$ is to 1, the better the fit. 2)

The test of significance of a regression equation can be illustrated by the F-test for significance:

F - test of the sum of squares decomposition formula: 25 $\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2} = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} + \sum_{i = 1}^{n} {({\hat{y}}_{i} - \bar{y})}^{2}$ $$\sum\limits_{i = 1}^n {{{\left( {{y_i} - \bar y} \right)}^2}} = \sum\limits_{i = 1}^n {{{\left( {{y_i} - {{\hat y}_i}} \right)}^2}} + \sum\limits_{i = 1}^n {{{\left( {{{\hat y}_i} - \bar y} \right)}^2}}$$

Denoted as L_yy = Q + U. Can be obtained: 26 $F = \frac{U}{Q / (n - 2)} ~ F (1, n - 2)$ $$F = \frac{U}{{Q/(n - 2)}}\sim F(1,n - 2)$$

Rejecting domain $χ_{0} = {F > F_{1 - α} (1, n - 2)}$ $${\chi_0} = \left\{ {F > {F_{1 - \alpha }}(1,n - 2)} \right\}$$ indicates that the regression is good.

3

Empirical analysis of the effectiveness of brand promotion for agricultural products

The data of evaluation indexes are all derived from the sales data of an agricultural product in Taobao in 2023, and the relevant sales data on December 1, 2023 are selected for empirical analysis in this paper.

3.1

Determination of Influential Factors Based on Principal Component Analysis

The eigenvalues and contribution rates are shown in Table 1. From the table, it can be seen that the cumulative variance contribution rate of the first 4 principal components reaches 87.38%, which basically retains the information of the original indicators, so that the original 8 indicators are transformed into 4 new indicators, and the 4 indicators have a linear relationship with the original indicators.

Table 1.

Eigenvalue and contribution rate

	Characteristic root	Contribution rate(%)	Cumulative contribution(%)
1	2.85	35.63	35.63
2	2.1	26.25	61.88
3	1.04	13.00	74.88
4	1	12.50	87.38
5	0.51	6.38	93.76
6	0.33	4.13	97.89
7	0.15	1.88	99.77
8	0.02	0.25	100

The principal component coefficient matrix is shown in Table 2. From the table, it can be seen that the first principal component is mainly a comprehensive reflection of the 3 indicators of cycle continuity, product reliability and information exhaustiveness, with a contribution rate of 42.24%. The second principal component is mainly a composite reflection of the 2 indicators of financing effect and project reputation, with a variance contribution rate of 27.82%. The third principal component is mainly the project innovativeness and the number of initial fans, with a variance contribution rate of 10.12%. The fourth principal component is mainly the originator’s word of mouth, with a variance contribution rate of 7.2%.

Table 2.

Principal component coefficient matrix

Index	Principal Component
Index	1	2	3	4
Financing Effect	0.85	0.86	0.38	0.13
Project Innovation	0.14	0.58	0.86	0.27
Periodic Continuity	0.71	0.78	0.25	0.28
Product Reliability	0.41	0.31	0.55	0.24
Detail	0.89	0.07	0.19	0.4
Project Reputation	0.55	0.93	0.24	0.57
Reputation Of Originator	0.43	0.4	0.3	0.93
Initial Number Of Fans	0.49	0.37	0.91	0.06

3.2

Estimation and testing of the model of the impact of brand promotion effect of agricultural products

3.2.1

Sample data sources

China’s agricultural product promotion data mainly come from several major websites and platforms, such as Taobao, Shake, Xiaohongshu, WeChat Video No. These websites and platforms regularly publish relevant transaction data, which are objective, true and fair. The experiments in this section take the promotional sales data of an agricultural product on the Taobao platform on December 1, 2023 as the research object. Descriptive statistics are shown in Table 3.

Table 3.

Descriptive statistics

	Observed Value	Mean	Median	Maximum Value	Minimum Value	Standard Deviation	Degree Of Bias	Kurtosis	P Value
Promotion Effect	225	9.828	9.261	10.215	9.489	0.215	3.018	35.696	0
Financing Effect	225	17.841	6.061	12.441	0.2	0.329	1.97	1.925	0.587
Project Innovation	225	16.657	2	6	0	0.014	17.755	2.779	0.599
Periodic Continuity	225	8.363	1.5	2	1	0.021	14.112	2.18	0.477
Product Reliability	225	7.954	3.5	4	1	0.003	2.741	2.883	0.149
Detail	225	20.292	0.5	1	0	0.057	10.312	2.355	0.096
Project Reputation	225	2.134	0.5	1	0	0.202	20.017	1.704	0.002
Reputation Of Originator	225	8.446	2	2	1	0.178	21.013	1.639	0.186
Initial Number Of Fans	225	2.137	2	2	1	0.032	11.193	2.435	0

3.2.2

Modeling

The study of the process of explaining changes in a dependent variable by multiple influences as independent variables is multiple regression analysis. For multiple linear regression models the estimation of structural parameters and variance is done mainly by least squares. The collected data were substituted into the model and the regression and statistical tests obtained. The regression results are shown in Table 4. According to the relevant theory in statistics, the closer the P-value of the results of the statistical analysis software in the table is to 1, it means that the influence of the factor on the effect of agricultural brand promotion is more insignificant, and the closer the P-value is to 0, it means that the influence of the factor on the effect of agricultural brand promotion is more significant, and the better the results are. As can be seen from the table, the correlation between financing effect, information exhaustiveness and project reputation is relatively high, and its P-value is 0, 0.0761 and 0.0842 respectively.

Table 4.

Regression

Independent variable	Coefficient	Standard deviation	T-statistic	P value
Financing Effect	0.207894	0.343334	1.81357	0.6227
Project Innovation	0.012231	0.233795	0.37927	0.4125
Periodic Continuity	0.032513	0.153191	1.53056	0.2308
Product Reliability	0.235319	0.180799	2.8163	0.2352
Detail	0.041436	0.131607	0.58968	0.0761
Project Reputation	0.149697	0.277089	1.8261	0.0842
Reputation Of Originator	0.03234	0.128173	0.27102	0.6763
Initial Number Of Fans	0.306701	0.173075	2.7514	0.7341
Adjusted R²=0.832261 R²=0.836522 F statistic=75.62
DW=1.82322 Prob=0

3.2.3

Testing of the model

After parametric regression analysis and interpretation of the economic significance of the model, the assessment model needs to be quality tested to verify the consistency (or accuracy) of its evaluation results. The assessment consistency test is usually conducted through ratio analysis. The calculation results of assessment consistency indicators are summarized in Table 5. As can be seen from the table, the median assessment ratio is 1.00261, the arithmetic mean of the ratios is 1.0352, and the weighted mean of the ratios is 0.9522, which are all within the range of 0.9-1.1 required by IAAO. The coefficient of dispersion is 4.02311, which meets the requirement of ≦15 and can be considered reasonable. The correlation difference of 0.9877 meets the IAAO’s requirement of 0.98-1, so the assessment model meets the test criteria, indicating that the model is usable in batch assessment. This shows that it is feasible to use the multiple regression analysis method in the brand promotion effect of agricultural products.

Table 5.

Evaluate the consistency index calculation results

Median	Ratio arithmetic mean	Weighted ratio mean	Discrete coefficient	Price correlation
1.00261	1.0352	0.9522	4.02311	0.9825

3.3

Model Fitting

This system uses data on brand promotion of agricultural products for regression model fitting, which can be used to determine the importance of the influencing factors by using the stepwise entry method in the SPSS software, or by choosing the all entry method to screen the influencing factors based on the analysis results. First you need to enter the influence variable data. Select the variable to enter the model method, set the statistical test method, and the system fits the model according to the least squares method and generates the analysis graph. The histogram of standardized residuals is shown in Figure 1. It can be seen that the residuals conform to a normal distribution.

The standardized residual P-P plot is shown in Figure 2. The regression model was judged to be linear based on the output of the plot.

4

Conclusion

In order to analyze the influencing factors of agricultural brand promotion effect, the article introduces the multiple linear regression model to explore, and the results are as follows:

In the principal component coefficient matrix analysis, the first principal component is mainly cycle continuity, product reliability, and information exhaustiveness, the second principal component is mainly financing effect and project word-of-mouth, the third principal component is mainly project innovativeness and initial number of fans, and the fourth principal component is mainly initiator’s word-of-mouth, whose contribution rates are 42.24%, 27.82%, 10.12% and 7.2%, respectively.

The analysis of regression results shows that the correlation between financing effect, information exhaustiveness and project word-of-mouth and the brand promotion effect of agricultural products is relatively high, and its P-value is 0, 0.0761 and 0.0842, respectively.

In the Internet era, agricultural products must take the road of branding in order to realize long-term sustainable development. To this end, the brand concept should be established, branding should be emphasized, and excellent brands of agricultural products should be created to improve brand awareness.

Language:: English

Publication timeframe:: 1 times per year
Journal Subjects:: Life Sciences, Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics, Physics, other

Journal RSS Feed

Quantitative assessment of brand promotion effect of agricultural products based on multiple regression analysis

Yanjun Su

Published Online: Sep 29, 2025

Received: Jan 13, 2025

Accepted: May 02, 2025

DOI: https://doi.org/10.2478/amns-2025-1115

KeywordsMultiple linear regression model, Principal component analysis, Brand promotion effect, Agricultural products

© 2025 Yanjun Su, published by Sciendo.

This work is licensed under the Creative Commons Attribution 4.0 International License.

Keywords
Multiple linear regression model, Principal component analysis, Brand promotion effect, Agricultural products