Dynamic channel model estimation based on gradient descent method and its optimization in massive MIMO
Publié en ligne: 21 mars 2025
Reçu: 26 oct. 2024
Accepté: 10 févr. 2025
DOI: https://doi.org/10.2478/amns-2025-0653
Mots clés
© 2025 Jinhui Chen et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
With the advent of the information age, wireless communication network plays an extremely important strategic role in the social and economic development of the country, and penetrates into various fields of society, and its development shows the trend of high-speed, broadband, heterogeneous and ubiquitous, and ultimately constitutes the air and ground, three-dimensional intersection and seamless link of the global network [1-4]. At the same time, wireless communication networks are also facing serious challenges, mainly in the relative “shortage” and “waste” of spectrum resources, the simultaneous existence of multiple heterogeneous network standards, ubiquitous access and services, and the complexity of network management [5-8]. The root cause of these problems is that the network is not a “network”, but a “network”. The root cause of these problems is that the static wireless network is difficult to adapt to the dynamic changes in the environment, one of the effective ways to solve these problems is cognitive radio technology.
The study of multiple-input multiple-output (MIMO) channels has now become a hot research topic in cognitive radio networks. Most of the current studies on MIMO model the channel from a static scenario of some fixed propagation environment to describe the characteristics of radio wave propagation in the channel, which will lead to a lack of perception of dynamic changes in the characteristics of radio wave propagation at the user terminal [9-13]. Wireless MIMO systems effectively utilize the spatial degrees of freedom of multi-antenna transceivers, which provide significant improvements in increasing channel capacity, link reliability, and reducing channel interference, but these properties cannot be optimized at the same time in practical applications [14-16]. Therefore, observable MIMO channel models are developed using suitable methods for estimating channel parameters and enhancing the ability of user terminals to gain a deeper understanding of the multidimensionality and multiscale characteristics of radio wave propagation in wireless environments that are in a dynamic stochastic nature [17-18].
In this paper, the Stein Variational Gradient Descent (SVGD) method is firstly applied to the channel simulation of large-scale MIMO in dynamic environments, which utilizes the maximum likelihood function established by the SVGD sampling model to improve the accuracy of channel estimation with the help of the deterministic updating direction of particles. Then, the performance of SVGD-based channel estimation algorithm is optimized in large-scale MIMO scenarios, and a dynamic channel simulation technique based on SVGD is proposed, which supports static channel, dynamic channel and birth/death channel simulation, and can also support large-scale MIMO fading types. Finally, the channel estimation problem is transformed into a variational inference optimization problem by taking advantage of the low-rank nature of massive MIMO channels, and the performance of the new method is simulated and experimented.
When exact inference is used to solve the dynamic channel model, the accuracy of the model computation results is improved, but at the same time, a high computational complexity is introduced. Therefore, to strike a balance between computational complexity and accuracy, this paper uses approximate inference to solve the dynamic channel model. There are two common methods for approximate inference. The first one is to complete the approximation by using a randomized sampling algorithm, and the commonly used method is MCMC. The second one is to complete the inference by using deterministic particle updating method, and the commonly used method is Stein’s variational gradient descent method.
Markov chain Monte Carlo (MCMC) is a method for solving integrals by simulation, which is widely used in various fields. Currently, MCMC has been able to solve many difficult high-dimensional complex models and non-Bayesian problems [19]. The main idea of MCMC is to construct a Markov chain and obtain a smooth distribution after many iterations, at which time the particles obtained by sampling the smooth distribution can approximate the target distribution.The main sampling method of MCMC is Metroplis-Hastings.
The Metroplis-Hastings algorithm is a classical MCMC sampling method, which is based on the idea of “rejection sampling”, and each iteration is updated according to the probability 1) Initialize the starting value 2) Sample a candidate point 3) Given candidate point From this, it can be found that the normalization constant 4) When
In the Metropolis sampling process, it is calculated first:
A random number is then generated and compared with the acceptance probability
Hastings optimized the Metropolis sampling method and proposed the Metropolis-Hastings method, which sets the acceptance probability
The Metropolis-Hastings method is able to use an asymmetric transfer probability function of:
Thus, the condition of strict symmetry of the recommendation distribution is avoided.
Stein Variational Gradient Descent (SVGD) is a variational inference method that uses particle deterministic updating to approximate the target distribution, avoiding the need to solve for regularization coefficients in the process of approximating the target distribution [20]. SVGD solves for the fastest descent direction in a different way compared to traditional variational inference algorithms. The traditional variational inference algorithm approximates and then optimizes when solving for the fastest descent direction, while SVGD uses optimization first and then goes for approximation.
Suppose that by minimizing the KL scatter, a simpler distribution
Suppose that
where
Let
where A
Let
Therefore, it can be found that the value of the gradient of Eq. (9) is equal to the value of the KSD, i.e:
The above equation shows a specific iterative process, transforming step by step from the initialized distribution to finally approximate the target distribution. First, the initialized particle
Doing transformation
Doing transformation
In summary, Stein variational gradient descent is a new particle-based variational inference algorithm. It combines the advantages of variational inference and deterministic updating of particle methods to effectively utilize gradient information for the approximation of the target distribution, thus achieving a faster convergence rate. Unlike traditional variational inference that constructs a parametric approximation of the target distribution by minimizing the KL scatter, SVGD directly approximates the target distribution with a series of particles, which are continuously iteratively updated to reduce the KL scatter as fast as possible in all possible velocity fields of the regenerative kernel Hilbert space of a positive definite kernel. Furthermore, it is theoretically possible to apply SVGD directly to high-dimensional models. However, standard SVGD involves kernel functions for updating particles at each iteration, but its kernel functions are defined over all variable dimensions, and the use of global kernel functions over all variables introduces a loss in the algorithm’s performance in higher dimensions. Therefore, SVGD does not produce distributed message passing like the confidence propagation algorithm (BP). In this paper, we mainly consider the problem of estimating the mean of high-dimensional model variables using SVGD, and do not consider variance estimation, so the performance defect of SVGD on high dimensions will not affect the accuracy of the experimental results in this paper.
In dynamic scenarios, the transceiver antenna of the communication device is in a mobile state, the propagation environment changes continuously, and the channel parameters such as delay, maximum Doppler shift, and path loss should have randomness. At the same time, the continuous change of the channel scene also causes the delay, maximum Doppler shift and path loss to change regularly and continuously. The theoretical model of channel impact response at moment
where

Schematic diagram of time-varying impulse response of dynamic channel
According to the microwave signal transmission path to establish a multi-channel wireless channel simulation platform shown in Figure 2, the platform contains user parameter configuration unit, RF input unit, channel parameter storage unit, channel simulation unit, RF output unit and other modules. When the multi-channel channel simulation platform works, it first selects the channel mode and configures the channel parameters through the user software interface. Then, the channel parameters are passed to the FPGA through the data interface, and the FPGA stores the parameters according to the user-selected channel mode, and the RF input module downmixes the input RF signal to IF. Then downconvert to baseband in FPGA, simulate channel fading, delay, loss, and superimpose channel noise on the baseband signal according to user-configured parameters, and then after upconversion to IF number, output to RF output unit to upmix to RF.

Block diagram of multi-channel dynamic channel simulation platform
Among them, the user parameter configuration unit is the core process of dynamic channel parameter configuration of the simulation platform, which is through the user parameter configuration software on the PC, and the flow of the user parameter configuration unit is shown in Figure 3. User parameter function configuration, parameter calculation, parameter transmission, and formation of dynamic channel parameters are needed to form the frame flow structure shown in Figure 4. The parameter function configuration mainly consists of the user selecting each channel mode, including three modes: static channel, mobile channel, and birth/death channel. Then, the channel parameters are configured by the user, and the number of fading paths, the delay of each path, the loss, the fading type, the moving speed, the communication frequency, and the signal-to-noise ratio need to be configured in the static mode. In dynamic mode, it is necessary to configure the number of fading paths, the fading type, the starting traveling speed, the traveling acceleration, the communication frequency, the path loss, the basic delay, the delay variation range, the delay variation rate, the signal-to-noise ratio and so on. In fading mode, it is necessary to configure the moving speed, communication frequency, path loss, basic delay, delay change range, delay interval, number of fading positions, fading period, signal-to-noise ratio, etc.

Flowchart of user parameter configuration unit

Frame structure of channel parameters
The parameter calculation is based on the Stein Variational Gradient Descent (SVGD) method to calculate the channel parameter calculation with parameter localization as follows:
1) Calculate the Doppler frequency according to SVGD The Doppler frequency is calculated for static channel as:
When moving the channel, the moving speed at moment 2) Calculate the delay of each path according to the parameters set by the user In the mobile channel mode, the path delay of each path at the And satisfy 0 ≤ 3) In the birth/death channel mode, the path delay at the where 4) Calculate the fading factor, discrete Doppler, and phase of each path according to the user setting of each path fading type and spectrum. 5) Fixed-point processing of calculated channel parameters
Parameter transmission is based on the user’s choice of channel mode, the fixed-point channel parameters are assembled into frames in a certain order, and header information is added, including the channel mode, channel dynamic update rate, signal-to-noise ratio, etc., and transmitted to the FPGA through the data interface.
Compared with the MIMO system in traditional 4G mobile communications, the massive MIMO system in 5G communications deploys a dense array of antennas at the base station end, which results in an increase in the transmission capacity of the channel and an increase in the spectrum utilization of the system, as well as the ability to satisfy more low-latency and high-reliability services. However, the large number of antennas makes the channel estimation techniques for massive MIMO systems face unprecedented challenges, so it is crucial to investigate high-performance channel estimation methods [21].
The principle of massive MIMO is to configure hundreds or thousands of antennas at the transceiver ends to simultaneously provide services to multiple single-antenna users sharing the same time-frequency time slots, and the framework of the massive MIMO system is shown in Fig. 5 [22]. Consider the single-cell multi-user massive MIMO case, the cell base station end is configured with Block diagram of a massive MIMO system
Where,
By definition, element (
where
Among them:
Introducing the above communication channel model to multi-antenna users, consider that the cell base station end is equipped with
where
where
where
Considering that the channel undergoes flat fading in massive MIMO systems, Eq. (24) and Eq. (25) can be simplified and written as:
In this section, the channel estimation problem is modeled as a variational inference optimization model by exploiting the low-rank nature of the massive MIMO channel matrix, and the Stein variational gradient descent (SVGD) algorithm is used to recover the channel state information of the user using the log function as the matrix penalty function. In this method, the log function has a better ability to induce sparsity than the kernel paradigm and penalizes all elements in a uniform way. The log function can impose a larger penalty on small elements in the matrix than on elements with larger values, a property that makes the log function closer to the rank of the matrix than the kernel paradigm, resulting in more accurate channel estimation accuracy.
In the finite scatterer propagation environment, the massive MIMO channel matrix exhibits a low-rank property, and in order to accurately estimate the channel state information at the receiver, the channel estimation problem is transformed into a rank minimization constraint problem according to the SVGD-based dynamic channel model:
While in massive MIMO operating in time-division duplex mode, the commonly used channel estimation method is non-blind channel estimation i.e., sending guide frequency sequences. In the guide frequency training phase, all the users simultaneously send a guide frequency sequence Φ of length
where
Among them:
Since the above rank minimization constraint problem is an NP-hard problem, it is computationally difficult to solve, but it has been shown that the matrix rank minimization problem can be converted to a kernel-paradigm minimization constraint problem due to the fact that the kernel-paradigm minimization problem is the optimal convex approximation of the rank minimization problem. Where
In order to solve Eq. (31), it is necessary to know the rank information in advance and then compute this optimization problem using the iterative hard thresholding method. However, obtaining a priori information about the rank of the matrix in advance is very difficult, so the low rank matrix can be transformed into an unconstrained kernel-paradigm convex optimization problem by transforming it into an unconstrained kernel-paradigm convex optimization problem:
Further Eq. (35) is transformed into a weighted kernel paradigm minimization problem:
Among them:
where
where
The simulation data used in this paper is generated by Deep MIMO, a multi-channel dynamic channel simulation platform. In order to test the ability of the SVGD-based dynamic channel model to adapt to different environments, two types of large-scale MIMO datasets are generated in this paper, LOS and NLO, respectively, and the outdoor scenario O1_60 ray-tracing dataset used in the LOS scenario. The test area is shown in Fig. 6. In the dynamic channel scenario, the test area is divided into three stages from east to west, from near to far from the base station, and the user numbers are R551-R654, R655-R789, and R790-R920, respectively, and the specific parameters of the simulation are shown in Table 1.

Test area in LOS scenario
Simulation parameters of LOS scenarios
Argument | Numerical value |
---|---|
Carrier frequency | 50 GHz |
System bandwidth | 0.5 GHz |
Number of subcarriers | 1 |
Multipath quantity | 10 |
Base station number | 1 |
Base station antenna array | (1,64,32) |
Antenna spacing | 0.5 |
Phase shifter resolution | 5 bit |
User area number | R551-R654, R655-R789, R790-R920 |
The NLOS scene uses the outdoor scene O1_28B, the overall layout of this scene is similar to O1_60, the only difference is the existence of blocking and reflective surfaces near base station 3, the NLOS scene is shown in Fig. 7. The specific parameter settings of the NLOS scene are shown in Table 2. The NLOS scene realizes the NLOS condition by adding blocking at base station 3, and two reflective surfaces are set up on both sides for create more NLOS paths. In the dynamic scenario, the test area is divided into three phases from east to west, from far to near to the base station, and the user numbers are R551-R650, R651-R750, and R751-R850, respectively.

NLOS scenario
Simulation parameters of NLOS scenarios
Argument | Numerical value |
---|---|
Carrier frequency | 30 GHz |
System bandwidth | 0.5 GHz |
Number of subcarriers | 1 |
Multipath quantity | 5 |
Base station number | 2 |
Base station antenna array | (1,32,2) |
Antenna spacing | 0.5 |
Phase shifter resolution | 5 bit |
User area number | R551-R650, R651-R750, R751-R850 |
After generating the simulation data it has to be pre-processed before it is fed into the network for training, first the data needs to be normalized as shown in Eq:
where
The dynamic channel model based on the Stein variational gradient descent (SVGD) algorithm (hereinafter referred to as the SVGD model), with the remaining hyper-parameters shown in Table 3. The SVGD model consists of two networks, Actor and Critic. The input to the Actor network is state
Network parameters of SVGD model
Argument | Numerical value |
---|---|
Learning rate | 0.001 |
Weight attenuation | 0.02 |
Playback buffer size | 10000 |
Lot size | 1036 |
In this subsection, the beamforming capability of the SVGD model is tested by selecting the user channel input model at the middle position of the LOS scenario, and the training curves record the optimal beamforming gain magnitude of the SVGD model during the iteration process. The training curve is shown in Fig. 8 and compared with the DFT codebook with 32 beams and the equal gain combination (EGC) performance upper limit, which assumes that the phase shifter variations are continuous, the number of beams within the codebook is equal to the number of users and the channel information is known. From the training curves, it can be seen that the performance of the beams obtained using the SVGD model is already the same as the optimal beams searched in the DFT codebook at about 3200 iterations. And it reaches 90% of the EGC upper limit at 28500 iterations, and finally stabilizes at a gain of 35.5 at 35200 iterations.

Beamforming training curve of SVGD model
The beams obtained from the SVGD model trained in the LOS scene are shown in Fig. 9 and compared with the EGC optimal beam cluster. Among them, Fig. 9(a) shows the beam obtained by the SVGD model after learning, and the beam map shown in Fig. 9(b) is the result after the optimal beam is solved for the root of the 4th power, which aims at enlarging the details of the paraflap for easy observation. From the figure, it can be seen that the beam obtained by the SVGD model learning can accurately point to the user and approximate the optimal beam pattern in the details of the paraflaps well, which is the reason why the SVGD beamforming can outperform the traditional DFT codebook.

The beam obtained by training SVGD model in LOS scene
For channel optimization of large-scale MIMO systems, the most important issue is how to simulate the spatial domain characteristics of the target channel, including the horizontal and vertical dimensions, in the test area (TV) using a 3D spherical multi-probe device scheme with comprehensive consideration of the horizontal azimuth and downward inclination angles. The main goal of the SVGD-based dynamic channel model is to get the right power weight allocation using the gradient descent method to replicate the spatial domain characteristics of the target channel.
If the probe weights are valued using the Min-Sum objective function, corresponding to different horizontal arrival angles

Min-Sum·0bjective function simulation effect of LOS scene
The NLO scenario uses minimizing the maximum spatial correlation simulation error (Min-Max) with the following objective function:
where

NLO scenario simulation results using Min-Max objective function
For the channel estimation problem of massive MIMO systems, this paper takes advantage of the low-rank nature of massive MIMO matrices to model the channel estimation problem as an SVGD-based variational inference optimization model. The virtual antenna pair selection schematic for the spherical anechoic darkroom of the massive MIMO scenario is shown in Fig. 12. The test region is sampled by selecting the positions of antenna pairs

Schematic diagram of virtual antenna pair selection in spherical anechoic darkroom
The virtual antenna pair
The microwave darkroom environment (especially phase measurements) is difficult to keep smooth during long time measurements. As a result, the wave propagation phase is unstable, which will adversely affect conventional spatial deconvolution methods that require phase information.
Figure 13 exhibits the 400 Markov chain Monte Carlo (MCMC) simulations implemented with the reference algorithm and the SVGD algorithm proposed in this paper, where a uniformly distributed phase deviation from -3.5° to 3.5° is added to each simulation. This section follows the simulation parameters for

Monte Carlo simulation results of reference algorithm and SVGD algorithm
Massive MIMO technology is becoming more important in 5G communications and even in next-generation mobile communications. Compared with traditional MIMO technology, the hundreds of antenna arrays at both ends of the massive MIMO base station make the system energy and spectral efficiency greatly improved, and will provide people with higher-quality and richer communication methods. In addition, due to the huge antenna arrays in the communication system, especially when the massive MIMO technology is applied in the frequency division duplex mode, the channel estimation in the downlink will cause a large problem of guide frequency pollution. At this time, the superior performance channel estimation technique plays a pivotal role. However, there are still many challenges in the application of large-scale MIMO technology, such as the current channel estimation techniques have the shortcomings of low estimation accuracy, large channel guide frequency overhead and high computational complexity. Therefore, to address the shortcomings of the above channel estimation techniques, this paper proposes a channel optimization scheme based on Stein’s variational gradient descent method by taking advantage of the inherent low-rank characteristics of large-scale MIMO channels, and establishes a dynamic channel model by applying the theory of low-rank matrix recovery and the SVGD algorithm to channel estimation problems. Finally, experimental data on fading is provided, and the simulation error between the experimental results and theoretical values is less than 0.05 as a whole, which proves the feasibility and practicality of the method.
This research was sponsored by the Beijing Nova Program (No.20240484645).