Research on Traffic Parameter Measurement Methods for Intelligent Transportation Systems

As people’s requirements for traffic safety, efficiency and comfort are getting higher and higher, intelligent transportation has been widely used as an emerging traffic management method [1-2]. Intelligent transportation system through the use of a variety of advanced technical means, can quickly and accurately detect and analyze the road traffic parameters, so as to provide accurate data for traffic management to support the realization of intelligent traffic management [3-5].

Traffic parameters refer to traffic flow, speed, distance, traffic density, traffic accidents and other series of parameters, these parameters are important indicators for evaluating traffic conditions and traffic management effects [6-8]. According to different classification standards, traffic parameters can be divided into many kinds, for example, according to the classification of time scale, it can be divided into different levels such as hours, days, weeks and years [9-10]. According to the classification of measurement methods, they can be divided into field, real-time, fixed-point and mobile types. In ITS, the commonly used traffic parameters mainly include traffic flow, vehicle speed and vehicle distance [11-14].

At present, the commonly used traffic parameter detection techniques in ITS mainly include in-vehicle system, roadside detection system and video detection system [15-16]. Vehicle-mounted system refers to the installation of vehicle-mounted sensors and other equipment in the vehicle, real-time measurement of vehicle speed, distance and other parameters, so as to realize the detection of traffic parameters [17-18]. Roadside detection system is through the installation of traffic flow detectors on both sides of the road, the use of ground sensing coils and other sensors to detect the situation of passing vehicles in real time, and transmit the data to the central server for analysis and processing [19-20]. The video detection system mainly relies on video technology, through intelligent algorithms to detect and track the vehicles in the video image, so as to obtain real-time traffic parameter data [21-23].

This paper designs an intelligent transportation system and explains its architecture and functional modules. The coil detector and microwave detector are used to collect the traffic data of a city’s GY expressway, and the flow rate, speed and occupancy rate are selected as the characterization parameters of traffic flow, and the correlation between the three is analyzed based on time and space. On this basis, the spatial characteristics of traffic flow data are extracted by using GCN, which is good at dealing with topological structure relations, and then the temporal characteristics of traffic flow data are comprehensively obtained by BiLSTM, which has the advantage of dealing with time series problems, and the two are combined in order to fully explore the spatial and temporal correlation of the traffic flow of the detection point of a section of continuous section, and to construct the combined GCN-BiLSTM prediction model. The predicted and actual values of the traffic flow of each model are compared, and the prediction errors of the four models are analyzed by selecting evaluation indexes to assess the prediction effect of the GCN-BiLSTM model on the traffic flow parameters in this paper.

2

Intelligent transportation system design

With the acceleration of urbanization, traffic management is facing serious challenges. The traditional traffic system is not able to handle the growing number of vehicles and traffic demand, so it is crucial to implement intelligent technology to enhance the traffic system. Intelligent transportation systems have an important application value in improving the intelligent management level of the transportation department and improving the road traffic environment. The article designs the intelligent transportation system based on multi-dimensional functional requirements.

2.1

System architecture design

The overall architecture of the intelligent transportation system is shown in Figure 1. The bottom layer is the data acquisition layer, which needs to use the intelligent monitoring equipment at each traffic intersection to collect traffic data, and then use the network layer to transmit the collected traffic data to the server. Because the amount of traffic data is too large, so the intelligent transportation system is also a big data-related system, in the data collection, data storage, data cleaning and data processing stage will involve the use of big data-related components, data collection stage using big data components Kafka, data storage stage using big data components HBase, data flow processing stage using big data components Storm and so on, while the need to use the map through the map of the data collection stage. Storm and others need to assist the system in completing its functions through interfaces such as map GPS. The application layer in the intelligent transportation system has a real-time monitoring module, information query module, security management module, statistical analysis module and rights management module, which is built using Spring, SpringMVC framework.

The following section describes the overall design of the system from the perspective of five view models: scenario view, logical view, process view, development view, and physical view. The scenario view describes the business scenario of the system from the user’s perspective. The logical view of the system mainly shows the design of the object model in the system, and describes the services provided by the system to the users from the user’s perspective. The system is divided into three layers according to the logical structure. The top layer is the display layer, which displays the user interface, mostly including the monitoring page, query page, management page, analysis page, and permissions page. The middle layer is the business logic layer, which mainly includes monitoring business logic processing, querying business logic processing, management business logic processing, analyzing business logic processing and authority management business logic processing. The bottom layer is the data layer, which mainly includes monitoring data, query data, management data, analysis data, and permission management data.

The development view of the system mainly describes the module organization and management of the software from the developer’s point of view. The system display part includes HTML files, CSS files, JavaScript files and Image resource files. The monitoring module includes video monitoring, vehicle monitoring, and traffic monitoring. The query module includes vehicle violation query services, driver information query services, and violation fact query services. Security Management Module includes Alarm Management Service, Resource Management Service and Arming Management Service. The query analysis module includes analysis service and statistical display service. The permission management module includes user management, role management, and department management. The system collects data through a Kafka cluster, stores it in an HBase cluster and an Oracle database, and processes it using a Storm cluster.

The physical view of the system is mainly for operation and maintenance personnel, describing the physical relationship between the system software and hardware, as well as the actual deployment. After the user logs into the system, the request is forwarded to the specific function server through the Nginx server, and at the same time, the database is clustered to ensure security.

The process view of the system is mainly oriented towards the integrator of the system, describing the relevant processes and threads of the system and their communication. After successfully logging into the system, users can click on specific pages to use related functions according to their needs. The system will process the data according to the user’s input, save the processing results in the database after successful processing, and return the processing results to the front-end display page for display.

2.2

System Module Division

The system can be classified into five modules from the perspective of functional modules, and the structure of the functional modules of the intelligent transportation system is shown in Figure 2, which are real-time monitoring module, information query module, safety management module, statistical analysis module and authority management module. Real-time monitoring module is mainly used to view road vehicle-related information, including road video monitoring, road vehicle monitoring and traffic flow monitoring. The information query module is mainly used to query the vehicle and related driver information, including vehicle violation query, vehicle type query, driver information query, illegal fact query, alarm query, electronic police query and suspected license plate query. The security management module is mainly used to manage security deployment and control, as well as resources, including security deployment and control management, security alarm management, and security resource management. Statistical analysis module is mainly used for statistical analysis of traffic data, including query analysis and statistical analysis. Permission management module is mainly used to manage the user’s permission, including user management, role management and department management.

Intelligent transportation systems are comprehensive systems that apply modern technology and information management methods to improve transportation efficiency, safety, and environmental friendliness. Traffic parameter measurement is the key means to realize the functional module of intelligent transportation systems, so it is of great significance to study the method of traffic parameter measurement. Based on this, this paper explores the measurement of traffic parameters in intelligent transportation systems.

3

Analysis of traffic data and characterization parameters

Traffic data is the basis for traffic flow prediction, road risk early warning, traffic path guidance and other research, and the quality of its collection will directly affect the final results of traffic parameter measurement, so the scientific and standardized data collection and checking is crucial for traffic parameter measurement.

3.1

Traffic data collection

This paper takes a city’s GY highway as a research object, and uses coil detectors and microwave detectors to collect traffic flow data of GY highway at 8 cross sections for 25 weekdays, fully combining the collection advantages of the two types of detectors.

3.1.1

Data detectors

1)

Coil detector

The coil detector is a detection device based on the principle of electromagnetic induction, which realizes the collection of traffic flow data by detecting the inductance change of the induction coil, and consists of a frame, a base plate, a processor, a detection card, and a terminal block. The coil detector can determine the inductance change by detecting the phase change through the phasor, and it can also determine the inductance change by detecting the oscillation frequency through the coupling circuit. When the vehicle passes through the two induction coils respectively, it will cause the coil inductance to change, and then it can detect the passing state of the vehicle, and then transmit the state signal to the detector to realize data acquisition.

2)

Microwave detector

Microwave detector is the use of linear frequency modulation technology continues to transmit signals to the road surface, through the return signal detection and analysis, and then collect the traffic volume, speed and occupancy and other information. Microwave vehicle detectors consist of microwave detectors, controllers, power supplies, and columns.

3.1.2

Traffic data acquisition

Considering the data demand of the study, this paper decides to select 8 sections of GY highway in a city as the object of traffic flow data collection.The 8 sections of GY highway are numbered Section 1~Section 8, respectively, in which the first 4 sections use coil detector to collect the data, and the last 4 sections use microwave detector to collect the data.

In order to make full use of the time-related features such as between daily and weekly traffic flow data, the weekdays from 2023-12-01 to 2023-12-25 were selected as the collection time, totaling 25 days, in which the time interval of data collection was 5 minutes. In order to make full use of the traffic flow data between types and other related features, traffic volume, speed and occupancy were selected as the data collection types, totaling three types of data.

3.2

Traffic flow characterization parameters

The three basic parameters used in this paper to characterize traffic flow are traffic volume Q (veh/h), speed V (km/h) and occupancy O (%). 1)

Flow

Flow refers to the number of vehicles and pedestrians passing through a certain place or a certain section of the road per unit of time, usually refers only to the number of motor vehicles. There are two common ways of calculation:

Time-averaged flow: (1) $Q_{t} = N / T$ $${Q_t} = N/T$$

Where: Q_i - traffic volume per unit of time (vehicles / h), N - total traffic volume during the observation period (vehicles), T - the length of the observation (h).

Average traffic volume between zones: (2) $Q_{s} = {\bar{V}}_{s} \times K_{q}$ $${Q_s} = {\bar V_s} \times {K_q}$$

Where: Q_x - average traffic volume (vehicles / h), ${\bar{V}}_{s}$ $${\bar V_s}$$ - average speed (km / h), K_q - traffic density (vehicles / km). 2)

Speed

Speed refers to the distance traveled by the vehicle in a unit of time, according to the difference in the conditions of use and purpose, the speed is divided into location speed, time average speed and interval average speed, the specific calculation is shown in the formula (3), (4) and (5).

Location speed: (3) $V = 3.6 \times \frac{L}{T} (\begin{matrix} T \to 0 \end{matrix})$ $$V = 3.6 \times \frac{L}{T}(\begin{array}{c} {T \to 0} \end{array})$$

Where: V - vehicle speed at the location (km/h), L - distance traveled by the vehicle (m), T - observation period (s).

Time-averaged vehicle speed: (4) ${\bar{V}}_{t} = \frac{1}{N} \sum_{m = 1}^{N} v_{m}$ $${\bar V_t} = \frac{1}{N}\sum\limits_{m = 1}^N {{v_m}}$$

Where: ${\bar{V}}_{t}$ $${\bar V_t}$$ - time average speed (km/h), v_m - location speed of the mrd vehicle (km/h), N - total traffic volume (vehicles) during the observation period.

Average speed of vehicles in the interval: (5) ${\bar{V}}_{s} = L \cdot N / \sum_{m = 1}^{N} t_{m}$ $${\bar V_s} = L \cdot N\:/\:\sum\limits_{m = 1}^N {{t_m}}$$

Where: ${\bar{V}}_{s}$ $${\bar V_s}$$ - the average speed of the interval (km / h), L - the interval mileage (km), N - the total traffic volume in the observation period (vehicles), t_m - the time taken by the fifth vehicle to pass through the interval (h). --The time taken by the mth vehicle to pass through the interval (h). 3)

Occupancy

Occupancy refers to the degree of possession of the road by vehicles on the road, divided into two concepts of time occupancy and spatial occupancy, the specific calculation method is shown in equations (6) and (7).

Time occupancy: (6) $R_{t} = \frac{1}{T} \sum_{m = 1}^{N} t_{m}$ $${R_t} = \frac{1}{T}\sum\limits_{m = 1}^N {{t_m}}$$

Where: R_i - lane time occupancy (%),T - total observed time (h), t_m - time occupied by the mth vehicle (h), N - total number of vehicles in the roadway (vehicles).

Spatial occupancy: (7) $R_{s} = \frac{1}{1000 L} \sum_{m = 1}^{N} L_{m}$ $${R_s} = \frac{1}{{1000L}}\sum\limits_{m = 1}^N {{L_m}}$$

Where: R_s - lane time occupancy (%), L - observed roadway mileage (km), L_m - length of the mth vehicle (m), N - total number of vehicles on the roadway (vehicles).

3.3

Spatial and temporal correlation analysis

Traffic flow data is a sequence data with both temporal and spatial characteristics, its temporal characteristics show that the data of the later moment/cycle can be regarded as the continuation of the data of the previous moment/cycle in terms of characteristic attributes, while its spatial characteristics show that the traffic flow data will be interfered by the traffic status of adjacent lanes and upstream and downstream segments. Therefore, analyzing the correlation of traffic flow can help to extract the correlation characteristics of the data in depth, so as to carry out the subsequent measurement of traffic parameters.

3.3.1

Time correlation

1)

Time correlation of traffic volume

In this paper, the traffic volume is analyzed by using the data of 2023.12.04-2023.12.08 for inter-weekday correlation, and by using the data of 2023.12.04, 2023.12.11, 2023.12.18 for weekly period correlation. The traffic inter-weekday correlation coefficients are shown in Figure 3, and the traffic week-period correlations are shown in Figure 4. The correlation coefficients between weekdays and between week periods are all greater than 0.88, indicating the existence of very strong temporal correlation. Through the weekly period correlation analysis of the data of 12.04, 12.11 and 12.18 time nodes, it is found that the correlation between the two of them shows a bimodal normal distribution.

2)

Speed time correlation

In this paper, speed utilizes the data from 2023.12.04-2023.12.08 for correlation analysis between weekdays, Figure 5 shows the correlation coefficient between speed weekdays, the correlation coefficient between weekdays decreases with the estrangement of the time level, and the correlation coefficients between weekdays are all greater than 0.82, which indicates that there is an extremely strong correlation.

The data from 2023.12.04, 2023.12.11, and 2023.12.18 were selected for week-to-week correlation analysis. Figure 6 shows the weekly period correlation coefficients of speed. The weekly period correlation coefficients fall within the range of values [0.68, 0.84] indicating strong correlation.

3)

Time correlation of occupancy rate

The occupancy rate analysis in this paper uses the same time selection as the correlation analysis. After calculation, the correlation coefficient of the occupancy rate data for each working day and weekly period is greater than 0.78, indicating the existence of strong time correlation.

3.3.2

Spatial correlation

The data of GY highway 2023.12.08 was selected to carry out the study of spatial correlation of traffic volume, speed and occupancy. 1)

Spatial correlation of traffic volume

In this paper, the traffic volume analyzes the spatial correlation between lanes (L1~L4) using the data of section Section 1, and analyzes the spatial correlation between sections using the data of L1 lanes in 8 sections. The correlation coefficients between traffic volume lanes are shown in Figure 7, and the correlation coefficients between traffic volume sections are shown in Figure 8. The correlation coefficients of the traffic volume data of each lane and the traffic volume data of each cross-section are greater than 0.85, which is extremely strong correlation.

2)

Speed space correlation

Similarly, the correlation coefficients between speed lanes are shown in Fig. 9, and the correlation coefficients between speed sections are shown in Fig. 10. The correlation coefficients of speed data between lanes are all greater than 0.86, indicating that there is a very strong correlation between the lanes.The correlation coefficients of data between Section 1 and the remaining 7 sections are in the range of [0.46, 0.58], indicating that there is a general correlation, and the correlation coefficients of speed data between the remaining 7 sections are all greater than 0.6, indicating that there is a strong correlation.

3)

Spatial correlation of occupancy rate

After calculation, the correlation coefficient between each lane and section of the occupancy rate is greater than 0.87, indicating that the selected data have very strong spatial correlation.

4

Deep learning-based measurement of traffic parameters

Based on the above studies, traffic flow data have strong similarity in time and space. In this chapter, based on the consideration of the spatial relationship in which the study road section is located and the spatial and temporal correlation characteristics of the traffic flow, a traffic parameter measurement model is established based on graph convolutional neural network (GCN) and bi-directional long and short-term memory network (BiLSTM).

4.1

Graph Convolutional Neural Networks

As an extension of CNN on the graph domain, graph convolutional neural network (GCN) enables neural networks to directly process graph-structured data with arbitrary topology by redefining the convolution operation. As a result, GCNs can directly learn the representation of graph-structured data and are able to capture the intrinsic connections between nodes on the graph more efficiently, thus more comprehensively capturing the potential information within the graph-structured data.

4.1.1

Basic framework

The basic framework of a graph convolutional neural network consists of an input layer, a graph convolutional layer, and an output layer. 1)

Input layer

The input layer is used to receive a representation of the graph structure data, which usually consists of node information and adjacency matrix. The node information contains the feature vectors of each node, while the adjacency matrix contains the connection relationships between nodes and their respective connection strengths or weights.

2)

Graph Convolution Layer

The graph convolution layer is the core module of GCN, and its role is to utilize the adjacency matrix of the graph structure data to perform graph convolution operation, so as to realize the information propagation between nodes and feature extraction operation on the graph.

3)

Output Layer

The output layer is responsible for mapping the features extracted from the graph convolution layer into the appropriate output space, according to the specific needs of specific tasks. According to the different tasks, the structure of the output layer will be different. If the features extracted from the graph convolutional layer are used for the classification task, the output layer can be composed of a fully connected layer and a Softmax layer. Among them, the fully connected layer can combine the nonlinear activation function to realize further feature transformation and dimension adjustment of the features extracted from the graph convolutional layer to meet the category requirements of the classification task, while the Softmax layer is used to obtain the corresponding probability distributions of each category in the classification task, and then realize the specific classification task.

4.1.2

Principle of graph convolution

The Fourier transform is a classical method for achieving graph convolution. Specifically, the graph Fourier transform is used to perform feature extraction and frequency domain filtering operations on the graph signal. Before performing the graph Fourier transform, it is necessary to obtain the graph Laplacian matrix L corresponding to the target graph, and perform the singular value decomposition operation on L to obtain the eigenvector matrix U and the eigenvalue diagonal matrix Λ, which is calculated as follows: (8) $L = D - A$ $$L = D - A$$ (9) $L = U Λ U^{T}$ $$L = U\Lambda {U^T}$$

where $D \in ℝ^{N \times N}$ $$D \in {\mathbb{R}^{N \times N}}$$ is the diagonal matrix and $D_{i} i = \sum_{j} A_{i j}$ $${D_i}i = \sum\limits_j {{A_{ij}}}$$.

Using the eigenvector matrix U of L as the base of the graph Fourier transform, the graph Fourier transform operation of the graph signal f on the null domain can be defined as: (10) $f = U^{T} f$ $$f = {U^T}f$$

where f is the graph signal converted to the spectral domain after performing the graph Fourier transform.

The graph convolution operation is equivalent to applying a convolution filter to the graph signal f on the spectral domain with respect to the eigenvalues of the graph Laplacian matrix L, which is defined as follows: (11) $y = ω (L) f = ω (U Λ U^{T}) f = U ω (Λ) U^{T} f$ $$y = \omega (L)f = \omega (U\Lambda {U^T})f = U\omega (\Lambda ){U^T}f$$

where y denotes the output signal obtained from the graph convolution operation and ω(Λ) denotes the convolution filter function, which can be expressed as: (12) $ω (Λ) = d i a g (ω (λ_{0}), ω (λ_{1}), ..., ω (λ_{N - 1}))$ $$\omega (\Lambda ) = diag(\omega ({\lambda _0}),\omega ({\lambda _1}),...,\omega ({\lambda _{N - 1}}))$$

where λ_i denotes the ird eigenvalue in Λ.

Considering that the computational cost required to directly compute all the eigenvectors of the graph Laplacian matrix is too expensive, the filter function ω(Λ) is simplified by using the Kth order Chebyshev polynomials, which enables fast localization and achieves a reduction in computational overhead. The simplified function ω(Λ) can be expressed as: (13) $ω (Λ) = \sum_{k = 0}^{K} θ_{k} T_{k} (\tilde{Λ})$ $$\omega (\Lambda ) = \sum\limits_{k = 0}^K {{\theta _k}} {T_k}(\tilde \Lambda )$$

where $\tilde{Λ} = (2 I λ_{max}) Λ - I$ $$\tilde \Lambda = (2I{\lambda _{{\text{max}}}})\Lambda - I$$, λ_max denote the largest eigenvalues of the Laplace matrix L and I denotes the unit matrix. θ denotes the coefficients of the Chebyshev polynomials. T_k( · ) denotes the Chebyshev polynomial of the kth order, which is defined as follows: (14) ${\begin{array}{l} T_{0} (u) = 1 & T_{1} (u) = u \\ T_{k} (u) = 2 u & T_{k - 1} (u) - T_{k - 2} (u) \end{array}$ $$\left\{ {\begin{array}{l} {{T_0}(u) = 1}&{{T_1}(u) = u} \\ {{T_k}(u) = 2u}&{{T_{k - 1}}(u) - {T_{k - 2}}(u)} \end{array}} \right.$$

where u denotes the real independent variable.

By combining Eq. (11) and Eq. (14), the graph convolution operation can be simplified as: (15) $\begin{array}{rcl} y & = & U \sum_{k = 0}^{K} θ_{k} T_{k} (\tilde{Λ}) U^{T} f \\ = & \sum_{k = 0}^{K} θ_{k} T_{k} (\tilde{L}) f \end{array}$ $$\begin{array}{rcl} y &=& U\sum\limits_{k = 0}^K {{\theta _k}} {T_k}(\tilde \Lambda ){U^T}f \\ &=& \sum\limits_{k = 0}^K {{\theta _k}} {T_k}(\tilde L)f \\ \end{array}$$

where $\tilde{L} = (2 I λ_{\max}) L - I$ $$\tilde L = (2I{\lambda _{\max }})L - I$$.

Considering that the maximum eigenvalue of the graph Laplacian matrix is approximately equal to 2, i.e., λ_max ≈ 2, and that the symmetrically normalized graph Laplacian matrix L^xym is often used instead of the original L, this paper further simplifies the graph convolution operation and generalizes the inputs to the graph-structured data equipped with N nodes and each node has a feature dimension of D. Thus, the graph convolution operation used in this paper is shown below: (16) $\begin{array}{rcl} g r a p h_c o n v (X, A) & = & \sum_{k = 0}^{K} θ_{k} T_{k} (L^{s y m} - I) X \\ = & \sum_{k = 0}^{K} θ_{k}^{'} T_{k} (D^{- \frac{1}{2}} A D^{- \frac{1}{2}}) X \end{array}$ $$\begin{array}{rcl} graph\_conv(X,A) &=& \sum\limits_{k = 0}^K {{\theta _k}} {T_k}({L^{sym}} - I)X \\ &=& \sum\limits_{k = 0}^K {\theta _k^\prime } {T_k}({D^{ - \frac{1}{2}}}A{D^{ - \frac{1}{2}}})X \\ \end{array}$$

where graph_conv(·) denotes the graph convolution operation and θ′ = − θ.

4.2

Improved LSTM algorithm

4.2.1

LSTM

Long Short-Term Memory (LSTM) neural network is a special type of recurrent neural network that addresses the drawbacks of long term dependencies by introducing gate mechanisms and memory cells that allow to maintain long term dependent information and mitigate gradient vanishing and gradient explosion. Each LSTM node consists of at most a set of data flow units responsible for storage delivery, with an upper line in each unit connecting the model as a transportation line to pass data from the past to the present, and the independence of the units helps the model to handle filters that add values from one unit to another. Ultimately, the -shaped neural network layers that make up the gate drive the units to optimal values by processing or letting data pass through.

The LSTM expression is as follows, which describes the update process of the memory storage structure in the LSTM layer. Where, x is the input vector of the LSTM model, h is the output vector of the model, f, i, o denotes the activation values of the forgetting, input and output gates, respectively, C and $\tilde{C}$ $$\tilde C$$ denote the cell states and their candidate values, the subscript t denotes the time, σ, tach denotes the sigmoia and tach activation functions, respectively, and w and b denote the weights and the bias matrices, respectively.

In the first step, the LSTM layer decides which information will be discarded from the previous unit state C_t−1. Therefore, the activation value f_t of the forgetting gate at moment t is computed based on the inputs x_t at moment t, the outputs h_t−1 at moment (t − 1). As well as the deviation matrix of the forgetting gate b_f. The Sigmoid function norms all the activation values between 0 (all forgotten) and 1 (all remembered): (17) $f_{t} = σ (w_{f} [h_{t - 1}, x_{t}] + b_{f})$ $${f_t} = \sigma ({w_f}[{h_{t - 1}},{x_t}] + {b_f})$$

In the second step, the LSTM layer decides which information will be added to the cell state C_t. This step consists of two steps: first, computing ${\tilde{C}}_{t}$ $${\tilde C_t}$$, who may be added to the cell state, and second, computing the activation value of the input gate i_t: (18) $i_{t} = σ (w_{i} [h_{t - 1}, x_{t}] + b_{i})$ $${i_t} = \sigma ({w_i}[{h_{t - 1}},{x_t}] + {b_i})$$ (19) ${\tilde{C}}_{t} = t a c h (w_{C} [h_{t - 1}, x_{t}] + b_{C})$ $${\tilde C_t} = tach({w_C}[{h_{t - 1}},{x_t}] + {b_C})$$

In the third step, the new cell state C_t is computed: (20) $C_{t} = f_{t} C_{t - 1} + i_{t} {\tilde{C}}_{t}$ $${C_t} = {f_t}{C_{t - 1}} + {i_t}{\tilde C_t}$$

In the last step, the output value h_t is obtained by the following two equations: (21) $O_{t} = σ (w_{O} [h_{t - 1}, x_{t}] + b_{O})$ $${O_t} = \sigma ({w_O}[{h_{t - 1}},{x_t}] + {b_O})$$ (22) $h_{t} = O_{t} t a c h (C_{t})$ $${h_t} = {O_t}tach({C_t})$$

4.2.2

BiLSTM

In usual time series processing, LSTM tends to ignore future information. Bidirectional Long Short-Term Memory Neural Network (BiLSTM) uses two separate hidden layers on top of LSTM to process the sequence data in both forward and backward directions.Connecting the two hidden layers to the same output layer stores both previous and later information as the current time base of the time series data, so the Theoretically the prediction performance will be better than one-way LSTM.The hidden layer output of BiLSTM includes the activation output of the forward hidden layer and the activation output of the backward hidden layer.The BiLSTM expression is as follows, where σ is the activation function, H_t is the hidden layer output, and the outputs are generated by updating the forward structure ${\vec{h}}_{t}$ $${\vec h_t}$$ and the backward structure ${\overset{\leftarrow}{h}}_{t}$ $${\overleftarrow h_t}$$: (23) $\vec{h_{t}} = σ (W_{x \vec{h}} x_{t} + W_{\vec{h} \vec{h}} \vec{h_{t - 1}} + b_{\vec{h}})$ $$\overrightarrow {{h_t}} = \sigma ({W_{x\vec h}}{x_t} + {W_{\vec h\vec h}}\overrightarrow {{h_{t - 1}}} + {b_{\vec h}})$$ (24) $\overset{\leftarrow}{h_{t}} = σ (W_{x \overset{\leftarrow}{h}} x_{t} + W_{\overset{\leftarrow}{h} \overset{\leftarrow}{h}} \overset{\leftarrow}{h_{t - 1}} + b_{\bar{h}})$ $$\overleftarrow {{h_t}} = \sigma ({W_{x\overleftarrow{h} }}{x_t} + {W_{\overleftarrow{h} \overleftarrow{h} }}\overleftarrow {{h_{t - 1}}} + {b_{\bar h}})$$ (25) $H_{t} = W_{x \vec{h}} \vec{h} + W_{\vec{h} y} \overset{\leftarrow}{h_{t}} + b_{y}$ $${H_t} = {W_{x\vec h}}\vec h + {W_{\vec hy}}\overleftarrow {{h_t}} + {b_y}$$

In summary, LSTM deep neural networks can effectively take into account the temporal order of the input data, which can solve the problems of gradient recursion and gradient explosion that traditional RNNs are prone to.Bi LSTM deep neural networks are able to take future data into account on the basis of LSTM. However, due to the complexity of the deep neural network structure, its training often requires various optimization techniques to speed up the training process and solve the overfitting problem.

4.3

GCN-BiLSTM modeling

Graph Convolutional Neural Network (GCN) can extract the spatial characteristics of time-series data through feature learning, and BiLSTM can maximize the temporal characteristics of the data by mining the data from two directions. In order to make full use of the advantages of a single model and to grasp the potential spatio-temporal characteristics of the traffic flow data, this study combines the GCN and the BiLSTM for the measurement of traffic parameters. The station traffic flow observation data can be processed and merged into a matrix P containing spatio-temporal characteristics, which is expressed as follows: (26) $P = [\begin{matrix} P_{1} \\ P_{2} \\ ⋮ \\ P_{m} \end{matrix}] = [\begin{matrix} p_{1} (t - n) & p_{1} (t - n + 1) & \dots & p_{1} (t - 1) \\ p_{2} (t - n) & p_{2} (t - n + 1) & \dots & p_{2} (t - 1) \\ ⋮ & ⋮ & ⋮ \\ p_{m} (t - n) & p_{m} (t - n + 1) & \dots & p_{m} (t - 1) \end{matrix}]$ $$P = \left[ {\begin{array}{c} {{P_1}} \\ {{P_2}} \\ \vdots \\ {{P_m}} \end{array}} \right] = \left[ {\begin{array}{c} {{p_1}(t - n)}&{{p_1}(t - n + 1)}& \cdots &{{p_1}(t - 1)} \\ {{p_2}(t - n)}&{{p_2}(t - n + 1)}& \cdots &{{p_2}(t - 1)} \\ \vdots & \vdots &{}& \vdots \\ {{p_m}(t - n)}&{{p_m}(t - n + 1)}& \cdots &{{p_m}(t - 1)} \end{array}} \right]$$

Where n denotes the historical time span and m denotes the amount of sample data.

The hybrid model system proposed in the paper has input layer, GCN layer, BiLSTM layer, connectivity layer and output layer, where minimizing the mean square error of predicted and observed values of traffic parameters is the objective function of the algorithm of the model. The layers in the model are now described. 1)

Input layer. The traffic flow data measured at the observation point is preprocessed as the input of the combination model.

2)

Graph Convolution Layer: The GCN layer extracts the spatial features of the time series to maximize the retention of the spatial change pattern of the traffic flow data.

3)

Long and short-term network layer. Utilizing the advantage of the long and short-term memory network model for time series, we mined the data from two directions to maximize access to the temporal characteristics of the data.

4)

Connection layer. The output of the aforementioned layer is used as the input of the connection, and all the extracted information features are fused through this layer to obtain the traffic flow prediction results.

5)

Output layer. Output the prediction results.

Where the detection sites in a road section are described as G = (V, E), V represents the number of detection points, 4 detection points per section and 12 sections in total, i.e., $V = {v_{i}}_{1}^{48}$ $$V = \left\{ {{v_i}} \right\}_1^{48}$$. The relationship between neighboring detection points can be described as an interaction, i.e., A ∈ R^{48 × 48}, A_ij represents the degree of influence between two points.

In this research, the construction and training steps of the combined model are as follows:

Step1: Clean the raw data obtained from the processing and repair the identified abnormal data at the same time, and then normalize the data operation so that the data is between [0,1]. Combined with the amount of sample data and target requirements, the data dimension setting is carried out.

Step2: Split the data into windows, determine the ratio of training set and test set as 8:2, set the number of training iterations as 100, set the batch size as 64, set Dropout as 0.5, and set the step size as 4.

Step3: Model training, parameter tuning, and determine the termination conditions. Determine the optimal number of convolutional layers through continuous experimentation to achieve the optimal effect.

Step4: Use the trained model to make predictions, get the prediction results and output the model evaluation index.

The prediction results of traffic parameters are directly related to the selection and determination of key parameters in the model. The parameters that need to be tuned during the construction of the model include the number of iterations, the batch size, the number of neural network layers, etc. After experimental testing, it is found that the prediction effect of the model reaches the best when the number of bi-directional LSTM layers is set to 2, and the number of iterations at this time is 80.

4.4

Experimental results and analysis

4.4.1

Experimental setup

The experimental software environment of this paper is completed using the MATLAB2017b platform. The experimental data is selected from the coil and microwave detection data of the GY expressway in a city, which is the same source of experimental data as in the previous chapter, including flow, speed, and occupancy. Parameters of model input and output: the experiment is for traffic flow prediction to study the application of input traffic speed and occupancy, and the output is traffic flow.

In this paper, GCN-LSTM, GCN and LSTM algorithms are selected for research and analysis to illustrate the advantages of the GCN-BiLSTM traffic parameter measurement model, and the four methods are used to conduct comparative experiments to verify the superiority of the GCN-BiLSTM model.

4.4.2

Experimental results

The traffic flow is predicted for the whole day on weekdays by observing the changes in data and the actual traffic state. The data collection time is from 6:00 to 22:00 for 25 consecutive days, with a 5-minute interval between the collection time, the data collected on the first 20 days of the road section is used as a learning sample, and the data on the 5th day of the road section is used as a testing sample. Traffic flow parameters based on the GCN-BiLSTM model, the comparison algorithms are GCN-LSTM, GCN and LSTM three baseline models, respectively, GCN-BiLSTM, GCN-LSTM, GCN and LSTM model of the traffic flow prediction effect is shown in Fig. 11 to Fig. 14. It can be seen that the GCN-BiLSTM model in this paper has the closest predicted value curve and true value curve for the traffic flow parameters, followed by the GCN-LSTM model, and the predicted and true value errors of the GCN and LSTM models are relatively large.

By outputting the comparison graphs of data points and also analyzing and comparing the error as well as accuracy of the output results of the two groups of models at the same time, the three evaluation indexes selected are:

MAE indicates the mean absolute error, which is the average of the absolute error between the true value and the predicted value.

MAXRE is the maximum relative error, which is the magnitude of the deviation between the predicted value and the actual value.

MSE indicates the mean square error, which is a measure of the deviation between the predicted value and the observed value.

The results of the comparison of traffic flow prediction error results are shown in Figure 15. The average absolute error, mean square error and maximum relative error of the GCN-BiLSTM model among the methods for performing traffic flow parameter prediction are superior to the prediction error results of the GCN-LSTM, GCN and LSTM models, and the error results are 1.027, 1.606 and 0.511, respectively, which also proves that the overall prediction effect of the GCN-BiLSTM is closer to the true value, through the error illustrates the availability and superiority of the GCN-BiLSTM model in the field of traffic parameter measurement.

5

Conclusion

This topic is based on the intelligent transportation system, which uses deep learning algorithms to study the measurement methods of traffic parameters. The traffic data of GY expressway in a city is collected as the research object, the traffic flow characterization parameters and spatio-temporal correlation are analyzed, and the traffic flow prediction model based on GCN-BiLSTM is proposed by using graph convolutional neural network and bidirectional long and short-term memory neural network, and the performance is analyzed through experiments. The main conclusions of the study are as follows: 1)

Using the Pearson phase relation to quantify the correlation degree of traffic flow between weekdays, between weeks, between lanes and between cross-sections, it is found that each characterization parameter of traffic flow has similarity in time and space distribution and belongs to a strong correlation relationship with each other, and most of the results of correlation coefficient of each characterization parameter are above 0.7.

2)

The prediction results of the GCN-BiLSTM model on traffic flow in this paper are all better than those of the GCN-LSTM, GCN and LSTM models, and their predicted value curves are almost the same as the real value curves, and the error results of MSE, MAE and MAXRE are all less than 1.7, which indicates that the model has excellent accuracy in measuring traffic parameters.

Intelligent transportation systems can significantly improve the efficiency and accuracy of traffic management, which is of great significance to the development of urban transportation. By designing and implementing an efficient traffic parameter measurement method, this paper can help the intelligent transportation system to enhance the functions of real-time monitoring and safety management, and then optimize the urban traffic flow, reduce traffic congestion, improve road safety, and improve the user travel experience.

Idioma:: Inglés

Calendario de la edición:: 1 veces al año
Temas de la revista:: Ciencias de la vida, Ciencias de la vida, otros, Matemáticas, Matemáticas aplicadas, Matemáticas generales, Física, Física, otros

RSS Feed de revista

Research on Traffic Parameter Measurement Methods for Intelligent Transportation Systems

Chunhong He

Bin Ren

Publicado en línea: 24 mar 2025

Recibido: 07 nov 2024

Aceptado: 12 feb 2025

DOI: https://doi.org/10.2478/amns-2025-0770

Palabras claveGCN, BiLSTM, Correlation analysis, Parameter measurement, Intelligent transportation system

© 2025 Chunhong He et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Palabras clave
GCN, BiLSTM, Correlation analysis, Parameter measurement, Intelligent transportation system