Research on target detection and tracking algorithms in real-time video streaming
, , y
29 sept 2025
Acerca de este artículo
Publicado en línea: 29 sept 2025
Recibido: 27 ene 2025
Aceptado: 09 may 2025
DOI: https://doi.org/10.2478/amns-2025-1130
Palabras clave
© 2025 Wenjing Yu, Daitao Wang, Hongjun Qiu and Jianbin Zhong, published by Sciendo.
This work is licensed under the Creative Commons Attribution 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Figure 9.

Figure 10.

Figure 11.

Comparison of improved inter-frame difference method on UAV123 dataset
Detection method | Precision% | MAP% | Recall% | FPS (f/s) | |||
---|---|---|---|---|---|---|---|
Car | Bus | Van | Others | ||||
Interframe difference method | 86.09 | 86.01 | 84.37 | 84.87 | 85.34 | 84.68 | 24 |
Recluster the anchor box only | 87.27 | 88.98 | 86.36 | 85.54 | 87.04 | 86.53 | 29 |
Only after improving the extraction network | 89.56 | 87.88 | 88.36 | 86.41 | 88.05 | 86.41 | 25 |
Only after improving the loss function | 90.15 | 87.54 | 91.29 | 87.09 | 89.02 | 87.96 | 30 |
Improved interframe difference method+Mean shift | 96.62 | 94.28 | 94.68 | 93.22 | 94.70 | 89.92 | 32 |
Comparison of results of different target detection algorithms
Detection method | Precision% | MAP% | Recall% | FPS (f/s) | Parameter quantity/M | GFLOPs/G | |||
---|---|---|---|---|---|---|---|---|---|
Car | Bus | Van | Others | ||||||
Faster-RCNN | 89.63 | 88.36 | 90.88 | 88.21 | 89.27 | 85.96 | 11 | 51.1 | 124.1 |
SSD-512 | 87.84 | 86.43 | 85.64 | 87.7 | 86.90 | 84.24 | 15 | 47.6 | 129.7 |
YOLOv3 | 84.36 | 85.06 | 85.61 | 86.07 | 85.28 | 83.32 | 8 | 48.7 | 109.7 |
YOLOx | 85.97 | 86.81 | 86.72 | 85.10 | 86.15 | 82.56 | 21 | 38.1 | 114.28 |
YOLOv7 | 87.18 | 88.58 | 87.93 | 89.52 | 88.30 | 86.36 | 23 | 49.8 | 117.3 |
OURS | 96.62 | 94.28 | 94.68 | 93.22 | 94.70 | 89.92 | 32 | 37.4 | 112.9 |
MOT16 comparison experiment
Method | Accuracy | Precision | Recall | MT | ML | MOTA | IDF1 | IDSW |
---|---|---|---|---|---|---|---|---|
DeepSORT2 | 95.37% | 93.72% | 94.54% | 35.21% | 18.18% | 62.53 | 61.20 | 894 |
JDE | 94.15% | 93.89% | 94.02% | 35.72% | 17.21% | 63.39 | 67.36 | 1022 |
RAR16 | 87.14% | 85.18% | 86.15% | 33.76% | 18.62% | 61.22 | 64.33 | 560 |
TAP | 92.14% | 90.73% | 91.43% | 37.84% | 19.97% | 65.50 | 67.31 | 839 |
CNNMTT | 94.59% | 93.41% | 94.00% | 37.59% | 20.74% | 68.06 | 71.49 | 1106 |
POI | 94.48% | 94.12% | 94.30% | 38.74% | 21.42% | 63.42 | 70.86 | 699 |
TubeTK | 95.39% | 95.37% | 95.11% | 39.20% | 17.48% | 69.47 | 71.39 | 1185 |
CTracker | 96.18% | 96.18% | 96.37% | 40.18% | 17.84% | 71.30 | 72.05 | 1244 |
OURS | 97.57% | 98.82% | 98.19% | 43.86% | 16.62% | 75.76 | 73.37 | 1376 |
Cloud server configuration
Module | Model/Version |
---|---|
Internal memory | 4 Cores |
Graphics card | 32GB |
Hard disk | Tesla V108 |
Operating system | Ubuntu |
Python | 3.8.15 |
CUDA | 11.8 |
CuDnn | 8.2.0 |
MOT17 comparison experiment
Method | Accuracy | Precision | Recall | MT | ML | MOTA | IDF1 | IDSW | FP | FN |
---|---|---|---|---|---|---|---|---|---|---|
DeepSORT2 | 96.76% | 94.62% | 95.43% | 38.34% | 17.45% | 64.78 | 63.56 | 1534 | 22746 | 143976 |
JDE | 95.55% | 94.65% | 93.49% | 39.03% | 16.34% | 65.43 | 68.40 | 1735 | 22420 | 193791 |
RAR16 | 90.56% | 87.25% | 88.43% | 36.51% | 17.35% | 62.90 | 64.78 | 1575 | 19527 | 135461 |
TAP | 94.50% | 92.07% | 93.01% | 38.52% | 17.43% | 66.34 | 68.50 | 1903 | 18811 | 183747 |
CNNMTT | 95.43% | 94.20% | 93.94% | 39.07% | 19.34% | 68.96 | 75.63 | 2154 | 15345 | 184952 |
POI | 95.32% | 95.38% | 95.30% | 40.39% | 21.07% | 65.04 | 72.11 | 1865 | 22966 | 167528 |
TubeTK | 96.03% | 96.12% | 96.09% | 43.56% | 16.40% | 71.33 | 73.75 | 2287 | 16778 | 168269 |
CTracker | 96.77% | 96.84% | 97.14% | 45.43% | 15.45% | 73.986 | 72.94 | 2335 | 15549 | 149950 |
OURS | 98.91% | 99.43% | 99.52% | 49.45% | 14.34% | 80.61 | 75.37 | 3408 | 10184 | 129203 |