Research on Classification of College Students’ Physical Fitness Test Scores Based on Neural Network
Data publikacji: 27 lut 2025
Otrzymano: 08 paź 2024
Przyjęty: 08 sty 2025
DOI: https://doi.org/10.2478/amns-2025-0098
Słowa kluczowe
© 2025 Longyun Ren, published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
The advent of COVID-19 in 2019 has increased our awareness of how important our immunity is. Building up your immune system by exercising is the only effective method of preventing germs and viruses. This new pandemic is a wake-up call for all of us, particularly for our current university students. A healthy body is a foundation for university students to learn and live and an important element to guarantee their comprehensive development. University students are our nation’s future support; their sound development is vital to our nation’s future. The Ministry of Education has asked universities to conduct health examinations every year and present the results to understand their health. Physical examination is a comprehensive evaluation of students’ physical condition in terms of body shape, quality, function and sports capability. Therefore, it is an efficient education instrument for improving students’ health and encouraging them to participate in sports activities[1].
Nowadays, our society has many challenges and competitive situations, so there is a higher demand for talented persons’ fundamental qualities. Higher learning institutions are confronted with serious challenges as a major basis for cultivating high-quality personnel. University P. E. is important in developing university students’ interest in athletics and encouraging them to participate in life’s activities. Nowadays, physical activity has been one of the most important ways to enhance one’s health. Therefore, it is good for everyone’s health to be healthy. Sports can strengthen the body, improve mental quality, and make the person’s muscles more powerful[2].
Over the years, higher education institutions have attached great importance to the intelligence development of undergraduates and neglected their body situation. Along with the rise in the level of people’s daily life, the common life of university students has been enriched, which leads to a shortage of sports activities and a decrease in the body quality of some students. Bad health will influence students’ learning and living, even influencing their future job and social position. Based on the physical examination data, it is possible to observe the real body quality of university students[3]. Every year, the physical examination can let the students realize their physical changes, the significance of sports, and encourage them to enhance their health. We can concentrate on our everyday learning and work only if we have good health. All in all, physical training is essential.
University students actively participate in sports to become more powerful. P. E. is one of the most important approaches to enhance university students’ body quality. The P. E. examination is one of the most important means for teachers to know their body quality. Therefore, it helps to make a scientific and rational curriculum and set up an efficient cultivation system. For this reason, this article has carried on the analysis to the P. E. Students’ P. E. Teachers’ P. E. We can make various training programs for the students of various sports to enhance their physical capabilities[4].
Traditionally, PE teachers use the National Student’s Physical Health Standards to assess their measured values. Then, they can make an appropriate addition or subtraction of marks based on their average performance. Because there are a lot of factors that influence the performance of players, the calculating procedure of overall points is complex, which results in poor performance forecasts. The entire computation procedure is manually operated, and achieving a strict uniform evaluation criterion for several years has not been possible. It is hard to see if there has been any improvement in the performance of the pupils when they are measured by the same pupils for several years.
Along with the development of AI theory, many university P. E. teachers and P. S. researchers have proposed many auto-forecasting models for university students. Then, the related parameters were calculated using the university students’ history information, and it was applied to forecast the overall performance of the university students. So, it is very important to establish a high-precision athletic ability evaluation model for predicting performance. For rational evaluation of university students’ body quality and increased forecast efficiency and standard computing level, an integrated evaluation model is proposed for forecasting university athletic examination results. The forecast of the experiment result by this method can decrease the computation by hand and save a lot of time, and it is consistent with the forecast criterion of last year.
Physical fitness test data can reflect the basic physical information of contemporary college students, such as body shape and body function. This test can help teachers accurately understand the physical weaknesses of students in each school or even in each region, adjust the physical education curriculum reasonably according to the actual physical fitness of students, and help students improve their physical fitness and immunity. The test is conducted once a year, and the results of the test can make students deeply feel the changes in their physical fitness to encourage them to increase physical exercise reasonably. We can improve our health by participating in sports and developing the good habit of regular exercise. Through physical fitness tests, college students can deeply understand the importance of physical exercise. Only with a strong body can we better devote ourselves to the future work.
This paper will analyze the data obtained from the teacher’s unassessed physical fitness test and the final comprehensive score of the student assessed by the teacher. According to the National Students’ Physical Health Standards, the total score given by physical education teachers is divided into four grades, including excellent (more than 90 points), good (80 to 89.9 points), pass (60 to 79.9 points) and fail (less than 60 points). Each year, the proportion of other groups is calculated to observe the distribution of students’ overall physical fitness, as shown in the table below.
The proportion of students in each category from 2016 to 2019
Year | Excellent category | Good class | Passing category | Fail class |
---|---|---|---|---|
2019 | 0.85% | 15.82% | 74.89% | 8.44% |
2018 | 1.03% | 16.73% | 72.26% | 9.98% |
2017 | 1.34% | 20.29% | 67.76 | 10.61% |
2016 | 0.99% | 19.71% | 69.17% | 10.13% |
This article discusses the whole achievement level, the relation between 8 measurements and total marks, and the mean value of 8 indexes in each kind of integrated grade. The data set’s most recent year was analyzed visually with a random sample of pupils in every class. Then, the relative factors of all measuring indicators were computed, and the effects of different test items on synthetic marks were observed, as illustrated in the following chart.
The correlation between attributes and synthesis score (2019)
Height | Weight | Vital capacity | 50m Run | Standing long jump | Sitting forward bending | Endurance Project | Power project | |
---|---|---|---|---|---|---|---|---|
Consolidated results | — | — | -0.0274 | -0.2078 | 0.1579 | 0.3992 | -0.6848 | 0.4328 |
Sort | — | — | 6 | 4 | 5 | 3 | 1 | 2 |
Data visualization is the main way of communicating the message using a graphical approach. It is essentially a visual conversation. Based on the data visual analysis of the sample, it is possible to see the real state of the students’ bodies. In 2019, as illustrated in Figure (a), SLJ, Sitting Front Curve (SR), Stamina (SP) and Strength (PP) were all higher than the mean values of “excellent.” The boy’s physique was excellent, and he was more flexible in sitting posture (SR). Based on the three years of examination results, we can see that PP is improved while the rest of the subjects are stable. This indicates that the students should enhance their physical fitness. In Figure (b), the student’s sitting front curve (SR) test results were significantly lower than the average for their total performance group and did not increase over 3 years. This student has a marked lack of physical agility. Nevertheless, his Endurance (SP) and Strength (PP) scores were outstanding since they were important in determining how well they performed.
In Figure (c), the student’s physical indicators are lower, and the area enclosed by the curve is significantly reduced compared with the first two students. The student’s 2019 sitting Forward Bend (SR) and Strength Items (PP) scores were well below the average for the “pass” category. Comparing the data of the past three years, we see a significant improvement in the standing long jump (SLJ). 50m running (S) and endurance (SP) did not change over the three years, but the weight (W), as well as sitting forward bend (SR) and PP performance, decreased and fluctuated greatly. All physical indicators decreased after the student entered the school, and the overall score decreased. Figure (d) shows that the student’s test scores in recent years have been very erratic, with most of the items tested scoring well below the average for the various attributes of the “failing category” but weight (W) significantly above the average. Excess body weight can lead to a decline in various physical fitness indicators and, in severe cases, can lead to various diseases. Students should have a clear understanding of their physical condition to improve.
According to the results of the visual analysis, different training suggestions will be given to students with outstanding performance characteristics according to their actual physical condition. “Students with excellent test scores”: These students can undergo intensive physical training to ensure training safety. High-intensity training methods can strengthen the muscles in the main parts of the body, which will continue to stimulate the body’s potential. These students have excellent physical indicators and can be selected as athletes to represent the school in various sports competitions.

Four years of data of selected men
Along with the social development, people’s study area is getting more complicated. Using one property to describe something in detail is hard, and you need to think about it as a whole. So, the multiindicator assessment method emerged. A multi-indicator assessment is a more comprehensive and detailed description of the object using several properties. Multiple indicators have various properties, and there is a great difference in size and degree between them. If the dimensions of different properties are too big, and the former property is analyzed, it will greatly affect the result and reduce the effectiveness of the other properties. So, we need to regulate the properties of raw data to guarantee analytical results’ reliability.
There are a lot of ways to standardize data. One of the more common approaches is the standardization of the data, which is the uniform distribution of the data into the interval [0,1]. Commonly used standard approaches are Min - Max, Fuzzy Quantizing, Z Score, and Logarithm Function Transformation. Using Z - Z-score normalization, the measuring results of 8 QMS are normalized so that the cell limit and dimensional relation are removed, and the forecast time is shortened. Normalization of the Z Score is a way to standardize the data by using raw data’s average and standard deviation. Following is the standard calculation formula. The standardized approach can transform the raw data to a normal distribution with a median value of 0 and a variance of 1.
In the above formula,
In everyday learning, we may see some data sets listed before us. These data sets may have a particular association, which may be negative or no association. Therefore, we need a tool to measure correlations and analyze several data sets. That tool is data correlation analysis. Correlation between data refers to a specific relationship between data input attributes. Nowadays, in the Large Data Age, data relevance analysis can find inner relationships among objects rapidly and effectively. Relativity means a relation exists among two or more variables whose aim is to discover the hidden information within a dataset. Correlation analysis plays a very important role in data reduction and outlier repair. It is the core technology of data preprocessing. Without correlation analysis, the information expressed between the data may overlap to a certain extent, seriously affecting the accuracy of subsequent modeling and prediction.
The correlation coefficient can describe the relation among the variables. The sign of the correlative factor shows that the relation is positive or negative, and it shows the intensity of the relation. It has no correlation when it is 0, while it has a complete correlation when it is 1. The Pearson Correlation Coefficient, Spearman’s Correlation Factor, Part Correlation Factor, Kendall Correlation Factor, etc. The Pearson Relativity Factor was applied to compute the correlative degree of every property. Because the original data is normalized, the relationship among properties is not altered[5]. Therefore, the normalized data can compute the relationship among the properties. The method for calculating the correlation coefficient is given in the following equation:
Take the data from the last year of 2019 in the dataset as an example, calculate the correlation between each attribute, and show the results in Table 3 below.
The correlation between attributes
Height | Weight | Vital capacity | 50m run | Standing long jump | Sitting forward bending | Endurance Project | Power project | |
---|---|---|---|---|---|---|---|---|
1.0000 | 0.6106 | 0.6631 | -0.5396 | 0.5976 | -0.2374 | 0.1305 | -0.6750 | |
1.0000 | 0.5442 | -0.2694 | 0.2901 | -0.1876 | 0.3332 | -0.5438 | ||
0.5442 | 1.0000 | -0.5372 | 0.5737 | -0.1440 | 0.1314 | -0.6386 | ||
-0.2694 | -0.5372 | 1.0000 | -0.7478 | 0.1766 | 0.0790 | 0.6322 | ||
0.2901 | 0.5737 | -0.7478 | 1.0000 | -0.1497 | -0.0492 | -0.6619 | ||
-0.1876 | -0.1440 | 0.1766 | -0.1497 | 1.0000 | -0.1230 | 0.3287 | ||
0.3332 | 0.1314 | 0.0790 | -0.0492 | -0.1230 | 1.0000 | -0.2490 | ||
-0.5438 | -0.6386 | 0.6322 | -0.6619 | 0.3287 | -0.2490 | 1.0000 |
As shown in the table above, there is a strong correlation between the items measured in the fitness test results. The 50-meter running and standing long jump showed a negative correlation with a correlation coefficient 0.7478[6-7]. Because the 50-meter race data is time data, there is a negative correlation between time data and the score; the longer the time, the lower the score. A positive correlation exists between height, weight and lung capacity, and a strong negative correlation between height and strength events. Taller people require more force to measure than others, affecting the work speed. Since there is a strong correlation between attributes, this leads to redundant information affecting the accuracy of the model. If the correlation between the input data is too strong, the weights in the network connected to the input neurons play a similar role. The data correlation is too strong, the weight relationship trained in the network is not portable, and the model cannot be applied to other years’ data. So, PCA should be adopted to convert the raw data and remove the strong relativity before NN training.
A major problem with data handling is the complexity of the data. PCA is one of the most popular methods for minimizing the size of datasets. It can expose and simplify complicated relations among different variables[8]. Some primary variables with strong relativity are transformed into independent ones by means of coordinate conversion, and the computed independent variables are the main parts. PCA aims to substitute many relevant variables for a less significant number of non-relevant variables and retain as much raw data as possible. The main part is a linear combination of the initial variables, illustrated in the following diagram[9].

Principal component analysis (PCA) model
The mathematical model expression is shown in the following formula.
After the model is changed into a matrix form, the following formula is shown:
See the following formula:
At this time, the following formula can be obtained:
Part of a nerve net [10-12]. The model uses an error-back learning method based on multiple layers of nonlinear feed-forward networks. The network consists of three levels: the entry-level, concealed, and output. The BP study method consists of the error signal’s forward and backward transmission.
Connections among neurons create a connectivity model among nerve networks randomly allocated to each link by a computer. The forward transmission phase means that the raw data signal is transmitted via a hidden level from an input to an output level. In other words, the output of a preceding node serves as an input for the next one. The following diagram illustrates the fundamental architecture of BP Neural Network’s forward propagation[13-15].

Basic structure of forward propagation stage BP neural network
Each neuron cell
It is possible to get a better forecast precision during the forecast by using the activation function. Activation functions have various types, such as Sigmoid, Hyperbola, and ReLU. The Sigmoidogram function is utilized to activate the output message. The activation function activates the input layer’s output value as
Then, it takes the hidden level data as the input level and transmits it to the output level. The output value
The following formula activates the output value to obtain the final output layer data.
Based on the 2016 Physique Test Data, 80% of the Student Sample is taken as the Training Set, and the other 20% is the Model Assessment Kit. There are differences in the standards and grading methods between boys and girls. For this reason, male and female test data are divided into two groups, each constructed to predict the results. Based on 80 percent of the training data, continuous adjustment and optimization were carried out, and the results were presented in the table[16].
The parameters of the model
Parameter name | Parameter value |
---|---|
Number of Input Layer Neurons | 8 |
Number of Output Layer Neurons | 1 |
Number of Neurons in the Hidden Layer | 11 |
Threshold | 0.005 |
Learning rate | 0.1 |
Maximum Iteration Count | 1.0e10 |
Training algorithm | rprop+ |
Fault function | sse |
Activation function | logistic |
The threshold is taken as a condition to terminate the training, as illustrated in the chart above, and it is expressed as a default in the error function. If it is impossible to achieve a predefined value, then the maximal iteration count will force the training to halt when it is impossible to stop the iteration. The arithmetic of “prop +” is based on the weight of the error backpropagation method and the BP NN[17-18]. Then, an error function “sse” is applied to compute the value of the error at the termination of the forward transmission. The activation function uses the parameter “logistic” as the Sigmoid activation function. The Neural Count of the Hidden Level is calculated based on the Average Square Error (MSE) and the Equation below.
Based on the PCA, the ANN model is built for 80% of the data input model. To increase the precision of the model, a forecast model for boys and girls was built, and the experiment results were assessed[19-20]. Experimental data not involved in the model’s training were entered into the model. At last, the forecast effect of male and female was combined with that of male and female, and the forecast capability of the entire model was observed. Based on a random sample of 40 participants from the 2016 dataset, the following graph illustrates the discrepancy between the results obtained from those 40 participants and those of the model[21-23].
The following row diagram compares the predictions with the results obtained from a sample of 40 random pupils. The solid line indicates the real value, while the dotted line indicates the prediction. The results indicate that both lines are highly coincident with each other, and only a small number of samples exhibit remarkable errors. Experimental results indicate that the forecasting model of QMS has high accuracy and excellent performance. The MSE (MSE) is 1.361713.

Comparison of predicted data with actual data samples

Error value frequency distribution
The absolute error is in the range of [-1, 1], which takes up approximately two-thirds of the data set. The highest error rate is about 0. The error rate is so low that it is below 0. 01 when the absolute error is larger than 3. In this diagram, the dotted line is the density profile of the error distribution, which is close to that of the normal distribution. Visual results indicate that the forecast capability of this model is quite good, which will help forecast the synthetic performance of the PE test.
The absolute error absolute value of the 20% test set in 2016 and the data prediction result in 2019 is binned into six intervals, which are [0,1), [1,2), [2,3), [3,4), [4,5), [5, ∞), their distribution is shown in the following table.
Percent of absolute error distribution predicted by the model established 2016 data
[0, 1) | [1, 2), | [2, 3) | [3, 4) | [4, 5) | [5, ∞) | |
---|---|---|---|---|---|---|
2016 | 64.38% | 28.57% | 6.27% | 0.7% | 0.04% | 0.02% |
2019 | 40.84% | 27.16% | 15.71% | 8.06% | 4.06% | 4.17% |
As can be seen from the chart above, the 2016 prediction was excellent, with an absolute error below 2 for 92.95 percent of the figures and a mere 0.06 percent higher than 4. This shows that this model has a good prediction ability. When the models created by the 2016 scoring criteria were applied to 2019, the predictive accuracy of the models declined. The percent absolute error rate decreased by 23.54 percent on a 0-to-1 scale, but the total predicted outcome remains significant. The standard for 2016 should be suitable for 2019 if the standard is strict, and the forecast precision of the model will be similar to that of 2016. A significant reduction in the forecast impact suggests that manual calculations have resulted in differences in the 2016 and 2019 scoring criteria.
Based on 2016 data, this model is based on 2016, and the weight of each model is based on 2016. When the 2016 rating scale was used to predict the 2019 total score, the results showed a difference in the rating scale. Because the teacher’s hand is involved in the computation, the standard of the full marks is not unified. Because of the differences in the past classification criteria, the full score cannot be used to describe the variation in the fundamental body quality of the students. It’s hard to tell if a student’s body quality has increased or declined with time. Not making the best use of data for many years. Based on this model, we can find a stable relationship between project data and overall score and ensure the model can be studied. That’s a great method to eliminate all of the complex conventional fitness tests, and it can also save you the time needed to calculate the overall points. Based on the uniform classification criterion, we can get a clearer view of the variation in students’ body quality by using the Radar Chart. Thus, the model’s prediction of fitness test results is essential.
In this article, we make a classification based on synthetic and measurement marks and deeply analyze the fundamental constitution of every class. By analyzing the findings visually, we can get a better understanding of the reality of every student. It also offers a variety of training programs to assist the P. E. teachers in developing more rational teaching programs.
In addition, a forecast model for synthetic performance is proposed, and BP Neural Network and PCA are applied to forecast the synthetic performance. Using a performance forecasting model to forecast the total performance can save time for computing and resolve the issue of inconsistent evaluation standards due to human computation for many years. Compared with the conventional calculating method, measuring the body’s quality is more valuable. As the points have not changed from one year to another, the results of their yearly examinations can be compared. Comparing can show the improvement of the student’s body quality. If there are no obvious changes, we can adjust the training program to assist the teachers in formulating their teaching programs.