Exploration of machine learning based on big data in sports models and physical education teaching
Abstract
Under the background of big data-driven education digitization, physical education in colleges and universities is facing the problems of data isolation (< 35% compatibility rate of multi-source devices) and training homogenization (only a 28% coverage rate of personalized programs). In this study, we integrated six existing data specifications, including the Student Physical Fitness Standard, and designed a standardized data collection scheme for physical fitness monitoring: (1) We developed a data meta-standard that included 12 types of wearable devices (heart rate/step rate/oxygen uptake) and three types of training scenarios (classroom/outside classroom/competition) and unified the collection protocols for the 23 core metrics; (2) We adopted a machine-learning preprocessing method K-Nearest Neighbors (KNN) filling of missing values + Isolation Forest detection of outliers) to make the data standardized. (3) Using machine learning preprocessing (missing value KNN filling + anomaly detection by Isolation Forest), the data completeness rate is increased from 61% to 89%; (4) Constructing a comprehensive physical fitness scoring model (XGBoost algorithm, with 8 basic physical fitness and 5 dynamic response indicators) and combining it with the reinforcement learning recommender engine, generating a hierarchical teaching plan (with 3 levels of difficulty adaptation and 5 types of sports intervention templates) for teachers. Empirical evidence shows that the program improves the efficiency of teachers’ lesson planning by 46.42% and increases the matching degree of students’ personalized training by 3.2 times. Compared with traditional empirical teaching, the standardized data-based intelligent model reduces the error rate of physical fitness assessment from 19.7% to 8.3% and provides teachers with a three-in-one tool: class heat map—individual warning list—lesson plan knowledge base, which pushes the physical education teaching from “experience-driven” to “data-driven”. This study provides a reusable methodological framework for the digital transformation of college sports.
References
1. Chen M, Mao S, Liu Y. Big data: A survey. Mobile Networks and Applications. 2014; 19(2): 42–52.
2. Hopkins MS. Big data, analytics and the path from insights to value. MIT Sloan Management Review. 2011; 52(2): 21–22.
3. Wigan MR, Clarke R. Big Data. IEEE Computer Society Press. 2013; 43(4): 452–514.
4. Xu L. Study on multimedia teaching and the Innovate of college PE: Evaluation of teaching effect. International Journal of Security and its Applications. 2016; 10(2):255–266.
5. Assuncao MD, Calheiros RN, Bianchi S. Big Data calculate and Clouds: Challenges, Solutions, and Future Directions. Journal of Parallel and Distributed calculate. 2013; 79–80(3): 3–15.
6. Lohr S. The age of Big Data. New York Times. 2012; 11(4): 4–62.
7. Li G, Cheng X. Research Status and Scientific Thinking of Big Data. Bulletin of Chinese Academy of Sciences. 2012; 4(4): 432–622.
8. Meng X, Ci X. Big Data Management: Concepts, Techniques and Challenges. Journal of Computer Research and Development. 2013; 50(1): 146–169.
9. George G, Haas MR, Pentland A. Big data and management[J]. Academy of management Journal, 2014, 57(2): 321-326.
10. Fan W, Bifet A. Mining big data: Current status, and forecast to the future. ACM SIGKDD Explorations Newsletter. 2014; 16(1): 1–5.
11. Chen Y, Alspaugh S, Katz R. Interactive analytical processing in big data systematics. Proceedings of the VLDB Endowment. 2012; 5(4): 54–131.
12. Guo W. Exploration and Empirical Analysis of the Teaching Ability Evaluation Model of Physical Education Teachers in Higher Vocational Colleges Under the Background of Big Data with Integrated Collaborative CAD. Computer-Aided Design and Applications. Published online September 4, 2024: 153–164. doi: 10.14733/cadaps.2025.s5.153-164
13. Madden, S. From Databases to Big Data. IEEE Internet calculate. 2012; 16(3): 4–6.
14. Jagadish HV, Gehrke J, Labrinidis A. Big Data and Its Technical Challenges. Communications of the ACM. 2014; 57(7): 86–94.
15. Bhatia A, Vaswani G. BIG Data–A Review. International Journal of Engineering Sciences & Research Technology. 2013; 2(8): 42–47.
16. Wu HY, Zhang H, Qin J. Research on Teaching Innovate of PE in Higher professional and Technical learn. Zhejiang Sport Science. 2010; 5(5): 5–22.
17. Hu JP, Yu KH, Polytechnic J. Study on Current Situation and Innovate Strategy of PE Course in Higher professional learn. Journal of Beijing Sport University. 2013; 42(52): 53–225.
18. Sarıkaya S. Effectiveness of A Data-Based Decision-Making Professional Learning Community for Physical Education Teachers and Its Impact on students’ knowledge and Physical Activity Level. Open Metu; 2023.
19. Rao W. Design and implementation of college students’ physical education teaching information management system by data mining technology. Heliyon. 2024; 10(16).
20. Xin X, Shu-Jiang Y, Nan P, et al. Review on A big data-based innovative knowledge teaching evaluation system in universities. Journal of innovation & knowledge. 2022; 7(3): 100197.
21. Petrachkov O, Yarmak O, Biloshitskiy V, et al. The influence of morphofunctional condition on the physical fitness level of Ukrainian soldiers. Journal of Physical Education and Sport. 2022; 22(9): 2182–2189.
22. Ashaari MA, Singh K SD, Abbasi GA, et al. Big data analytics capability for improved performance of higher education institutions in the Era of IR 4.0: A multi-analytical SEM & ANN perspective. Technological Forecasting and Social Change. 2021; 173: 121119.
23. Fernandez-Rio J, Iglesias D. What do we know about pedagogical models in physical education so far? An umbrella review. Physical Education and Sport Pedagogy. 2024; 29(2): 190–205.
24. Alam A, Mohanty A. From Bricks to Clicks: The Potential of Big Data Analytics for Revolutionizing the Information Landscape in Higher Education Sector[C]//International Conference on Data Management, Analytics & Innovation. Singapore: Springer Nature Singapore; 2023. pp. 721–732.
25. Chiva-Bartoll O, Ruiz-Montero PJ, Olivencia JJL, et al. The effects of service-learning on physical education teacher education: A case study on the border between Africa and Europe. European Physical Education Review. 2021; 27(4): 1014–1031.
Copyright (c) 2025 Author(s)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright on all articles published in this journal is retained by the author(s), while the author(s) grant the publisher as the original publisher to publish the article.
Articles published in this journal are licensed under a Creative Commons Attribution 4.0 International, which means they can be shared, adapted and distributed provided that the original published version is cited.