Advancing genetics of agriculture and biomolecules: A BERTopic-based analysis of research evolution in leading agricultural universities
Abstract
This study investigates the evolution of research at leading agricultural universities, with a particular focusing on genetics of agriculture and biomolecules as central themes. The objective is to identify trends, knowledge evolution pathways, and the relationship between scientific innovation and technological application. Utilizing the BERTopic model, a word-embedding-based topic extraction approach, the study analyzed data from cited articles and citing patents sourced from Web of Science and Lens databases. Key methodologies included advanced text preprocessing, topic clustering using Uniform Manifold Approximation and Projection (UMAP) and Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN), and knowledge evolution analysis based on topic heat and cosine similarity metrics. The findings indicate that research on the genetics of agriculture and biomolecules play a critical role in driving both fundamental science and application-oriented innovation. A strong correlation between cited articles and citing patents was observed, particularly at institutions such as the University of Tokyo and Kyoto University. Notably, genetics-related scientific outputs were associated with denser knowledge networks, while biomolecule-focused patents demonstrated more pronounced application trends, highlighting the translational potential of these innovations. Over time, research in genetics of agriculture and biomolecules intensified, underpinning their critical role in addressing global challenges like food security and sustainable development. This analysis offers insights into interdisciplinary convergence and the dynamic interplay between science and technology, contributing to strategic planning and policy development for agricultural innovation.
References
1. Laurett R, Paço A, Mainardes EW. Sustainable development in agriculture and its antecedents, barriers and consequences—An exploratory study. Sustainable Production and Consumption. 2021; 27: 298–311.
2. Prakash CS, Fiaz S, Nadeem MA, et al. Sustainable agriculture through technological innovations. In: Sustainable Agriculture in the Era of the OMICs Revolution. Springer; 2023. pp. 223–239.
3. Qayyum M, Zhang Y, Wang M, et al. Advancements in technology and innovation for sustainable agriculture: Understanding and mitigating greenhouse gas emissions from agricultural soils. Journal of Environmental Management. 2023; 347: 119147. doi: 10.1016/j.jenvman.2023.119147
4. FAO, IFAD, UNICEF, et al. In: In Brief to The State of Food Security and Nutrition in the World 2023. FAO; 2023.
5. Perkmann M, Tartari V, McKelvey M, et al. Academic engagement and commercialisation: A review of the literature on university—industry relations. Research Policy. 2013; 42(2): 423–442. doi: 10.1016/j.respol.2012.09.007
6. Ma LT. Leading universities should provide leading social services. The Shanghai Mercury, 9 August 2019.
7. Min H, Chen YH. The Generation Logic and Realization Mechanism of the Characteristic Strategy in World-Class Universities with Characteristics of Agriculture. Heilongjiang Researches on Higher Education. 2021; 39(08): 51–56.
8. Torotwa I, Ding Q, Awuah E, He R. Biomimetic tool design improves tillage efficiency, seedbed quality, and straw incorporation during rototilling in conservation farming. Journal of Agricultural Engineering. 2023; 54(1). doi: 10.4081/jae.2023.1327
9. Al-Kareem AAA, Aoues Y, Eid M, et al. Integrating optimization and reliability tools into the design of agricultural machines. In: Proceedings of the 20ème Congrès Français de Mécanique; 2 September 2011; Besançon, France.
10. Alfyorov O, Grynchenko O, Ponomarenko V, et al. Agricultural Equipment Design Optimization Based on the Inversion Method. Agriculture. 2022; 12(9): 1410. doi: 10.3390/agriculture12091410
11. Nag PK, Gite LP. Ergonomics application in design of farm tools and equipment. In: Human-Centered Agriculture: Ergonomics and Human Factors Applied. Springer; 2020.
12. Erndwein L, Ganji E, Killian ML, Sparks EE. Comparative biomechanical characterization of maize brace roots within and between plants. bioRxiv. 2019; 547794.
13. Kolb E, Legué V, Bogeat-Triboulot MB. Physical root—soil interactions. Physical Biology. 2017; 14(6): 065004. doi: 10.1088/1478-3975/aa90dd
14. Tomobe H, Tsugawa S, Yoshida Y, et al. A mechanical theory of competition between plant root growth and soil pressure reveals a potential mechanism of root penetration. Scientific Reports. 2023; 13(1): 7473. doi: 10.1038/s41598-023-34025-x
15. Hamza O, Bengough AG, Bransby MF, et al. Novel biomechanical analysis of plant roots. In: Eco-and Ground Bio-Engineering: The Use of Vegetation to Improve Slope Stability. Springer; 2007. pp. 13–20.
16. Kolb E, Quiros M, Meijer GJ, et al. Root—Soil Interaction. In: Soft Matter in Plants. Royal Society of Chemistry; 2022.
17. Baggs EM, Cairns JE, Mhlanga B, et al. Exploiting crop genotype-specific root-soil interactions to enhance agronomic efficiency. Frontiers in Soil Science. 2023; 3: 1–18.
18. Arreguit J, Ramalingasetty ST, Ijspeert A. FARMS: Framework for Animal and Robot Modeling and Simulation. bioRxiv. 2023. doi: 10.1101/2023.09.25.559130
19. Fethke NB, Schall MC, Chen H, et al. Biomechanical factors during common agricultural activities: Results of on-farm exposure assessments using direct measurement methods. Journal of Occupational and Environmental Hygiene. 2020; 17(2–3): 85–96. doi: 10.1080/15459624.2020.1717502
20. Behnke R, Fernandez-Gimenez ME, Turner MD, Stammler F. Pastoral migration: mobile systems of livestock husbandry. In: Animal Migration. Oxford University Press; 2011.
21. Hasahya E, Thakur K, Dione MM, et al. Analysis of patterns of livestock movements in the Cattle Corridor of Uganda for risk-based surveillance of infectious diseases. Frontiers in Veterinary Science. 2023; 10. doi: 10.3389/fvets.2023.1095293
22. Johansson CL, Muijres FT, Hedenström A. The physics of animal locomotion. In: Animal Movement Across Scales. Oxford University Press; 2014.
23. Calero-Medina C, Noyons ECM. Combining mapping and citation network analysis for a better understanding of the scientific development: The case of the absorptive capacity field. Journal of Informetrics. 2008; 2(4): 272–279. doi: 10.1016/j.joi.2008.09.005
24. Dubarić E, Giannoccaro D, Bengtsson R, Ackermann T. Patent data as indicators of wind power technology development. World Patent Information. 2011; 33(2): 144–149. doi: 10.1016/j.wpi.2010.12.005
25. Xu HY, Yue ZH, Wang C, et al. Multi-source data fusion study in scientometrics. Scientometrics. 2017; 111: 773–792.
26. Narin F, Hamilton KS, Olivastro D. The increasing linkage between US technology and public science. Research policy. 1997; 26(3): 317–330.
27. Meyer M. Does science push technology? Patents citing scientific literature. Research policy. 2000; 29(3): 409–434.
28. Kleinberg J. Bursty and hierarchical structure in streams. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining; 23–26 July 2002; Edmonton, Canada. pp. 91–101.
29. Bai Y, Li H. Mapping the evolution of e-commerce research through co-word analysis: 2001–2020. Electronic Commerce Research and Applications. 2022; 55: 101190. doi: 10.1016/j.elerap.2022.101190
30. Girvan M, Newman MEJ. Community structure in social and biological networks. Proceedings of the National Academy of Sciences. 2002; 99(12): 7821–7826. doi: 10.1073/pnas.122653799
31. Liang LM, Xie CX. Investigation of China’s Nanotechnology Study Based On Frequency Analysis of Key Words. Studies in Science of Science. 2003; 138(02): 42.
32. Qiu JP, Wen FF. Visualization Analysis of the Research Front and Hot Domains of Library and Information Science in the Past Five Years: Studies Based On the Quantitative Analysis of 13 High-Impact International Journals. Journal of Library Science in China. 2011; 2(31): 51–60.
33. Mane KK, Börner K. Mapping topics and topic bursts in PNAS. Proceedings of the National Academy of Sciences. 2004; 101(suppl_1): 5287–5290. doi: 10.1073/pnas.0307626100
34. Fan Z, Ke S, Meili R. Research on similarity algorithm improvement of dynamic tracing disciplinary themes based on co-word analysis. In: Proceedings of the 2010 2nd IEEE International Conference on Information Management and Engineering; 16–18 April 2010; Chengdu, China. pp. 506–510.
35. Qiu JP, Wen XT, Song YH. Knowledgometrics. Science Press; 2014.
36. Wang Q, Qian L, Liu XW. A Review on Technical Methods for Knowledge Evolution Analysis. Library and Information Service. 2023; 67(7): 121–134.
37. Huang L, Hou Z, Fang Y, et al. Evolution of CCUS Technologies Using LDA Topic Model and Derwent Patent Data. Energies. 2023; 16(6): 2556. doi: 10.3390/en16062556
38. Qi YS, Zhu N, Zhai YJ. A Comparative Study On Topic Heats Evolution in the Field of Information Science a Comparative Study On Topic Heats Evolution in the Field of Information Science. Library and Information Service. 2016; 60(16): 99–109.
39. Zhang P, Liu D. Topic evolutionary analysis of short text based on word vector and BTM. Data Analysis and Knowledge Discovery. 2019; 3(3): 95–101.
40. Zeng JF, Huang YT, Chen JY, et al. Theme Evolution for Technology of Integrated Publishing. Digital Library Forum. 2023; 19(4): 9–18.
41. Jeyaraj A, Zadeh AH. Evolution of information systems research: Insights from topic modeling. Information & Management. 2020; 57(4): 103207. doi: 10.1016/j.im.2019.103207
42. Quille RVE, Barros JM, Júnior MB, et al. Detecting Favorite Topics in Computing Scientific Literature via Dynamic Topic Modeling. IEEE Access. 2023; 11: 41535–41545. doi: 10.1109/access.2023.3269660
43. Wang Z, Chen J, Chen J, Chen H. Identifying interdisciplinary topics and their evolution based on BERTopic. Scientometrics. 2023; 1–26.
44. Xu H, Winnink J, Yue Z, et al. Topic-linked innovation paths in science and technology. Journal of Informetrics. 2020; 14(2): 101014. doi: 10.1016/j.joi.2020.101014
45. Teng GQ, Jiang Y, Tuo R. Comparative Study on Domain Knowledge Evolution Based on Multiple Data Source Dimensions: Taking Graphene Field Research in the United States as an Example. Information and Documentation Services. 2023; 44(06): 61–70.
46. Lu X, Zhang Y, Lin C, Wu F. Evolutionary Overview and Prediction of Themes in the Field of Land Degradation. Land. 2021; 10(3): 241. doi: 10.3390/land10030241
47. Wei T, Jiang T, Feng D, Xiong J. Exploring the Evolution of Core Technologies in Agricultural Machinery: A Patent-Based Semantic Mining Analysis. Electronics. 2023; 12(20): 4277. doi: 10.3390/electronics12204277
48. (2025, January 25).The Lens-free&open patent and scholarly search. Retrieved from https://www.lens.org.
49. Coxhead A. A New Academic Word List. TESOL Quarterly. 2000; 34(2): 213–238. doi: 10.2307/3587951
50. USPTO. (2025, January 25). Patents Glossary. Retrieved from https://www.findlaw.com/smallbusiness/intellectual-property/patents-glossary.html.
51. Grootendorst M. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv. 2022.
52. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. Journal of machine Learning research. 2003; 3(2003): 993–1022.
53. Wang XH, Gao M. The Key Technology Identification Method Based on BERT-LDA and Its Empirical Research: A Case Study of Agricultural Robots. Library and Information Service. 2021; 65(22): 114–125.
54. Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv. 2018.
55. McInnes L, Healy J, Melville J. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv. 2018.
56. McInnes L, Healy J, Astels S. hdbscan: Hierarchical density based clustering. The Journal of Open Source Software. 2017; 2(11): 205. doi: 10.21105/joss.00205
57. Wu RP, Li YN, Liu S, et al. DTM-Based Analysis of Hot Topics and Evolution of American Artificial Intelligence Strategy. Journal of Intelligence. 2023; 42(12): 134–143.
58. Mann GS, Mimno D, McCallum A. Bibliometric impact measures leveraging topic analysis. In: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries; 11–15 June 2006; New York, NY, USA. pp. 65–74.
59. Sun LZ, Zhou P, Cao ZW, et al. Reflections and Practices of Interdisciplinary Cross-Research On Agriculture in Universities. China University Science & Technology. 2020; 11: 26–30.
60. Zhao Zj, Ku YM, Cao GX. Reflections on the Optimization of Discipline Construction of Higher Agricultural Universities in China in the 21st Century. Higher Education Exploration. 2006; (05): 50–52.
61. Ling D. Study on the Integration and Promotion of Disciplines and Specialties in Higher Agricultural Colleges and Universities in the New Century. Journal of Anhui Agricultural Sciences. 2008; 36(11): 4702.
62. Moshelion M, Altman A. Current challenges and future perspectives of plant and agricultural biotechnology. Trends in Biotechnology. 2015; 33(6): 337–342. doi: 10.1016/j.tibtech.2015.03.001
63. Edmeades GO, McMaster GS, White JW, Campos H. Genomics and the physiologist: bridging the gap between genes and crop response. Field Crops Research. 2004; 90(1): 5–18. doi: 10.1016/j.fcr.2004.07.002
64. Narin F, Noma E. Is technology becoming science? Scientometrics. 1985; 7(3–6): 369–381. doi: 10.1007/bf02017155
65. Glänzel W, Meyer M. Patents cited in the scientific literature: An exploratory study of’reverse’citation relations. Scientometrics. 2003; 58: 415–428.
66. Sun YN, Du J, Li YJ, et al. Can Funding for Basic Research Make a Technological Impact?—Analysis Based on the Analysis of National Natural Science Foundation of China. World Sci-Tech R & D. 2023; 45(S1): 10–17.
67. Lu C, Zhu L, Su Y, et al. Industry-Academia Collaboration in the Field of Functional Nucleic Acids: An Analysis Based on Papers and Patents. Applied Mathematics and Nonlinear Sciences. 2024; 9(1). doi: 10.2478/amns-2024-1072
68. Huang LC, Liu YM, Wu FF, et al. Research Framework of Technical Knowledge Diffusion Characteristics Based on Patent Full Citation Information: The Example of Grapheme. Science of Science and Management of S. & T. 2017; 38(04): 149–161.
69. Jiang ZY. Analysis on the evolution of Shanghaibiomedical industry based on patent thicket [Master of Thesis]. East China Normal University; 2021.
70. Strock CF, Rangarajan H, Black CK, et al. Theoretical evidence that root penetration ability interacts with soil compaction regimes to affect nitrate capture. Annals of Botany. 2022; 129(3): 315–330. doi: 10.1093/aob/mcab144
71. Cavalieri-Polizeli KMV, Marcolino FC, Tormena CA, et al. Soil Structural Quality and Relationships With Root Properties in Single and Integrated Farming Systems. Frontiers in Environmental Science. 2022; 10. doi: 10.3389/fenvs.2022.901302
72. Bengough AG, Loades K, McKenzie BM. Root hairs aid soil penetration by anchoring the root surface to pore walls. Journal of Experimental Botany. 2016; 67(4): 1071–1078. doi: 10.1093/jxb/erv560
73. Keller T, Or D. Farm vehicles approaching weights of sauropods exceed safe mechanical limits for soil functioning. Proceedings of the National Academy of Sciences. 2022; 119(21). doi: 10.1073/pnas.2117699119
74. Shahgholi G, Moinfar A, Khoramifar A, et al. Investigating the Effect of Tractor’s Tire Parameters on Soil Compaction Using Statistical and Adaptive Neuro-Fuzzy Inference System (ANFIS) Methods. Agriculture. 2023; 13(2): 259. doi: 10.3390/agriculture13020259
75. Moinfar A, Shahgholi G, Abbaspour-Gilandeh Y, et al. Investigating the Effect of the Tractor Drive System Type on Soil Behavior under Tractor Tires. Agronomy. 2021; 11(4): 696. doi: 10.3390/agronomy11040696
76. Hazelkorn E. In: Rankings and the Reshaping of Higher Education: The battle for world-class excellence. Palgrave Macmillan; 2015.
77. Liu PD, Wu ZC. Research on the Community with a Shared Future for Mankind: A Visual Bibliometric Analysis Based on CiteSpace Knowledge Mapping. Teaching and Research. 2022; (01): 70–81.
78. Liu YH, Zhang HT, Zhang KX, et al. Implementation of Tank Service for Food Security Policy Consultation: Policy Text Analysis. Information Studies: Theory & Application. 2024; 1–14.
79. Isoaho K, Gritsenko D, Mäkelä E. Topic modeling and text analysis for qualitative policy resear. Policy Studies Journal. 2021; 49(1): 300–324. doi: 10.1111/psj.12343
80. Wang X, Ren H, Liu Y. Visualization Method for Technology Theme Map with Clustering. Data Analysis and Knowledge Discovery. 2022; 6(1): 91–100.
81. Etzkowitz H, Loet L. The dynamics of innovation: from National Systems and “Mode 2” to a Triple Helix of university-industry-government relations. Research policy. 2000; 29(2): 109–123.
82. Cui J, Hou QM. The Theory, Practice and Inspiration of Building of University-Led Innovation Ecosystem in Japan. Forum on Science and Technology in China. 2024; (07): 176–188.
Copyright (c) 2025 Author(s)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright on all articles published in this journal is retained by the author(s), while the author(s) grant the publisher as the original publisher to publish the article.
Articles published in this journal are licensed under a Creative Commons Attribution 4.0 International, which means they can be shared, adapted and distributed provided that the original published version is cited.