What Do Scopus Index Keywords Reveal About Educational Data Mining Research? A Bibliometric Analysis (2014-2024)

Authors

DOI:

https://doi.org/10.56294/dm20251199

Keywords:

Educational Data Mining, Learning Analytics, Bibliometric Analysis, Research Evolution, Machine Learning

Abstract

Introduction: Educational Data Mining (EDM) has emerged as a pivotal interdisciplinary field addressing the increasing demand for data-driven educational enhancement. However, a comprehensive understanding of its developmental trajectory is hindered by fragmented literature reviews and a lack of longitudinal analysis spanning critical technological and educational transformations.
Objective: This study investigates the evolution of EDM research over the transformative decade from 2014 to 2024 through systematic bibliometric analysis, aiming to identify growth patterns, thematic developments, and methodological innovations.
Method: We conducted an extensive analysis of 436 peer-reviewed publications indexed in Scopus, employing rigorous keyword analysis, mathematical modeling of research trends, and systematic thematic classification to examine temporal evolution patterns. The methodology utilized PRISMA-guided selection procedures, standardized keyword extraction and normalization, and quantitative measures including growth ratios, Shannon diversity indices, and thematic strength calculations.
Results: Our analysis reveals remarkable research growth, with a 777.8 % increase in publication output, representing a compound annual growth rate of 24.3 %. The findings document a significant paradigmatic shift from descriptive analytics toward predictive methodologies, evidenced by a 215-fold growth in Machine Learning and AI themes and the complete emergence of deep learning applications. Thematic evolution analysis identified 47.3 % of recent keywords as entirely new terms, indicating substantial conceptual expansion.
Conclusions: The research demonstrates EDM's transition from foundational exploration (2014-2017) through rapid expansion (2018-2020) to sophisticated maturation (2021-2024), characterized by methodological pluralism and the integration of advanced computational techniques.
 

References

1. Siemens G, Long P. Penetrating the Fog: Analytics in Learning and Education. EDUCAUSE Review. 2011;46(5).

2. Shum SB, Ferguson R. Social learning analytics. Educational Technology and Society. 2012;15(3).

3. Romero C, Ventura S. Educational data mining and learning analytics: An updated survey. Wiley Interdiscip Rev Data Min Knowl Discov. 2020;10(3).

4. Baker R, Siemens G. Educational Data Mining and Learning Analytics Ryan S.J.d. Baker, Teachers College, Columbia University George Siemens, Athabasca University 1. Cambridge Handbook of the Learning Sciences. 2013;

5. Peña-Ayala A. Educational data mining: A survey and a data mining-based analysis of recent works. Vol. 41, Expert Systems with Applications. 2014.

6. Sutoyo E, Almaarif A. Educational Data Mining for Predicting Student Graduation Using the Naïve Bayes Classifier Algorithm. Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi). 2020 Feb 8;4(1):95–101.

7. Romero C, Ventura S. Educational data mining: A review of the state of the art. Vol. 40, IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews. 2010.

8. Khalil H, Ebner M. MOOCs Completion Rates and Possible Methods to Improve Retention - A Literature Review. EdMedia: World Conference on Educational Media and Technology. 2014;2014(1).

9. Dutt A, Ismail MA, Herawan T. A Systematic Review on Educational Data Mining. Vol. 5, IEEE Access. 2017.

10. Chen CC, Wang NC, Tang KY, Tu YF. Research issues of the top 100 cited articles on information literacy in higher education published from 2011 to 2020: A systematic review and co-citation network analysis. Australasian Journal of Educational Technology. 2022 Nov 26;38(6):34–52.

11. Chen L, Chen P, Lin Z. Artificial Intelligence in Education: A Review. IEEE Access. 2020;8:75264–78.

12. Aldowah H, Al-Samarraie H, Fauzy WM. Educational data mining and learning analytics for 21st century higher education: A review and synthesis. Vol. 37, Telematics and Informatics. 2019.

13. Viberg O, Hatakka M, Bälter O, Mavroudi A. The current landscape of learning analytics in higher education. Vol. 89, Computers in Human Behavior. 2018.

14. Ahmad ST, Watrianthos R, Samala AD, Muskhir M, Dogara G. Project-based Learning in Vocational Education: A Bibliometric Approach. International Journal Modern Education and Computer Science. 2023;15(4):43–56.

15. Muskhir M, Luthfi A, Watrianthos R, Usmeldi U, Fortuna A, Dwinggo Samala A. Emerging Research on Virtual Reality Applications in Vocational Education: A Bibliometric Analysis. Journal of Information Technology Education: Innovations in Practice. 2024;23:005.

16. Irfan D, Watrianthos R, Nur Bin Yunus FA. AI in Education: A Decade of Global ResearchTrends and Future Directions. International Journal of Modern Education and Computer Science. 2025 Apr 8;17(2):135–53.

17. Gough D, Oliver S, Thomas J. An introduction to systematic reviews / David Gough, Sandy Oliver, James Thomas. SAGE Publications Ltd. 2012.

18. Donthu N, Kumar S, Mukherjee D, Pandey N, Lim WM. How to conduct a bibliometric analysis: An overview and guidelines. J Bus Res. 2021;133.

19. Silva M do ST, Oliveira VM de, Correia SÉN. Scientific mapping in Scopus with Biblioshiny: A bibliometric analysis of organizational tensions. Contextus – Revista Contemporânea de Economia e Gestão. 2022;20.

20. Baas J, Schotten M, Plume A, Côté G, Karimi R. Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies. Quantitative science studies. 2020;1(1):377–86.

21. Batubara HS, Jalinus N, Rizal F, Watrianthos R. Mapping the Frontier: A Bibliometric Analysis of AI in Tertiary Education. International Journal of Learning, Teaching and Educational Research. 2024 Oct 30;23(10):62–81.

22. Watrianthos R, Triono Ahmad S, Muskhir M. Charting the Growth and Structure of Early ChatGPT-Education Research: A Bibliometric Study. Journal of Information Technology Education: Innovations in Practice. 2023;22:235–53.

23. Samala AD, Bojic L, Bekiroğlu D, Watrianthos R, Hendriyani Y. Microlearning: Transforming Education with Bite-Sized Learning on the Go—Insights and Applications. International Journal of Interactive Mobile Technologies (iJIM). 2023 Nov 15;17(21):4–24.

24. Aria M, Cuccurullo C. bibliometrix : An R-tool for comprehensive science mapping analysis. J Informetr. 2017 Nov;11(4):959–75.

25. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Vol. 372, The BMJ. 2021.

26. Rethlefsen ML, Kirtley S, Waffenschmidt S, Ayala AP, Moher D, Page MJ, et al. PRISMA-S: an extension to the PRISMA Statement for Reporting Literature Searches in Systematic Reviews. Syst Rev. 2021;10(1).

27. Tsai CF, Tsai CT, Hung CS, Hwang PS. Data mining techniques for identifying students at risk of failing a computer proficiency test required for graduation. Australasian Journal of Educational Technology. 2011;27(3):481–98.

28. Williamson B. Big Data in Education: The digital future of learning, policy and practice. Big Data in Education: The digital future of learning, policy and practice. 2020.

29. Gašević D, Dawson S, Siemens G. Let’s not forget: Learning analytics are about learning. TechTrends. 2015;59(1).

30. Sweller J. Cognitive load during problem solving: Effects on learning. Cogn Sci. 1988;12(2).

31. Zimmerman BJ. Becoming a self-regulated learner: An overview. Vol. 41, Theory into Practice. 2002.

32. Hwang GJ, Tsai CC. Research trends in mobile and ubiquitous learning: A review of publications in selected journals from 2001 to 2010. Vol. 42, British Journal of Educational Technology. 2011.

Downloads

Published

2025-10-09

Issue

Section

Original

How to Cite

1.
Edi F, Ambiyar A, Waskito W, Samsir S, Watrianthos R. What Do Scopus Index Keywords Reveal About Educational Data Mining Research? A Bibliometric Analysis (2014-2024). Data and Metadata [Internet]. 2025 Oct. 9 [cited 2025 Oct. 20];4:1199. Available from: https://dm.ageditor.ar/index.php/dm/article/view/1199