DISCOVERING LEARNER PERSONAS IN AI-ASSISTED ENGLISH LANGUAGE LEARNING USING COSINE-BASED CLUSTERING: IMPLICATIONS FOR PERSONALIZED SUPPORT IN GCC CONTEXTS

Marwan  Alshar‘e; Shaher  Elayyan; Abdallah Abualkishik; Khaled  Abuhmaidan; Wasin  Al Kishri

doi:10.46281/bjmsr.v11i2.2852

PDF

Published: 2026-04-14

DOI: https://doi.org/10.46281/bjmsr.v11i2.2852

Keywords:

AI-Assisted Language Learning; English as a Second Language (ESL); Learner Profiling; Unsupervised Learning; Clustering Analysis; Cosine Similarity; Educational Data Mining; Oman Education

Marwan Alshar‘e

Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

https://orcid.org/0000-0001-5187-4902

Shaher Elayyan

Assistant Professor, Faculty of Education and Arts, Sohar University, Sohar, Oman

https://orcid.org/0000-0003-0630-2301

Abdallah Abualkishik

Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

https://orcid.org/0000-0001-9961-5563

Khaled Abuhmaidan

Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

https://orcid.org/0000-0003-2346-6201

Wasin Al Kishri

Assistant Professor, Faculty of Computer Studies, Arab Open University, Muscat, Oman

https://orcid.org/0000-0003-0833-5633

Abstract

The rapid expansion of artificial intelligence (AI)-assisted English language learning tools has introduced substantial variability in learner outcomes due to differences in behavioural patterns, task engagement, and usage strategies among non-native learners, particularly within Omani educational contexts. This heterogeneity creates a methodological challenge in identifying consistent learner profiles without relying on predefined or subjective labels. This study investigates the effectiveness of unsupervised cosine-based clustering in identifying distinct learner personas in AI-assisted English learning environments. The study utilizes a dataset of 15,000 learner interaction records obtained from Kaggle, incorporating demographic attributes, behavioural features, task modalities, and learning outcome indicators. A structured experimental methodology is employed, beginning with baseline Euclidean K-Means clustering, followed by dimensionality reduction using Singular Value Decomposition (SVD), and subsequent clustering using cosine similarity across multiple algorithms, including K-Means, Gaussian Mixture Models, Agglomerative Clustering, and BIRCH. The results reveal that cosine-based K-Means clustering (k = 6) achieves a Silhouette Score of 0.678 compared to 0.10 for baseline Euclidean clustering, representing an absolute improvement of 0.578 and approximately a sixfold increase in clustering performance. Compared to SVD-based Euclidean clustering (Silhouette = 0.41), cosine similarity improves clustering effectiveness by approximately 65%, while the Davies–Bouldin Index decreases to 0.56 and the Calinski–Harabasz Index increases to 33,074. The findings indicate that cosine-based unsupervised modelling effectively identifies distinct learner personas, demonstrating that learning-gain variations are driven by behavioural interaction patterns rather than usage intensity alone.

JEL Classification Codes: G32, F65, L66, L25, M41.

Downloads

Download data is not yet available.

Issue

Vol. 11 No. 2 (2026): Continuous Publication

Section

Research Paper/Theoretical Paper/Review Paper/Short Communication Paper

This work is licensed under a Creative Commons Attribution 4.0 International License.

Author Biographies

Marwan Alshar‘e , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Marwan Alshar‘e is an Associate Professor in the Faculty of Computing and Information Technology at Sohar University, Oman. He is an accomplished academic and researcher with extensive experience in computer science and information technology, specializing in areas such as software engineering, data systems, and emerging digital technologies. Dr. Alshar‘e has contributed to both teaching and research, actively engaging in curriculum development and mentoring undergraduate and postgraduate students. His scholarly work includes publications in reputable journals and conferences, reflecting his commitment to advancing knowledge in computing and IT. In addition to his academic responsibilities, he collaborates with industry and academic partners, supporting innovation and the practical application of technology in addressing real-world challenges.

Shaher Elayyan , Assistant Professor, Faculty of Education and Arts, Sohar University, Sohar, Oman

Shaher Elayyan is an Assistant Professor in the Faculty of Education and Arts at Sohar University, Oman. He is an academic professional with a strong background in education, humanities, and interdisciplinary studies. Dr. Elayyan is dedicated to teaching, research, and community engagement, contributing to the development of innovative educational practices and student-centered learning environments. His academic interests include curriculum development, pedagogy, and the integration of modern educational technologies. He has been actively involved in mentoring students and participating in scholarly activities, including research publications and academic conferences. Through his work, Dr. Elayyan aims to enhance educational quality and promote critical thinking, creativity, and lifelong learning among students.

Abdallah Abualkishik , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Abdallah Abualkishik is an Associate Professor in the Faculty of Computing and Information Technology at Sohar University, Oman. He is a dedicated academic and researcher with expertise in computer science and information technology, with particular interests in areas such as software engineering, intelligent systems, and data-driven applications. Dr. Abualkishik has a strong commitment to teaching excellence, contributing to curriculum design and the delivery of high-quality education for undergraduate and postgraduate students.

He is actively engaged in research, with publications in reputable journals and conferences, reflecting his contributions to advancing knowledge in computing and IT. In addition to his academic work, Dr. Abualkishik collaborates with peers and industry partners on research and development initiatives, aiming to bridge the gap between theoretical knowledge and practical applications. His work supports innovation and the effective use of technology to solve real-world problems.

Khaled Abuhmaidan , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Khaled Abuhmaidan is an Associate Professor in the Faculty of Computing and Information Technology at Sohar University, Oman. He is an experienced academic and researcher in the field of computer science, with expertise spanning areas such as information systems, software development, and modern computing technologies. Dr. Abuhmaidan is committed to delivering high-quality education, actively contributing to curriculum development and fostering an engaging learning environment for students.

His research interests focus on advancing innovative solutions in computing and IT, and he has contributed to scholarly publications in recognized journals and conferences. In addition to his teaching and research roles, Dr. Abuhmaidan collaborates with academic and industry partners, supporting the application of technology to address contemporary challenges. Through his work, he aims to promote academic excellence, technological innovation, and the development of skilled graduates equipped for the evolving digital landscape.

Wasin Al Kishri, Assistant Professor, Faculty of Computer Studies, Arab Open University, Muscat, Oman

Wasin Al Kishri is an Assistant Professor in the Faculty of Computer Studies at the Arab Open University in Muscat, Oman. He is an academic professional specializing in computer science and information technology, with a focus on advancing teaching and research in modern computing disciplines. Dr. Al Kishri is dedicated to delivering high-quality education, fostering student engagement, and supporting the development of technical and analytical skills among learners.

His academic interests include areas such as software development, information systems, and emerging technologies. He is actively involved in curriculum development and contributes to academic research through publications and participation in scholarly conferences. In addition to his teaching and research responsibilities, Dr. Al Kishri engages with academic and professional communities to promote innovation and the practical application of computing solutions in real-world contexts.

How to Cite

Alshar‘e , M. ., Elayyan , S. ., Abualkishik , A., Abuhmaidan , K. ., & Al Kishri, W. . (2026). DISCOVERING LEARNER PERSONAS IN AI-ASSISTED ENGLISH LANGUAGE LEARNING USING COSINE-BASED CLUSTERING: IMPLICATIONS FOR PERSONALIZED SUPPORT IN GCC CONTEXTS. Bangladesh Journal of Multidisciplinary Scientific Research, 11(2), 37-47. https://doi.org/10.46281/bjmsr.v11i2.2852

References

Alanazi, M., Soh, B., Samra, H., & Li, A. (2025). The Influence of Artificial Intelligence Tools on Learning Outcomes in Computer Programming: A Systematic Review and Meta-Analysis. Computers, 14(5), 185. https://doi.org/10.3390/computers14050185

Albahli, S. (2025). Advancing Sustainable Educational Practices Through AI-Driven Prediction of Academic Outcomes. Sustainability, 17(3), 1087. https://doi.org/10.3390/su17031087

Aljehani, K., & Modiano, M. (2025). The impact of English medium instruction in the Gulf: A comparative study of KSA and UAE. Cogent Education, 12(1), 2479402. https://doi.org/10.1080/2331186X.2025.2479402

Arumugam, N., Rafik-Galea, S., Mello, G. D., & Dass, L. C. (2013). Cultural Influences on Group Learning in an ESL Classroom. Review of European Studies, 5(2), 81-89. https://doi.org/10.5539/res.v5n2p81

Asahara, A., Sato, A., & Maruyama, K. (2009). Evaluation of Trajectory Clustering Based on Information Criteria for Human Activity Analysis. 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware, 329–337. https://doi.org/10.1109/MDM.2009.65

Baker, R. S., & Hawn, A. (2022). Algorithmic Bias in Education. International Journal of Artificial Intelligence in Education, 32(4), 1052–1092. https://doi.org/10.1007/s40593-021-00285-9

Booth, B. M., Bosch, N., & D’Mello, S. (2023). Engagement detection and its applications in learning: A selective review. Proceedings of the IEEE, 111(9), 1026–1046. https://doi.org/10.1109/JPROC.2023.3309560

Chen, D.-L., Aaltonen, K., Lampela, H., & Kujala, J. (2025). The Design and Implementation of an Educational Chatbot with Personalized Adaptive Learning Features for Project Management Training. Technology, Knowledge and Learning, 30(2), 1047–1072. https://doi.org/10.1007/s10758-024-09807-5

Crompton, H., & Burke, D. (2023). Artificial intelligence in higher education: The state of the field. International Journal of Educational Technology in Higher Education, 20, 22. https://doi.org/10.1186/s41239-023-00392-8

Da Silva, F. L., Slodkowski, B. K., Da Silva, K. K. A., & Cazella, S. C. (2023). A systematic literature review on educational recommender systems for teaching and learning: Research trends, limitations and opportunities. Education and Information Technologies, 28(3), 3289–3328. https://doi.org/10.1007/s10639-022-11341-9

D’Mello, S., & Graesser, A. (2012). Dynamics of affective states during complex learning. Learning and Instruction, 22(2), 145–157. https://doi.org/10.1016/j.learninstruc.2011.10.001

Dorneich, M., Whitlow, S., Ververs, P. M., Carciofini, J., & Creaser, J. (2004). Closing the Loop of an Adaptive System with Cognitive State. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 48(3), 590–594. https://doi.org/10.1177/154193120404800367

Fan, C.-I., Shie, C.-H., Tseng, Y.-F., & Huang, H.-C. (2023). An Efficient Data Protection Scheme Based on Hierarchical ID-Based Encryption for MQTT. ACM Transactions on Sensor Networks, 19(3), 1–21. https://doi.org/10.1145/3570506

Ferguson, R. (2019). Ethical Challenges for Learning Analytics. Journal of Learning Analytics, 6(3), 25–30. https://doi.org/10.18608/jla.2019.63.5

Granström, M., & Oppi, J. (2025). Student engagement with AI tools in learning: Evidence from recent educational contexts. Frontiers in Education, 10, 1298456. https://doi.org/10.3389/feduc.2025.1688092

Holi, H. I. (2025). In-Class EMI Challenges Arising in an Arabian Gulf Engineering Programme. SAGE Open, 15(3), 21582440251367125. https://doi.org/10.1177/21582440251367125

Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., Gasser, U., Groh, G., Günnemann, S., Hüllermeier, E., Krusche, S., Kutyniok, G., Michaeli, T., Nerdel, C., Pfeffer, J., Poquet, O., Sailer, M., Schmidt, A., Seidel, T., … Kasneci, G. (2023). ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274

Kaur, M., Dhalaria, M., Sharma, P. K., & Park, J. H. (2019). Supervised Machine-Learning Predictive Analytics for National Quality of Life Scoring. Applied Sciences, 9(8), 1613. https://doi.org/10.3390/app9081613

Lee, K.-A., & Lim, S.-B. (2023). Designing a Leveled Conversational Teachable Agent for English Language Learners. Applied Sciences, 13(11), 6541. https://doi.org/10.3390/app13116541

Martín-Moncunill, D., & Alonso Martínez, D. (2025). Students’ Trust in AI and Their Verification Strategies: A Case Study at Camilo José Cela University. Education Sciences, 15(10), 1307. https://doi.org/10.3390/educsci15101307

Melchor, F., Conejero, J. M., Fernández-García, A. J., Sánchez-Figueroa, F., & Rodríguez-Echeverría, R. (2026). An empirical evaluation of clustering processes for early detection of university dropout. International Journal of Data Science and Analytics, 22, 25. https://doi.org/10.1007/s41060-025-00965-y

Mello, F. L. D., & Souza, S. A. D. (2021). Decision Maker Profiling Using Their Mental Behavior Pattern. Frontiers in Psychology, 12, 667255. https://doi.org/10.3389/fpsyg.2021.667255

Munassar, N. M. A., & Al-hobishi, M. A. A. (2025). Dimensionality Reduction Techniques in Big Data and Their Impact on E-Learning. Journal of Science and Technology, 30(7), 12–28. https://doi.org/10.20428/jst.v30i7.3002

Najem, K., Seghroucheni, Y. Z., & Ziti, S. (2026). Behavioral clustering for adaptive learning: A data-driven alternative to static learning style models. International Journal of Information and Education Technology, 16(1), 196–204. https://doi.org/10.18178/ijiet.2026.16.1.2494

Park, S., Kim, S.-Y., Lee, H., & Kim, E. G. (2022). Professional development for English-medium instruction professors at Korean universities. System, 109, 102862. https://doi.org/10.1016/j.system.2022.102862

Rebolledo-Méndez, G., Huerta-Pacheco, S., Baker, R. S., & du Boulay, B. (2022). Meta-affective behaviour within an intelligent tutoring system. International Journal of Artificial Intelligence in Education, 32(1), 81–112. https://doi.org/10.1007/s40593-021-00247-1

Rosenberg, J. M., Schultheis, E. H., Kjelvik, M. K., Reedy, A., & Sultana, O. (2022). Big data, big changes? The technologies and sources of data used in science classrooms. British Journal of Educational Technology, 53(5), 1179–1201. https://doi.org/10.1111/bjet.13245

Shaffer, D. W., & Ruis, A. R. (2024). Theories All the Way Across: The Role of Theory in Learning Analytics and the Case for Unified Methods. In K. Bartimote, S. K. Howard, & D. Gašević (Eds.), Theory Informing and Arising from Learning Analytics (pp. 187–201). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-60571-0_12

Shirkhorshidi, A. S., Aghabozorgi, S., & Wah, T. Y. (2015). A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data. PLOS ONE, 10(12), e0144059. https://doi.org/10.1371/journal.pone.0144059

Tudor, I., Holenko Dlab, M., Đurović, G., & Horvat, M. (2025). Using Clustering Techniques to Design Learner Personas for GenAI Prompt Engineering and Adaptive Interventions. Electronics, 14(11), 2281. https://doi.org/10.3390/electronics14112281

Venkatesh Sharma, K., Ayiluri, P. R., Betala, R., Jagdish Kumar, P., & Shirisha Reddy, K. (2024). Enhancing query relevance: Leveraging SBERT and cosine similarity for optimal information retrieval. International Journal of Speech Technology, 27(3), 753–763. https://doi.org/10.1007/s10772-024-10133-5

Viberg, O., Khalil, M., & Baars, M. (2020). Self-regulated learning and learning analytics in online learning environments: A review of empirical research. Computers & Education, 156, 103878. https://doi.org/10.1016/j.compedu.2020.103878

Wang, S., Ren, J., & Bai, R. (2023). A semi-supervised adaptive discriminative discretization method that improves the discrimination power of regularised naive Bayes. Expert Systems with Applications, 225, 120094. https://doi.org/10.1016/j.eswa.2023.120094

Watson, D. S. (2023). On the Philosophy of Unsupervised Learning. Philosophy & Technology, 36(2), 28. https://doi.org/10.1007/s13347-023-00635-6

Zhu, M., & Wang, C. (2024). A Systematic Review of Artificial Intelligence in Language Education from 2013 to 2023: Current Status and Future Implications. https://doi.org/10.2139/ssrn.4684304

Acceptance Rate:	Below 10%
Time to First Decision:	10 days
Review Time:	70 day
Submission to Acceptance:	90 days
Acceptance to Publication:	10 days
Issue Per Year:	6
Number of Volumes:	11
Number of Issues:	24
Number of Articles:	130
Number of Reviewers:	340
Number of Contributors:	400
Contributing Countries:	30
No. of WoS Citations:	300
No. of Scopus Citations:	407
No. of Google Citations:	493
Google h-index:	10
Google i10-index:	11
Abstract Views:	5951567
PDF Download:	5958456

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

Author Biographies

Marwan Alshar‘e , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Shaher Elayyan , Assistant Professor, Faculty of Education and Arts, Sohar University, Sohar, Oman

Abdallah Abualkishik , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Khaled Abuhmaidan , Associate Professor, Faculty of Computing and IT, Sohar University, Sohar, Oman

Wasin Al Kishri, Assistant Professor, Faculty of Computer Studies, Arab Open University, Muscat, Oman

How to Cite

References

Similar Articles