Using Ai to Analyze Language Learners' Discourse: A Corpus-Based Study of Learner Language Development

Authors

  • Arfa Maham Department of Computer Science & Information Technology, Government Sadiq College Women, University Bahawalpur, 63100, Pakistan. Author
  • Muniba Saleem Department of Computer Science & Information Technology, Government Sadiq College Women, University Bahawalpur, 63100, Pakistan. Author
  • Muhammad Ismail Rahu Lecturer at the Department of English, Quaid-e-Awam University of Engineering, Science and Technology, Nawabshah. Author
  • Sohail Ahmad MPhil. in English Linguistics. SSE English School Education Department (SED), Govt. of Punjab, Pakistan Author

DOI:

https://doi.org/10.63056/ACAD.004.04.1132

Keywords:

Patterns , language development , spoken discourse , natural language processing , machine learning tools , lexical variability , grammatical precision , syntactic intricacy , coherence.

Abstract

This research examined patterns of language development of Pakistani university students using AI-driven corpus tools. The researchers gathered 150 students’ written and spoken discourse samples from Punjab and Sindh provinces for six months and built a small learner corpus of around 500,000 words. The researchers used natural language processing and machine learning tools to evaluate the samples for lexical variability, grammatical precision, syntactic intricacy, and coherence. The researchers used a mixed-method design for the study, incorporating quantitative frequency analysis and qualitative thematic analysis. The analysis demonstrated advanced and less advanced learners’ levels of proficiency and lexical sophistication and syntactic complexity to a higher degree. The researchers found patterns of common errors, which included articles, propositions, and subject-verb agreement. The AI managed to recognize the gaps of interlanguage and track the development level. Qualitative analysis produced five themes: L1 transfer, rule overgeneralization, lexical discourse fossilization, organization problems, and code-switching. The study demonstrated the corpus-based approach’s ability to detect the language learner’s level of development. This research helped understand the second language acquisition processes of South Asian learners and illustrated how AI technology can benefit learning and teaching research conducted to improve language education.

Downloads

Published

2025-11-25

How to Cite

Maham, A. ., Saleem, M. ., Ismail Rahu, M., & Ahmad, S. . (2025). Using Ai to Analyze Language Learners’ Discourse: A Corpus-Based Study of Learner Language Development. ACADEMIA International Journal for Social Sciences, 4(4), 2787-2797. https://doi.org/10.63056/ACAD.004.04.1132

Similar Articles

41-50 of 525

You may also start an advanced similarity search for this article.