Personality Recognition from Text in Indian Languages : Challenges, Progress, and Future Directions
DOI:
https://doi.org/10.32628/CSEIT251117136Keywords:
Personality Recognition, Computational Linguistics, Indian Languages, Low-Resource NLP, Big Five Model, Code-Switching, Multimodal AI, Transfer LearningAbstract
Personality Recognition (PR) from text is an emerging area within computational linguistics that seeks to automatically infer an individual’s personality traits based on their written or spoken language. While substantial progress has been achieved for English and other high-resource languages, research on Indian languages remains at an early stage. This review presents a comprehensive synthesis of the current work on PR for Indian languages. We examine the linguistic, cultural, and technical challenges unique to this context such as morphological complexity, code-switching, data scarcity, and the cultural applicability of Western personality frameworks like the Big Five model. The paper systematically reviews available datasets, methodologies (ranging from traditional machine learning to state-of-the-art large language models), and evaluation strategies. Our analysis highlights a clear performance gap between English-based systems and those developed for Indian languages, primarily driven by limited resources and cultural differences in language use. We conclude by identifying key research gaps and proposing a roadmap for the future emphasizing the creation of large, diverse, and multimodal datasets; the adaptation of culturally relevant personality models; and the fine-tuning of large language models for India’s multilingual environment.
Downloads
References
Vinciarelli, A., & Mohammadi, G. (2014). A survey of personality computing. IEEE Transactions on Affective Computing, 5(3), 273-291. DOI: https://doi.org/10.1109/TAFFC.2014.2330816
Patil, J., Patil, V., Prajapati, K., Patel, D., Trivedi, S., & Patel, R. (2024, August). Enhanced Depression Detection on Social Media Using Advanced Machine Learning and Linguistic Analysis Techniques. In International Conference on Intelligent Computing and Communication (pp. 263-275). Singapore: Springer Nature Singapore. DOI: https://doi.org/10.1007/978-981-96-1264-2_23
Farnadi, G., Sitaraman, G., Sushmita, S., Celli, F., Kosinski, M., Stillwell, D., Davalos, S., Moens, M. F., & De Cock, M. (2016). Computational personality recognition in social media. User Modeling and User-Adapted Interaction, 26(2), 109-142. DOI: https://doi.org/10.1007/s11257-016-9171-0
Majumder, N., et al. (2017). Deep Learning-based Personality Recognition from Facebook Data in Hindi. ICON.
Singh, V., et al. (2019). Leveraging Multilingual BERT for Personality Recognition in Code-Switched Hindi-English Text. arXiv preprint.
Kumar, A., & Reddy, P. (2021). Personality Prediction from Tweets in Indian Languages using Ensemble Learning. Journal of Intelligent Information Systems.
Patil, J., & Sheth, J. (2022). Data Preparation and Quality Challenges for the Personality Recognition in Indian Languages using Machine Learning and Deep Learning Approaches. Journal of IoT in Social, Mobile, Analytics, and Cloud, 4(1), 33-40. DOI: https://doi.org/10.36548/jismac.2022.1.004
Poria, S., et al. (2017). A Review of Affective Computing: From Unimodal Analysis to Multimodal Fusion. Information Fusion. DOI: https://doi.org/10.1016/j.inffus.2017.02.003
Patil, J., & Sheth, J. (2022). Deep Learning and Machine Learning Approaches for the Classification of Personality Traits. In Rising Threats in Expert Applications and Solutions: Proceedings of FICR-TEAS 2022 (pp. 139-146). Singapore: Springer Nature Singapore. DOI: https://doi.org/10.1007/978-981-19-1122-4_16
(Team), M. (2021). MuRIL: Multilingual Representations for Indian Languages. Google AI Blog.
Patil, J., & Sheth, J. (2021). Comparative study of data sources, features, and approaches for automatic personality classification from text. Int. J. Comput. Appl, 174, 0975-8887. DOI: https://doi.org/10.5120/ijca2021920968
Patil, M. J. A., & Godhwani, M. P. B. (2016). Review of Name Entity Recognition in Marathi Language. IJSART, 2(6).
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.