Personality Recognition from Text in Indian Languages : Challenges, Progress, and Future Directions

Authors

  • Jayashri Patil School of Engineering, P P Savani University, Kosamba, Gujarat, India Author
  • Kamini Sharma School of Engineering, P P Savani University, Kosamba, Gujarat, India Author

DOI:

https://doi.org/10.32628/CSEIT251117136

Keywords:

Personality Recognition, Computational Linguistics, Indian Languages, Low-Resource NLP, Big Five Model, Code-Switching, Multimodal AI, Transfer Learning

Abstract

Personality Recognition (PR) from text is an emerging area within computational linguistics that seeks to automatically infer an individual’s personality traits based on their written or spoken language. While substantial progress has been achieved for English and other high-resource languages, research on Indian languages remains at an early stage. This review presents a comprehensive synthesis of the current work on PR for Indian languages. We examine the linguistic, cultural, and technical challenges unique to this context such as morphological complexity, code-switching, data scarcity, and the cultural applicability of Western personality frameworks like the Big Five model. The paper systematically reviews available datasets, methodologies (ranging from traditional machine learning to state-of-the-art large language models), and evaluation strategies. Our analysis highlights a clear performance gap between English-based systems and those developed for Indian languages, primarily driven by limited resources and cultural differences in language use. We conclude by identifying key research gaps and proposing a roadmap for the future emphasizing the creation of large, diverse, and multimodal datasets; the adaptation of culturally relevant personality models; and the fine-tuning of large language models for India’s multilingual environment.

Downloads

Download data is not yet available.

References

Vinciarelli, A., & Mohammadi, G. (2014). A survey of personality computing. IEEE Transactions on Affective Computing, 5(3), 273-291. DOI: https://doi.org/10.1109/TAFFC.2014.2330816

Patil, J., Patil, V., Prajapati, K., Patel, D., Trivedi, S., & Patel, R. (2024, August). Enhanced Depression Detection on Social Media Using Advanced Machine Learning and Linguistic Analysis Techniques. In International Conference on Intelligent Computing and Communication (pp. 263-275). Singapore: Springer Nature Singapore. DOI: https://doi.org/10.1007/978-981-96-1264-2_23

Farnadi, G., Sitaraman, G., Sushmita, S., Celli, F., Kosinski, M., Stillwell, D., Davalos, S., Moens, M. F., & De Cock, M. (2016). Computational personality recognition in social media. User Modeling and User-Adapted Interaction, 26(2), 109-142. DOI: https://doi.org/10.1007/s11257-016-9171-0

Majumder, N., et al. (2017). Deep Learning-based Personality Recognition from Facebook Data in Hindi. ICON.

Singh, V., et al. (2019). Leveraging Multilingual BERT for Personality Recognition in Code-Switched Hindi-English Text. arXiv preprint.

Kumar, A., & Reddy, P. (2021). Personality Prediction from Tweets in Indian Languages using Ensemble Learning. Journal of Intelligent Information Systems.

Patil, J., & Sheth, J. (2022). Data Preparation and Quality Challenges for the Personality Recognition in Indian Languages using Machine Learning and Deep Learning Approaches. Journal of IoT in Social, Mobile, Analytics, and Cloud, 4(1), 33-40. DOI: https://doi.org/10.36548/jismac.2022.1.004

Poria, S., et al. (2017). A Review of Affective Computing: From Unimodal Analysis to Multimodal Fusion. Information Fusion. DOI: https://doi.org/10.1016/j.inffus.2017.02.003

Patil, J., & Sheth, J. (2022). Deep Learning and Machine Learning Approaches for the Classification of Personality Traits. In Rising Threats in Expert Applications and Solutions: Proceedings of FICR-TEAS 2022 (pp. 139-146). Singapore: Springer Nature Singapore. DOI: https://doi.org/10.1007/978-981-19-1122-4_16

(Team), M. (2021). MuRIL: Multilingual Representations for Indian Languages. Google AI Blog.

Patil, J., & Sheth, J. (2021). Comparative study of data sources, features, and approaches for automatic personality classification from text. Int. J. Comput. Appl, 174, 0975-8887. DOI: https://doi.org/10.5120/ijca2021920968

Patil, M. J. A., & Godhwani, M. P. B. (2016). Review of Name Entity Recognition in Marathi Language. IJSART, 2(6).

Downloads

Published

31-10-2025

Issue

Section

Research Articles

How to Cite

[1]
Jayashri Patil and Kamini Sharma, “Personality Recognition from Text in Indian Languages : Challenges, Progress, and Future Directions”, Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol, vol. 11, no. 5, pp. 398–401, Oct. 2025, doi: 10.32628/CSEIT251117136.