AI-Driven Cloud Optimization: A Machine Learning Approach to Kubernetes Autoscaling in Financial Platforms
DOI:
https://doi.org/10.32628/CSEIT24102152Keywords:
Kubernetes, Cost Optimization, Intelligent Autoscaling, Financial Platforms, AI-Driven Solutions, Cloud ComputingAbstract
Cloud cost efficiency remains a critical concern for enterprises, especially in always-on Kubernetes environments. This paper details a cost optimization strategy using AI-driven intelligent autoscaling to dynamically provision compute nodes based on real-time workload demands. The solution incorporates usage telemetry, machine learning-based bin-packing algorithms, and custom eviction policies to balance performance and cost. Deployed in a high-availability, HIPAA-aligned payment integrity platform, the framework demonstrated monthly cloud savings of over 40%, while maintaining SLA adherence. This extended analysis provides comprehensive technical details of the ML algorithms, architectural components, implementation specifics, and performance characteristics. The approach serves as a replicable model for financial and healthcare institutions aiming to control operational costs without compromising reliability or compliance.
Downloads
References
Chen, K., & Liu, S. (2019). Machine learning approaches for cloud resource management: A comprehensive survey. Journal of Cloud Computing, 8(1), 1-24.
Johnson, R., Martinez, P., & Wong, L. (2020). Kubernetes autoscaling in production: Lessons from financial services implementations. IEEE Cloud Computing, 7(3), 45-53.
Patel, A., & Thompson, D. (2018). Cost optimization strategies for containerized applications in regulated industries. ACM Computing Surveys, 51(4), 78-95.
Rodriguez, M., Kim, J., & Davis, R. (2017). Intelligent resource provisioning in cloud environments: Algorithms and implementation strategies. Computer Networks, 125, 134-148.
Singh, V., & Miller, C. (2019). Compliance-aware cloud optimization: Balancing cost and regulatory requirements. Information Security Journal, 28(2), 89-104.
Wang, H., Zhang, Q., & Li, X. (2020). Predictive scaling for container orchestration platforms: A machine learning approach. Future Generation Computer Systems, 108, 192-205.
Oleti, Chandra Sekhar. (2023). Cognitive Cloud Security : Machine Learning-Driven Vulnerability Management for Containerized Infrastructure. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. 773-788. 10.32628/CSEIT23564528. DOI: https://doi.org/10.32628/CSEIT23564528
Kamadi, Sandeep. (2022). AI-powered rate engines: modernizing financial forecasting using microservices and predictive analytics. International journal of computer engineering & technology. 13. 220-233. 10.34218/IJCET_13_02_024. DOI: https://doi.org/10.34218/IJCET_13_02_024
Arcot, Siva Venkatesh. (2022). Federated Learning Framework for Privacy- Preserving Voice Biometrics in Multi-Tenant Contact Centers. International Journal For Multidisciplinary Research. 4.
Subbian, Rajkumar. (2023). Advanced Data-Driven Frameworks for Intelligent Underwriting Risk Assessment in Property and Casualty Insurance. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. 880-893. 10.32628/CSEIT2342437. DOI: https://doi.org/10.32628/CSEIT2342437
Oleti, Chandra Sekhar. (2022). The future of payments: Building high-throughput transaction systems with AI and Java Microservices. World Journal of Advanced Research and Reviews. 16. 1401-1411. 10.30574/wjarr.2022.16.3.1281. DOI: https://doi.org/10.30574/wjarr.2022.16.3.1281
Arcot, Siva Venkatesh. (2022). Secure Cloud-Native GNN Architecture for Multi-Channel Contact Center Flow Orchestration. International Journal of Scientific Research in Computer Science Engineering and Information Technology. 8. 565-581. 10.32628/CSEIT2541328. DOI: https://doi.org/10.32628/CSEIT2541328
Sandeep Kamadi. (2022). Proactive Cybersecurity for Enterprise Apis: Leveraging AI-Driven Intrusion Detection Systems in Distributed Java Environments. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 5(1), 34-52. https://iaeme.com/MasterAdmin/Journal_uploads/IJRCAIT/VOLUME_5_ISSUE_1/IJRCAIT_05_01_004.pdf DOI: https://doi.org/10.34218/IJRCAIT_05_01_004
Arcot, Siva Venkatesh. (2022). Federated Learning Framework for Privacy- Preserving Voice Biometrics in Multi-Tenant Contact Centers. International Journal For Multidisciplinary Research. 4.
Gollapudi, Pavan Kumar. (2022). Intelligent Data Analytics Platform for Insurance Domain Test Data Management and Privacy Preservation. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. 8. 553-564. 10.32628/CSEIT2541327. DOI: https://doi.org/10.32628/CSEIT2541327
Gollapudi, Pavan Kumar. (2022). Predictive Analytics for Proactive Quality Assurance in Guidewire Cloud Implementations. International Journal of Scientific Research in Computer Science Engineering and Information Technology. 8. 520-536. 10.32628/CSEIT23902190. DOI: https://doi.org/10.32628/CSEIT23902190
Chandra Sekhar Oleti. (2022). Serverless Intelligence: Securing J2ee-Based Federated Learning Pipelines on AWS. International Journal of Computer Engineering and Technology (IJCET), 13(3), 163-180. https://iaeme.com/MasterAdmin/Journal_uploads/IJCET/VOLUME_13_ISSUE_3/IJCET_13_03_017.pdf DOI: https://doi.org/10.34218/IJCET_13_03_017
Praveen Kumar Reddy Gujjala. (2022). Enhancing Healthcare Interoperability Through Artificial Intelligence and Machine Learning: A Predictive Analytics Framework for Unified Patient Care. International Journal of Computer Engineering and Technology (IJCET), 13(3), 181-192. https://iaeme.com/Home/issue/IJCET?Volume=13&Issue=3 DOI: https://doi.org/10.34218/IJCET_13_03_018
Gujjala, Praveen Kumar Reddy. (2022). Data science pipelines in lakehouse architectures: A scalable approach to big data analytics. World Journal of Advanced Research and Reviews. 16. 1412-1425. 10.30574/wjarr.2022.16.3.1305. DOI: https://doi.org/10.30574/wjarr.2022.16.3.1305
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.