Bio
Nagarjuna Malladi is a Principal Site Reliability Engineer at Oracle America, Inc., with over 12 years of expertise in enhancing system reliability, optimizing monitoring solutions, and driving operational efficiency. Nagarjuna has a proven track record in leveraging advanced CI/CD tools and cloud technologies to improve service uptime and reduce operational costs. Currently, at Oracle, Nagarjuna implements real-time monitoring systems for Oracle’s Health Sciences Global Business Unit (HSGBU), using tools like Prometheus, Grafana, and the ELK stack, resulting in a 15% improvement in uptime and a 40% reduction in log analysis time. His work includes developing Service Level Indicator (SLI), Service Level Objective (SLO), and Service Level Agreement (SLA) metrics to ensure seamless communication between development and operations teams, significantly enhancing customer satisfaction. Prior to Oracle, Nagarjuna held key roles at Cisco and Apple, where he pioneered cloud resource optimization strategies, reducing operational costs by 30%, and led initiatives to automate deployment processes and infrastructure management, cutting deployment times by 40%. At Cisco, he was instrumental in developing Python automation scripts for Jenkins pipelines and deploying microservices architectures for scalable environments. Nagarjuna holds a Bachelor’s degree in Information Technology from Jawaharlal Nehru Technological University, Kakinada, India, and is certified in Oracle Cloud Infrastructure (OCI) Foundations and Observability. His expertise spans cloud services such as AWS and OCI, programming languages including Bash, Python, and Go, and an array of monitoring tools like Thousand Eyes, Nagios, and Zabbix. His commitment to innovation and operational excellence continues to drive the resilience and scalability of critical infrastructures across industries.