Summary
Engineer with customer outward focus and extensive experience in Linux Systems, large-scale Cloud Infrastructure, and Container Orchestration. Passionate about systems automation, troubleshooting complex systems, and enhancing observability to drive high availability and resilience in cloud environments. Skilled at delivering exceptional support experiences and adept at resolving mission-critical issues to empower partners, customers, engineering teams.
Education
- M.S. in Computer Engineering, University of Bridgeport
- Bachelors in Technology – Electronics & Communications
Certifications
- RedHat Certified Engineer 9 - 160-139-218 .
- RedHat Certified System Admin. 8 - 160-139-218 .
- Certified Kubernetes App. Developer - LF-7e8li1lr2e .
- Certified Kubernetes Admin. - CKA-1900-003762-0100.
- ITIL v3 Certified - 4711930.1191862
Skills
Operating Systems:
Red Hat Enterprise Linux (RHEL) 6, 7, 8, 9, Ubuntu 16+, SUSE Linux Enterprise Server 12/15.
Virtualization:
KVM, Libvirt, VMware ESXi 5.X/6.X, Microsoft Hyper-V
Container Management:
Azure Kubernetes Service (AKS), Docker, Kubernetes, Microservices.
Scripting & Coding:
PowerShell, AzureCLI, Bash & Python.
CI/CD & Automation:
Azure DevOps, GitHub Actions, GitHub, Git, Ansible, Terraform, ARM Templates.
Observability:
Prometheus, Grafana, Kusto, Azure Data Explorer.
Cloud Environments:
Microsoft Azure, Amazon Web Services (AWS), Verizon (Terremark).
Others:
Pacemaker Clustering, NGINX, Performance Tuning, Root Cause Analysis, Troubleshooting Distributed Systems, Technical Training Deliveries, Customer Obsession, Collaboration with Engineering Teams, Working with global teams across different continents, Case & Trend Analysis, Cross functional support collaborations.
Experience
Microsoft, Texas
March 2018 to Present
Sr. Escalation Engineer
- Led Red Hat Enterprise Linux Escalation Partnership - Served as Global Red Hat Lead, conducting monthly case analysis and collaborating with Microsoft Account teams to reduce customer escalations and improve integrated support experience for Azure Linux workloads running in RHEL. Reduced RedHat partner escalations from 90–120 per month to less than 25 cases /month, over a 2.5-year period. That is a 75+% volume drop in escalations with RedHat partner.
- Architected scalable debugging infrastructure - Designed and implemented Linux debugging environment on Azure Kubernetes Service for automated RCAs & support incidents, reducing incident resolution time by 50%.
- Complex Technical Issue Resolutions - Provided Tier-3 escalation support for mission-critical Linux workloads (RHEL, SUSE, Ubuntu) across Fortune 100 customers, specializing in performance optimization, root cause analysis, design optimizations, OS upgrades, High Availability Clustering and kernel-level troubleshooting.
- Enhanced Operational Excellence through Documentation & Tooling - Authored comprehensive public-facing troubleshooting guides and runbooks for RHEL on Azure, increasing team Linux proficiency by 35% and reducing partner escalation volume. Developed Insights in internal tooling to speed up issue isolation and case resolution. Developed key modules for the Kernel Process & Debugging Technology Group, creating an advanced Linux Debugging (L400) course and automating the certification lab environment, reducing lab setup time by 70%.
- Managed enterprise lifecycle transitions - Orchestrated RHEL7 Extended Life Cycle Support rollout and RHEL6 sunset strategy across Azure infrastructure, ensuring zero-downtime migrations for enterprise customers. Led engineering efforts for RHEL6 Extended Lifecycle Support on Azure, managing all customer add-on requests and billing workflows, ensuring compliance and smooth service continuity.
- Cross-functional Collaboration with Engineering Teams - Partnered with Azure Platform Engineering, AKS Engineering, and Linux Systems Group (Azure Linux) to resolve critical issues, supportability meetings to improving overall platform experience and customer satisfaction for Linux workloads.
- Represented Microsoft Customer Support org. at Microsoft Ignite 2019, delivering a live demo on building a Kubernetes cluster in Azure from scratch using Ansible and Terraform, engaging an audience of 80+ attendees.
Virginia Tech, Blacksburg VA
June 2017 to March 2018
Linux Systems Administrator
- Designed and implemented a Secondary Site failover environment, improving disaster recovery readiness and reducing potential downtime by >90% during failover events.
- Automated VM provisioning via one-click Ansible Tower workflows, cutting deployment time from hours to minutes.
- Configured and deployed PowerDNS for the University Library IT infrastructure, enhancing DNS reliability and management efficiency.
- Provisioned and configured multiple AWS services to support scalable and secure application deployments.
- Applied security hardening and traffic management policies on F5 devices, improving application availability and reducing security vulnerabilities.
- Automated Linux OS patching with Ansible, achieving near-zero manual intervention and ensuring consistent compliance across server fleets.
Amazon Web Services - Contract, Herndon VA
Feb 2017 – June 2017
Systems Engineer
- Ranked #1 in team performance metrics for three consecutive weeks, demonstrating consistent technical excellence and efficiency.
- Resolved a backlog of critical network hardware tickets, restoring service reliability and reducing open issues by 100% within the assigned timeframe.
- Performed bare-metal Linux installations for EC2/S3/EBS hosts, including performance stress testing and physical asset management to ensure optimal infrastructure readiness.
- Configured and troubleshot compute and networking hardware in data centers, ensuring high availability and minimizing downtime for production environments.
Verizon Data Services Pvt. Ltd. - Hyderabad, India
Sept 2014 to July 2015
Systems Engineer
- Resolved automation defects in HP Operations Orchestrator, the core orchestration engine for Verizon Enterprise Cloud, restoring seamless workflow automation.
- Led rapid incident response during a critical outage, delivering immediate solutions to enterprise customers and minimizing business impact.
- Collaborated with enterprise customers to troubleshoot and resolve Linux infrastructure issues, aligning solutions with business-critical requirements.
- Upgraded the entire VMware ESXi infrastructure to the latest version with zero downtime, ensuring uninterrupted customer operations.
- Automated server audits and patch management using Ansible and HPE Server Automation (HPSA), improving compliance and reducing manual effort by >80%.
HCL Technologies Ltd., India
Jan 2013 to Sept 2014
Sr. Analyst Linux/VMware
- VMware and Linux Administration of a large cluster of servers, both physical and virtual environments.
- Incident/Change/Request Management of various issues.
- Worked on complete life-cycle of server from powering them in Rack to Decommissioning.
- L2 level support for all issues related to Linux and ESXi.
HCL Technologies Ltd., India
Aug 2011 to Jan 2013
Analyst Windows/Hyper-V
- Automated Server Audits using PowerShell scripting. Worked extensively on PowerShell to automate routine tasks.
- Created post-validation scripts in PowerShell after Patch activity and scripts for disk monitoring and reporting.
- Physical/Virtual Server builds on Hyper-V
- Patch Management of Windows servers through WSUS, troubleshooting issues related to DHCP and DNS.
- SCOM gateway server installation/configurations, SCOM DB monitoring and maintenance, installed SCOM agents on different Windows 2008/2012 servers.
- DHCP server reservations/maintenance. Configuring/Modifying file permissions for users and groups over file-shares.