Site Reliability Engineer III New
As a Site Reliability Engineer III at JPMorgan Chase, you will be responsible for ensuring the reliability, availability, and performance of large-scale distributed systems that power critical financial services. You will design and implement automation frameworks to reduce manual toil, build monitoring and alerting solutions, and drive incident response and root cause analysis for production issues. The role requires close collaboration with software engineering teams to influence system architecture for improved resiliency and scalability. You will develop and maintain CI/CD pipelines, manage infrastructure as code, and establish SLIs/SLOs to measure service health. Additionally, you will lead capacity planning efforts, conduct chaos engineering experiments, and champion SRE best practices across the organization.