Lead Site Reliability Engineer ⚡ Urgent
Critical role at ChartIQ building, maintaining, and scaling infrastructure supporting Development and QA needs while driving new cloud-based solutions. Design, implement, and manage infrastructure using Terraform or IaC tools, leverage AWS to build scalable high-performance infrastructure. Implement site reliability practices using DataDog for monitoring and alerting, design high-availability architecture, manage CI/CD workflows using GitHub Actions. Work with OIDC integrations across Microsoft, AWS, GitHub, and Okta. Contribute to QA testing and light JavaScript programming including HTML and CSS fixes. Assist with mobile app deployment on Apple App Store and Google Play Store. Requires 10-20 years experience with Terraform, AWS, DataDog, ServiceNow CMDB, GitHub Actions, OIDC integrations, JavaScript/HTML/CSS skills, and network management experience. Work overlapping hours with US teams until 12 noon EST.