We are looking for a Cloud Infra Architect with AWS to lead the architecture and implementation for Launch Darkly. This person must have prior experience in leading this implementation and roll out to application teams at large scale.
Launch Darkly Feature Flags with Progressive rollouts, A/B Testing with Zero Downtime & Full Automation
This document outlines the comprehensive requirements for vendor partners to support initiatives aimed at achieving zero downtime, reducing production incidents, improving change failure rate metrics, and enabling full automation. The scope includes feature flags, A/B testing, progressive rollout, and support for deployment patterns across APIs, EKS, OnPrem, Lambdas, and other AWS services.
Functional Scope
Zero Downtime Deployments
Implement blue/green or canary deployment models with seamless traffic switching, rollback capability, and session persistence.
Change Failure Rate Reduction
Integrate root cause tracking, automated rollback, and pre-deployment validation pipelines.
Feature Flags
Enable real-time toggling, secure access control, and auditability. Must support both server-side and client-side toggles.
A/B and B/G Testing
Support traffic segmentation, real-time metrics, rollback, and privacy compliance.
Progressive Rollouts
Automate staged rollouts by region, user cohort, or environment. Include rollback triggers based on metrics.
Automation & CI/CD
Full GitHub Actions integration, dynamic runners, and golden path patterns for EKS, Lambda, and OnPrem.
Environment Patterns
Support for APIs, EKS, OnPrem, Lambdas, Kafka, Glue, RDS, S3, and other AWS services.
Observability & Metrics
Integrate with Grafana, Splunk, and DORA metrics (lead time, change frequency, failure rate, MTTR).
Self-Service Enablement & Onboarding/Migration support for feature flags
Empower teams with Express Lane-style pipelines, role-based access, and audit trails.
Expected Outcomes
- Pilot with at least 5 teams by Nov’2025
- We need enterprise adoption ready by Nov with at least 5 Patterns inclusive of Cloud & OnPrem
- 99.9%+ availability during deployments.
- 99%+ reduction in change failure rate.
- Full automation of provisioning, testing, and deployment pipelines
- Full automation and governance for E2E feature flag lifecycle management
Non-Functional Requirements (NFRs)
Performance
Low-latency toggling, fast rollback, <4 min deployment time.
These cookies are necessary for the website to function and cannot be turned off in our systems. You can set your browser to block these cookies, but then some parts of the website might not work.
Security
User experience
Target group oriented cookies
These cookies are set through our website by our advertising partners. They may be used by these companies to profile your interests and show you relevant advertising elsewhere.
Google Analytics
Google Ads
We use cookies
🍪
Our website uses cookies and similar technologies to personalize content, optimize the user experience and to indvidualize and evaluate advertising. By clicking Okay or activating an option in the cookie settings, you agree to this.
The best remote jobs via email
Join 5'000+ people getting weekly alerts with remote jobs!