Manager, Database Administration presso HHAeXchange
HHAeXchange · États-Unis d'Amérique · Remote
Essential Job Duties
- Architect and maintain complex replication models (e.g., multi-region, real-time) and high-availability solutions like clustering or multi-AZ deployments.
- Ensure data consistency across distributed SaaS environments and hybrid-cloud infrastructures.
- Design and implement end-to-end Disaster Recovery (DR) strategies, ensuring compliance with Recovery Point Objectives (RPO) and Recovery Time Objectives (RTO).
- Participate in regular DR exercises, automated failover testing, and demonstration of no-outage resiliency.
- Establish and enforce automated data retention, archiving, and purging policies aligned with industry compliancy, including healthcare industry requirements.
- Implement and support security measures, including transparent data encryption (TDE), role-based security, and unified auditing.
- Lead proactive performance tuning, query optimization, and capacity management for high-volume, transactional SaaS workloads.
- Monitor and analyze advanced performance metrics to identify and resolve resource contention.
- Serve as a technical advisor for database modernization roadmaps, transitioning legacy systems to cloud-native or containerized architectures.
- Develop standards for database schema design, storage structures, and automated deployment pipelines (CI/CD).
- Manage a global team of database operators to ensure 24/7 system availability, oversee shift rotations, on-call schedules, and incident response protocols.
- Establish Standard Operating Procedures for the database operations team, covering routine maintenance, patching, and ticket escalation paths.
- Build and promote our "Operations-as-Code" model, mentoring the team in using tools like Terraform, Ansible, and utilizing CI/CD pipelines to automate repetitive tasks.
- Conduct regular technical reviews and provide career development guidance to junior database operators to bridge the gap between operations and engineering.
Other Job Duties
- Other duties as assigned by supervisor or HHAeXchange leader.
Travel Requirements
- Travel up to 10%, including overnight travel
Required Education, Experience, Certifications and Skills
- Bachelor’s or Master’s degree in Computer Science, Information Systems, or related field and applicable experience.
- 8+ years in database administration, with at least 4 years in a Principal or Lead capacity within an enterprise SaaS environment.
- 3+ years of experience directly managing or leading a team of database operators or site reliability engineers (SREs)
- Expertise in Cloud Data Platforms (e.g., AWS RDS/Aurora, Azure SQL, OCI) and relational systems (PostgreSQL, MySQL, Oracle, or SQL Server) in both Cloud and On-Prem environments.
- Experience with No-SQL solutions (e.g., Redis, Couchbase, MongoDB, Cassandra) in a distributed environment.
- Advanced Automation/Scripting skills (Python, Bash, Terraform, or Ansible) for infrastructure-as-code and task automation.
- Proven experience with Large-Scale Data Migration and zero-downtime upgrades.
- Proven ability to lead a "War Room" during critical outages, coordinating between operations staff and executive stakeholders.
- Experience defining and reporting on operational KPIs (e.g., Mean Time to Resolve, system uptime, and backup success rates) to senior leadership.
- Experience managing distributed or offshore operations teams in a follow-the-sun support model.
- Experience analyzing and negotiating license and hosting agreements from a performance and cost perspective
- Strong analytical thinking, complex problem-solving, and the ability to communicate technical strategies to non-technical stakeholders
- Willingness to explore and adopt AI tools responsibly to enhance productivity and innovation in your role