Hosting Lead (Senior Task Lead/Systems Engineer - Hosting) chez Node.Digital
Node.Digital · Herndon, États-Unis d'Amérique · Remote
Description
Hosting Lead (Senior Task Lead / Systems Engineer - Hosting)
Location: Remote Work
- Clearance: Must meet DHS/USCIS background investigation/EOD; support $24x7 after-hours response.
Role Summary
Lead day-to-day Operations & Maintenance (O&M) across hybrid multi-cloud enterprise, DHS data centers, Equinix data centers, and cloud environments (AWS, Azure, GCP). Own infrastructure readiness, patching, capacity, performance, and Tier II-III incident/problem resolution. Drive automation-first operations and ensure compliance with EIOSS/eAUTO SLAs and AQLs.
Key Responsibilities
- End-to-End O&M: Own O&M for servers, storage, backup, databases, virtualization, and middleware; ensure operational acceptance with As-Built/VDD/Runbooks.
- Patching & Remediation: Lead patching, image baselines, remediation, and hardening for Windows, Red Hat/CentOS, and Solaris. Run integrated patch IPTs with DBAs, Security, and Field Ops.
- Infrastructure Management: Manage VMware vSphere, Microsoft Hyper-V, Citrix/VDI, FlexPod, NetApp, and enterprise backup (NetBackup, Backup Exec, Veeam).
- Modern Hosting: Run CI/CD-enabled hosting using Jenkins, Harness, Git, and Sonatype; operate OpenShift, Docker, Kubernetes, and UiPath.
- Performance Metrics: Deliver bi-weekly health checks and monthly metrics. Meet SLAs/AQLs (e.g., $\ge99.95\%$ availability) and produce capacity/incident reports.
- Automation & ITSM: Champion automation (Ansible/Chef) and ServiceNow ITSM/ITOM integration; maintain CMDB accuracy.
- Collaboration: Lead collaboration with customers and partners to break down silos and drive automation adoption across Engineering, Security, and the TOC.
Requirements
Required Technology Experience
- OS: Windows Server 2019+, Red Hat/CentOS, Solaris 11 (SPARC).
- Virtualization/VDI: VMware vSphere, Microsoft Hyper-V, Citrix.
- Containers/PaaS: Red Hat OpenShift, Docker, Kubernetes, PCF.
- Storage/Backup: NetApp (OCI), FlexPod, NetBackup, Backup Exec, Veeam.
- Databases: Oracle (Grid/RAC/ASM), Microsoft SQL Server, RMAN/Data Guard.
- Tooling: Jenkins, Git, Sonatype, ServiceNow, SolarWinds/SCOM, Twistlock/Prisma, Harness, UiPath.
Preferred Experience & Certifications
- 10+ years in large-scale engineering & O&M; scripting (PowerShell, Bash, Python) and multi-cloud ops.
- IaC/DevSecOps (Ansible/Terraform) and AIOps; 2-5 years of DHS experience with DHS 4300A knowledge.
- Certifications: Cisco CCNP Data Center, VMware VCP, NetApp NCIE/NCDA, RHCSA/RHCE, Microsoft, ITIL v4.