Cloud Site Reliability Engineer (SRE)


Job Location:

Charlotte, VT - USA

Monthly Salary: Not Disclosed
Posted on: 2 hours ago
Vacancies: 1 Vacancy

Job Summary

seeking an experienced Cloud Site Reliability Engineer (Cloud SRE) to support a critical infrastructure transition initiative involving disaster recovery (DR) modernization backup environment migration and cloud infrastructure resiliency. This role will focus on ensuring the continuity availability recoverability and operational stability of business-critical applications and services throughout a large-scale infrastructure transformation.

This is a hands-on contract position responsible for planning executing validating and operationalizing disaster recovery and backup solutions while collaborating closely with infrastructure cloud application and security teams.

Key Responsibilities

Disaster Recovery & Infrastructure Migration

  • Plan and execute migrations of disaster recovery and backup environments associated with data center consolidations infrastructure modernization efforts or cloud transformation initiatives.
  • Responsible for supporting and improving DR processes and testing initiatives.
  • Support the relocation re-platforming or modernization of DR environments for applications databases and business-critical services.
  • Ensure recovery architectures meet established recovery objectives resiliency requirements and operational standards.

Disaster Recovery Testing & Validation

  • Plan coordinate and execute disaster recovery exercises and failover testing activities.
  • Validate backup recovery procedures restoration processes recovery sequencing and operational readiness.
  • Identify recovery gaps operational risks and remediation opportunities to improve resiliency and recoverability.
  • Document test results lessons learned and recommended improvements.

Cloud & Hybrid Infrastructure Operations

  • Support cloud and hybrid infrastructure platforms related to backup disaster recovery and business continuity.
  • Assist in the implementation and operationalization of cloud-based backup and recovery solutions.
  • Execute operational runbooks standard operating procedures and recovery processes aligned with approved designs and governance standards.
  • Contribute to infrastructure reliability automation monitoring and operational excellence initiatives.

Operational Coordination & Documentation

  • Collaborate with application owners infrastructure engineers cloud teams security teams and third-party vendors during migration and testing activities.
  • Track project milestones dependencies risks and execution progress.
  • Maintain accurate documentation of environments configurations recovery procedures and operational outcomes.
  • Provide status updates and technical recommendations to project stakeholders.

Required Qualifications

  • 5 years of experience in Site Reliability Engineering (SRE) Cloud Operations Infrastructure Engineering or related disciplines.
  • MUST HAVE Disaster Recovery (DR) testing experience.
  • MUST HAVE experience with automation and reliability engineering
  • Hands-on experience supporting enterprise disaster recovery environments backup systems and business continuity initiatives.
  • MUST HAVE strong understanding of disaster recovery concepts resiliency strategies recovery testing backup technologies and infrastructure migration methodologies.
  • Experience operating within cloud hybrid cloud or multi-cloud environments.
  • Proven ability to execute complex infrastructure projects within defined timelines and operational constraints.
  • Strong troubleshooting analytical and problem-solving skills.
  • Excellent communication collaboration and documentation abilities.

Preferred Qualifications

  • Experience supporting data center migrations colocation exits or infrastructure modernization projects.
  • Knowledge of cloud-native backup and disaster recovery platforms.
  • Experience with AWS Microsoft Azure Google Cloud Platform (GCP) or hybrid cloud environments.
  • Familiarity with infrastructure automation and operational tooling.
  • Experience working within regulated industries such as financial services healthcare insurance or government.
  • Strong operational discipline and process-oriented mindset.

Technical Skills

  • Disaster Recovery Planning & Execution
  • Backup & Recovery Solutions
  • Cloud Infrastructure (AWS Azure GCP)
  • Hybrid Infrastructure Operations
  • Infrastructure Reliability & Resiliency
  • Recovery Testing & Validation
  • Operational Runbooks & Documentation
  • Infrastructure Migration & Transformation
  • Monitoring & Operational Support
  • Incident Response & Problem Resolution

seeking an experienced Cloud Site Reliability Engineer (Cloud SRE) to support a critical infrastructure transition initiative involving disaster recovery (DR) modernization backup environment migration and cloud infrastructure resiliency. This role will focus on ensuring the continuity availability ...