seeking an experienced Cloud Site Reliability Engineer (Cloud SRE) to support a critical infrastructure transition initiative involving disaster recovery (DR) modernization backup environment migration and cloud infrastructure resiliency. This role will focus on ensuring the continuity availability recoverability and operational stability of business-critical applications and services throughout a large-scale infrastructure transformation.
This is a hands-on contract position responsible for planning executing validating and operationalizing disaster recovery and backup solutions while collaborating closely with infrastructure cloud application and security teams.
Key Responsibilities
Disaster Recovery & Infrastructure Migration
Plan and execute migrations of disaster recovery and backup environments associated with data center consolidations infrastructure modernization efforts or cloud transformation initiatives.
Responsible for supporting and improving DR processes and testing initiatives.
Support the relocation re-platforming or modernization of DR environments for applications databases and business-critical services.
Ensure recovery architectures meet established recovery objectives resiliency requirements and operational standards.
Disaster Recovery Testing & Validation
Plan coordinate and execute disaster recovery exercises and failover testing activities.
Identify recovery gaps operational risks and remediation opportunities to improve resiliency and recoverability.
Document test results lessons learned and recommended improvements.
Cloud & Hybrid Infrastructure Operations
Support cloud and hybrid infrastructure platforms related to backup disaster recovery and business continuity.
Assist in the implementation and operationalization of cloud-based backup and recovery solutions.
Execute operational runbooks standard operating procedures and recovery processes aligned with approved designs and governance standards.
Contribute to infrastructure reliability automation monitoring and operational excellence initiatives.
Operational Coordination & Documentation
Collaborate with application owners infrastructure engineers cloud teams security teams and third-party vendors during migration and testing activities.
Track project milestones dependencies risks and execution progress.
Maintain accurate documentation of environments configurations recovery procedures and operational outcomes.
Provide status updates and technical recommendations to project stakeholders.
Required Qualifications
5 years of experience in Site Reliability Engineering (SRE) Cloud Operations Infrastructure Engineering or related disciplines.
MUST HAVE Disaster Recovery (DR) testing experience.
MUST HAVE experience with automation and reliability engineering
Hands-on experience supporting enterprise disaster recovery environments backup systems and business continuity initiatives.
MUST HAVE strong understanding of disaster recovery concepts resiliency strategies recovery testing backup technologies and infrastructure migration methodologies.
Experience operating within cloud hybrid cloud or multi-cloud environments.
Proven ability to execute complex infrastructure projects within defined timelines and operational constraints.
Strong troubleshooting analytical and problem-solving skills.
Excellent communication collaboration and documentation abilities.
Preferred Qualifications
Experience supporting data center migrations colocation exits or infrastructure modernization projects.
Knowledge of cloud-native backup and disaster recovery platforms.
Experience with AWS Microsoft Azure Google Cloud Platform (GCP) or hybrid cloud environments.
Familiarity with infrastructure automation and operational tooling.
Experience working within regulated industries such as financial services healthcare insurance or government.
Strong operational discipline and process-oriented mindset.
Technical Skills
Disaster Recovery Planning & Execution
Backup & Recovery Solutions
Cloud Infrastructure (AWS Azure GCP)
Hybrid Infrastructure Operations
Infrastructure Reliability & Resiliency
Recovery Testing & Validation
Operational Runbooks & Documentation
Infrastructure Migration & Transformation
Monitoring & Operational Support
Incident Response & Problem Resolution
seeking an experienced Cloud Site Reliability Engineer (Cloud SRE) to support a critical infrastructure transition initiative involving disaster recovery (DR) modernization backup environment migration and cloud infrastructure resiliency. This role will focus on ensuring the continuity availability ...
seeking an experienced Cloud Site Reliability Engineer (Cloud SRE) to support a critical infrastructure transition initiative involving disaster recovery (DR) modernization backup environment migration and cloud infrastructure resiliency. This role will focus on ensuring the continuity availability recoverability and operational stability of business-critical applications and services throughout a large-scale infrastructure transformation.
This is a hands-on contract position responsible for planning executing validating and operationalizing disaster recovery and backup solutions while collaborating closely with infrastructure cloud application and security teams.
Key Responsibilities
Disaster Recovery & Infrastructure Migration
Plan and execute migrations of disaster recovery and backup environments associated with data center consolidations infrastructure modernization efforts or cloud transformation initiatives.
Responsible for supporting and improving DR processes and testing initiatives.
Support the relocation re-platforming or modernization of DR environments for applications databases and business-critical services.
Ensure recovery architectures meet established recovery objectives resiliency requirements and operational standards.
Disaster Recovery Testing & Validation
Plan coordinate and execute disaster recovery exercises and failover testing activities.
Identify recovery gaps operational risks and remediation opportunities to improve resiliency and recoverability.
Document test results lessons learned and recommended improvements.
Cloud & Hybrid Infrastructure Operations
Support cloud and hybrid infrastructure platforms related to backup disaster recovery and business continuity.
Assist in the implementation and operationalization of cloud-based backup and recovery solutions.
Execute operational runbooks standard operating procedures and recovery processes aligned with approved designs and governance standards.
Contribute to infrastructure reliability automation monitoring and operational excellence initiatives.
Operational Coordination & Documentation
Collaborate with application owners infrastructure engineers cloud teams security teams and third-party vendors during migration and testing activities.
Track project milestones dependencies risks and execution progress.
Maintain accurate documentation of environments configurations recovery procedures and operational outcomes.
Provide status updates and technical recommendations to project stakeholders.
Required Qualifications
5 years of experience in Site Reliability Engineering (SRE) Cloud Operations Infrastructure Engineering or related disciplines.
MUST HAVE Disaster Recovery (DR) testing experience.
MUST HAVE experience with automation and reliability engineering
Hands-on experience supporting enterprise disaster recovery environments backup systems and business continuity initiatives.
MUST HAVE strong understanding of disaster recovery concepts resiliency strategies recovery testing backup technologies and infrastructure migration methodologies.
Experience operating within cloud hybrid cloud or multi-cloud environments.
Proven ability to execute complex infrastructure projects within defined timelines and operational constraints.
Strong troubleshooting analytical and problem-solving skills.
Excellent communication collaboration and documentation abilities.
Preferred Qualifications
Experience supporting data center migrations colocation exits or infrastructure modernization projects.
Knowledge of cloud-native backup and disaster recovery platforms.
Experience with AWS Microsoft Azure Google Cloud Platform (GCP) or hybrid cloud environments.
Familiarity with infrastructure automation and operational tooling.
Experience working within regulated industries such as financial services healthcare insurance or government.
Strong operational discipline and process-oriented mindset.