Job Description Skill Role Grade Location DevOps Automation Technical Delivery Lead Level 4 or 5 New Jersey We are seeking a highly skilled Senior DevOps Engineer to design implement automate and support enterprise-scale infrastructure and platform solutions across hybrid cloud and on-premises environments. The ideal candidate will have deep expertise in Linux systems administration container orchestration CI/CD automation big data platforms observability and distributed data processing technologies. This role requires strong experience with OpenShift Kubernetes MongoDB Kafka Flink Spark/Cloudera ecosystems virtualization platforms and infrastructure automation using Ansible. The engineer will collaborate closely with development architecture security and operations teams to build highly available scalable secure and automated platforms supporting mission-critical enterprise applications. Key Responsibilities Infrastructure & Platform Engineering Design deploy configure and maintain enterprise Linux-based infrastructure environments. Administer and optimize Red Hat/OpenShift and Kubernetes container platforms. Manage Linux virtual machine environments across VMware KVM or cloud-based virtualization platforms. Implement highly available fault-tolerant and scalable infrastructure architectures. Perform capacity planning performance tuning and infrastructure optimization. Kubernetes & OpenShift Administration Build and maintain Kubernetes/OpenShift clusters for production and non-production environments. Configure ingress controllers networking storage classes service mesh operators and cluster security. Automate deployment pipelines and container lifecycle management. Implement GitOps and Infrastructure-as-Code practices. Troubleshoot cluster performance node failures networking issues and container runtime problems. Automation & DevOps Develop automation solutions using Ansible for provisioning patching configuration management and application deployment. Build and maintain CI/CD pipelines supporting microservices and distributed platforms. Standardize deployment and operational processes through scripting and automation. Integrate security compliance and operational controls into deployment workflows. Data & Streaming Platform Support
Administer and support Apache Kafka clusters including brokers topics partitions replication and security. Support Apache Flink streaming data platforms and real-time processing pipelines. Manage Cloudera/Spark ecosystems for distributed data processing workloads. Optimize distributed compute and data platforms for performance and resiliency. Support data ingestion streaming and large-scale analytics environments. Monitoring & Observability Implement enterprise monitoring logging and observability solutions using Splunk and related tooling. Develop dashboards alerts and operational metrics for infrastructure and application monitoring. Conduct root cause analysis and incident troubleshooting across distributed systems. Support production operations incident response and problem management activities. Security & Compliance Implement infrastructure hardening RBAC secrets management and container security best practices. Support enterprise security standards vulnerability remediation and compliance initiatives. Ensure operational reliability backup strategies and disaster recovery readiness. Required Qualifications Bachelors degree in Computer Science Information Technology Engineering or related field (or equivalent experience). 6 years of experience in DevOps Infrastructure Engineering Site Reliability Engineering or Platform Engineering roles. Strong expertise in Linux systems administration (RHEL). Hands-on experience with OpenShift and Kubernetes administration in enterprise environments. Extensive experience with Ansible automation and Infrastructure-as-Code methodologies. Strong experience supporting Kafka Flink MongoDB and Spark/Cloudera platforms. Experience managing Linux virtual machines and virtualization platforms. Experience with CI/CD tools and automated deployment pipelines. Strong scripting skills using Bash Python or similar languages. Experience with monitoring and logging platforms such as Splunk. Strong troubleshooting and performance tuning capabilities across distributed systems. Preferred Experience Areas Enterprise-scale distributed systems Financial services or high-availability environments Real-time data streaming platforms Large-scale containerized environments Behavioral Skills: Self-starter and experienced in leading the junior resources Hand-on architect with ability to implement and validate the solution Good Communication skills
Flexible to rotational shifts 5 days WFO Team Player Ability to work in a changing environment Strong problem solving and analytical skills Ability to work independently or within a team Manage day-to-day challenges and com
Job Description Skill Role Grade Location DevOps Automation Technical Delivery Lead Level 4 or 5 New Jersey We are seeking a highly skilled Senior DevOps Engineer to design implement automate and support enterprise-scale infrastructure and platform solutions across hybrid cloud and on-premises envir...
Job Description Skill Role Grade Location DevOps Automation Technical Delivery Lead Level 4 or 5 New Jersey We are seeking a highly skilled Senior DevOps Engineer to design implement automate and support enterprise-scale infrastructure and platform solutions across hybrid cloud and on-premises environments. The ideal candidate will have deep expertise in Linux systems administration container orchestration CI/CD automation big data platforms observability and distributed data processing technologies. This role requires strong experience with OpenShift Kubernetes MongoDB Kafka Flink Spark/Cloudera ecosystems virtualization platforms and infrastructure automation using Ansible. The engineer will collaborate closely with development architecture security and operations teams to build highly available scalable secure and automated platforms supporting mission-critical enterprise applications. Key Responsibilities Infrastructure & Platform Engineering Design deploy configure and maintain enterprise Linux-based infrastructure environments. Administer and optimize Red Hat/OpenShift and Kubernetes container platforms. Manage Linux virtual machine environments across VMware KVM or cloud-based virtualization platforms. Implement highly available fault-tolerant and scalable infrastructure architectures. Perform capacity planning performance tuning and infrastructure optimization. Kubernetes & OpenShift Administration Build and maintain Kubernetes/OpenShift clusters for production and non-production environments. Configure ingress controllers networking storage classes service mesh operators and cluster security. Automate deployment pipelines and container lifecycle management. Implement GitOps and Infrastructure-as-Code practices. Troubleshoot cluster performance node failures networking issues and container runtime problems. Automation & DevOps Develop automation solutions using Ansible for provisioning patching configuration management and application deployment. Build and maintain CI/CD pipelines supporting microservices and distributed platforms. Standardize deployment and operational processes through scripting and automation. Integrate security compliance and operational controls into deployment workflows. Data & Streaming Platform Support
Administer and support Apache Kafka clusters including brokers topics partitions replication and security. Support Apache Flink streaming data platforms and real-time processing pipelines. Manage Cloudera/Spark ecosystems for distributed data processing workloads. Optimize distributed compute and data platforms for performance and resiliency. Support data ingestion streaming and large-scale analytics environments. Monitoring & Observability Implement enterprise monitoring logging and observability solutions using Splunk and related tooling. Develop dashboards alerts and operational metrics for infrastructure and application monitoring. Conduct root cause analysis and incident troubleshooting across distributed systems. Support production operations incident response and problem management activities. Security & Compliance Implement infrastructure hardening RBAC secrets management and container security best practices. Support enterprise security standards vulnerability remediation and compliance initiatives. Ensure operational reliability backup strategies and disaster recovery readiness. Required Qualifications Bachelors degree in Computer Science Information Technology Engineering or related field (or equivalent experience). 6 years of experience in DevOps Infrastructure Engineering Site Reliability Engineering or Platform Engineering roles. Strong expertise in Linux systems administration (RHEL). Hands-on experience with OpenShift and Kubernetes administration in enterprise environments. Extensive experience with Ansible automation and Infrastructure-as-Code methodologies. Strong experience supporting Kafka Flink MongoDB and Spark/Cloudera platforms. Experience managing Linux virtual machines and virtualization platforms. Experience with CI/CD tools and automated deployment pipelines. Strong scripting skills using Bash Python or similar languages. Experience with monitoring and logging platforms such as Splunk. Strong troubleshooting and performance tuning capabilities across distributed systems. Preferred Experience Areas Enterprise-scale distributed systems Financial services or high-availability environments Real-time data streaming platforms Large-scale containerized environments Behavioral Skills: Self-starter and experienced in leading the junior resources Hand-on architect with ability to implement and validate the solution Good Communication skills
Flexible to rotational shifts 5 days WFO Team Player Ability to work in a changing environment Strong problem solving and analytical skills Ability to work independently or within a team Manage day-to-day challenges and com