Site Reliability Engineer

Apple


Job Location:

Vancouver, WA - USA

Monthly Salary: Not Disclosed
Posted on: 17 hours ago
Vacancies: 1 Vacancy

Job Summary

The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes tools and automation for managing distributed systems in production environments. Our SRE team combines software and systems engineering and system administration practices to build and run large-scale massively distributed fault-tolerant systems. Our software ensures that Apples services are reliable scalable and secure and we leverage both open source and home-grown technologies to provide managed data infrastructure services. You will help building next generation search infrastructure and platform services collaborating cross-functionally with various ASE teams from store and commerce to search and recommendations. Youll create platforms that can rapidly scale to serve personalized and non-personalized data with very low latencies. You should be someone who is not afraid to question assumptions are a good standout colleague under tight deadlines and can take on problems with elegant technical solutions.

The ASE SRE team develops applications and tooling that are safe reliable scalable and fast. Our Data Reliability Engineering team is responsible for all aspects of managing Voldemort key-value distributed database infrastructure deployment on on-premise bare metal and public cloud platforms including maintenance deployment automation backup observability and telemetry with focus on reliability performance and scaling to deliver continuous data store availability to ASE Media Applications. Success in this role requires expertise in several of the following:nn- Understanding of core SRE concepts - Monitoring Alerting Incident managementnn- Performance engineering (design concepts profile-guided optimization)nn- Service management across bare metal and virtualized (EC2) platforms nn- Prepare alert handling procedures run-books and collaborate with other SRE team members. nn- Excellent communication and a high degree of customer focus when engaging with internal platform customers nn- As a distributed team ability to work optimally with colleagues based in other locations is also essential; experience in this area is a plusnn- Prior experience with development or maintenance of distributed databases and operating systems is recommended nnCome join us at Apple Services Engineering and help us deliver services and applications that are fluid and responsive. You will collaborate with engineers from across Apple to define the metrics set targets uncover optimization opportunities and ship a service that will delight our customers. This role is for engineers who enjoy deep technical engineering that spans large cross-organizational projects. Your openness to learning and implementing new technologies will contribute to the continuous evolution of our organization. Good ideas are valued and rewarded.

Success in this role requires expertise in several of the following:nnBS/MS in Computer Science or Equivalent nAt least 2-5 years in a Reliability Engineering DevOps or infrastructure focused rolenSupport of internet-facing production services and distributed systems via deployments onCall and Incident of distributed database concepts (consistency models isolation levels crash and recovery semantics).nPerformance engineering (design concepts profile-guided optimization).nDatacenter architecture (networking topologies host placement strategies and failure modes); design of multi-datacenter systems; failure domains; and wide-area advocate - prior history of removing operational toil via software.n Self motivated inquisitive and always looking to learn more.

Demonstrated expertise developing distributed systems storage engines distributed systems or performance developing critical internet services and/or platform in one or more of the following programming languages: Java Go (golang) PythonnOptional experience managing services on Kubernetes nOptional experience with EC2 EBS and Terraform

Required Experience:

IC

The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes tools and automation for managing distributed systems in production environments. Our SRE team combines software and systems engineering and system administration practices to b...

About Company

Company Logo

Ask Siri to name the most successful company in the world and it might respond: Apple. And it's not just out of familial pride. Apple consistently ranks highly in profit, revenue, market capitalization, and consumer cachet. In 2018, the company became the first reach a trillion dollar ... View more

View Profile View Profile