Apples Media Graphics and Compute Technologies Group (MGC) is looking for a talented and dedicated big data engineer to join our Data Engineering team. The Data Engineering team within the MGC organization plays a critical role in supporting data-driven analytics by providing data collection warehousing and analytics at big data scale. Our team provides the infrastructure to power numerous trend and operational dashboards as well as other ad-hoc use cases in support of services like Apple TV Apple Music and FaceTime. We are leveraging Generative AI and Machine Learning technologies to provide best-in-class data analytics and role offers the opportunity to help design enhance and develop our very-high-volume processing pipeline. Youll work with talented engineers within our team as well as cross-functional teams in an agile and dynamic environment that values engineering excellence creativity and innovation and you will be a key contributor to our next generation of processing pipeline and data analytics platform.
Our team leverages modern Data Engineering Generative AI and Machine Learning technologies to deliver actionable insights. You will be:n Collaborating with data scientists across functional teams to define and enhance performance metrics that provide valuable insights for stakeholdersn Building and maintaining:n - Ingestion pipelines for real-time data processingn - Real-time applications driving operational monitoringn - Batch ETL/ELT applications populating our data warehousen Applying Generative AI and Retrieval Augmented Generation (RAG) techniques to enhance data analytics capabilitiesn Applying Machine Learning technologies for anomaly detection
Tuning and scaling Apache Kafka producer/consumer Spark Structured Streaming and Flink applications running in cloud environmentnManaging and monitoring large scale data collection and analytics pipelines at the application levelnPerforming capacity planning to scale infrastructure and applications running on KubernetesnTroubleshooting production issues and conducting performance analysis of distributed systemsnCollaborating with cross-functional teams to ensure high availability and reliability of data pipelinesnKeeping up with the latest data engineering trends and applying the corresponding technologies
Bachelors degree in Computer Science or equivalent professional experiencenExperience in building large scale distributed systems in Java/Python or similar languagesnProficient in SQLnExperience with data warehouse architectures and dimensional modelingnDemonstrated ability to conduct performance analysis and troubleshoot large scale distributed systemsnStrong collaboration skills with ability to understand complex architectures and work effectively across teamsnHands-on experience with Docker and Kubernetes
Production experience with Apache Kafka Spark or FlinknWorking knowledge of Trino or similar distributed query enginesnExperience building multi-agent AI systems or agentic workflowsnFamiliarity with Retrieval Augmented Generation (RAG) techniques working in conjunction with LLMsnExperience with creating and consuming Model Context Protocol (MCP) services
Required Experience:
IC
Apples Media Graphics and Compute Technologies Group (MGC) is looking for a talented and dedicated big data engineer to join our Data Engineering team. The Data Engineering team within the MGC organization plays a critical role in supporting data-driven analytics by providing data collection warehou...
Apples Media Graphics and Compute Technologies Group (MGC) is looking for a talented and dedicated big data engineer to join our Data Engineering team. The Data Engineering team within the MGC organization plays a critical role in supporting data-driven analytics by providing data collection warehousing and analytics at big data scale. Our team provides the infrastructure to power numerous trend and operational dashboards as well as other ad-hoc use cases in support of services like Apple TV Apple Music and FaceTime. We are leveraging Generative AI and Machine Learning technologies to provide best-in-class data analytics and role offers the opportunity to help design enhance and develop our very-high-volume processing pipeline. Youll work with talented engineers within our team as well as cross-functional teams in an agile and dynamic environment that values engineering excellence creativity and innovation and you will be a key contributor to our next generation of processing pipeline and data analytics platform.
Our team leverages modern Data Engineering Generative AI and Machine Learning technologies to deliver actionable insights. You will be:n Collaborating with data scientists across functional teams to define and enhance performance metrics that provide valuable insights for stakeholdersn Building and maintaining:n - Ingestion pipelines for real-time data processingn - Real-time applications driving operational monitoringn - Batch ETL/ELT applications populating our data warehousen Applying Generative AI and Retrieval Augmented Generation (RAG) techniques to enhance data analytics capabilitiesn Applying Machine Learning technologies for anomaly detection
Tuning and scaling Apache Kafka producer/consumer Spark Structured Streaming and Flink applications running in cloud environmentnManaging and monitoring large scale data collection and analytics pipelines at the application levelnPerforming capacity planning to scale infrastructure and applications running on KubernetesnTroubleshooting production issues and conducting performance analysis of distributed systemsnCollaborating with cross-functional teams to ensure high availability and reliability of data pipelinesnKeeping up with the latest data engineering trends and applying the corresponding technologies
Bachelors degree in Computer Science or equivalent professional experiencenExperience in building large scale distributed systems in Java/Python or similar languagesnProficient in SQLnExperience with data warehouse architectures and dimensional modelingnDemonstrated ability to conduct performance analysis and troubleshoot large scale distributed systemsnStrong collaboration skills with ability to understand complex architectures and work effectively across teamsnHands-on experience with Docker and Kubernetes
Production experience with Apache Kafka Spark or FlinknWorking knowledge of Trino or similar distributed query enginesnExperience building multi-agent AI systems or agentic workflowsnFamiliarity with Retrieval Augmented Generation (RAG) techniques working in conjunction with LLMsnExperience with creating and consuming Model Context Protocol (MCP) services
Ask Siri to name the most successful company in the world and it might respond: Apple. And it's not just out of familial pride. Apple consistently ranks highly in profit, revenue, market capitalization, and consumer cachet. In 2018, the company became the first reach a trillion dollar
... View more