Job Description
Summary
We are seeking a passionate and experienced Data Infrastructure Engineer to play a pivotal role in revolutionizing how we process and use substantial datasets as the heart of Siri, Search and Machine Learning. You will be instrumental in building a unified, groundbreaking data insights framework and data processing framework, powered by technologies such as Spark or Iceberg. You will collaborate closely with teams with varied strengths (i.e. Data Scientists and Analysts, other Engineering teams) to transform massive data into valuable, actionable datasets. You will also build metrics platform that fuel our innovative features and future machine learning area.
Description
Minimum Qualifications
- Demonstrated expertise in large-scale data processing, with a strong background of working with Spark and Python or Scala.
- Understanding of distributed computing principles, data engineering and DevOps standard processes.
- Proven programming skills in Python and Scala.
- A genuine passion for working with data and solving complex problems at scale, in cloud platforms (AWS, GCP).
- Experience with machine learning data mining.
- B.S.degree in Computer Science or Data Science.
Preferred Qualifications
- Metrics infrastructure experience, including metrics sharing, management, version control.
- PhD or MS in Computer Science.