Job Description
Summary
Do you want to make Siri and Apple products more intelligent for our users? The Information Intelligence Infrastructure team is building groundbreaking technology for search, natural language processing, artificial intelligence and machine learning. Our infrastructure is the back-bone of Apple Intelligence. It powers the largest Apple foundation models on servers and a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware.
As part of this group, you will work with one of the most exciting high performance computing environments, with petabytes of data, millions of queries per second, and have an opportunity to imagine and build products that delight our customers every single day. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.
Description
Minimum Qualifications
- Strong background in computer science: algorithms, data structures and system design
- 10+ year experience on large scale distributed system design, operation and optimization
- Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow
- Excellent interpersonal skills able to work independently as well as cross-functionally
Preferred Qualifications
- Proficient in building and maintaining systems written in modern languages (e.g. Golang, Python)
- Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
- Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.
- Experience writing custom CUDA kernels using CUDA or OpenAI Triton.