Jan 10, 2024

Machine Learning Engineer

Permanent
On-site
San Francisco
USA

Machine Learning Engineer

Specialism: Training and Inference
Location: US – San Francisco / CA
$200,000 - 250,000
  
We’re partnered with a global giant that is a challenger to some of the major social media platforms and they are looking to add creative and self-driven Staff or Principal Machine Learning Engineers to work on various projects at a massive scale with users well above 300M.
  
This is a unique opportunity to build a platform from the ground up (think Facebook in 2006).
  
To give you insight into the role, they currently support close to 300 million users on their platform who have come online in just the last few years and their vision for the product is to revolve around its algorithmically generated content feeds and ensuring users remain engaged and diverse as it grows on a massive scale.
  
From a team perspective, they operate in a free and experimental way, the major difference from their competitors is they allow staff & engineers complete freedom to conduct experiments by learning from both successes and failures to develop highly scalable and state-of-the-art algorithms that will be used by hundreds of millions of people globally.
  
We’re currently attracting subject matter experts from Facebook, Amazon, Apple & Spotify, and we’re keen to hear from candidates who’ve worked on similar projects scaling up large systems (all applications considered).
  
As part of the Training and Inference Team you will build and maintain the tools and services needed to scale ML models on state of the art hardware across the Platform. You will have free reign to experiment and deploy models at lightening speed.
  
The ML engineer will have a hands-on role in building common tools and services and deploying them across the platform, including inference clusters to streamline future research, and high quality reusable libraries.
  
Person Skills:
- Ph.D/ MSc degree in Computer Science or related quantitative discipline
- Experience in training ML models with frameworks like PyTorch Serving: TFServing, Triton, TorchServe,Seldon 2 years of experience working on large systems at massive scale
- Excellent coding skills in at least one of Python, C/C++, Java, NodeJS, Go, Scala
- Experience working with Kubernetes/Kubeflow
- Experience with distributed systems, scalable data processing frameworks (e.g., Spark,Kafka) and noSQL systems (e.g., HBase, Cassandra) is a plus
- Good knowledge of GPU, TPU accelerators and GCP
  
On offer is an excellent package and, in most cases, we are beating current packages. For example, leading salaries, genuine share options in a company in hyper-growth mode, bonuses, health insurance, and more.

This company is an equal opportunity employer and value diversity. They do not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

** If you're interested in this opportunity, please submit your CV via the link provided **

Cubiq Recruitment is recognised as a trusted supplier of permanent, contract and interim recruitment services to AI, Software ERP, Engineering, Manufacturing and Commercial sectors. Our teams of specialist recruiters operate across all core commercial engineering & technology disciplines and specialist areas.

Apply form

Max file size 10MB.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Get in touch with our consultant

Jack Cartlidge
PhoneEmailLinkedin

We want to here from you

Get in touch

Our specialists team are waiting for hear from you whether you're a business looking to hire or looking for your next opportunity!