Talent.com
Machine Learning Data Engineer - Systems & Retrieval
Machine Learning Data Engineer - Systems & RetrievalZyphra • Palo Alto, CA, United States
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Machine Learning Engineer • Palo Alto, CA, United States

Job_description.internal_linking.related_jobs
Machine Learning Engineer

Machine Learning Engineer

Institute Of Foundation Models • Sunnyvale, California, United States
serp_jobs.job_card.full_time
About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Lead Machine Learning Engineer, Recommender Systems

Lead Machine Learning Engineer, Recommender Systems

HP IQ • Palo Alto, California, United States
serp_jobs.job_card.full_time
HP IQ is HP’s new AI innovation lab.Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.We’re asse...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer

Machine Learning Engineer

Gridmatic • Cupertino, California, United States
serp_jobs.job_card.full_time
Bay Area and Houston that is accelerating the clean energy transition by applying our expertise in data, machine learning, and energy to power markets. We are the rare startup that has multiple year...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer 2

Machine Learning Engineer 2

Intuit • Mountain View, CA, United States
serp_jobs.job_card.full_time
Embedded inside a vibrant team of data scientists, you’ll conceive, code, and deploy data science models at scale using industry tools. Key skills : data wrangling, feature engineering, model develop...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Systems Engineer

Machine Learning Systems Engineer

Apple Inc. • Cupertino, CA, United States
serp_jobs.job_card.full_time
Cupertino, California, United States Machine Learning and AI.The Siri organization is looking for passionate Machine Learning Systems Engineers to join us in developing and shipping state-of-the-ar...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, Compute

Machine Learning Engineer, Compute

Waymo • Mountain View, California, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sr. Machine Learning Engineer, GAI Search Relevance

Sr. Machine Learning Engineer, GAI Search Relevance

Moveworks • Mountain View, California, United States
serp_jobs.job_card.full_time
As a senior member of the core platform team, you will play a key role in shaping the evolution of moveworks conversational AI platform. You will have the opportunity to - build enterprise products ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Cisco is Seeking Machine Learning Engineer – AI Research

Cisco is Seeking Machine Learning Engineer – AI Research

Globalsouthopportunities • San Jose, CA, United States
serp_jobs.job_card.full_time
This full-time professional role offers an opportunity to advance state-of-the-art research in.Cisco is one of the world’s most trusted technology companies, connecting people, businesses, and comm...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, Level 4

Machine Learning Engineer, Level 4

Minimal • Palo Alto, CA, United States
serp_jobs.job_card.full_time
Machine Learning Engineer, Level 4 page is loaded## Machine Learning Engineer, Level 4locations : Palo Alto, California : Santa Monica - 3250 Ocean Park Blvd : Seattle, Washington : Bellevue, W...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Founding Machine Learning Engineer

Founding Machine Learning Engineer

Key Technology • Fremont, CA, United States
serp_jobs.job_card.full_time
You’ll design, build, and ship ranking and recommendation systems that make every match feel more personal and improve week after week. Train and fine-tune LLMs / encoders.Collaborate across ML, platf...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Engineer - GenAI, LLM, Agentic AI

Machine Learning Engineer - GenAI, LLM, Agentic AI

Nutanix • Santa Clara, CA, United States
serp_jobs.job_card.full_time
We are building the next generation of our AI-powered talent platform, aiming to match the right career for everyone in the world. Our AI-native enterprise talent intelligence platform leverages Gen...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Engineer 2

Machine Learning Engineer 2

Intuit Inc. • Mountain View, CA, United States
serp_jobs.job_card.full_time
In this role, you’ll be embedded inside a vibrant team of data scientists.You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools.Importan...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, California, United States
serp_jobs.job_card.full_time
Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

Newsbreak • Mountain View, California, United States
serp_jobs.job_card.full_time
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer (Data Science)

Machine Learning Engineer (Data Science)

Autonomous Healthcare • Santa Clara, CA, US
serp_jobs.job_card.full_time
At Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Engineer 756

Machine Learning Engineer 756

Protegrity • Palo Alto, California, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments.We leverage adva...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Researcher / ML-Ops Engineer

Machine Learning Researcher / ML-Ops Engineer

Rivet Industries, Inc. • Palo Alto, CA, United States
serp_jobs.job_card.full_time
Machine Learning Researcher / ML-Ops Engineer.Rivet is an American company building integrated task systems — fusing hardened hardware with software, sensors, AI, and networking — for industrial wo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer

Machine Learning Engineer

Gotion • Fremont, California, United States
serp_jobs.job_card.full_time
Silicon Valley, CA, currently building a Manufacturing facility in Manteno, IL and has R&D centers in Ohio, China, Japan and Europe. We innovate in the next generation electric vehicle and energy st...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted