Kalpa Labs

Scaling Generalist Speech models

Founding ML Research Engineer - Training Infrastructure

$140K - $200KSan Francisco, CA, US
Job type
Full-time
Role
Engineering, Machine learning
Experience
Any (new grads ok)
Visa
Will sponsor
Connect directly with founders of the best YC-funded startups.
Apply to role ›
Prashant Shishodia
Prashant Shishodia
Founder

About the role

We’re hiring a Founding ML Research Engineer to build the pre-training and post-training infrastructure for training some of the largest speech models in the world. You’ll own the training stack end-to-end with a small team, tons of compute, high autonomy, and the mandate to push toward 100B+ scale as we scale generalist speech models.

What you’ll do

  • Design and implement a production-grade training stack for large-scale speech model pre-training and post-training (SFT/RLHF-style, distillation, preference optimization, etc.).
  • Build scalable data + compute pipelines: dataset curation, filtering, mixing, tokenization/feature pipelines, evaluation harnesses, and experiment tracking.
  • Own distributed training: performance profiling, stability, fault tolerance, checkpointing, resumption, and high-throughput I/O.

What we’re looking for

  • Strong ML systems and engineering depth (distributed training, performance, reliability).
  • Practical experience training large models (speech/audio is a plus but not required; language/vision experience is also relevant).
  • Comfort operating in ambiguity: you can spec, build, debug, and ship.

To apply
Reply with your best paper / blog post / arXiv link and a short note on what you built and what you want to build next.

About Kalpa Labs

Kalpa Labs
Founded:2025
Batch:F25
Team Size:2
Status:
Active
Founders
Gautam Jha
Gautam Jha
Founder
Prashant Shishodia
Prashant Shishodia
Founder