Chai is one of the fastest-growing, generative AI startups in Silicon Valley. YouTube but for LLM’s - we have over 1 million active users.
Who we are looking for:
We need a relentless engineer with 3+ years of experience overseeing and being responsible for optimizing our LLMs. Ensuring they are performant, scaleable, and cost-efficient. You will work alongside equally talented and driven teammates implementing cutting-edge AI inference engines. We need someone who is reliable and has high standards.
Here’s why we might not be the right fit for you:
• We work hard and have a high-velocity environment with lots of growth opportunities.
• We value exceptional performance and continuous improvement. We believe that if you aren’t constantly learning, you aren’t growing.
• You will be responsible and accountable for making high-impact decisions that determine Chai’s future
Here are the top 2 reasons why you should join us:
• Exponential growth. 1 Million MAU. Join the team that gets us to 100 million MAU
• Craftsmanship. Build something beautiful
Requirements:
• Familiar with vLLM, quantization, and current techniques of LLM optimization
• 3+ years of experience in software engineering
• Bachelor or Master degree from a leading academic institution
Here is our tech stack:
• Front end: Python, Flutter, Dart
• Back end: Python, GCP, Redis, Kubernetes
Process:
Exceptionally fast, application to offer within 7 days
1. Apply here
2. First round video interview, system design interview, then onsite
3. Reference checks, negotiation, and offer