Summary
This role can be either fully in-person or remote.
As a machine learning engineer, you’ll work very closely with a senior member of our research team on cutting-edge deep learning research, infrastructure, and tooling towards the goal of creating general human-like machine intelligence.
Example projects
• Read and implement various papers that describe improvements to architecture efficiency, training speed, loss metrics, or fine-tuning methods.
• Devise scientifically rigorous experiments to determine whether a proposed adjustment is actually helpful for the problems Imbue wants to use LLMs to solve.
• Improve our understanding of the fundamentals of LLM knowledge and inference, especially around chain-of-thought, reasoning, generalization, hallucination, or grokking
• Leverage our existing framework to perform hyperparameter sweeps, and maintain and contribute new features to the sweep code
• Work with our infrastructure engineers to develop monitoring, informative logging and maintenance guides for long-running experiments
• Work with our data and product engineers to find areas of improvement in our training or evaluation datasets
• In summary: do science and make LLM models that perform better in practice.
You are
• Very comfortable writing Python.
• Familiar with PyTorch and training deep neural networks.
• Excited to work on open source code.
• Passionate about engineering best practices.
• Self-directed and independent.
• Excellent at getting things done.
Compensation and Benefits
• Work directly on creating software with human-like intelligence
• Flexible working hours
• Time and budget for learning and self improvement
• Compensation packages are highly variable based on a variety of factors. If your salary requirements fall outside of the stated range, we still encourage you to apply. The range for this role is $140,000–$350,000 cash, $10,000–$5,000,000 in equity.
How to apply
All submissions are reviewed by a person, so we encourage you to include notes on why you're interested in working with us. If you have any other work that you can showcase (open source code, side projects, etc.), certainly include it! We know that talent comes from many backgrounds, and we aim to build a team with diverse skillsets that spike strongly in different areas.
We try to reply either way within a week or two at most (usually much sooner).
Learn more about our full interview process here.
About us
Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. We train our own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, we gain insights into improving both the capabilities of the underlying models and the interaction design for agents.
We aim to rekindle the dream of the *personal* computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.