About HyperDX
HyperDX helps engineers figure out why production is broken faster by centralizing and correlating logs, metrics, traces, exceptions and session replays in one place. We're building the open source and developer friendly alternative to tools like Datadog and New Relic.
About the role
Skills: JavaScript, Kubernetes, Node.js, TypeScript, SQL, Distributed Systems, Docker, Amazon Web Services (AWS)Hi there! We’re building an open source platform for engineers to monitor, debug, and scale their production applications. What this looks like is we help developers collect and ingest terabytes of telemetry from their application (ex. logs, metrics, traces, session replays) and give them the tools to easily monitor their application/infra health and search through all that data to get to the root cause of what’s gone wrong.
If you’ve ever been on call and been frustrated at 2am over why your Grafana/Datadog/New Relic/Elastic/etc. isn’t giving you straightforward answers to what’s gone wrong - you know exactly the problem we’re solving for.
We’re incredibly early in our journey, but already captured a huge amount of interest from the wider developer community, from reaching 6k Github stars, deployed by enterprises such as ARM, to thousands of teams on HyperDX cloud.
We’re hiring for a founding engineer who is excited to build with us a PB-scale high performance observability streaming & analytics with a focus on crafting an amazing developer experience on top (the DX in HyperDX).
The Role
- You'll be hands-on building out any and all parts of the product (broadly focused on the backend), with a focus on building the best developer experience possible to bring an engineer from incident or bug, to root cause and resolution.
- You'll be directly engaging our amazing community of customers, open source contributors, and users (who are all engineers) - closely listening to their feedback and actioning on them quickly or giving them a hand when they get stuck. Most of them live directly in our Slack or Discord community.
- You'll be solving hard technical problems, from building durable SDKs for a variety of languages/platforms (from Node, Deno, RoR, Java, etc.) to helping effectively scale out our ingestion pipeline and Clickhouse cluster.
About You
- You love to move quickly and ship often to solve real customer-impacting problems.
- You embrace ambiguity - you love to take big ideas and execute on them independently, but can collaborate with the team and customers when needed.
- 3+ years of experience as an engineer, bonus points for experience building developer tools.
- Strong proficiency in Typescript for both our Next.js frontend and Express backend.
- Proficient in SQL to do query generation for our Clickhouse DB.
- Comfortable with Docker/containers and familiar with Kubernetes or other container orchestration platforms.
- You love building alongside an open source community.
- You love learning new technology and approaches, and love to push the boundaries of what you know.
Bonus Points
- Deep experience with distributed systems, particularly scaling high-volume event ingestion pipelines and columnar data stores (ex. Clickhouse, Druid).
- Opinionated in observability tooling today - and know where they can get 10x better.
- Curious about other languages and platforms, to help build/maintain our next iteration of integrations for anything from Rust to PHP.
About Us
It's currently just us the founders (Michael and Warren) - we're both deeply technical with a passion for building great developer tools on top of solid and scalable infrastructure.
We plan to continue scaling up the team to meet the incredible amount of customer demand we've had - and are well capitalized with many years of runway as part of that. On the technical end, you can learn more about our setup here in our contributing docs. In production we're running on AWS and Kubernetes.
We think there's an incredibly rewarding experience ahead in re-thinking how developers are empowered when they're tasked with their next bug ticket or paged for their next incident. If you've read this far, drop us a note about the most memorable incident you were part of!
Technology
StackApp: Typescript, AWS, Kubernetes, Next/React, Node, Express, Clickhouse, OpenTelemetry, Vector, PythonMaintained Integrations: Typescript, Python, Ruby, Deno, and Go (many more to come)
Get an overview and take our stack for a spin here via our contributing.md
Technical ChallengesHyperDX needs to stay up when our customers are down, massively scale when our customers scale, and trawl through TBs of data in seconds. On top of all this, we need to build a world-class developer experience so developers can actually make sense of the telemetry they're sending our way.