About the RoleFeatherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible forauthentication and inference to all modelssubscription management and subscription entitlement (e.g. context-length, concurrency limits)and providing the necessary API surface for applications and buildersAPI Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.What you'll doThe API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you willundertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new modelsimprove the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)respond to availability incidentstriage and resolve issues of inference quality and reliabilitymanage the infrastructure on which our gateway runsWhat you'll bringfirst-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)ability to debug complex issues across a wide stack and build instrumentation as necessarydesire to work collaboratively as part of a skilled teamAlignment with team and company values, includingbias to actionresponsiveness to users (bug-fixes over features)instinct to iteratesubscribing to that done means proven by usage dataOtherThis team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.