Software Engineer - Full Stack

San Francisco Remote April 21, 2026 Full Time

About the Job

We’re seeking a Full-Stack Engineer to design, build, and scale our web platform, which serves as the core interface for deploying multimodal models, observing workloads, and building agent workflows. In this role, you’ll work closely with product, infrastructure, and design teams to create high-performance, developer-friendly, and enterprise-ready tools.

We are looking for a hands-on engineer who is eager to work at the intersection of infrastructure, developer experience, and AI applications. The ideal candidate is a talented full-stack developer, strong collaborator, and someone who enjoys working across the stack, cares deeply about developer workflows, and is excited to help define the future of AI adoption.

Key Responsibilities

Design, build, and maintain web applications and tools for AI model deployment, monitoring, and performance optimization
Develop clean, scalable, and robust APIs powering AI agents, workflows, and user-facing systems
Collaborate with infrastructure engineers to integrate backend systems with deployment and orchestration pipelines
Optimize the performance and usability of web interfaces
Drive code quality through automated testing, CI/CD, and code reviews
Contribute to architecture and design decisions that shape our platform’s long-term direction
Identify and resolve technical debt and improve system reliability in production systems

Qualifications

5+ years of industry experience in full-stack or backend engineering
Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent
Fluent in TypeScript and Python, Expert with React/Next.js
Strong backend experience with FastAPI or similar Python frameworks
Proven expertise in delivering production-scale full-stack applications
Proficiency in designing data models, writing SQL, and working with PostgreSQL
Deep understanding of modern web frameworks and component-driven architecture
Strong API design experience across gRPC/REST/GraphQL in production systems
Solid foundation in cloud-native development
Familiarity with OpenTelemetry tracing, metrics, and structured logging
Knowledge of web security, authentication, RBAC, and multi-tenant SaaS systems

Preferred Experience

Familiarity with LLM-based workflows, tool invocation, or agentic systems
Familiarity with Kubernetes for container orchestration, including deploying, scaling, and managing containerized applications in production environments
Have worked in a startup or fast-paced environments with ownership and ambiguity
Built developer-facing SDKs/CLIs
Passion for developer experience and enabling AI adoption

Benefits

Flexible working hours
Daily lunch and dinner provided; unlimited snacks and beverages
Supportive and highly collaborative work environment
Health check-up support and top-tier equipment/hardware support
A front-row seat to the generative AI infrastructure revolution
Competitive compensation, startup equity, health insurance, and other benefits.

About FriendliAI

FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.

We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.