Job Description
Note: The job is a remote job and is open to candidates in USA. Baseten powers mission-critical inference for leading AI companies and is seeking an Applied AI Inference Engineer. In this role, you will partner with customers to architect, build, and deploy high-scale production AI applications, driving impact throughout the customer journey from initial exploration to production deployment. Responsibilities β’ Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects β’ Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing β evaluation β production deployment β monitoring). This involves working with customersβ engineering teams at every stage of the customer journey including: sales, implementation, and expansion β’ Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers β’ Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs β’ Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution β’ Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity β’ Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates Skills β’ Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field β’ 1+ years of professional work experience in a fast-paced, high-growth environment β’ Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python β’ Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment β’ Strong communication skills, particularly on complex technical topics β’ Experience in building or optimizing AI/ML projects is highly valued Benefits β’ Competitive compensation, including meaningful equity. β’ 100% coverage of medical, dental, and vision insurance for employee and dependents β’ Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) β’ Paid parental leave β’ Company-facilitated 401(k) β’ Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Company Overview β’ Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is Company H1B Sponsorship β’ Baseten has a track record of offering H1B sponsorships, with 6 in 2025, 8 in 2024, 1 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job