Job Description
<strong>Overview<br><br></strong>Core AI is at the forefront of Microsoft's mission to redefine how software is built and experienced in the AI era. Our product portfolio includes vital developer tools like VS Code, Visual Studio, GitHub, AI Foundry, and others. Our work enables developers and enterprises to harness the full potential of AI to create intelligent, adaptive, and transformative software.<br><br>Guidance is an applied research and development team and technology stack focused on providing more precise control over language models. The team operates across the full research and development lifecycle, from research ideation through production deployment, with a primary focus on language model engines. Guidance advances engineālevel capabilities by developing new techniques that improve model accuracy, speed, reliability, and expressivity across a wide range of execution environments. An industry example of this work is llguidance, which originated structured output capabilities and helped drive their adoption across firstāparty Microsoft engines and thirdāparty model providers.<br><br>You will work as a Senior Research Software Development Engineer focused on advancing language model engineālevel capabilities through applied research to integration. This role is responsible for integrating ināhouse techniques and stateāofātheāart research into a variety of firstāparty (1P) Microsoft engines and thirdāparty (3P) industry engines. You will translate research ideas into highāperformance, productionāready implementations, contributing directly to new engine capabilities that improve model correctness, efficiency, robustness, and expressive control.<br><br><strong>Responsibilities<br><br></strong><ul><li>Advance language model engine capabilities through applied research and production engineering, integrating ināhouse innovations and stateāofātheāart techniques to improve model accuracy, speed, reliability, and expressivity across firstāparty and thirdāparty engines.</li><li>Design, implement, and review performanceācritical engine code (primarily in Python and Rust), ensuring high standards for correctness, test coverage, security, diagnosability, and maintainability, while coaching peers through rigorous and timely code reviews.</li><li>Apply AIānative development practices across the full SDLC, using AI tools responsibly for design, coding, testing, and analysis, and taking ownership of the quality and correctness of AIāassisted outputs while helping establish best practices across the team.</li><li>Develop and evolve advanced inference techniques (e.g., speculative decoding, constrained decoding, structured generation), validating design choices through experimentation, benchmarking, and production telemetry.</li><li>Own engineālevel design and integration decisions, producing clear design documents, evaluating tradeāoffs across multiple architectural options, and collaborating across teams to ensure solutions meet requirements for performance, scalability, reliability, security, and cost.</li><li>Drive engineering excellence in production environments, including comprehensive testing strategies, observability, liveāsite readiness, incident response, and postāincident learning, with a focus on reducing operational risk in multiātenant inference systems.</li><li>Contribute to and leverage openāsource LM infrastructure where appropriate, responsibly reusing and extending external code, sharing learnings with the broader community, and continuously staying current with emerging research, tools, and engineālevel techniques. <br><br></li></ul><strong>Qualifications<br><br></strong><strong>Required Qualifications <br><br></strong><ul><li>Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to Rust or C++, and Python</li><ul><li>OR equivalent experience. <br></li></ul></ul><strong>Other Requirements<br><br></strong>Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:<br><br><ul><li>Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years<br><br></li></ul><strong>Preferred Qualifications<br><br></strong><ul><li>Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, Rust or C++, and Python</li><ul><li>OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, Rust or C++, and Python </li><li>OR equivalent experience. </li></ul><li>5+ years of professional software engineering experience, including ownership of complex, productionāquality systems</li><li>Strong proficiency in Python and at least one systems programming language (e.g., Rust, C++, or equivalent), with experience writing and maintaining performanceācritical code.</li><li>Openāsource contributions or industry experience in language model infrastructure (e.g., vLLM, sglang, llguidance, or comparable LM libraries), including work on core engine logic rather than application layers.</li><li>Handsāon familiarity with advanced inference techniques, such as speculative decoding, constrained decoding, or related inferenceātime capabilities.<br></li></ul>Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.<br><br>Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:<br><br>https://careers.microsoft.com/us/en/us-corporate-pay<br><br>This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.<br><br>Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about <strong>requesting accommodations.</strong>