Job Description
Location: Remote (US)
Who we are:
Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are the AI technology solutions provider-of-choice to 4 out of 5 of the worldâs biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.
By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, weâre helping usher in the promise of clean and optimized digital data to all industries. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.
Our global workforce includes over 3,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. Weâre poised for a period of explosive growth over the next few years.
About the Role
(Combined Technical + Customer-Facing Role)
We are building a Robotics & Physical AI team focused on systems that perceive, reason, and act in the physical world. Our mission is to design, collect, and evaluate the data that powers frontier robotics and humanoid foundation models. We are looking for a solutions architect who can translate realâworld robotic and embodiedâAI problems into highâvalue datasets, data pipelines, and evaluation strategiesâwhile also acting as a trusted technical partner for our customersâ model and product teams.
- Key Responsibilities
- Design Physical AI problem formulations that map realâworld robotic behavior into concrete data and evaluation requirements for training policies, world models, and perception systems.
- Prototype perception, worldâmodel, and actionârepresentation pipelines (e.g., VLMs, VLAs, world models) to understand what data is needed, why it matters, and how quality will be measured.
- Use simulation and synthetic environments (digital twins, Isaac/Omniverseâstyle tools) to generate, stressâtest, and scale datasets for robotics and humanoid systems, grounded in real sensors, tasks, and constraints.
- Work directly with customersâ robotics and ML teams to define data specifications, collection strategies (egocentric capture, teacherâfollower demonstrations, imitation learning), and evaluation benchmarks that tie to model performance and business outcomes.
- Lead technical discovery and preâsales pilots: scope projects, design experiments, and secure the âtechnical winâ by demonstrating uplift from our data, annotations, and pipelines.
- Collaborate with internal dataâcollection and platform teams to design robust data pipelines, annotation workflows (including affordances and advanced CV labels), and QA processes that generalize across customers.
- Develop reusable playbooks, reference architectures, and demos for common PhysicalâAI use cases (manipulation, mobile navigation, teleoperation, humanârobot interaction) to accelerate future engagements.
- Influence the product and tooling roadmap by bringing structured feedback from frontier robotics customers, and help shape a scalable âRobotics & Physical AI data platform.â
- Represent the company at key industry events and workshops, evangelizing best practices for robotics data, simulation, and evaluation and helping build a broader data and partner ecosystem.
- What weâre looking for
- Strong background in robotics, computer vision, or embodied / Physical AI, with experience building or training real robotic or simulationâbased systems.
- Systemsâlevel mindset: able to move from physical task â model behavior â data representation â metrics, and explain tradeâoffs clearly to both engineers and product leaders.
- Familiarity with world models, imitationâlearning or teleoperation pipelines, simulationâbased workflows, or synthetic data generation for robotics.
- 3+ years of handsâon development in Python and at least one of C++/Java or similar languages used in robotics or ML engineering.
- Comfort operating in ambiguous, earlyâstage problem spaces; you can rapidly scope MVP solutions and iterate with customers.
- Clear technical communication and a customerâfacing solutionâarchitect mindsetâable to run whiteboard sessions, lead workshops, and collaborate directly with foundationâmodel builders and robotics teams on data and technical requirements.
- Strong projectâmanagement and ownership skills: you can drive pilots from idea to delivery, coordinate across internal teams, and keep technical and commercial goals aligned.
- Nice to have
- Experience with real robotic platforms (humanoids, manipulators, mobile robots) or advanced simulators and digitalâtwin platforms.
- Experience designing largeâscale datasets, annotation schemes (e.g., affordances, action labels, dense CV annotations), or evaluation pipelines for robotics/PhysicalâAI models.
- Prior solutionsâengineering, preâsales, or consulting experience with technical customers, especially in frontier robotics or autonomous systems.
- Contributions to openâsource robotics/ML projects, technical blogs, or publications that demonstrate handsâon prototyping, dataset design, or applied research.
Application note
We strongly encourage candidates to include links to GitHub repositories, technical blog posts, openâsource contributions, or publications that showcase their work in robotics, Physical AI, data pipelines, and evaluation frameworks.
Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commissionâs guide at https://consumer.ftc.gov/articles/job-scams.
If you believe youâve been targeted by a recruitment scam, please report it to Innodata at [email protected] and consider reporting it to the FTC at ReportFraud.ftc.gov.
#LI-NS1
Apply tot his job
Apply To this Job