[Remote] Llama Developer (Generative AI / LLM Engineer)

🌍 Remote, USA 🎯 Full-time πŸ• Posted Recently

Job Description

Note: The job is a remote job and is open to candidates in USA. Tranzeal Incorporated is a company focused on developing AI applications. They are seeking a Llama Developer to build AI applications using Llama models, fine-tune them, and implement Retrieval-Augmented Generation pipelines while collaborating with various teams to deploy scalable AI products. Responsibilities β€’ Build AI applications using Llama (Llama 3 / Llama Stack / Llama API / local LLM inference). β€’ Fine-tune and evaluate Llama models on proprietary and domain-specific datasets. β€’ Implement Retrieval-Augmented Generation (RAG) pipelines using vector databases. β€’ Develop conversational agents, copilots, or knowledge assistants for business workflows. β€’ Optimize model performance via quantization, prompt engineering, and latency reduction. β€’ Integrate LLM capabilities into back-end services, microservices, APIs, or cloud platforms. β€’ Ensure compliance, safety, and responsible-AI standards for generated content. β€’ Collaborate with Data Science, MLOps, and Product teams to deploy scalable AI products. Skills β€’ 3–8+ years of software engineering or machine-learning experience. β€’ Proven experience with Llama models (self-hosted or via Meta API). β€’ Proficiency in Python (FastAPI, LangChain, LlamaIndex, Hugging Face ecosystem). β€’ Experience with vector databases (FAISS, Pinecone, Weaviate, ChromaDB). β€’ Strong understanding of prompt engineering, model fine-tuning, LoRA / QLoRA. β€’ Hands-on experience with GPU computing, PyTorch, Docker, Kubernetes. β€’ Familiarity with MLOps practices: CI/CD for ML, model monitoring, logging. β€’ Experience deploying models on AWS / GCP / Azure / on-prem GPU clusters. β€’ Knowledge of RAG architectures, knowledge graphs, and document parsing pipelines. β€’ Understanding of model safety, hallucination mitigation, red-team testing. β€’ Experience with llama.cpp, vLLM, Ollama, or NVIDIA Triton. β€’ Contributions to open-source LLMs or AI frameworks. Company Overview β€’ Tranzeal is an industry leading global Business Transformation Service Provider. It was founded in undefined, and is headquartered in San Jose, California, USA, with a workforce of 201-500 employees. Its website is Company H1B Sponsorship β€’ Tranzeal Incorporated has a track record of offering H1B sponsorships, with 3 in 2025, 6 in 2024, 4 in 2023, 6 in 2022, 7 in 2021, 15 in 2020. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job

Ready to Apply?

Don't miss out on this amazing opportunity!

πŸš€ Apply Now

Similar Jobs

Recent Jobs

You May Also Like