Full-stack AI developer to set up a secure local GPT-like assistant with doc analysis and memory.

🌍 Remote, USA 🎯 Full-time 🕐 Posted Recently

Job Description

Seeking a highly competent AI engineer to build a fully local, modular AI assistant designed for legal analysis, document synthesis, clinical planning, and research-intensive writing. This is a mission-critical cognitive environment, not a chatbot or LLM toy.

The system must run entirely offline, preserve compartmentalized memory, and support vector-based document ingestion — all deployed on a MacBook Pro M4 Max (64GB RAM, 2TB SSD). It must be auditable, inspectable, and fully under user control at every layer.

Build Scope (Three Phases – One Builder Only)

You will be contracted to deliver the entire architecture across three phases:

LM Studio / llama.cpp optimized for Apple Silicon
Vector DB setup (Chroma/Qdrant/Weaviate) with local document parsing
UI (Streamlit, Electron, or lightweight shell)
PDF/DOCX ingestion with embedded citation + source trace
Memory persistence: prompt saving, tone tagging, output control
Output styles: affidavit, brief, memo, etc.
No cloud sync. No logging. Fully local.

Add support for clinical workflows, estate planning, strategic writing
Improved memory containers (per domain)
API access wrapper to GPT-4 or Claude 3 (via user-owned keys ONLY)
Enhanced persona switching, tone locks, exportable sessions

Separate vault for high-context reasoning and longform writing
Firewalled from Office; no cross-contamination
Persistent memory, stylistic voice preservation, session archiving
Local-only inference with optional narrative assistive tools
Optional future integration: multimodal parsing, encryption layers

You must consent to third-party code audit after each phase
No proprietary wrappers
No API calls unless explicitly declared and authorized
All configuration paths, prompts, vector stores, and memory logs must be accessible to the user
You will provide:
Full install scripts
Summary.txt documenting build logic, parameters, dependencies
Notes to replicate install from clean OS

Payment Structure

This is an hourly project with a firm "not-to-exceed" ceiling, payable at the end of each phase upon satisfactory completion and code audit.

You (the builder) are committing to the entire 3-phase build, unless mutually terminated.
I (the client) commit to paying for each completed phase promptly and fully once:
Deliverables are met
The audit review is cleared
I reserve the right to end the project after any phase, for any reason, without obligation to proceed further.
You agree that no partial or hidden ownership exists — all build components, logic, and configuration must be shared in full and are under the client’s control.

Summary:

PhaseDeliverable Payment Exit Clause

Phase 1 Inference + UI Paid after audit Client may exit

Phase 2 Vector Store + API logic Paid after audit Client may exit

Phase 3 Final Memory Shell + Vault Split Paid after audit Final

Please submit your hourly rate and your not-to-exceed cap (USD) for the full build.

System Context

You’ll be working on a clean Apple M4 Max MacBook Pro (64GB RAM, 2TB SSD), macOS Sonoma.