[Remote] Senior AI Engineer (Full-Stack) (Remote)
Note: The job is a remote job and is open to candidates in USA. Codvo.ai is seeking a Senior AI Engineer with strong full-stack capabilities to join their engineering team. The role involves designing and building AI agent pipelines, integrating Large Language Models, and developing end-to-end product features while collaborating with clients to translate business requirements into scalable software solutions.
Responsibilities
- Design and build AI agent pipelines, including multi-node LangGraph graphs
- Implement intent routing, multi-turn conversational context, session state management, and tool integrations
- Develop multi-step reasoning pipelines and graph-based agent workflows
- Build and maintain Retrieval-Augmented Generation (RAG) systems
- Design vector search architectures, embedding pipelines, retrieval grounding, and chunking strategies
- Implement hallucination mitigation techniques and retrieval evaluation frameworks
- Integrate and optimize Large Language Models (LLMs) including OpenAI, Gemini, and Anthropic
- Develop structured output workflows using JSON schemas
- Create effective prompt engineering strategies, few-shot examples, and context window management solutions
- Build provider-neutral client architectures to support multiple LLM vendors
- Design, develop, and deploy end-to-end product features
- Build scalable FastAPI backends and React/Next.js frontends
- Implement Server-Sent Events (SSE) streaming and REST API contracts
- Deliver production-ready UI features independently without requiring dedicated frontend support
- Own LLM observability including: Token usage logging, Cost tracking, Fallback detection, Performance monitoring, Regression test suites
- Build evaluation pipelines and golden test suites to ensure AI quality and consistency
- Collaborate directly with clients and stakeholders to understand business requirements
- Translate requirements into scalable, maintainable software solutions
- Keep technical documentation, specifications, and test coverage aligned with product changes
Skills
- LangGraph or equivalent graph-based agent frameworks
- Multi-step reasoning pipelines
- Tool usage and orchestration
- State management and conversational workflows
- End-to-end RAG pipeline design and implementation
- Experience with vector databases such as Pinecone, Qdrant, pgvector, Weaviate
- Chunking strategies and retrieval optimization
- Retrieval evaluation methodologies
- OpenAI, Gemini, and Anthropic SDKs
- Prompt engineering and prompt optimization
- Structured JSON outputs
- Context window management
- Multi-provider LLM integrations
- Python 3.12
- FastAPI
- Async Python
- Pydantic
- SQLite
- PostgreSQL
- Redis
- Pytest
- React
- Next.js
- TypeScript
- Modern frontend architecture
- API integration and state management
- Evaluation pipelines
- Golden datasets and test suites
- Regression tracking
- Model performance monitoring
- ArcGIS REST APIs
- GeoJSON
- MapLibre GL JS
- Spatial queries
- Recharts
- D3.js
- Equivalent charting libraries
- Docker
- Azure
- AWS
- CI/CD pipelines
- OIDC Authentication
- Ability to understand and interpret Figma designs
- Evaluate trade-offs between engineering effort and business value
- Deliver solutions aligned with business objectives
Company Overview
Company H1B Sponsorship
Apply To This Job