Tech
LLM Operations Consultant
LLM Operations
Taking a chatbot and turning it into a reliable, accurate, and cost-effective service using Retrieval Augmented Generation(RAG) and cloud architecture best-practices.
Highlights
- · Replaced long-running containers with serverless functions, where applicable.
- · Created RAG pipelines to access data from a number of public/proprietary sources.
This was a fun role, as it combined my interest in AI with my established architectural knowledge. Working with the developers to use containers and serveless frameworks when applicable, we significantly cut costs, reduced response time, eliminated hallucinations, and improved response accuracy.
Tagged
AWS · Kubernetes · Terraform