
May 2025 - Present
AsterGaze Technologies
Kathmandu, Nepal / Part-time
AI Engineer
- Designed a multi-agent AI system for a study-abroad CRM platform using LangGraph and FastAPI.
- Built hybrid RAG over PostgreSQL and pgvector HNSW, reducing vector search from about 10s to under 4s.
- Reduced LLM inference cost by up to 55% through prompt design, context compression, and token optimization.
- Tracked traces, latency, and failure points with LangSmith for better production visibility.

























