Enterprise RAG
Multilingual retrieval systems, grounded responses, evaluation loops, and production safety on AWS.
Senior Generative AI Engineer building RAG, agents, computer vision, search, and recommendation systems.
I build production AI systems across RAG, agentic workflows, computer vision, search, and recommendation platforms. My work focuses on reliability, measurable impact, cost control, and practical automation for enterprise teams.
42%
Cost reduction
43%
Engagement lift
100+
Concurrent jobs
95%+
Success rate
Impact summary
Measured outcomes
Multilingual retrieval systems, grounded responses, evaluation loops, and production safety on AWS.
Tool-using agents for logistics, pricing, lesson planning, quiz generation, and operational workflows.
Inspection automation, object detection, video analysis, and structured reporting pipelines.
Semantic search, vector retrieval, personalization, and ranking systems tied to measurable product impact.
Selected production systems with clear technical scope and measurable business impact.
Architected AI-powered logistics platform using MCP agents & Strands framework with intelligent job assignment algorithms that automate delivery coordination and optimize driver efficiency.
Impact: 100+ concurrent jobs, automated delivery coordination, dynamic route optimization
Production-grade RAG pipeline with AWS Bedrock, OpenSearch, and multi-model LLM support featuring agentic tools and real-time streaming.
Impact: Multi-model LLM support, real-time streaming, agentic framework, production safety
Event-driven AWS SageMaker pipeline for offshore inspection analysis using Claude Sonnet 4.5 and TwelveLabs for image/video processing.
Impact: 30-40% cost reduction, 95%+ success rate, 100+ concurrent executions
Enterprise-grade multilingual chatbot for a major utility company using Claude 3 & Cohere v3 embeddings with AWS SageMaker integration for Arabic/English document processing.
Impact: Arabic/English support, enterprise-grade RAG, optimized response accuracy
Multi-agent AI system for real-time airline pricing with autonomous agents for price adjustments, market analysis & revenue forecasting across North America, Europe & Asia.
Impact: Significant revenue optimization, real-time price adjustments, global deployment
RAG-enhanced spell correction using Llama3-8B & Databricks with semantic retrieval, achieving 7.5% conversion rate increase and 40% token reduction.
Impact: 7.5% conversion increase, 40% token reduction, 18s latency improvement