Available for new opportunities

Hi, I'm Saurav👋

AI Engineer

Specializing in building scalable AI systems, advanced Machine Learning models, RAG pipelines, and agentic workflows. I bridge the gap between research and production to deliver high-impact solutions. Machine Learning, Generative AI, Deep Learning, Data Science.
(And yes, this website is created by AI with a very deep blend of my prompt engineering and software engineering skills 🤖)

20+
Projects Built
3+
Years Of Experience
Ideas to Code
Saurav

About Me

Professional Profile

About Saurav

I am an AI Engineer with a strong foundation in Python, Machine Learning, and Generative AI. I specialize in developing production-ready AI solutions, including RAG systems, autonomous agents, and fine-tuned LLMs. My expertise extends to end-to-end application development, allowing me to build robust applications that integrate complex AI logic with user-friendly interfaces.

My focus is on delivering measurable business value through automation and optimization. Whether it's reducing operational costs with intelligent workflows or enhancing user engagement with context-aware chatbots, I approach every project with a problem-solving mindset and a commitment to engineering excellence.

I thrive in collaborative environments and am adept at translating technical concepts for diverse stakeholders. I am continuously expanding my skill set to stay at the forefront of AI innovation.

Key Achievements

🚀
20+
Projects Delivered
🤖
10+
Models Deployed
100%
Client Satisfaction

🚀 Professional Journey

2022

Generative AI Research

Started with fine-tuning and RAG techniques for specialized domains.

2023

Machine Learning Specialization

Deepened expertise in ML algorithms and data science methodologies.

2024

End-to-End AI Integration

Developed end-to-end AI applications with Python backends and modern frontends.

2025

Enterprise MLOps

Deployed scalable LLM solutions using vLLM and Ollama for production environments.

Skills & Expertise

AI Engineering & Development

Core Competencies

AI Engineering Arsenal

Building production-ready AI systems with proven impact

Building impactful AI solutions for

  • Advanced Machine Learning & Predictive Modeling
  • Deep Learning & Neural Network Architectures
  • AI-powered chatbots with RAG
  • Automated document generation
  • Retrieval-Augmented Generation systems
  • Agentic and multi-agent AI frameworks
  • Scalable LLM deployments with vLLM & Ollama
🤖

AI Engineering

🧠Large Language Models
🔍RAG Pipelines
🤵AI Agents
🔧Fine-tuning
⛓️LangChain
📊Vector Databases
🎯Prompt Engineering
🤗Hugging Face
⚙️Machine Learning Models
📈Exploratory Data Analysis
🛠️Feature Engineering
🏋️Model Training & Evaluation
🚀MLOps
☁️

Cloud & MLOps

☁️AWS (Bedrock, EC2, S3)
☁️GCP (Vertex AI)
🚀vLLM & Ollama
🐳Docker
Vercel
📝Git/GitHub
🐧Linux

Leadership & Soft Skills💡

🧩

Problem Solving

Complex issue resolution & analysis

👥

Team Leadership

Leading & mentoring dev teams

🎯

Critical Thinking

Strategic decision making

💬

Communication

Technical & stakeholder alignment

Projects & Portfolio

Development Journey

Open Source
Poster 1
View Project

AI Code Generation Agent (Python Package - 3500+ Downloads)

With over 3500+ organic downloads on PyPI, CodeFabric is an AI-powered Python package that automates codebase generation with 85% accuracy, covering 95% of the development lifecycle — all via a user-friendly CLI.

PythonSQLiteLangChainLangGraphRAG+2 more
Open Source
Poster 1
View Project

Fine-Tuned LLM with 10M+ tokens (HuggingFace - 550+ Downloads)

A fine-tuned 1B parameter model trained on 10M+ tokens of psychologist conversational data for empathetic, contextually relevant mental health support. It comes with ollama compaitable GGUF format for easy access on any device with CPU.

PythonLangChainTransformersUnslothGemma+1 more
Open Source
SerenAI Dashboard
View Project

SerenAI: AI Chatbot Platform

Self-hosted AI chatbot platform that turns your documents into intelligent conversational assistants using RAG. Build unlimited chatbots with customizable personalities and extensive RBAC.

LangChainRAGOpenAIHuggingFaceQdrant+5 more
Open Source
Agent Workflow
View Project

Notebook CrewAI Agent

AI-powered EDA agent built with CrewAI and programmable Jupyter Notebook tools, allowing LLMs to directly control notebooks for data analysis pipelines.

PythonCrewAIJupyter NotebookLangChainLLMs
Open Source
FallSafe AI Architecture
View Project

FallSafe AI: On-Device Fall Detection (Bidirectional LSTM & TFLite)

End-to-end on-device fall detection system using mobile inertial sensors and Deep Learning. Features multi-task TFLite models, real-time inference (<20ms), and robust false-positive control.

PythonTensorFlowTFLiteAndroid SensorsMachine Learning+1 more
Open Source
App Screenshot
View Project

On-Device AI LLM App (Flutter)

Run small open-source LLMs fully on-device. Privacy-first, offline inference app using Flutter and Cactus runtime.

On-Device AIllama.cppCactusFlutter
Closed Source
Poster 1
View Project

JusticeAI: Indian Legal AI Advisor (justiceai.in)

JusticeAI: India's first AI-powered legal advisor, built on authentic Indian law books and data, leveraging RAG for accurate, context-aware legal guidance.

JavascriptLangChain.JsRAGAWSNode.Js+2 more
Closed Source
Poster 1
View Project

Legacy Code Transformation & Documentation

Built an AI-powered platform to document legacy applications (COBOL, RPG) and accelerate modernization into modern architectures, reducing planning and documentation time by up to 90%.

PythonFastAPILangChainRAGHFEmbeddings+1 more
Closed Source
Poster 1
View Project

Custom LLM Deployment and Optimization

Deployed and integrated LLaMA 3 and LLaMA 4 models using Ollama and vLLM for scalable inference. Enhanced performance and reduced latency with advanced KV caching.

vLLMPythonOllamaAWS EC2Azure
Closed Source
Poster 1
View Project

U.S Job & Salary Prediction Machine Learning Model

Designed a hybrid ML-AI system for predicting job titles and salaries using EDA, feature engineering, Random Forest, and RAG-based generative models on 1.5M+ records, achieving up to 95% accuracy.

Pythonscikit-learnFastAPIEDAFeature Engineering+2 more

Get In Touch

Contact Me

Let's Connect!

I'm always open to discussing new projects, creative ideas, or opportunities to be part of your visions. Feel free to reach out through any of these channels!

Email
sksrivastava.me@gmail.com
Location
Noida, Uttar Pradesh, India
Response Time
Usually within 24 hours
Availability
Open to opportunities

Looking for a passionate AI Engineer? Let's build something amazing together! 🚀