Gaurav Gada

Hello! I'm an Applied Scientist with over 10 years of industry experience focused on NLP with a recent focus on Generative AI and agentic automation. In the past I've worked on content moderation, AI safety, conversational agents, shift scheduling and labor cost optimization. I love to combine my creative and scientific approach to solve complex problems in an applied and practically useful way and intend to keep pushing the boundaries of what's possible with AI abd its applications. Thanks for dropping by—I’m glad you’re here! Let’s explore what’s next, together. Do drop a note--apart from deep learning (pun intended), I'm always down for deep conversations with practitioners in the field.

Posts

The 10% You Should Never Automate

November 8, 2025

yesterday

Everyone's asking what AI can do. The better question is what you shouldn't let it do. Frameworks for deciding what to automate and what to protect.

Read Post

When Should You Build an AI Agent? A Practical Decision Framework

November 5, 2025

4 days ago

Practical framework to determine when AI agents make sense for your use case. Learn when to build agents and when simpler approaches like prompt engineering or RAG work better.

Read Post

Mistral 7B on consumer hardware

July 21, 2024

last year

Run Mistral 7B locally on Mac with Ollama for fast seed data generation. Learn CLI setup, prompt formatting, and downstream parsing to generate thousands of samples on consumer hardware.

Read Post

Finding the right words

July 6, 2024

last year

Understand how LLMs choose words during generation. Learn temperature, top-k, and top-p sampling strategies to balance coherence, diversity, and task-appropriateness in generated text.

Read Post

Paper Review - Embers of Autoregression

June 29, 2024

last year

Critical review of LLM limitations in low-probability situations. Explores why AI practitioners should understand autoregressive training pressures before deploying LLMs for tasks requiring precise reasoning or uncommon patterns.

Read Post

Multi-label text classification

February 16, 2024

last year

Learn to build a multi-label text classifier using DistilBERT with imbalanced classes. Covers binary cross-entropy loss, multi-hot encoding, and practical implementation strategies for handling multiple labels.

Read Post

Library version mismatches declared not safe

February 2, 2024

last year

Critical lessons on matching Python package versions between model development and inference. Learn about safetensors format advantages and why version mismatches cause production failures.

Read Post

Mining word collocations

February 1, 2024

last year

Extract common bigrams and trigrams from text using Gensim and NPMI scoring. Learn to mine jargon, phrases, and collocations from customer reviews, feedback, and text corpora.

Read Post

Science Talk: Generative LLMs

September 1, 2023

2 years ago

Comprehensive introduction to generative LLMs covering basics, training processes, and real-world applications. Slides from talk delivered to 70+ attendees.

Read Post

Projects

Skill Quality Coach

July 20, 2022

3 years ago

Amazon Alexa announced Skill Quality Coach (SQC), a personalized guide to help skill developers build high-quality skills on Alexa

View Project

Data Science: Analyzing crime stats in Seattle and San Francisco

April 26, 2017

8 years ago

Analysis of criminal activity periodicity, geospatial distribution by district in R.

View Project

Posts

The 10% You Should Never Automate

When Should You Build an AI Agent? A Practical Decision Framework

Mistral 7B on consumer hardware

Finding the right words

Paper Review - Embers of Autoregression

Multi-label text classification

Library version mismatches declared not safe

Mining word collocations

Science Talk: Generative LLMs

Projects

Skill Quality Coach

Data Science: Analyzing crime stats in Seattle and San Francisco

Subscribe