gdg chennai hackfest '26 · feb 2026

build with ai:
what, why & how

ideas, apis, and everything you need to ship something incredible today

sanju

founder, thisux · droidclaw · unitedby.ai

what's this talk about

not another "ai is cool" talk

this is a practical walkthrough — from what ai actually is, to what you can build with it today.

by the end, you'll have real project ideas and the tools to build them.

who's talking

design engineer working across the stack

thisux

design studio

dunsocial

ai product

dun

ai-native tools

droidclaw

went super viral

unitedby.ai

ai community

i build with ai every day. agents, automations, products. let me show you how.

who's used gemini before?

and who's built something with an api? keep your hand up.

what is ai, really?

let's start from scratch. no jargon.

the simple version

pattern recognition at scale

ai sees billions of examples and learns to predict the next thing.

text? it predicts the next word.

images? it predicts the next pixel.

code? it predicts the next line.

that's it. really.

the landscape

three flavors of ai

narrow ai (ANI)

one task, really well

chatgpt, gemini, dall-e. this is where we are today.

general ai (AGI)

human-level thinking

can do any intellectual task a human can. not here yet.

super ai (ASI)

beyond human

smarter than all humans combined. science fiction... for now.

the agi question

are we close to agi?

short answer: no one really knows.

2027-2030

optimistic estimates (openai, google deepmind)

2040+

conservative estimates (most researchers)

but what we have today? already incredibly powerful for building things.

the model landscape

models you should know

gemini

google

multimodal, free tier

gpt-4o

openai

reasoning, tools

claude

anthropic

coding, analysis

llama 3

the automation spectrum

from simple scripts to fully autonomous agents

let's define it

what is real automation?

it's not just "make a chatbot." it's removing human effort from repetitive, predictable tasks.

real automation =

trigger → decision → action → feedback loop

email comes in → ai reads it → drafts response → you approve (or don't)

the spectrum

5 levels of ai automation

L1

simple rules

if-then automation. zapier, ifttt. no ai needed.

L2

ai-assisted

ai suggests, human decides. copilot, grammarly, smart compose.

L3

human-in-the-loop

ai does the work, human approves. best for most hackathon projects.

L4

supervised autonomy

ai acts independently, human monitors. ai agents with guardrails.

L5

full autonomy

ai handles everything end-to-end. self-driving cars. very few systems here.

the sweet spot

human-in-the-loop

ai does the heavy lifting

drafts, analyzes, summarizes, generates code

human validates

reviews, approves, rejects, or edits the output

system improves

feedback loop makes the ai better over time

this is where 90% of useful ai products live. and it's perfect for a hackathon.

why it matters

ai hallucinations are real

27%

of ai-generated code has bugs (github study)

15-20%

of factual claims by llms are wrong

human oversight isn't optional. it's the product feature.

the frontier

fully autonomous ai agents

agents that can plan, use tools, and execute multi-step tasks on their own.

what they can do

browse web, write code, call apis, manage files, send emails, book meetings

what they struggle with

long-term planning, ambiguous goals, error recovery, knowing when to ask for help

think of agents as interns: capable but need clear instructions and supervision.

how agents work

the agent loop

// simplified agent architecture

while (task_not_complete) {'{'}

observe() // read context

think() // plan next step

act() // call tool or api

evaluate() // did it work?

{'}'}

this is the mental model. every agent framework follows this pattern.

sectors to hack

where ai is making the biggest impact right now

sector 1

healthcare

diagnosis support

ai that reads medical images, flags anomalies, assists doctors

patient engagement

appointment scheduling, symptom checkers, follow-up reminders

drug discovery

protein folding, molecule screening, clinical trial matching

hackathon idea

ai-powered medicine reminder + interaction checker using gemini

sector 2

education

personalized tutoring

adapts to each student's learning pace and style

content generation

quizzes, flashcards, lesson plans from any source material

accessibility

translating content, sign language, text-to-speech, simplification

hackathon idea

ai study buddy that reads your notes (drive api) and quizzes you (gemini)

sector 3

productivity & work

email automation

smart replies, categorization, priority inbox on steroids

document processing

summarize contracts, extract data, auto-fill forms

meeting assistant

transcribe, summarize, extract action items, schedule follow-ups

hackathon idea

gmail + gemini agent that auto-drafts responses based on your style

sector 4

local business & logistics

route optimization

delivery routes, traffic-aware scheduling using maps api

local discovery

ai-powered "what's near me" with reviews and recommendations

inventory management

predict demand, auto-reorder, waste reduction

hackathon idea

ai trip planner: maps api + gemini for personalized chennai itineraries

and there's more

every sector is up for grabs

agriculture

crop monitoring, soil analysis

fintech

fraud detection, budgeting

entertainment

content recs, music gen

real estate

price prediction, virtual tours

cybersecurity

threat detection, phishing

e-commerce

product recs, support bots

legal

contract review, research

transport

routing, demand prediction

your toolkit

free gcp credits. powerful apis. let's build.

the good news

free gcp credits

you have google cloud credits. use them. build something real.

gemini api

gmail api

maps api

drive api

api deep dive

gemini api

the brain of your project

what it does

text generation, code writing, image understanding, audio processing, function calling

why it's great for hackathons

generous free tier, multimodal out of the box, google ecosystem integration

// get started in 3 lines

import google.generativeai as genai

model = genai.GenerativeModel('gemini-pro')

response = model.generate_content("hello!")

api deep dive

gmail api

automate the inbox chaos

read emails

fetch, search, filter, label — programmatically

send emails

compose, reply, forward — with ai-generated content

organize

auto-label, categorize, archive — based on ai analysis

combine with gemini: ai reads your email, understands context, drafts the perfect reply.

api deep dive

google maps api

location intelligence for your app

places

search nearby, get details, photos, reviews

directions

routes, distance, traffic, eta, waypoints

geocoding

address ↔ coordinates, reverse geocoding

pair with gemini for natural language: "find me the best biryani within 2km that's open now"

api deep dive

google drive api

your ai's file system

read files

access docs, sheets, pdfs — feed them to gemini

create & update

generate reports, update spreadsheets, create docs

organize

auto-organize files, smart search, permissions

imagine: upload a pdf → ai extracts insights → creates a summary doc → emails it to your team

putting it all together

build an agent system

combine apis into something greater than the sum of its parts

1

gemini as the brain — understands intent, generates responses

2

gmail as the communication layer — reads and sends emails

3

drive as the knowledge base — stores and retrieves documents

4

maps as the location layer — adds spatial awareness

the secret sauce

function calling

how ai talks to your apis

// tell gemini what tools it has

tools = [

search_emails(query),

send_email(to, subject, body),

find_nearby_places(location, type),

read_drive_file(file_id)

]

// gemini decides which tool to use

user: "email the team about tomorrow's meeting venue"

gemini → calls find_nearby_places() → then send_email()

the model decides which api to call. you just define the tools.

the architecture

your hackathon stack

user input

↓

gemini (brain)

↓

gmail api

maps api

drive api

gcloud services enable drive.googleapis.com

# 3. get api key

gcloud auth application-default login

# 4. install sdk

pip install google-generativeai google-api-python-client

resources

bookmarks for today

🤖

gemini api docs

ai.google.dev

📧

gmail api quickstart

developers.google.com/gmail/api

🗺️

maps platform

developers.google.com/maps

📂

drive api quickstart

developers.google.com/drive/api

☁️

google cloud console

console.cloud.google.com

your challenge

build something that makes one person's life easier.

not "revolutionary ai platform." not "gpt wrapper."

something specific. something useful. something you'd actually use tomorrow.

now go build

questions? ideas? come talk to me.

sanju

sanju.sh — personal

thisux.com — design studio

droidclaw.ai — went viral

dunsocial.com — ai product

unitedby.ai — ai community

gdg chennai hackfest '26

let's build with ai. together.

build with ai:what, why & how

not another "ai is cool" talk

design engineer working across the stack

who's used gemini before?

what is ai, really?

pattern recognition at scale

three flavors of ai

are we close to agi?

models you should know

the automation spectrum

what is real automation?

5 levels of ai automation

human-in-the-loop

ai hallucinations are real

fully autonomous ai agents

the agent loop

sectors to hack

healthcare

education

productivity & work

local business & logistics

every sector is up for grabs

your toolkit

free gcp credits

gemini api

gmail api

google maps api

google drive api

build an agent system

function calling

your hackathon stack

hackathon ideas

smart email assistant

ai study companion

chennai explorer agent

smart document hub

multimodal accessibility tool

personal ai ops agent

can't decide?

hackathon survival guide

quick start in 5 minutes

bookmarks for today

your challenge

now go build

build with ai:
what, why & how