Author: Sekhar Emmela

Simple Steps on how we are building Chatbots.
Lately, we have been experimenting with chatbot developments. Every day we have been progressing with- speed/ ease/ search-quality or the other thing. Adding a high-level summary here so the larger audience can benefit.

Build the Ask Zeneral chatbot first with sample data

Create a Small Sample Knowledge Base-Notion or even simple text files initially.

Convert This Content Into Embeddings.Each document will be converted into vector embeddings.
```
Text document
↓
Embedding model
↓
Vector representation
↓
Stored in vector database
```
If the sample data is already in Notion, the next immediate step is to connect that data to the AI system so the chatbot can read, embed, and search it.

Turn your Notion content into searchable AI knowledge.

Get Notion API Access;
Give the Integration Access to Your Pages,
Extract Notion Content
```
from notion_client import Client

notion = Client(auth="NOTION_API_KEY")

response = notion.databases.query(
    database_id="YOUR_DATABASE_ID"
)

print(response)
```
Split Content Into Chunks

AI works better with small pieces of text. AI models cannot efficiently search large documents directly.

If we store this entire page as one document, problems occur:
- retrieval becomes inaccurate
- AI gets too much context
- answers become vague
Instead we split it.

Suppose your Notion page looks like this:

AI Services

Zeneral provides several AI solutions including knowledge assistants,
document intelligence, customer support agents, workflow automation,
data insights systems and market intelligence platforms.

Knowledge Assistants

These systems allow employees to query company knowledge.

Document Intelligence

AI can extract structured data from documents such as invoices.

If this entire page is stored as one entry, then when someone asks:

“What is AI document processing?”

The vector search might return the whole page, which includes many unrelated topics.

AI then gets too much mixed information.

Instead we split the page-
Chunk 1/2/3

Now if the user asks:

What is AI document processing?

Vector search retrieves Chunk 3 only.

The AI receives exactly the right context.

Typical chunk sizes:

Size Words When to use
Small 100–200 FAQs
Medium 200–400 best general size
Large 500–700 long technical docs

Chunks should slightly overlap so meaning is not lost.

Split content by idea, not randomly.

Good chunks represent:

• one concept

• one explanation

• one use case

✅ Good chunking = smart chatbot

Bad chunking = confused chatbot

Designing the Notion structure correctly will make your chatbot much easier to build and maintain. If the structure is clean, chunking becomes almost automatic.

for a scalable chatbot knowledge base, a Notion database is much better.

A Notion database is like a spreadsheet + pages combined.

Example database entry:

Title

AI Document Processing

Category

AI Service

Content

AI document processing systems automatically extract
information from business documents such as invoices,
contracts and reports.

These systems reduce manual work and improve
processing accuracy.

This becomes a perfect knowledge chunk.

✅ Python can read your Notion database

✅ Title, Category, Context are extracted correctly

✅ Data is ready for the Zeneral knowledge base

a complete simple chatbot script that:

1️⃣ Reads your Notion database

2️⃣ Builds the Zeneral Knowledge Base

3️⃣ Sends it to the AI model

4️⃣ Lets you ask questions in the terminal

This gives you a working chatbot immediately (no vector DB yet).
```
pip install notion-client open
```
Yes — OpenAI requires authentication. The credential is an API key.

OpenAI APIs require billing enabled.Set a usage limit.

Go to:

Billing → Usage Limits

Never commit your API key to GitHub.
```
.env file
environment variables
```
```
Notion Database
        ↓
Python fetch
        ↓
Zeneral Knowledge Base
        ↓
AI Model
        ↓
Ask Zeneral Chatbot
```
Once this works, the next improvements are:

1️⃣ Add vector search

2️⃣ Add conversation memory

3️⃣ Deploy on zeneral.ai website

4️⃣ Add consultation lead capture
```
Notion database
        ↓
Python script
        ↓
Zeneral knowledge base
        ↓
OpenAI API
        ↓
AI-generated answer
```
Moving to a vector database is the right next step. This will make Ask Zeneral much smarter and scalable, especially when your knowledge base grows.
```
Notion database
↓
Create embeddings
↓
Store in vector database
↓
User question → embedding
↓
Vector search
↓
Retrieve relevant knowledge
↓
AI generates answer
```
This approach is called RAG (Retrieval Augmented Generation).

Database Difficulty Notes
Pinecone Easy Most common for RAG
Supabase Vector Medium SQL + vectors
Weaviate Advanced full vector platform
Chroma Very easy local development
```
pip install openai pinecone-client notion-client
```
Each knowledge entry must become a vector embedding.

Vector databases do not store text for reading.

They store semantic meaning.

✅ Once vector search works, your chatbot becomes a real production-grade RAG system.

understanding embeddings is crucial because this is the foundation of vector search and RAG chatbots like Ask Zeneral.

An embedding converts text into a list of numbers that represent its semantic meaning.

Why Embeddings Are Powerful

Because similar meanings produce similar vectors.Even though the words differ, the embeddings are close in vector space.Create Embeddings in Python

Using OpenAI embeddings API.

Yes — you noticed correctly 👍

Earlier LangChain came up in the discussion, but it’s important to understand the difference:

Embeddings are created by models (like OpenAI), not by LangChain.

LangChain is just a framework that helps orchestrate the steps.

Provider Embedding Model
OpenAI text-embedding-3-small
Cohere embed-english-v3
Google textembedding-gecko
HuggingFace sentence-transformers

What LangChain Actually Is

LangChain is a developer framework for building AI pipelines.

It helps manage things like:
```
documents
↓
chunking
↓
embeddings
↓
vector database
↓
retrieval
↓
LLM response
```
It does not create embeddings itself.

Instead it calls embedding providers for you.

LangChain becomes useful when your system grows and you need:

• document loaders

• automatic chunking

• retrieval pipelines

• agent workflows

• multi-step reasoning

LangChain is optional.

Many large production systems use plain Python + APIs

Summary:

Component Role
OpenAI creates embeddings
LangChain manages AI pipeline
Pinecone stores vectors
Your Python code orchestrates everything

LLMs are language models, not databases.

They do not automatically apply exact filtering logic unless prompted clearly.

Once you finish the Pinecone ingestion script, the next step will be the retrieval script,

Since ingestion to Pinecone is done, the next step is the retrieval + answer generation script. This script will:

1️⃣ Convert the user question → embedding

2️⃣ Search Pinecone vector database

3️⃣ Retrieve the most relevant knowledge entries

4️⃣ Send them to OpenAI

5️⃣ Generate the final answer

This completes the RAG pipeline for Ask Zeneral.
```
Notion Database
        ↓
Ingestion Script
        ↓
Embeddings
        ↓
Pinecone Vector DB
        ↓
Retrieval Script
        ↓
OpenAI Model
        ↓
Ask Zeneral Chatbot
```
If an image is uploaded into notion db, can the query in the chatbot look for answers from the image too?

Short answer: Yes, but not automatically with the current setup.

Your current pipeline (Notion → embeddings → Pinecone → RAG chatbot) only processes text. Images require an extra step to convert them into text or embeddings.
```
Notion database
      ↓
Fetch rows
      ↓
Check if image exists
      ↓
Extract image URL
      ↓
OCR / Vision model
      ↓
Convert image → text
      ↓
Create embedding
      ↓
Store in Pinecone
```
Enterprise RAG systems often support:
```
PDF
Images
PowerPoint
Web pages
Notion
Google Docs
```
All converted to text chunks before embedding.

Let us say I have data in excel sheets and want to add them to knowledge base so the chatbot can process the data and answer

Yes — Excel data can definitely be added to your knowledge base, and this is actually a very common requirement in AI assistants. The key point is:

The chatbot does not read Excel directly.

We convert Excel rows into text knowledge entries or structured metadata before creating embeddings.

Since you already have the pipeline (Notion → embeddings → Pinecone → retrieval), Excel can be integrated into the same ingestion pipeline.
```
Excel file
      ↓
Python reads rows
      ↓
Convert rows → text
      ↓
Create embeddings
      ↓
Store in Pinecone
```
In the notion I added founder of zeneral. In the chatbot I asked for fonder. Unfortunately it was unable to give answer. Can the chatbot understand fonder is mispelled and should check for founder?

Yes — a well-designed RAG chatbot can usually handle misspellings like “fonder” vs “founder.” But whether it works depends on where the correction happens in the pipeline. In your case, the failure likely happened before the LLM step — during vector retrieval.
```
User question
↓
Create embedding
↓
Vector search in Pinecone
↓
Retrieve context
↓
Send context to LLM
↓
Generate answer
```
Solution: Use a Query Rewrite Step

Before doing vector search, send the user question to the LLM to rewrite it clearly.
```
User question
↓
LLM query rewrite
↓
Create embedding
↓
Vector search
↓
Retrieve context
↓
Generate answer
```
March 17, 2026
AI Workshops at 9FAQs: Launch your first website with AI
“What is next to Python?”.

Python Learners and developers often aim to try out AI tools and secure their career with AI. Whether it is using ChatGPT or Gemini or Claude to write your code or design something!

At 9faqs, we have opened AI Workshops in the below areas. Take your first step to advance in your career with easy and simple AI Workshops.

Launch your website with AI
1. Goal: Using AI launch your first website.
2. Who can join: Anyone who wants to use and learn AI and advance in their career.
3. Any Pre-requisites: Familiarity with computer and Internet.
4. Duration: 4Hrs.
  1. Introduction to the program: 50 mins
  2. Developing Webpage: 50 mins
  3. Domain booking and Hosting: 50 mins
  4. Q&A and Closing note: 50 mins.
5. Takeaway: know how to
  1. Book a domain
  2. Create a Responsive, multi-page HTML & CSS webpage that has a header, footer, Nav bar/ dropdown Menu and content.
6. Hands on Experience with:
  1. AI Chatbot: ChatGPT
  2. IDE: Cursor
  3. Domain: GoDaddy
  4. Linode: Hosting platform
  5. Programming Skills: HTML, CSS
7. Workshop Fee
  1. Introductory offer INR 1999/- only for the first 100 slots.
  2. INR 3999/- only from 101 slots.
8. FAQs
  - What if I am unable to launch my website in the workshop
    
    After the session, you can request 1-1 support.
    
    We help you launch your website.
    
    write to support@9faqs.com sharing your details.
  - Is domain and hosting charges included in the program fee?
    
    No, it is to be paid by the participant and shall stay with participant.
  - Why should we join 9faqs
    
    9faqs hosts only instructor based online training programmes.
    
    You’ll be part of 9FAQS Alumni a vibrant community that is thriving in computer technologies.
    
    10% discount in future courses for 9FAQs Alumni.
March 15, 2026
Python Training at 9faqs!

This blogpost is to help understand our community on how we train and help build career in Programming/ Software Development.

Python Weekend Workshop / Crash Course

Who can join?

Anyone interested to learn Python and build a career in Programming as a Developer in Software Industry.

I never coded before. Can I join?

Yes, you can join our Python bootcamp which is a 90 Day program. Talk to the instructor to tailor the program as per your needs.

No with the Python Weekend Crash Course as it being a fast paced 2 day workshop.

Is it interactive live training?

Yes, an expert in the Python Programming runs these sessions completely online. Live hands-on session happens.

I do not have a Python environment in my Laptop. Can you help?

Yes, we provide cloud-based-lab facility throughout the Python Bootcamp program. You can login and practice anytime.

As the weekend workshop is for a short-duration you’ll not get any cloud-support.

Can I get any Internship?

Yes, Python Bootcamp is designed with Internship in mind. Bootcamp from Mon-Fri 12 Noon to 6pm gives you good time for 90 days. After the 90 days you can enrol for additional 3-month Internship if you’re looking for a 6-month period.

How do I register for these programs?

Navigate to 9faqs.com Python Training pages and enter your details. One of our experts will walk you through the process.

Can I join Python training anytime?

Yes, anytime for Python Bootcamp. For Python Weekend Workshop you need to select available dates while filling the form on 9faqs.com

Any job assistance provided?

Under Python Bootcamp we extend support to all candidate in building their career. We conduct 1-1 sessions, analyse profile, support with LI and help with mock interviews as well during the internship. After the internship, interested candidates can buy support for an extend month or as needed.

Can we get support with general tools used in Software Industry?

Yes, we provide a complete support and confidence with LI Profile support and Job assistance. GIT, Jira, and Teams are part of your Internship program.

For the folks joined in Python Weekend Workshop/ crash course will be introduced to these tools too.

Any Job Guarantee program available?

As of now we are not providing any job guarantee program but assistance is provided as mentioned earlier with analysing your profile & mock interviews.

Can online programs be trusted?

You’ll love our online programs. It has the flexibility to attend sessions from anywhere. These are not video based training sessions. You’ll be talking to industry experienced Python programmers & Software developers during your training sessions.

February 15, 2026
Hello Learners. Welcome to 9faqs!

Welcome to our blog!

Here is where we keep sharing our thoughts.

9faqs is for the average guys out there like me. I am neither a brilliant guy nor a hard working student. At times I learn myself or search online to up-skill myself.

Know the basic Python syntax/ topics and the rest 9faqs can drive you with FAQ/ MCQs.

January 26, 2026

Size	Words	When to use
Small	100–200	FAQs
Medium	200–400	best general size
Large	500–700	long technical docs

Database	Difficulty	Notes
Pinecone	Easy	Most common for RAG
Supabase Vector	Medium	SQL + vectors
Weaviate	Advanced	full vector platform
Chroma	Very easy	local development

Provider	Embedding Model
OpenAI	text-embedding-3-small
Cohere	embed-english-v3
Google	textembedding-gecko
HuggingFace	sentence-transformers

Component	Role
OpenAI	creates embeddings
LangChain	manages AI pipeline
Pinecone	stores vectors
Your Python code	orchestrates everything

Author: Sekhar Emmela

Simple Steps on how we are building Chatbots.

Split Content Into Chunks

Why Embeddings Are Powerful

What LangChain Actually Is

Solution: Use a Query Rewrite Step

AI Workshops at 9FAQs: Launch your first website with AI

Python Training at 9faqs!

Hello Learners. Welcome to 9faqs!