Semantic Chunks for RAG

Last updated: February 8, 2026Reviewed date: February 8, 2026Reading time: 7 min read

Patient Tools

Read, save, and share this guide

Use these quick tools to make this medical article easier to read, print, save, or share with a family member.

Medical guide Rx Data Science and Artificial Intelligence Feb 8, 2026 65 reads

Understand this article easily

Switch between simple English and easy Bangla patient notes. This is for education and does not replace a doctor consultation.

For severe symptoms, danger signs, pregnancy, child illness, or sudden worsening, seek urgent medical care.

বাংলা রোগী নোট এখনো যোগ করা হয়নি। পোস্ট এডিটরে “RX Bangla Patient Mode” বক্স থেকে সহজ বাংলা সারাংশ যোগ করুন।

এই তথ্য শিক্ষা ও সচেতনতার জন্য। এটি ডাক্তারি পরীক্ষা, রোগ নির্ণয় বা প্রেসক্রিপশনের বিকল্প নয়।

Article Summary

In order to abide by the context window of the LLM , we usually break text into smaller parts / pieces which is called chunking. LLMs, although capable of generating text that is both meaningful and grammatically correct, these LLMs suffer from a problem called hallucination. Hallucination in LLMs is the concept where the LLMs confidently generate wrong answers, that is they make up wrong...

Key Takeaways

This article explains Comparison of methods steps: in simple medical language.
This article explains System Integration in simple medical language.
This article explains Semantic Memory in simple medical language.
This article explains Embeddings in Practice in simple medical language.

Educational health guideWritten for patient understanding and clinical awareness.

Reviewed content workflowUse writer and reviewer profiles for stronger trust.

Emergency safety firstUrgent warning signs are highlighted below.

Seek urgent medical care if you notice

These warning signs are general safety guidance. Local emergency numbers and clinical judgment should always come first.

Severe symptoms, breathing difficulty, fainting, confusion, or rapidly worsening illness.
New weakness, severe pain, high fever, or symptoms after a serious injury.
Any symptom that feels urgent, unusual, or unsafe for the patient.

Emergency now

Use emergency care for severe, sudden, rapidly worsening, or life-threatening symptoms.

See a doctor

Book a professional medical evaluation if symptoms persist, worsen, recur often, affect daily activities, or occur in a high-risk patient.

Learn safely

Use this article to understand possible causes, tests, treatment options, prevention, and questions to ask your clinician.

Before reading

RX Patient Tools

Use these quick guides before reading the article, or return to them when you need help preparing questions for a doctor.

Start here Choose the right pathway for symptoms, reports, medicines, or urgent warning signs. Disease article roadmap Read this topic step by step: meaning, symptoms, warning signs, diagnosis, treatment, prevention, and follow-up. Treatment planner Prepare questions about treatment choices, benefits, risks, side effects, and follow-up. Family & caregiver guide Organize symptoms, reports, medicines, questions, and follow-up safely. Nutrition & diet guide Prepare food, hydration, supplement, and medicine-timing questions safely. Prevention guide Organize risk factors, protective habits, screening, and warning signs. Recovery guide Prepare a safe plan for activity, rehabilitation, warning signs, and follow-up.

Definition

In order to abide by the context window of the LLM , we usually break text into smaller parts / pieces which is called chunking.

LLMs, although capable of generating text that is both meaningful and grammatically correct, these LLMs suffer from a problem called hallucination. Hallucination in LLMs is the concept where the LLMs confidently generate wrong answers, that is they make up wrong answers in a way that makes us believe that it is true. This has been a major problem since the introduction of the LLMs. These hallucinations lead to incorrect and factually wrong answers. Hence Retrieval Augmented Generation was introduced.

In RAG, we take a list of documents/chunks of documents and encode these textual documents into a numerical representation called vector embeddings, where a single vector embedding represents a single chunk of document and stores them in a database called vector store. The models required for encoding these chunks into embeddings are called encoding models or bi-encoders. These encoders are trained on a large corpus of data, thus making them powerful enough to encode the chunks of documents in a single vector embedding representation.

The retrieval greatly depends on how the chunks are manifested and stored in the vectorstore. Finding the right chunk size for any given text is a very hard question in general.

Improving Retrieval can be done by various retrieval method. But it can also be done by better chunking strategy.

Different chunking methods:

Fixed size chunking
Recursive Chunking
Document Specific Chunking
Semantic Chunking
Agentic Chunking

Fixed Size Chunking: This is the most common and straightforward approach to chunking: we simply decide the number of tokens in our chunk and, optionally, whether there should be any overlap between them. In general, we will want to keep some overlap between chunks to make sure that the semantic context doesn’t get lost between chunks. Fixed-sized chunking will be the best path in most common cases. Compared to other forms of chunking, fixed-sized chunking is computationally cheap and simple to use since it doesn’t require the use of any NLP libraries.

Recursive Chunking : Recursive chunking divides the input text into smaller chunks in a hierarchical and iterative manner using a set of separators. If the initial attempt at splitting the text doesn’t produce chunks of the desired size or structure, the method recursively calls itself on the resulting chunks with a different separator or criterion until the desired chunk size or structure is achieved. This means that while the chunks aren’t going to be exactly the same size, they’ll still “aspire” to be of a similar size. Leverages what is good about fixed size chunk and overlap.

Document Specific Chunking: It takes into consideration the structure of the document . Instead of using a set number of characters or recursive process it creates chunks that align with the logical sections of the document like paragraphs or sub sections. By doing this it maintains the author’s organization of the content thereby keeping the text coherent. It makes the retrieved information more relevant and useful, particularly for structured documents with clearly defined sections. It can handle formats such as Markdown, Html, etc.

Sematic Chunking: Semantic Chunking considers the relationships within the text. It divides the text into meaningful, semantically complete chunks. This approach ensures the information’s integrity during retrieval, leading to a more accurate and contextually appropriate outcome. It is slower compared to the previous chunking strategy

Agentic Chunk: The hypothesis here is to process documents in a fashion that humans would do.

We start at the top of the document, treating the first part as a chunk.
We continue down the document, deciding if a new sentence or piece of information belongs with the first chunk or should start a new one
We keep this up until we reach the end of the document.

This approach is still being tested and isn’t quite ready for the big leagues due to the time it takes to process multiple LLM calls and the cost of those calls. There’s no implementation available in public libraries just yet.

Here we will experiment with Semantic chunking and Recursive Retriever .

Comparison of methods steps:

Load the Document
Chunk the Document using the following two methods: Semantic chunking and Recursive Retriever .
Assess qualitative and quantitative improvements with RAGAS

Semantic Chunks

Semantic chunking involves taking the embeddings of every sentence in the document, comparing the similarity of all sentences with each other, and then grouping sentences with the most similar embeddings together.

By focusing on the text’s meaning and context, Semantic Chunking significantly enhances the quality of retrieval. It’s a top-notch choice when maintaining the semantic integrity of the text is vital.

The hypothesis here is we can use embeddings of individual sentences to make more meaningful chunks. Basic idea is as follows :-

Split the documents into sentences based on separators(.,?,!)
Index each sentence based on position.
Group: Choose how many sentences to be on either side. Add a buffer of sentences on either side of our selected sentence.
Calculate distance between group of sentences.
Merge groups based on similarity i.e. keep similar sentences together.
Split the sentences that are not similar.

Technology Stack Used

Langchain :LangChain is an open-source framework designed to simplify the creation of applications using large language models (LLMs). It provides a standard interface for chains, lots of integrations with other tools, and end-to-end chains for common applications.
LLM: Groq’s Language Processing Unit (LPU) is a cutting-edge technology designed to significantly enhance AI computing performance, especially for Large Language Models (LLMs). The primary goal of the Groq LPU system is to provide real-time, low-latency experiences with exceptional inference performance.
Embedding Model: FastEmbed is a lightweight, fast, Python library built for embedding generation.
Evaluation: Ragas offers metrics tailored for evaluating each component of your RAG pipeline in isolation.

Semantic chunking is a crucial technique in natural language processing that enhances the efficiency of information retrieval and understanding. By breaking down text into manageable pieces, or chunks, systems can better analyze and respond to user queries. This method is particularly effective when integrated with various retrieval strategies, allowing for both granular and broad searches.

System Integration

Efficient chunking aligns with system capabilities. For example:

Full-Text Search: Use larger chunks to allow algorithms to explore broader contexts effectively. This is useful for searching books based on extensive excerpts or chapters.
Granular Search Systems: Employ smaller chunks to precisely retrieve information relevant to user queries. For instance, if a user asks, “How do I reset my password?”, the system can retrieve a specific sentence or paragraph addressing that action directly.

Semantic Memory

Semantic memory functions similarly to how the human brain stores and retrieves knowledge. It utilizes embeddings to create a semantic memory by representing concepts or entities as vectors in a high-dimensional space. This approach allows models to learn relationships between concepts and make inferences based on the similarity or distance between vector representations. For example, the semantic memory can be trained to understand that “Word” and “Excel” are related concepts because they are both document types and Microsoft products, despite differing file formats and features.

Embeddings in Practice

Software developers can leverage pre-trained embedding models or train their own with custom datasets. Pre-trained models are beneficial as they have been trained on extensive data and can be utilized immediately for various applications. However, custom embedding models may be necessary when dealing with specialized vocabularies or domain-specific language.

Considerations for Retrieval Methods

There are various retrieval strategies to consider:

Similarity Search: A simple method that uses embeddings to find relevant text chunks.
Metadata Filtering: When metadata is available, filtering data based on it before performing a similarity search can yield better results.
Statistical Retrieval Methods: Techniques like TF-IDF and BM25 utilize term frequency and distribution to identify relevant text chunks.

Contextual Retrieval

Not all retrieved text chunks are taken as they are. Sometimes, it is beneficial to include more context around the actual retrieved text chunk. The actual retrieved text chunk is referred to as a “child chunk”, while the larger context it belongs to is called a “parent chunk”. Additionally, providing weights to retrieved documents can enhance relevance; for example, a time-weighted approach can help prioritize the most recent documents.

Safety note: This is not a prescription or diagnosis. For severe symptoms, pregnancy danger signs, children with serious illness, chest pain, breathing difficulty, stroke-like weakness, or major injury, seek urgent care.

Which doctor may help?

Start with a registered doctor or the nearest qualified health center.

What to tell the doctor

Write when the problem started and how it changed.
Bring old prescriptions, investigation reports, and current medicines.
Write allergies, pregnancy status, diabetes, kidney/liver disease, and major past illnesses.
Bring one family member if the patient is weak, elderly, confused, or a child.

Questions to ask

What is the most likely cause of my symptoms?
Which danger signs mean I should go to hospital quickly?
Which tests are necessary now, and which can wait?
How should I take medicines safely and what side effects should I watch for?
When should I come for follow-up?

Tests to discuss

Vital signs: temperature, pulse, blood pressure, oxygen saturation
Basic physical examination by a clinician
CBC, urine test, blood sugar, or imaging only when clinically needed

Avoid these mistakes

Do not use antibiotics, steroid tablets/injections, or strong painkillers without proper medical advice.
Do not hide pregnancy, kidney disease, ulcer, allergy, or blood thinner use.
Do not delay emergency care when danger signs are present.

Medicine safety and first-aid guide

This section is for patient education only. It does not replace a doctor, pharmacist, or emergency care.

Safe first steps

Rest, drink safe water, and observe symptoms carefully.
Keep a written note of symptoms, duration, temperature, medicines already taken, and allergy history.
Seek medical care quickly if symptoms are severe, worsening, or unusual for the patient.

OTC medicine safety

For mild pain or fever, ask a registered pharmacist or doctor before using common over-the-counter pain/fever medicines.
Do not combine multiple pain medicines without advice, especially if you have kidney disease, liver disease, stomach ulcer, asthma, pregnancy, or take blood thinners.
Do not give adult medicines to children unless a qualified clinician advises it.

Avoid these mistakes

Do not start antibiotics without a proper medical decision.
Do not use steroid tablets or injections casually for quick relief.
Do not delay emergency care because of home remedies.

Get urgent help if

Severe symptoms, confusion, fainting, breathing difficulty, chest pain, severe dehydration, or sudden weakness need urgent medical care.

Medicine names, dose, and timing must be decided by a qualified clinician or pharmacist after checking age, pregnancy, allergy, other diseases, and current medicines.

For rural patients and family caregivers

Patient health record and symptom diary

Write your symptoms, medicines already taken, test results, and questions before visiting a doctor. This note stays on your device unless you print or copy it.

Doctor to discuss: Doctor / qualified healthcare provider

Tests to discuss with doctor

Basic vital signs: temperature, pulse, blood pressure, oxygen level if needed
Relevant blood, urine, imaging, or specialist tests only after clinical assessment

Questions to ask

What is the most likely cause of my symptoms?
Which warning signs mean I should go to emergency care?
Which tests are really needed now?
Which medicines are safe for my age, pregnancy status, allergy, kidney/liver/stomach condition, and current medicines?

Emergency warning signs such as chest pain, severe breathing difficulty, sudden weakness, confusion, severe dehydration, major injury, or loss of bladder/bowel control need urgent medical care. Do not wait for online information.

Go to emergency care if you notice:

Severe or rapidly worsening symptoms
Breathing difficulty, chest pain, fainting, confusion, severe weakness, major injury, or severe dehydration

Doctor / service to discuss: Qualified healthcare provider; specialist depends on symptoms and examination.

Step 1
Check danger signs first

If danger signs are present, seek emergency care and do not wait for online information.
Step 2
Record the symptom story

Write when symptoms started, severity, medicines already taken, allergies, pregnancy status, and test results.
Step 3
Visit a qualified clinician

A doctor, nurse, or qualified healthcare provider can examine you and decide which tests or treatment are needed.
Step 4
Do only useful tests

Do tests after clinical assessment. Avoid unnecessary tests, random antibiotics, or repeated medicines without diagnosis.
Step 5
Follow up and return early if worse

If symptoms worsen, new warning signs appear, or treatment is not helping, return for review quickly.

Rural patient practical tips

Take a written symptom diary and all previous prescriptions/test reports.
Do not hide medicines already taken, even herbal or over-the-counter medicines.
Ask which warning signs mean urgent referral to hospital.

This roadmap is for education. A real diagnosis and treatment plan requires history, examination, and clinical judgment.

RX Patient Help

Ask a health question safely

Write your symptom story. A health professional or site editor can review it before any answer is prepared. This box is not for emergency care.

Emergency first: Severe chest pain, breathing trouble, unconsciousness, stroke signs, severe injury, heavy bleeding, or rapidly worsening symptoms need urgent local medical care now.

Website

Name or nickname Phone / contact District / area Age group Sex Severity today

Main symptoms How long has it been happening? Medicines already taken Tests already done Your question 0/1200

Frequently Asked Questions

Is this article a replacement for a doctor?

No. It is educational content only. Patients should consult a qualified clinician for diagnosis and treatment.

When should I seek urgent care?

Seek urgent care for severe symptoms, rapidly worsening condition, breathing difficulty, severe pain, neurological changes, or any emergency warning sign.

A global war against illness

Help this medical guide reach someone who may need it

Share reliable health information with a patient, family member, caregiver, or colleague. Reading and awareness can help people ask better questions and seek appropriate care.

Continue exploring

Explore this topic across the RX Medical Library

Open a focused A–Z pathway or continue with closely related indexed articles. These links are educational and do not replace personal medical care.

Search this topic

Diseases A–Z Drugs A–Z Lab Tests A–Z Cancer A–Z

Diseases A–Z

Read, save, and share this guide

Understand this article easily