Chapter Notes: LLM Engineer’s Handbook - RAG Feature Pipeline

5 minute read

Published: July 04, 2025

Book:

Amazon link

Chapter 4: RAG Feature Pipeline

Retrieval-augmented generation (RAG) Feature Pipeline: Qdrant vector DB for online serving and ZenML artifacts for offline training.
Naive RAG
- Chunking
- Embedding
- Vector DBs
Chapter teaches you what RAG is and how to implement it.
The main sections of this chapter are:
- Understanding RAG
- An overview of advanced RAG
- Exploring the LLM Twin’s RAG feature pipeline architecture
- Implementing the LLM Twin’s RAG feature pipeline
A RAG system is composed of three main modules independent of each other:
- Ingestion pipeline: A batch or streaming pipeline used to populate the vector DB
- Retrieval pipeline: A module that queries the vector DB and retrieves relevant entries to the user’s input
- Generation pipeline: The layer that uses the retrieved data to augment the prompt and an LLM to generate answers

Ingestion Pipeline

For the ingestion pipeline, first we need to collect the data.
This can be from DBs, APIs, or web pages. And depending on the source, your cleaning step might be different.
The cleaned data is then chunked (depending on the model’s input size)
Then the chunks are embedded.
The chunks data along with it’s metadata is taken by the loading module.

So the flow for ingestion pipeline is: Collect->Clean->Chunk->Embed->Load

Retrieval Pipeline

A retrieval pipeline uses the user input to output the similar chunks of data.
For this, first, the user input need to be translated to the same vector spaces as the chunks of data.
Then we use distance formula to get the ‘K’ nearest elements to it.
Those elements is used to augment the prompt.

Here, cosine distance formula is one of the most popular distance formula used to get the distance. But it is said that the distance formula depends on the data and the embedding model we have. How do we decide on the best distance formula?

Generation Pipeline

The final prompt results from a system and prompt template populated with the user’s query and retrieved context. You might have a single prompt template or multiple prompt templates, depending on your application. Usually, all the prompt engineering is done at the prompt template level.

Critical aspects affecting the accuracy of RAGs:

Embedding used,
Similarity function used.

Embeddings

Algorithm for creating vector indexes: Random Projection, Hierarchial Navigable Small World (HNSW), Product Quantization (PQ), and Locality Sensitive Hashing (LSH).

The vanilla RAG framework we just presented doesn’t address many fundamental aspects that impact the quality of the retrieval and answer generation, such as:

Are the retrieved documents relevant to the user’s question?
Is the retrieved context enough to answer the user’s question?
Is there any redundant information that only adds noise to the augmented prompt?
Does the latency of the retrieval step match our requirements?
What do we do if we can’t generate a valid answer using the retrieved information?

Therefore, for RAG we need two things;

robust evaluation of retrieval
retrieval limitation should be addressed in the algorithm itself.

Advanced RAG

The vanilla RAG design can be optimized at three different stages:

Pre-retrieval
Retrieval
Post-retrieval

Pre-retrieval

most of the data indexing techniques focus on better preprocessing and structuring the data to improve retrieval efficiency, such as:

Sliding Window
Enhancing Data Granularity
Metadata
Optimizing index structures
Small-to-big

For query optimization,

Query routing
Query rewriting
- Paraphrasing
- Synonym substitution
- Sub-queries
- Hypothetical document embeddings (HyDE)
Query Expansion
- Self-Query

Retrieval Pipeline Optimization

There are two ways

Improve the Embedding model
- by fine-tuning the pre-trained model (very computationally costly, evan financially)
- using Instruction models (less costly)
Leveraging the DB’s filter and search features

Post-Retrieval Pipeline Optimization

Re-ranking
Prompt compression

Exploring LLM Twin’s RAG feature pipeline

To implement the RAG feature pipeline, we have two design choice:

Batch Pipeline	Streaming Pipeline
regular interval	continuous
simple	complex
when data processing is not critical	when it is critical
handles large data efficiently	handles single data points

Core steps for RAG feature pipeline

Data Extraction
Cleaning
Chunking
Embedding
Data Loading: Embedding + Metadata + Chunks

Change data capture (CDC)

a strategy that allows you to optimally keep two or more data storage types in sync without computing and I/O overhead.
It captures any CRUD operation done on the source DB and replicates it on a target DB.
Optionally, you can add preprocessing steps in between the replication.

The CDC (Change Data Capture) pattern addresses these issues using two main strategies:

Push: The source DB actively sends changes to targets, enabling real-time updates. A messaging system buffers changes to prevent data loss if targets are unavailable.
Pull: The source DB logs changes, and targets fetch them periodically. This reduces source load but introduces delays; a messaging buffer ensures reliability.

The main CDC patterns that are used in the industry:

Time-stamp based: overhead to the source as we have to query the whole table/dataset.
Trigger based: same overhead.
Log-based: no overhead to the source system, however since logs are not standardized, we have to implement vendor-wise implementations.

Why is the data stored in two snapshots?

After the data is cleaned: For fine-tuning LLMs
After the documents are chunked and embedded: For RAG

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Learning Deutsch by learning Phrases and Sentences

1 minute read

Published: September 28, 2025

When it comes to language learning, focusing on phrases and sentences rather than isolated words can make a significant difference. While memorizing vocabulary lists might seem like a straightforward approach, it often leaves learners struggling to use those words in real-life situations. Words alone rarely convey complete meaning; context is crucial. By learning phrases and sentences, you naturally absorb grammar, word order, and common expressions, making your speech sound more natural and fluent.

For example, knowing the word “book” is helpful, but learning the phrase “I’d like to book a table” is far more practical. Phrases provide ready-made building blocks for conversation, reducing the mental effort needed to construct sentences from scratch. This approach also helps with pronunciation and intonation, as you practice speaking in chunks rather than isolated syllables.

Moreover, sentences and phrases expose you to cultural nuances and idiomatic expressions that single words cannot convey. This leads to better comprehension when listening or reading, and more confidence when speaking. In summary, prioritizing phrases and sentences accelerates your ability to communicate effectively, making language learning more enjoyable and efficient.

Below are some of the anki decks that can be used:

Deutsch:

German Sentences
- Part 1 - A1 and A2: https://ankiweb.net/shared/info/785874566
- Part 2 - B1 : https://ankiweb.net/shared/info/17323417
- Part 3 - B2-C1 : https://ankiweb.net/shared/info/944971572
German 7000 Intermediate/Advanced Sentences w/ Audio
- Part 1 : https://ankiweb.net/shared/info/1125602705

Japanese:

LTL Japanese Deck
- Level 1 - Short: https://ankiweb.net/shared/info/1184395484
- Level 2 - Short Medium: https://ankiweb.net/shared/info/187819699
- Level 3 - Medium: https://ankiweb.net/shared/info/266834099
- Level 4 - Medium Long: https://ankiweb.net/shared/info/660574631
- Level 5 - Long: TBD

Deutsch Day 16: Compound Nouns

12 minute read

Published: August 06, 2025

📘 MASTER PLAN: German Compound Noun Vocabulary Expansion

🔶 PHASE 1: Master the Most Useful Base Nouns (Top 50)

These are the “root” or “core” nouns you will see in countless combinations.

Noun Meaning Article Example Compound

| # | Noun | Meaning | Article | Example Compound | | – | ——– | ———– | ——– | —————– | | 1 | Haus | house | das | Krankenhaus | | 2 | Kind | child | das | Kindergarten | | 3 | Arbeit | work | die | Hausarbeit | | 4 | Schule | school | die | Sprachschule | | 5 | Auto | car | das | Autounfall | | 6 | Zeit | time | die | Freizeit | | 7 | Tag | day | der | Feiertag | | 8 | Bahn | rail/train | die | U-Bahn | | 9 | Buch | book | das | Wörterbuch | | 10 | Zimmer | room | das | Schlafzimmer | | 11 | Stadt | city | die | Hauptstadt | | 12 | Name | name | der | Nachname | | 13 | Licht | light | das | Taschenlicht | | 14 | Wasser | water | das | Trinkwasser | | 15 | Luft | air | die | Luftqualität | | 16 | Weg | path/way | der | Heimweg | | 17 | Spiel | game/play | das | Kinderspiel | | 18 | Reise | travel/trip | die | Dienstreise | | 19 | Zeitung | newspaper | die | Bildzeitung | | 20 | Gerät | device | das | Küchengerät | | 21 | Mann | man | der | Geschäftsmann | | 22 | Frau | woman | die | Hausfrau | | 23 | Essen | food/eating | das | Essenszeit | | 24 | Lehrer | teacher | der | Lehrerzimmer | | 25 | Student | student | der | Studentenstadt | | 26 | Bahn | train/rail | die | Eisenbahn | | 27 | Eltern | parents | die (pl) | Elternabend | | 28 | Körper | body | der | Körperpflege | | 29 | Kopf | head | der | Kopfschmerzen | | 30 | Zahn | tooth | der | Zahnarzt | | 31 | Auge | eye | das | Augenarzt | | 32 | Herz | heart | das | Herzenswunsch | | 33 | Beruf | profession | der | Berufsleben | | 34 | Unfall | accident | der | Autounfall | | 35 | Polizei | police | die | Polizeiauto | | 36 | Freund | friend | der | Freundschaft | | 37 | Uhr | clock | die | Wanduhr | | 38 | Sprache | language | die | Fremdsprache | | 39 | Tier | animal | das | Haustier | | 40 | Leben | life | das | Lebensstil | | 41 | Welt | world | die | Weltkarte | | 42 | Feuer | fire | das | Feuerzeug | | 43 | Glas | glass | das | Weinglas | | 44 | Straße | street | die | Hauptstraße | | 45 | Fenster | window | das | Fensterrahmen | | 46 | Schuh | shoe | der | Turnschuh | | 47 | Tasche | bag | die | Handtasche | | 48 | Lampe | lamp | die | Schreibtischlampe | | 49 | Computer | computer | der | Computerprogramm | | 50 | Tisch | table/desk | der | Esstisch |

✅ Goal: Learn gender, plural form, and 2-3 common compounds per base noun.

🔶 PHASE 2: Master Compound Prefix & Suffix Builders

These turn base nouns into real-life compound nouns.

✅ Goal: Learn 10 of each and how they behave when combined.

🔶 PHASE 3: Learn Noun Combination Patterns (Grouped by Theme)

Now combine your prefix + base/suffix using patterns and themes. Grouping by theme makes it easy to recall.

🏠 House & Furniture (Wohnen und Möbel)

🧑‍⚕️ Health & Body (Gesundheit und Körper)

🎓 School & Learning (Schule und Lernen)

🚗 Travel & Transport (Reisen und Verkehr)

⏱ Time & Work (Zeit und Arbeit)

📱 Devices & Objects (Geräte und Gegenstände)

📰 Media & Reading (Medien und Lesen)

| Compound Noun      | Meaning            | | ------------------ | ------------------ | | Wörterbuch         | dictionary         | | Schulbuch          | school book        | | Tageszeitung       | daily newspaper    | | Bildzeitung        | tabloid            | | Lesebrille         | reading glasses    | | Fernsehprogramm    | TV program         | | Lieblingsbuch      | favorite book      | | Zeitungsausschnitt | newspaper clipping | | Sachbuch           | nonfiction book    | | Bibliotheksausweis | library card       | | Buchhandlung       | bookstore          | | Nachrichtenkanal   | news channel       |

💼 People & Professions (Personen und Berufe)

🐾 Nature & Environment (Natur und Umwelt)

📦 BONUS GROUP – Easy-to-Understand Compounds from A1/A2 Level

Deutsch Day 12: Subject + Verb

1 minute read

Published: July 29, 2025

Nominative

Personal Pronomen in Nominative

Nominative pronouns are personal pronouns that replace the subject in a sentence. They show who or what is doing something, e.g., I am tired.

Deutsch: Pronomen im Nominativ sind Personalpronomen, die das Subjekt im Satz ersetzen. Sie zeigen, wer oder was etwas tut, z. B. Ich bin müde.

	Singular	Plural
1st person	ich (I)	Wir (We)
2nd person	du/Sie (you)	Ihr (you all)
3rd Person	er/sie/es (he/she/it)	sie (they)

Sein (to have)

Number	Person	Personalpronomen	Sein (to be)
Singular	1st person	ich (I)	bin
Singular	2nd person	du (you - informal)	bist
Singular	3rd person	er/sie/es (he/she/it)	ist
Singular	2nd person (formal)	Sie (you - formal)	sind
Plural	1st person	wir (we)	sind
Plural	2nd person	ihr (you all - informal)	seid
Plural	3rd person	sie (they)	sind
Plural	2nd person (formal)	Sie (you all - formal)	sind

habe (to have)

Number	Person	Personalpronomen	Haben (to have)
Singular	1st person	ich (I)	habe
Singular	2nd person	du (you - informal)	hast
Singular	3rd person	er/sie/es (he/she/it)	hat
Singular	2nd person (formal)	Sie (you - formal)	haben
Plural	1st person	wir (we)	haben
Plural	2nd person	ihr (you all - informal)	habt
Plural	3rd person	sie (they)	haben
Plural	2nd person (formal)	Sie (you all - formal)	haben

Deutsch Day 11: Ja/Nein Trage

less than 1 minute read

Published: July 28, 2025

The structure for Yes/No question (Ja/Nein Trage) in Deutsch is as follows:

` Verb (konjugiert) + Subjekt + Rest`

For Example:

Deutsch	Englisch
Bist du müde?	Are you tired?
Hast du ein Buch?	Do you have a book?
Kommt er aus Spanien?	Does he come from Spain?
Geht sie zur Schule?	Does she go to school?
Wohnst du in Berlin?	Do you live in Berlin?

Other example:

Ist das die Brille?
Ist das die Handdy?
Ist das der Apfel?
Ist das der Tasse?

Bishnu Khadka