To Embed or Not to Embed? That Is the Question.

To Embed or Not to Embed? That Is the Question In a series of stories about my grammar RAG assistant BookMind and it pissed me off again. Student asked: “ Explain the Past Simple tense. ” The system gave a decent explanation. Then the student said: “ Give me an exercise on this topic. ” Instead of pulling an exercise from the same unit, the model brought something from a completely different section. The conversation broke. That was the moment I finally added a proper reranker. What changed in the pipeline # Stage 1: Hybrid retrieval (25 candidates) candidates = retriever . invoke ( question ) # Stage 2: Cross-Encoder reranking scores = CrossEncoder ( ' cross-encoder/ms-marco-MiniLM-L-6-v2 ' ) \ . predict ([[ question , doc . page_content ] for doc in candidates ]) # Stage 3: Only the best 5 go to the LLM final_context = [ doc for _ , doc in sorted ( zip ( scores , candidates ), reverse = True )][: 5 ] Real conversation after adding reranker Student asks for the rule → system correctly

To Embed or Not to Embed? That Is the Question.

Related Articles

Structuring Go projects

The Code Simplification Skill Senior Engineers Develop

These Sony headphones are under $50 and punch above their weight - and they're on sale

Copilot Didn’t Replace Developers But Replaced Thinking

Google TV’s new Gemini features keep fans updated on sports teams and more

Related Articles

News
Structuring Go projects
Lobsters • 2h ago

News
The Code Simplification Skill Senior Engineers Develop
Medium Programming • 3h ago

News
These Sony headphones are under $50 and punch above their weight - and they're on sale
ZDNet • 3h ago

News
Copilot Didn’t Replace Developers But Replaced Thinking
Medium Programming • 3h ago

News
Google TV’s new Gemini features keep fans updated on sports teams and more
TechCrunch • 3h ago