
Embeddings shouldn't need a notebook
I kept running into the same annoyance whenever I needed embeddings. The retrieval part of a RAG pipeline was hard enough already. Generating vectors should've been the easy part. But every time I needed to embed something, the workflow looked like this: Open a notebook or write a throwaway script Import the SDK, set up the client Figure out the right model name (was it text-embedding-004 or text-embedding-3-small ?) Write the call, handle the response format Copy the vector out of the output For text that's annoying. For images or audio it's worse. Different SDKs, different input formats, different response shapes. I kept thinking: I can curl an API in seconds. I can jq a JSON response without writing a script. Why can't I just embed something from the terminal? The tool I wanted Something like httpie but for embeddings. Type a command, get a vector back. vemb text "hello world" # {"model": "gemini-embedding-2-preview", "dimensions": 3072, "values": [0.0123, -0.0456, ...]} vemb text "
Continue reading on Dev.to Python
Opens in a new tab




