
Gemini: Building a LINE E-commerce Chatbot That Can "Tell Stories" from Images
References: Gemini API - Function Calling with Multimodal GitHub: linebot-gemini-multimodel-funcal Vertex AI - Multimodal Function Response Complete code GitHub Background I believe many people have used the combination of LINE Bot + Function Calling. When a user asks "What clothes did I buy last month?", the Bot calls the database query function, retrieves the order data, and then Gemini answers based on that JSON: Traditional process designed by developers: User: "Help me take a look at the jacket I bought before" Bot: [Call get_order_history()] Function returns: {"product_name": "Brown pilot jacket", "order_date": "2026-01-15", ...} Gemini: "You bought a brown pilot jacket on January 15th for NT$1,890." The answer is completely correct, but it always feels like something is missing - the user is talking about "that jacket", and Gemini is just restating the text in the JSON, and has no way to "confirm" what that piece of clothing looks like. If there happen to be three jackets in the
Continue reading on Dev.to Tutorial
Opens in a new tab



