[Gemini] Building a LINE E-commerce Chatbot That Can "Tell Stories from Images"

Reference articles: Gemini API - Function Calling with Multimodal GitHub: linebot-gemini-multimodel-funcal Vertex AI - Multimodal Function Response Complete code GitHub Background I believe many people have used the combination of LINE Bot + Function Calling. When a user asks "What clothes did I buy last month?", the Bot calls the database query function, retrieves the order data, and then Gemini answers based on that JSON: Traditional process designed by developers: User: "Help me see the jacket I bought before" Bot: [ Call get_order_history() ] Function returns: { "product_name" : "Brown pilot jacket" , "order_date" : "2026-01-15" , ... } Gemini: "You bought a brown pilot jacket on January 15th for NT$1,890." The answer is completely correct, but it always feels like something is missing—the user is talking about "that jacket," and Gemini is just restating the text in the JSON, with no way to "confirm" what the jacket looks like. If there happen to be three jackets in the database, t

[Gemini] Building a LINE E-commerce Chatbot That Can "Tell Stories from Images"

Related Articles

What OpenClaw Gets Wrong Out of the Box (And How to Fix It)

Android Remote Compose：讓 Android UI 不用發版也能更新

Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?

“Learn to Code” Is Dead… Learn to Think Instead

How One File Makes Claude Code Actually Follow Your Instructions

Related Articles

How-To
What OpenClaw Gets Wrong Out of the Box (And How to Fix It)
Medium Programming • 2h ago

How-To
Android Remote Compose：讓 Android UI 不用發版也能更新
Medium Programming • 4h ago

How-To
Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?
Lobsters • 10h ago

How-To
“Learn to Code” Is Dead… Learn to Think Instead
Medium Programming • 12h ago

How-To
How One File Makes Claude Code Actually Follow Your Instructions
Medium Programming • 12h ago