elsewhere, a text-to-3D studio
This is a submission for the Built with Google Gemini: Writing Challenge What I Built with Google Gemini I built a high performance text-to-3D model studio that works straight from the browser! A user describes what they want in natural language from “cute cat” to “floating pizza with laser eyes” and Gemini generates an interactive 3D model (in THREE.js). Asset generation is a two-phase pipeline with: ## Planning phase — Gemini receives the user prompt + PLANNING_SYSTEM_PROMPT_V4 as systemInstruction. Temperature 0.5, thinkingLevel: 'low', max 8192 output tokens. It returns a v3 schema JSON: an array of 3-6 materials (color, roughness, metalness) and 4-12 parts, each specifying geometry type (Box|Sphere|Cylinder|Cone|Torus|Lathe|Tube|Dome), parent reference, priority (1-3), material index, geometry parameters, and instance transforms (position/rotation/scale arrays). The LLM never writes executable code and rather it describes geometry in a constrained JSON vocabulary. ## Compilation p
Continue reading on Dev.to
Opens in a new tab



