Explorable World Systems

A comparison of AI systems that generate or interact with 3D explorable worlds.

High-Level Comparison

System	Input	Output	Runtime	Primary Focus
Genie 3	Text, Image	Interactive 3D	Cloud	Game world generation
SIMA 2	Gameplay video	Agent behavior	Cloud	Game-playing AI agent
WorldGen	Text	3D Worlds (USD)	Cloud	Structured 3D world generation
Marble	Text, Image, Video, 3D	Gaussian Splats, 3D	Cloud	World model for 3D generation
Artcraft	Text	Images, Video	Cloud	Creative IDE for AI media
Intangible	Text	3D Scenes	Cloud	Camera-centric scene composition
SceneCraft	Text	Interactive narratives	Cloud	AI storytelling for education
Unity AI Beta	Text	3D Scenes	Cloud + Local	AI-assisted game development
Roblox AI	Text	Game Objects	Cloud	AI-generated game objects
Moonlake Reverie	Text	Interactive 2D/3D	Cloud	Generative game engine
LocalGPT Gen	Text	Interactive 3D (glTF)	Local	Open-source world building

Feature Comparison

Feature	Genie 3	SIMA 2	WorldGen	Marble	Artcraft	Intangible	SceneCraft	Unity AI	Roblox AI	Moonlake	LocalGPT Gen
Text-to-3D	✓	—	✓	✓	—	✓	—	✓	✓	✓	✓
Image-to-3D	✓	—	—	✓	—	—	—	—	—	—	—
Interactive playback	✓	✓	—	—	—	✓	✓	✓	✓	✓	✓
Real-time simulation	✓	✓	—	—	—	—	—	✓	✓	✓	✓
Structured generation	—	—	✓	—	—	—	✓	✓	✓	✓	✓
Local execution	—	—	—	—	—	—	—	✓	—	—	✓
Open source	—	—	—	—	—	—	—	—	—	—	✓
Procedural audio	—	—	—	—	—	—	—	—	—	—	✓
glTF/USD export	—	—	✓	✓	—	✓	—	✓	—	—	✓
Agent control	—	✓	—	—	—	—	—	—	—	✓	✓

System Highlights

Genie 3 (DeepMind)

Foundation world model that generates interactive 3D environments from a single text prompt or image. Designed for rapid game prototyping and synthetic data generation.

SIMA 2 (DeepMind)

Gemini-powered agent that learns to play 3D games by watching gameplay video. Self-improving through experience, it reasons about game objectives and adapts to new environments.

WorldGen (Meta Reality Labs)

Structured 3D world generation from Meta Research. Uses a multi-stage pipeline — LLM generates high-level layout parameters, then procedural systems handle actual geometry placement. Outputs USD scenes with terrain, structures, vegetation, and props. Key insight: LLMs should generate parameters, not geometry directly. LocalGPT Gen's WorldGen pipeline implements a similar blockout-first architecture locally.

Marble (World Labs)

Multimodal world model that creates 3D scenes from text, images, video, or 3D layouts. Exports as Gaussian splats for high-fidelity rendering.

Artcraft

IDE for AI-assisted creative work. Combines image generation, video creation, 3D compositing, character posing, and scene blocking in a unified interface.

Intangible

Spatial intelligence platform focused on camera-centric 3D composition. Designed for creative industries needing precise camera control and scene layout.

SceneCraft (EngageAI Institute)

AI-powered storytelling platform that generates interactive, narrative-based learning experiences from natural language prompts. Developed by the EngageAI Institute (NSF AI Institute for Engaged Learning, award DRL-2112635) across NC State, UNC, Indiana University, Vanderbilt, and Digital Promise. Teachers input prompts describing desired story foundations; SceneCraft generates scenes, characters, and dialogue that educators can fully customize to align with instructional goals.

Unity AI Beta

Unity's 2026 AI Beta integrates AI-powered tools directly into the Unity Editor for generating and modifying game objects, scenes, and assets from natural language prompts. Combines cloud AI services with local editor execution.

Unity AI Beta 2026 announcement

Roblox AI

Roblox brings AI-generated game objects to its developer tools, enabling creators to generate 3D objects, textures, and game assets from text descriptions directly within Roblox Studio.

Roblox AI-generated game objects

Moonlake Reverie

Generative Game Engine (GGE) from Moonlake AI that transforms text descriptions into playable 2D and 3D interactive worlds. Founded by Fan-Yun Sun and Sharon Lee (Stanford AI Lab), backed by $28M seed from AIX Ventures, Threshold, and NVIDIA Ventures. Combines multimodal reasoning with program synthesis and simulation layers — spatial layout, physics, and agent behaviors are generated structurally, then a real-time diffusion model conditioned on 3D signals provides visual reskinning. Unlike video-only generation, Reverie maintains world state across interactions, enabling consistent interactive sessions.

LocalGPT Gen

Open-source, local-first 3D world generation powered by Bevy. Features procedural audio synthesis, data-driven behaviors, and full glTF export. Runs entirely on your machine without cloud dependencies.

Showcases:

localgpt-gen-workspace — "World as skill" examples: complete explorable worlds saved as reusable, shareable skills
proofof.video — Video gallery comparing world generations across different models using the same or similar prompts

See the Gen documentation for details on LocalGPT's world generation capabilities.

High-Level Comparison​

Feature Comparison​

System Highlights​

Genie 3 (DeepMind)​

SIMA 2 (DeepMind)​

WorldGen (Meta Reality Labs)​

Marble (World Labs)​

Artcraft​

Intangible​

SceneCraft (EngageAI Institute)​

Unity AI Beta​

Roblox AI​

Moonlake Reverie​

LocalGPT Gen​