Immersive Book 2.0

Concept: Dive into Fantasy Books' 3D World!

This is the Immersive Book 2.0 project, an Immerse The Bay 2024 team project at Stanford on 11/8-10, 2024.

Original project idea is described here

Devpost from the hackathon is here: https://devpost.com/software/immersivebook2-0

Team

David Kordsmeier
Kazuki Yoda
Victor Arroyo
Youssif Seisa

Get Started

Build & Launch Server

cd ImmersiveBook2\prototypes\elixr-poc1
npm install
npm serve

Demo

On Meta Quest3 (or similar versions), connect to the same local WiFi network as the web server (XR Hackathon 5GHz at the venue)
Open the browser and type the local IPv4 address + port, such as https://10.20.30.40:8081/
Allow VR mode on the browser. (Enable if not yet by the browser settings.)
Click the gray Enter VR button at the top-left with the Meta Quest controller.

Architecture

Hardware Requirements

Meta Quest 3/3s (VR)
Chrome Browser/PC/Mac/Linux
NVIDIA GPU

Software Requirements

Node.js (Web server)
Three.js (3D rendering)
WebXR (VR mode on WEB browser)
Python (backend/ML)

External AI/ML Tools Used for Research

Stable Diffusion (SDXL) (Image Generation)
A1111 WebUI (Pipeline/API for Stable Diffusion)
ControlNet (Guided image generation on Stable Diffusion)
DUSt3R: Geometric 3D Vision Made Easy (Image to multi-view consistent poing cloud)
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes (point cloud)
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
OpenAI API (GPT-4o/4V) (Preprocessing of book/texts and Stable Diffusion's prompt generation)
RunPod (cloud GPU server)

Code

src : the final source code
assets: a JSON file containing links to any static assets
pipeline: this is the generative AI pipeline in combination with the graphics pipeline
prototypes: these are early prototypes that may later be thrown out or kept for historical reasons

Install

node 16 is fine

Build

The current design is WebXR base, so not really anything to build, but building helps if you wan to prep for cloud deployment.

Run

For testing:

npm run serve

Demo PoC1

The goal for the hackathon was to show off this workflow:

A Reading Room, where a user can choose between 2-3 books
The books load a PDF
In "reader" mode, the user can read line by line from a real PDF
The reading can be simply showing the text flowing in space
as each line is read, a generative pipeline kicks off
the generative pipeline uses the sentence for a prompt into a stable diffusion endpoint. This produces one image, which is used to feed to Dustr. The point cloud format is taken as a .glb file
The glb is loaded into the scene and appears out the window.
Continue until the user puts the book away

Performance

TODO

Live Demo

TODO

Additional Information

Attributions

Elixr Prototype: https://elixrjs.io/index.html
Meta WebXR example for Quest: https://github.com/meta-quest/webxr-first-steps
AFrame examples: https://glitch.com/~aframe-basic-guide
Content: https://sketchfab.com/3d-models/cartoon-lowpoly-small-city-free-pack-edd1c604e1e045a0a2a552ddd9a293e6
sky.jpg from Basic Scene - A-Frame
dat.gui https://github.com/dataarts/dat.gui

Inspirations

AR.js prototype: https://ar-js-org.github.io/AR.js-Docs/

License

MIT/X

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
pipeline		pipeline
prototypes		prototypes
src		src
test		test
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Immersive Book 2.0

Concept: Dive into Fantasy Books' 3D World!

Team

Get Started

Build & Launch Server

Demo

Architecture

Hardware Requirements

Software Requirements

External AI/ML Tools Used for Research

Code

Install

Build

Run

Demo PoC1

Performance

Live Demo

Additional Information

Attributions

Further Reading

Inspirations

License

About

Releases

Packages

Contributors 3

Languages

truedat101/ImmersiveBook2

Folders and files

Latest commit

History

Repository files navigation

Immersive Book 2.0

Concept: Dive into Fantasy Books' 3D World!

Team

Get Started

Build & Launch Server

Demo

Architecture

Hardware Requirements

Software Requirements

External AI/ML Tools Used for Research

Code

Install

Build

Run

Demo PoC1

Performance

Live Demo

Additional Information

Attributions

Further Reading

Inspirations

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages