Text-to-Scene Generation

Vision language agents as text-to-scene generators
Dec 2024

course

It was the course project of COS 429 (Computer Vision) at Princeton.

Advisor: Prof. Olga Russakovsky and Prof. Vikram Ramaswamy.

Collaborator: Cyrus Vachha.

Qualitative results

Alt text

Project Report