Text-to-Scene Generation
Vision language agents as text-to-scene generators
Dec 2024
course
It was the course project of COS 429 (Computer Vision) at Princeton.
Advisor: Prof. Olga Russakovsky and Prof. Vikram Ramaswamy.
Collaborator: Cyrus Vachha.
Qualitative results