Research Scientist · NVIDIA Research
Email: jindongj at nvidia.com · Google Scholar · GitHub
I am a Research Scientist at NVIDIA Research. I do architecture research for VLMs and LLMs.
Rutgers University.
Token-Efficient Long Video Understanding for Multimodal LLMs
Slot State Space Models
SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Object-Centric Slot Diffusion
Generative Neurosymbolic Machines
Improving Generative Imagination in Object-Centric World Models
SCALOR: Generative World Models with Scalable Object Representations
SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition