Publications

(2026). Nemotron-Labs-Diffusion: A Tri-Mode Language Model Unifying Autoregressive, Diffusion, and Self-Speculation Decoding. Nvidia Research.

PDF Cite Project

(2026). Not All Prefills Are Equal: PPD Disaggregation for Multi-turn LLM Serving. ICML 2026.

PDF Cite Project

(2026). Scaling Beyond Masked Diffusion Language Models. ICML 2026.

PDF Cite Project

(2025). Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed. ICML 2026.

PDF Cite

(2025). TiDAR: Think in Diffusion, Talk in Autoregression. MLSys 2026 (Oral).

PDF Cite Website ASAP Seminar Talk Discrete Diffusion Reading Group Talk

(2024). How Far Are We From AGI?. TMLR 2024.

PDF Cite Project

(2024). Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning. WACV 2025.

PDF Cite

(2023). Effective Long-Context Scaling of Foundation Models. NAACL 2024 Main Conference.

PDF Cite Project

(2023). Code Llama: Open Foundation Models for Code. Meta AI.

PDF Cite Project

(2023). Text-guided 3D Human Generation from 2D Collections. EMNLP Findings 2023.

PDF Cite Project

(2023). CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding.

PDF Cite