Dhwanil R. Chauhan

Graduate Research Assistant · CIVS · Purdue University Northwest

dhwanil_fixed.png

chauha56@purdue.edu

Hammond, Indiana

I am a graduate researcher at the Center for Innovation Through Visualization and Simulation (CIVS) at Purdue University Northwest, advised by Yang Ni. My research sits at the intersection of multimodal AI, audio-visual learning, and multi-agent systems, with a focus on building intelligent systems that can perceive, reason, and act in real-world environments.

My work is grounded in an active industrial research partnership through the Steel Manufacturing Simulation and Visualization Consortium (SMSVC), where I develop AI systems that address real safety and operational challenges in manufacturing environments. This applied context shapes how I think about research, reliability, deployment constraints, and robustness are not afterthoughts but design requirements.

Current research threads:

  • Audio-visual spatial reasoning — I contributed to a novel feed-forward framework for novel-view acoustic synthesis that bypasses explicit 3D reconstruction, presented at CVPR Workshop 2026.
  • Industrial AI safety — I have built and deployed conversational AI systems for safety incident management and multi-camera spatial reasoning systems for dynamic hazard detection in active melt shop environments (AISTech 2025, 2026).
  • VLM robustness — I am leading a benchmark evaluating 20 vision-language models under simultaneous visual and linguistic degradation conditions, targeting IEEE TPAMI / IJCV.
  • Multi-agent agentic systems — I am rebuilding our industrial safety AI from a monolithic pipeline into a modular multi-agent architecture, targeting ACL.

Before Purdue, I completed my undergraduate studies at Charotar University of Science and Technology (CHARUSAT), India, where I published across a range of ML domains, security, medical imaging, and natural language, building a foundation in research process before finding the problems that would define my direction.

I am applying to PhD programs for Fall 2027, seeking to work on multimodal agentic systems that are robust, grounded, and deployable in high-stakes real-world settings. If our work overlaps, I would love to connect.

news

Mar 31, 2026 Our paper Visual Geometry Grounded Novel-View Acoustic Synthesis has been accepted at the CVPR Workshop 2026. First unified framework for novel-view acoustic synthesis bypassing explicit 3D reconstruction via feed-forward visual geometry grounding.
Feb 10, 2026 Our paper Development of Trialing Image Detection for a Melt Shop Safety Tool has been accepted at AISTech 2026. Multi-camera spatial reasoning system for real-time dynamic safety zone reconfiguration in active industrial environments.

selected publications

  1. Visual Geometry Grounded Novel-View Acoustic Synthesis
    Jay Polra, Dhwanil Chauhan, Wenjun Huang, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2026
    Purdue University Northwest · UC Irvine · San Diego State University
  2. Development of Trialing Image Detection for a Melt Shop Safety Tool
    Kyle Toth, Jay Polra, Dhwanil Chauhan, and 4 more authors
    In AISTech 2026 — Iron and Steel Technology Conference Proceedings, 2026
    Purdue University Northwest · CIVS · SMSVC
  3. VLM Robustness Benchmark Under Simultaneous Multimodal Degradation
    Dhwanil Chauhan and others
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026
    In Preparation — Targeting IEEE TPAMI / IJCV
  4. A Dual-Model Approach to Industrial Safety: Computer Vision for PPE Compliance and Hazard-Zone Monitoring in Steel Production
    Kyle Toth, Monika Singhal, Dhwanil Chauhan, and 4 more authors
    Integrating Materials and Manufacturing Innovation, May 2026
    Purdue University Northwest · CIVS · SMSVC