B.Tech ECE
IIT Roorkee
Jul 2014 - May 2018
Research Intern
Caltech
Oct 2018 - Dec 2020
MS Robotics
Carnegie Mellon University
Aug 2021 - Aug 2023
ML Engineer Intern
Cruise
May 2022 - Aug 2022
ML Scientist
Tesla
Aug 2023 - Present

Publications

  • Reasoning with Latent Diffusion in Offline Reinforcement Learning
    Presented at International Conference on Learning Representations (ICLR) 2024

    PDF BibTeX

  • Distributional Distance Classifiers for Goal-Conditioned Reinforcement Learning
    Presented at New Frontiers in Learning, Control, and Dynamical Systems, ICML Workshop 2023

    PDF BibTeX

  • Deep Bayesian Quadrature Policy Optimization
    Proceedings of the 35th AAAI Conference on Artificial Intelligence 2021
    Presented at the NeurIPS Deep RL & Real-World RL Workshops 2020

    Video Project PDF Poster BibTeX Blog Slides Code

  • Reinforced Multi-task Approach for Multi-hop Question Generation
    Proceedings of the 28th International Conference on Computational Linguistics (COLING) 2020 (Long Paper)

    PDF BibTeX

  • Enhancing Perceptual Loss with Adversarial Feature Matching for Super-Resolution
    Proceedings of the 31st International Joint Conference on Neural Networks (IJCNN) 2020 (Oral)

    PDF BibTeX

  • A Randomized Kernel-Based Secret Image Sharing Scheme
    Presented at the IEEE International Workshop on Information Forensics and Security (WIFS) 2018

    PDF BibTeX

Recent Posts

Bayesian Quadrature for Policy Gradient

An introduction to Bayesian quadrature for policy gradient estimation, and the computation methods required for scaling it to high-dimensional settings. Introduces two policy gradient estimators, (i) Deep Bayesian Quadrature Policy Gradient (DBQPG) and (ii) Uncertainty Aware Policy Gradient (UAPG), that serve as a drop-in replacement for Monte-Carlo estimation in most policy gradient algorithms while offering more accurate policy gradient estimates, superior sample complexity, and higher average return.

Latest Updates

[Apr 2024] "Reasoning with Latent Diffusion in Offline Reinforcement Learning" accepted to ICLR 2024.
[Sept 2023] Started as a Machine Learning Scientist at Tesla AI.
[Jul 2023] "Distributional Distance Classifiers for Goal-Conditioned Reinforcement Learning" accepted to the New Frontiers in Learning, Control, and Dynamical Systems Workshop, ICML 2023.
[May 2022] Starting as an ML Engineer Intern in the Maneuver Planning team at Cruise.
[Aug 2021] Starting my M.S. Robotics (MSR) degree at the Robotics Institute, Carnegie Mellon University (CMU).
[Aug 2021] Attended Machine Learning Summer School (MLSS) Taipei.
[Dec 2020] "Deep Bayesian Quadrature Policy Optimization" accepted to AAAI 2021.
[Oct 2020] "Deep Bayesian Quadrature Policy Optimization" accepted to NeurIPS Deep RL and Real-World RL workshops.
[Sep 2020] "Reinforced Multi-task Approach for Multi-hop Question Generation" accepted to COLING 2020.
[May 2020] " Enhancing Perceptual Loss with Adversarial Feature Matching for Super-Resolution" accepted to IJCNN 2020.
[Oct 2019] Serving as Secondary Reviewer for AAAI 2020.
[Sept 2019] Serving as Secondary Reviewer for Machine Learning and the Physical Sciences (ML4PS), NeurIPS (2019) workshop.