ArXiv Manuscripts
- Xinlei Chen, Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollar, C. Lawrence Zitnick
Microsoft COCO Captions: Data Collection and Evaluation Server
[April, 2015]
Publications
- Ramakrishna Vedantam, Ian Fischer, Jonathan Huang, Kevin Murphy
Generative Models of Visually Grounded Imagination
International Conference on Learning Representations (ICLR), 2018
[Code]
- Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization
International Conference on Computer Vision (ICCV), 2017
[Code][Blog][Demo] - Ashwin K. Vijayakumar, Ramakrishna Vedantam, Devi Parikh
Sound-Word2Vec: Learning Word Representations Grounded in Sounds
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017 (Short)
- Prithvijit Chattopadhyay*, Ramakrishna Vedantam*, Ramprasaath RS, Dhruv Batra, Devi Parikh
Counting Everyday Objects in Everyday Scenes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)
[Code] - Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
Context-aware Captions from Context-agnostic Supervision
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)
[Project Page] [arXiv]
- Ramprasaath R. Selvaraju, Abhishek Das, Ramakrishna Vedantam, Michael Cogswell, Devi Parikh, Dhruv Batra
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization
NIPS Workshop on Interpretable Machine Learning in Complex Systems, 2016
- C. Lawrence Zitnick, Ramakrishna Vedantam, Devi Parikh
Adopting Abstract Images for Semantic Scene Understanding
Special Issue on the best papers at the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2016 -
Satwik Kottur*, Ramakrishna Vedantam*, Jose“ Moura, Devi Parikh
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
[Project Page] [Code] [arXiv] - Ramakrishna Vedantam*, Xiao Lin*, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh
Learning Common Sense Through Visual Abstraction
IEEE International Conference on Computer Vision (ICCV), 2015
[Project page] [Code]
- Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
CIDEr: Consensus-based Image Description Evaluation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
[Project Page] [Code][arXiv]
* Equal Contribution