Publications

Latest arXiv Manuscripts

topics

Xiaoliang Dai*, Ji Hou*, Chih-Yao Ma*, Sam Tsai*, Jialiang Wang*, Rui Wang*, Peizhao Zhang*, Simon Vandenhende, Xiaofang Wang, Abhimanyu Dubey, Matthew Yu, Abhishek Kadian, Filip Radenovic, Dhruv Mahajan, Kunpeng Li, Yue Zhao, Vladan Petrovic, Mitesh Kumar Singh, Simran Motwani, Yi Wen, Yiwen Song, Roshan Sumbaly^, Vignesh Ramanathan^, Zijian He^, Peter Vajda^, Devi Parikh^

* equal contribution, alphabetical order, ^ equal last authors

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

arXiv:2309.15807, 2023

AI + Creativity


2024 [back to top]

topics

Uriel Singer*, Amit Zohar*, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman

* equal contribution

Video Editing via Factorized Diffusion Distillation (a.k.a, Emu Video Edit)

European Conference on Computer Vision (ECCV), 2024 (Oral)

[project page]

AI + Creativity


topics

Rohit Girdhar*^, Mannat Singh*^, Andrew Brown*, Quentin Duval*, Samaneh Azadi*, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

* equal first authors ^ equal technical contributions

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

European Conference on Computer Vision (ECCV), 2024

[project page]

AI + Creativity


topics

Shelly Sheynin*, Adam Polyak*, Uriel Singer*, Yuval Kirstain*, Amit Zohar*, Oron Ashual, Devi Parikh, Yaniv Taigman

* equal contribution

Emu Edit: Precise Image Editing via Recognition and Generation Tasks

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 (Highlight)

[project page]

AI + Creativity


2023 [back to top]

topics

Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation

International Conference on Computer Vision (ICCV), 2023

[project page]

AI + Creativity


topics

Samaneh Azadi*, Thomas Hayes*, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta

* equal contribution

Text-Conditional Contextualized Avatars For Zero-Shot Personalization

arXiv:2304.07410, 2023

AI + Creativity


topics

Uriel Singer*, Shelly Sheynin*, Adam Polyak*, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman

* equal contribution

Text-To-4D Dynamic Scene Generation

International Conference on Machine Learning (ICML), 2023

[project page]

AI + Creativity


topics

Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin

SpaText: Spatio-Textual Representation for Controllable Image Generation

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

[project page]

AI + Creativity


topics

Uriel Singer*, Adam Polyak*, Thomas Hayes*, Xi Yin*, Jie An, Songyang Zhang, Qiyuan (Isabelle) Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh*, Sonal Gupta*, Yaniv Taigman*

* Core contributors

Make-A-Video: Text-to-Video Generation without Text-Video Data

International Conference on Learning Representations (ICLR), 2023

[project page]

AI + Creativity


topics

Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez1, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi

AudioGen: Textually Guided Audio Generation

International Conference on Learning Representations (ICLR), 2023

[project page]

AI + Creativity


2022 [back to top]

topics

Thomas Hayes*, Songyang Zhang*, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

* equal contribution

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

European Conference on Computer Vision (ECCV), 2022

[project page]

AI + Creativity


topics

Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

European Conference on Computer Vision (ECCV), 2022

[Illustrated story: The Little Red Boat][Illustrated story: New Adventures] [Blog post]

AI + Creativity


topics

Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

European Conference on Computer Vision (ECCV), 2022

[project page]

AI + Creativity


topics

Ramya Srinivasan, Devi Parikh

Building Bridges: Generative Artworks to Explore AI Ethics

Ethical Considerations in Creative applications of Computer Vision (EC3V) Workshop at CVPR, 2022

AI + Creativity


topics

Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Dhruv Batra, Devi Parikh

Episodic Memory Question Answering

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral)


topics

Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gokhan Tür, Devi Parikh, Dilek Hakkani-Tür

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022


2021 [back to top]

topics

Safinah Ali, Devi Parikh

Telling Creative Stories Using Generative Visual Aids

Machine Learning for Creativity and Design Workshop at Neural Information Processing Systems (NeuRIPS), 2021

AI + Creativity


topics

Gunjan Aggarwal, Devi Parikh

Dance2Music: Automatic Dance-driven Music Generation

Machine Learning for Creativity and Design Workshop at Neural Information Processing Systems (NeuRIPS), 2021

AI + Creativity


topics

Sasha Sheng*, Amanpreet Singh*, Vedanuj Goswami, Jose Alberto Lopez Magana, Wojciech Galuba, Devi Parikh, Douwe Kiela

* equal contribution

Human-Adversarial Visual Question Answering

Neural Information Processing Systems (NeurIPS), 2021


topics

Songwei Ge, Devi Parikh

Visual Conceptual Blending with Large-scale Language and Vision Models

International Conference on Computational Creativity (ICCC), 2021 (Oral)

AI + Creativity


topics

Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal

Contrast and Classify: Alternate Training for Robust VQA

International Conference on Computer Vision (ICCV), 2021


topics

Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations

ICLR workshop on Deep Learning for Simulation, 2021 (Best Paper Award)


topics

Lowik Chanussot*, Abhishek Das*, Siddharth Goyal*, Thibaut Lavril*, Muhammed Shuaibi*, Morgane Riviére, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, Zachary Ulissi

* equal contribution

The Open Catalyst 2020 (OC20) Dataset and Community Challenges

ACS Catalysis, 2021

[dataset][code][opencatalystproject.org]


topics

C. Lawrence Zitnick, Lowik Chanussot, Abhishek Das, Siddharth Goyal, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Thibaut Lavril, Aini Palizhati, Morgane Riviére, Muhammed Shuaibi, Anuroop Sriram, Kevin Tran, Brandon Wood, Junwoong Yoon, Devi Parikh, Zachary Ulissi

An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage

arXiv:2010.09435, 2020

[dataset][code][opencatalystproject.org]


topics

Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh, Ramprasaath R. Selvaraju

SOrT-ing in VQA : Contrastive Gradient Learning for Improved Consistency

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021


topics

Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021


topics

Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani

VX2TEXT: End-to-End Learning of Video-Based Text GenerationFrom Multimodal Inputs

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021


topics

Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh

Creative Sketch Generation

International Conference on Learning Representations (ICLR), 2021

[demo][code and datasets][project page]

AI + Creativity


2020 [back to top]



topics

Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James Rehg, Stefan Lee, Peter Anderson

Where Are You? Localization from Embodied Dialog

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020


topics

Purva Tendulkar, Abhishek Das, Ani Kembhavi, Devi Parikh

Feel The Music: Automatically Generating A Dance For An Input Song

International Conference on Computational Creativity (ICCC), 2020 (Oral)

[dances][code][demo][Tech@Facebook article]

AI + Creativity


topics

Devi Parikh, C. Lawrence Zitnick

Exploring Crowd Co-creation Scenarios for Sketches

International Conference on Computational Creativity (ICCC), 2020

[sketching interface]

AI + Creativity


topics

Gunjan Aggarwal, Devi Parikh

Neuro-Symbolic Generative Art: A Preliminary Study

International Conference on Computational Creativity (ICCC), 2020

[examples][demo]

AI + Creativity


topics

X. Alice Li, Devi Parikh

Lemotif: An Affective Visual Journal Using Deep Neural Networks

International Conference on Computational Creativity (ICCC), 2020 (Oral)

[demo][code]

AI + Creativity


topics

Devi Parikh

Predicting A Creator’s Preferences In, and From, Interactive Generative Art

International Conference on Computational Creativity (ICCC), 2020

[art interface]

AI + Creativity







topics

Devendra Singh Chaplot, Lisa Lee, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra

Embodied Multimodal Multitask Learning

International Joint Conference on Artificial Intelligence (IJCAI), 2020

[webpage]



topics

Jiasen Lu*, Vedanuj Goswami*, Marcus Rohrbach, Devi Parikh, Stefan Lee

* equal contribution

12-in-1: Multi-Task Vision and Language Representation Learning

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

[demo]


topics

Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Ribeiro, Besmira Nushi, Ece Kamar

SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 (Oral)


topics

Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra

Decentralized Distributed PPO: Solving PointGoal Navigation

International Conference on Learning Representations (ICLR), 2020


2019 [back to top]


topics

Peter Anderson*, Ayush Shrivastava*, Devi Parikh, Dhruv Batra, Stefan Lee

* equal contribution

Chasing Ghosts: Instruction Following as Bayesian State Tracking

Neural Information Processing Systems (NeurIPS), 2019


topics

Jianwei Yang, Zhile Ren, Hongyuan Zhu, Ji Lin, Chuang Gan, Devi Parikh

Cross-Channel Communication Networks

Neural Information Processing Systems (NeurIPS), 2019


topics

Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das

Improving Generative Visual Dialog by Answering Diverse Questions

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019





topics

Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman

Fashion++: Minimal Edits for Outfit Improvement

International Conference on Computer Vision (ICCV), 2019


topics

Jianwei Yang*, Zhile Ren*, Mingze Xu, Xinlei Chen, David Crandall, Devi Parikh, Dhruv Batra

* equal contribution

Embodied Visual Recognition

International Conference on Computer Vision (ICCV), 2019




topics

Harsh Agrawal*, Karan Desai*, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson

* equal contribution

nocaps: novel object captioning at scale

International Conference on Computer Vision (ICCV), 2019

www.nocaps.org


topics

Purva Tendulkar, Kalpesh Krishna, Ramprasaath R. Selvaraju, Devi Parikh

Trick or TReAT: Thematic Reinforcement for Artistic Typography

International Conference on Computational Creativity (ICCC), 2019 (Oral)

[demo]

AI + Creativity




topics

Yash Goyal, Ziyan Wu, Jan Ernst, Dhruv Batra, Devi Parikh, Stefan Lee

Counterfactual Visual Explanations

International Conference on Machine Learning (ICML), 2019


topics

Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael Rabbat, Joelle Pineau

TarMAC: Targeted Multi-Agent Communication

International Conference on Machine Learning (ICML), 2019



topics

Meet Shah, Xinlei Chen, Marcus Rohrbach, Devi Parikh

Cycle-Consistency for Robust Visual Question Answering

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)

[webpage]


topics

Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach

Towards VQA Models That Can Read

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

www.textvqa.org


topics

Erik Wijmans*, Samyak Datta*, Oleksandr Maksymets*, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra

* equal contribution

Embodied Question Answering in Photorealistic Environments with Point Cloud Perception

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)


topics

Huda Alamri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori

Audio-Visual Scene-Aware Dialog

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

www.video-dialog.com


topics

Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra , Marcus Rohrbach

CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2019




2018 [back to top]


topics

Jianwei Yang*, Jiasen Lu*, Stefan Lee, Dhruv Batra, Devi Parikh

* equal contribution

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition

Conference on Robot Learning (CoRL), 2018 (Oral)


topics

Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

Neural Modular Control for Embodied Question Answering

Conference on Robot Learning (CoRL), 2018 (Spotlight)



topics

Arjun Chandrasekaran*, Viraj Prabhu*, Deshraj Yadav*, Prithvijit Chattopadhyay*, Devi Parikh

* equal contribution

Do Explanations Make VQA Models More Predictable To A Human?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018




topics

Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh

Graph R-CNN for Scene Graph Generation

European Conference on Computer Vision (ECCV), 2018


topics

Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra

Visual Dialog

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018

[www.visualdialog.org]


topics

Arjun Chandrasekaran, Devi Parikh, Mohit Bansal

Punny Captions: Witty Wordplay in Image Descriptions

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018


topics

Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

Embodied Question Answering

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 (Oral)


topics

Jiasen Lu*, Jianwei Yang*, Dhruv Batra, Devi Parikh

* equal contribution

Neural Baby Talk

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 (Spotlight)


topics

Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Ani Kembhavi

Don’t Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018


2017 [back to top]

topics

Aishwarya Agrawal*, Jiasen Lu*, Stanislaw Antol*, Margaret Mitchell, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra

* equal contribution

VQA: Visual Question Answering

Special Issue on Combined Image and Language Understanding

International Journal of Computer Vision (IJCV), 2017

[www.visualqa.org]




topics

Prithvijit Chattopadhyay*, Deshraj Yadav*, Viraj Prabhu, Arjun Chandrasekaran, Abhishek Das, Stefan Lee, Dhruv Batra, Devi Parikh

* equal contribution

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

AAAI Conference on Human Computation and Crowdsourcing (HCOMP), 2017


topics

Mike Lewis, Denis Yarats, Yann N. Dauphin, Devi Parikh, Dhruv Batra

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017


topics

Ashwin Kalyan, Ramakrishna Vedantam, Devi Parikh

Sound-Word2Vec: Learning Word Representations Grounded in Sounds

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017


topics

Alexander Miller, Will Feng, Adam Fisch, Jiasen Lu, Dhruv Batra, Antoine Bordes, Devi Parikh, Jason Weston

ParlAI: A Dialog Research Software Platform (Demo)

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017


topics

Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra

Visual Dialog

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)

[www.visualdialog.org] [video]


topics

Jiasen Lu*, Caiming Xiong*, Devi Parikh, Richard Socher

* equal contribution

Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)

[code]


topics

Ramakrishna Vedantam, Samy Bengio, Kevin P Murphy, Devi Parikh, Gal Chechik

Context-aware Captions from Context-agnostic Supervision

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)


topics

Prithvijit Chattopadhyay*, Ramakrishna Vedantam*, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh

* equal contribution

Counting Everyday Objects in Everyday Scenes

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight)


topics

Yash Goyal*, Tejas Khot*, Douglas Summers-Stay, Dhruv Batra, Devi Parikh

* equal contribution

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering (a.k.a. The VQA v2.0 Dataset)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

[www.visualqa.org] [video]


topics

Jianwei Yang, Anitha Kannan, Dhruv Batra, Devi Parikh

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation

International Conference on Learning Representations (ICLR), 2017


topics

Rogerio Schmidt Feris, Christoph H. Lampert, Devi Parikh (Editors)

Visual Attributes (Book)

Series on Advances in Computer Vision and Pattern Recognition, Springer, 2017

[springer link]


topics

Tanmay Batra, Devi Parikh

Cooperative Learning with Visual Attributes

arxiv.org/abs/1705.05512, 2017





2016 [back to top]

topics

Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh

Hierarchical Question-Image Co-Attention for Visual Question Answering

Neural Information Processing Systems (NIPS), 2016


topics

Harsh Agrawal, Arjun Chandrasekaran, Dhruv Batra, Devi Parikh, Mohit Bansal

Sort Story: Sorting Jumbled Images and Captions into Stories

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016


topics

Arijit Ray, Gordon Christie, Mohit Bansal, Dhruv Batra, Devi Parikh

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016


topics

Aishwarya Agrawal, Dhruv Batra, Devi Parikh

Analyzing the Behavior of Visual Question Answering Models

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016


topics

Abhishek Das*, Harsh Agrawal*, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra

* equal contribution

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016

Also presented at:

Workshop on Visualization for Deep Learning at

International Conference on Machine Learning (ICML), 2016

(Best student paper)


topics

Yash Goyal, Akrit Mohapatra, Devi Parikh, Dhruv Batra

Towards Transparent AI Systems: Interpreting Visual Question Answering Models

Workshop on Visualization for Deep Learning at

International Conference on Machine Learning (ICML), 2016

(Best student paper)


topics

Xiao Lin, Devi Parikh

Leveraging Visual Question Answering for Image-Caption Ranking

European Conference on Computer Vision (ECCV), 2016



topics

C. Lawrence Zitnick, Aishwarya Agrawal, Stanislaw Antol, Margaret Mitchell, Dhruv Batra, Devi Parikh

Measuring Machine Intelligence Through Visual Question Answering

AI Magazine (2016)

[www.visualqa.org]



topics

Peng Zhang*, Yash Goyal*, Douglas Summers-Stay, Dhruv Batra, Devi Parikh

* equal contribution

Yin and Yang: Balancing and Answering Binary Visual Questions

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016


topics

Satwik Kottur*, Ramakrishna Vedantam*, José M. F. Moura, Devi Parikh

* equal contribution

Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016

[project page (including code)]


topics

Jianwei Yang, Devi Parikh, Dhruv Batra

Joint Unsupervised Learning of Deep Representations and Image Clusters

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016


topics

Ting-Hao Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Aishwarya Agrawal, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

Visual Storytelling

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016

[project page with dataset]


topics

Nasrin Mostafazadeh, Nate Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, James F. Allen

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016 (Oral)

[project page with data and evaluation]



topics

C. Lawrence Zitnick, Ramakrishna Vedantam, Devi Parikh

Adopting Abstract Images for Semantic Scene Understanding

Special Issue on the best papers at the

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2016

[project page, data, slides, video, etc.]


topics

Roozbeh Mottaghi, Sanja Fidler, Alan L. Yuille, Raquel Urtasun, Devi Parikh.

Human-Machine CRFs for Identifying Bottlenecks in Scene Understanding

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2016

[supplementary material]


2015 [back to top

topics

Stanislaw Antol*, Aishwarya Agrawal*, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh

* equal contribution

VQA: Visual Question Answering

International Conference on Computer Vision (ICCV), 2015

[www.visualqa.org]


topics

Ramakrishna Vedantam*, Xiao Lin*, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh

* equal contribution

Learning Common Sense Through Visual Abstraction

International Conference on Computer Vision (ICCV), 2015

[supplementary material] [project page]



topics

Mainak Jas, Devi Parikh

Image Specificity

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015 (Oral)

[extended abstract] [talk (video)] [project page with code, data, slides, etc.]


topics

Arturo Deza, Devi Parikh

Understanding Image Virality

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015

[extended abstract] [project page with code, data, etc.]


topics

Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh

CIDEr: Consensus-based Image Description Evaluation

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015

[extended abstract] [project page with code, data, etc.]


topics

Adriana Kovashka, Devi Parikh, Kristen Grauman

WhittleSearch: Interactive Image Search with Relative Attribute Feedback

International Journal of Computer Vision (IJCV), 2015

[project page and data][poster][video]


2014 [back to top

topics

Shrenik Lad, Devi Parikh

Interactively Guiding Semi-Supervised Clustering via Attribute-based Explanations.

European Conference on Computer Vision (ECCV), 2014

[project page]


topics

Aayush BansalAli Farhadi, Devi Parikh

Towards Transparent Systems: Semantic Characterization of Failure Modes

European Conference on Computer Vision (ECCV), 2014

[project page]


topics

Phillip Isola, Devi Parikh, Jianxiong Xiao, Antonio Torralba, Aude Oliva

What makes a photograph memorable?

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2014


topics

Peng Zhang, Jiuling Wang, Ali Farhadi, Martial Hebert, Devi Parikh

Predicting Failures of Vision Systems

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014

[project page]


topics

Gordon Christie, Amar Parkash, Ujwal Krothapalli, Devi Parikh

Predicting User Annoyance Using Visual Attributes

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014

[project page]


topics

Xiao Lin, Michael Cogswell, Devi Parikh, Dhruv Batra

Propose and Re-rank Semantic Segmentation via Deep Image Classification

Big Vision workshop

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014

[project page]


2013 [back to top

topics

Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew C. Gallagher, C. Lawrence Zitnick

Which Edges Matter?

Workshop on 3D Representation and Recognition (3dRR)

International Conference on Computer Vision (ICCV), 2013


topics

Devi Parikh

Visual Attributes for Enhanced Human-Machine Communication (Invited paper)

Allerton Conference on Communication, Control and Computing, 2013 (Oral)


topics

Naman Turakhia, Devi Parikh

Attribute Dominance: What Pops Out? 

International Conference on Computer Vision (ICCV), 2013

[project page and data] [poster]



topics

Devi Parikh, Kristen Grauman

Implied Feedback: Learning Nuances of User Behavior in Image Search

International Conference on Computer Vision (ICCV), 2013

[supp material] [poster]



topics

C. Lawrence Zitnick, Devi Parikh

Bringing Semantics Into Focus Using Visual Abstraction

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013 (Oral)

[project page, data, slides, video, etc.]


topics

Roozbeh Mottaghi, Sanja Fidler, Jian Yao, Raquel Urtasun, Devi Parikh

Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013

[poster]


topics

Arijit Biswas, Devi Parikh

Simultaneous Active Learning of Classifiers & Attributes via Relative Feedback

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013

[project page and data] [poster] [demo]


topics

Mohammad Rastegari, Ali Diba, Devi Parikh, Ali Farhadi

Multi-Attribute Queries: To Merge or Not to Merge?

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013

[poster]


topics

Naman Agrawal, Arijit Biswas, Adriana Kovashka, Kristen Grauman, Devi Parikh

Relative Attributes for Enhanced Human-Machine Communication (Demo)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013

[classifier feedback demo]


2012 [back to top

topics

Amar Parkash, Devi Parikh

Attributes for Classifier Feedback

European Conference on Computer Vision (ECCV), 2012 (Oral)

[slides] [talk (video)] [project page and data] [demo]


topics

Devi Parikh, Adriana Kovashka, Amar Parkash, Kristen Grauman

Relative Attributes for Enhanced Human-Machine Communication (Invited paper)

AAAI Conference on Artificial Intelligence (AAAI) 2012 (Oral)

[[Relative description demo][rel-desc-demo]] [Classifier feedback demo]


topics

Congcong Li, Devi Parikh, Tsuhan Chen

Automatic Discovery of Groups of Objects for Scene Understanding

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012

[project page] [poster]


topics

Adriana Kovashka, Devi Parikh, Kristen Grauman

WhittleSearch: Image Search with Relative Attribute Feedback

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012

[project page and data] [poster] [[demo][whit-demo]] [video]


topics

Kun Duan, Devi Parikh, David J. Crandall, Kristen Grauman

Discovering Localized Attributes for Fine-grained Recognition

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012

[project page] [poster]


topics

C. Lawrence Zitnick, Devi Parikh

The Role of Image Understanding in Contour Detection

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012

[project page] [data] [poster]


topics

Devi Parikh, Phillip IsolaAntonio Torralba, Aude Oliva

Understanding the Intrinsic Memorability of Images (Abstract)

Visual Sciences Society (VSS), 2012

[project page] [MIT news]



2011 [back to top

topics

Dhruv Batra, Adarsh Kowdle, Devi Parikh, Jeibo Luo, Tsuhan Chen

Interactive Co-segmentation of Objects in Image Collections (Book)

SpringerBriefs in Computer Science, 2011

[springer link]


topics

Phillip Isola, Devi Parikh, Antonio Torralba, Aude Oliva

Understanding the Intrinsic Memorability of Images

Neural Information Processing Systems (NIPS), 2011

[project page] [MIT news]


topics

Devi Parikh, Kristen Grauman

Relative Attributes

International Conference on Computer Vision (ICCV), 2011 (Oral)

Marr Prize (Best Paper Award) Winner

[project page] [data] [code] [slides] [talk (video)] [poster] [[relative description demo][rel-desc-demo]] [classifier feedback demo]


topics

Devi Parikh

Recognizing Jumbled Images: The Role of Local and Global Information in Image Classification

International Conference on Computer Vision (ICCV), 2011

[poster] [slides]


topics

Congcong Li, Devi Parikh, Tsuhan Chen

Extracting Adaptive Contextual Cues from Unlabeled Regions

International Conference on Computer Vision (ICCV), 2011

[project page]


topics

Devi Parikh, C. Lawrence Zitnick

Finding the Weakest Link in Person Detectors

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011

[project page] [data] [poster] [slides]


topics

Devi Parikh, Kristen Grauman

Interactively Building a Discriminative Vocabulary of Nameable Attributes

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011

[supplementary material] [project page] [poster] [slides]


topics

Devi Parikh, Kristen Grauman

Interactive Discovery of Task-Specific Nameable Attributes (Abstract)

First Workshop on Fine-Grained Visual Categorization (FGVC) 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011 (Best Poster Award)

[project page] [poster]


topics

Andrew Gallagher, Dhruv Batra, Devi Parikh

Inference for Order Reduction in MRFs

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011



topics

C. Lawrence Zitnick, Devi Parikh

Color Source Separation for Enhanced Pixel Manipulations

MSR-TR-2011-98, Microsoft Research, 2011


2010 [back to top]

topics

Devi Parikh, C. Lawrence Zitnick

The Role of Features, Algorithms and Data in Visual Recognition

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010

[poster] [slides]


topics

Dhruv Batra, Andrew Gallagher, Devi Parikh, Tsuhan Chen

Beyond Trees: MRF Inference via Outer-Planar Decomposition

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010

[poster]


topics

Dhruv Batra, Adarsh Kowdle, Devi Parikh, Jeibo Luo, Tsuhan Chen

iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010 

[poster] [project page and dataset]


2009 [back to top]

topics

Devi Parikh

Modeling Context for Image Understanding: When, For What, How?

Ph.D. Thesis, Carnegie Mellon University, 2009


topics

Dhruv Batra, Devi Parikh, Adarsh Kowdle, Tsuhan Chen, Jeibo Luo

Seed Image Selection in Interactive Cosegmentation

IEEE International Conference on Image Processing (ICIP), 2009


topics

Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen

Unsupervised Learning of Hierarchical Spatial Structures in Images

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009

[poster] [slides]


topics

Dhruv Batra, Adarsh Kowdle, Devi Parikh, Tsuhan Chen

Cutout-Search: Putting a name to the Picture

Workshop on Internet Vision

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009


topics

Dhruv Batra, Adarsh Kowdle, Kevin Tang, Devi Parikh, Jeibo Luo, Tsuhan Chen

Interactive Cosegmentation by Touch (Demo)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009

[project page]


topics

Ching-Hao Mao, Hahn-Ming Lee, Devi Parikh, Tsuhan Chen, Si-Yu Huang

Semi-Supervised Cotraining and Active Learning based Approach for Multi-view Intrusion Detection

ACM Symposium on Applied Computing (SAC), 2009


2008 [back to top]

topics

Devi Parikh, Tsuhan Chen

Unsupervised Modeling of Objects and their Hierarchical Contextual Interactions

EURASIP Journal on Image and Video Processing

Special Issue on Patches in Vision, 2008

[slides]


topics

Devi Parikh, Tsuhan Chen

Data Fusion and Cost Minimization for Intrusion Detection

IEEE Transactions on Information Forensics and Security

Special Issue on Statistical Methods for Network Security and Forensics, 2008


topics

Robi Polikar, Apostolos Topalis, Devi Parikh, Deborah Green, Jennifer Frymiare, John Kounios and Christopher M. Clark

An Ensemble Based Data Fusion for Early Diagnosis of Alzheimer’s Disease

Information Fusion, Special Issue on Applications of Ensemble Methods, 2008


topics

Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen

Determining Patch Saliency Using Low-Level Context

European Conference on Computer Vision (ECCV), 2008

[poster] [slides]


topics

Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen

From Appearance to Context-Based Recognition: Dense Labeling in Small Images

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008

[poster]


topics

Devi Parikh, Tsuhan Chen

Bringing Diverse Classifiers to Common Grounds: dtransform

International Conference on Acoustics, Speech, Signal Processing (ICASSP), 2008

[slides]


topics

Devi Parikh, Gavin Jancke

Localization and Segmentation of a 2D High Capacity Color Barcode

Workshop on Applications in Computer Vision (WACV), 2008

[slides


2007 [back to top]

topics

Devi Parikh, Robi Polikar

An Ensemble Based Incremental Learning Approach to Data Fusion

IEEE Transactions on Systems, Man and Cybernetics, 2007



topics

Devi Parikh, Tsuhan Chen

Hierarchical Semantics of Objects (hSOs)

IEEE International Conference in Computer Vision (ICCV), 2007

[poster] [slides]


topics

Devi Parikh, Tsuhan Chen

Classification-Error Cost Minimization Strategy: dCMS

IEEE Statistical Signal Processing Workshop, 2007

[poster]


topics

Devi Parikh, Tsuhan Chen

Unsupervised Learning of Hierarchical Semantics of Objects (hSOs)

Beyond Patches Workshop

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2007

(Best Paper Award)

[slides]


topics

Devi Parikh, Rahul Sukthankar, Tsuhan Chen, Mei Chen

Feature-based Part Retrieval for Interactive 3D Reassembly

IEEE Workshop on Applications of Computer Vision (WACV), 2007

[poster] [slides]


2006 [back to top]

topics

Robi Polikar, Devi Parikh, Shreekanth Mandayam

Multiple Classifiers System for Multisensor Data Fusion

IEEE Proceedings on Sensors Applications Symposium, 2006


2005 [back to top]

topics

Yusuf A. Mehta, Kauser Jahan, Jim Laicovsky, Laura Miller, Devi Parikh, Alicia Licon Lozano

Evaluate the Effect of Coarse and Fine Rubber Particles on Laboratory Rutting Performance of Asphalt Concrete Mixtures

The Journal of Solid Waste Technology And Management, 2005


topics

Devi Parikh, Nick Stepenosky, Apostolos Topalis, Deborah Green, John Kounios, Christopher Clark, Robi Polikar

Ensemble Based Data Fusion for Early Diagnosis of Alzheimer’s Disease

IEEE Proceedings on Engineering in Medicine and Biology, 2005


topics

Devi Parikh, Robi Polikar

A Multiple Classifier Approach for Multisensor Data Fusion

IEEE International Conference on Information Fusion (FUSION), 2005


2004 [back to top]

topics

Devi Parikh, Min T. Kim, Joseph Oagaro, Shreekanth Mandayam, Robi Polikar

Combining Classifiers for Multisensor Data Fusion

IEEE International Conference on Systems, Man and Cybernetics, 2004


topics

Devi Parikh, Min T. Kim, Joseph Oagaro, Shreekanth Mandayam, Robi Polikar

Ensemble of Classifiers Approach for NDT Data Fusion

IEEE Proceedings on Ultrasonics, Ferroelectrics and Frequency Control, 2004


2002 [back to top]

Webpage design courtesy Abhishek Das.