Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Much Ado About Time: Exhaustive Annotation of Temporal Data

Gunnar A. SigurdssonOlga RussakovskyAli Farhadiand Abhinav Gupta

2016

HCOMP

Large-scale annotated datasets allow AI systems to learn from and build upon the knowledge of the crowd. Many crowdsourcing techniques have been developed for collecting image annotations. These…

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

Roozbeh MottaghiHessam BagherinezhadMohammad Rastegariand Ali Farhadi

2016

CVPR

In this paper, we study the challenging problem of predicting the dynamics of objects in static images. Given a query object in an image, our goal is to provide a physical understanding of the…

PDFFigures 2.0: Mining Figures from Research Papers

Christopher Clark and Santosh Divvala

2016

JCDL

Figures and tables are key sources of information in many scholarly documents. However, current academic search engines do not make use of figures and tables when semantically parsing documents or…

Situation Recognition: Visual Semantic Role Labeling for Image Understanding

Mark YatskarLuke Zettlemoyerand Ali Farhadi

2016

CVPR

This paper introduces situation recognition, the problem of producing a concise summary of the situation an image depicts including: (1) the main activity (e.g., clipping), (2) the participating…

Stating the Obvious: Extracting Visual Common Sense Knowledge

Mark YatskarVicente Ordonezand Ali Farhadi

2016

NAACL

Obtaining common sense knowledge using current information extraction techniques is extremely challenging. In this work, we instead propose to derive simple common sense statements from fully…

Toward a Taxonomy and Computational Models of Abnormalities in Images

Babak SalehAhmed ElgammalJacob Feldmanand Ali Farhadi

2016

AAAI

The human visual system can spot an abnormal image, and reason about what makes it strange. This task has not received enough attention in computer vision. In this paper we study various types of…

Unsupervised Deep Embedding for Clustering Analysis

Junyuan XieRoss Girshickand Ali Farhadi

2016

ICML

Clustering is central to many data-driven application domains and has been studied extensively in terms of distance functions and grouping algorithms. Relatively little work has focused on learning…

"What happens if..." Learning to Predict the Effect of Forces in Images

Roozbeh MottaghiMohammad RastegariAbhinav Guptaand Ali Farhadi

2016

ECCV

What happens if one pushes a cup sitting on a table toward the edge of the table? How about pushing a desk against a wall? In this paper, we study the problem of understanding the movements of…

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

Mohammad RastegariVicente OrdonezJoseph Redmonand Ali Farhadi

2016

ECCV

We propose two efficient approximations to standard convolutional neural networks: Binary-Weight-Networks and XNOR-Networks. In Binary-Weight-Networks, the filters are approximated with binary…

You Only Look Once: Unified, Real-Time Object Detection

Joseph RedmonSantosh DivvalaRoss Girshickand Ali Farhadi

2016

CVPR

We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to…

Previous111-120Next