Papers

Learn more about AI2's Lasting Impact Award
Viewing 501-510 of 991 papers
  • Evaluating the Evaluation of Diversity in Natural Language Generation

    Guy Tevet, Jonathan BerantEACL2021 Despite growing interest in natural language generation (NLG) models that produce diverse outputs, there is currently no principled method for evaluating the diversity of an NLG system. In this work, we propose a framework for evaluating diversity metrics…
  • First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT

    Benjamin Muller, Yanai Elazar, Benoît Sagot, Djamé SeddahEACL2021 Multilingual pretrained language models have demonstrated remarkable zero-shot crosslingual transfer capabilities. Such transfer emerges by fine-tuning on a task of interest in one language and evaluating on a distinct language, not seen during the fine…
  • CLIPScore: A Reference-free Evaluation Metric for Image Captioning

    Jack Hessel, Ari Holtzman, Maxwell Forbes, R. L. Bras, Yejin ChoiEMNLP2021 Image captioning has conventionally relied on reference-based automatic evaluations, where machine captions are compared against captions written by humans. This is in contrast to the reference-free manner in which humans assess caption quality. In this paper…
  • Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines (preprint)

    Saadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi, Yejin ChoiACL2021 Even to a simple and short news headline, readers react in a multitude of ways: cognitively (e.g., inferring the writer's intent), emotionally (e.g., feeling distrust), and behaviorally (e.g., sharing the news with their friends). Such reactions are…
  • Contrasting Contrastive Self-Supervised Representation Learning Pipelines

    Klemen Kotar, Gabriel Ilharco, Ludwig Schmidt, Kiana Ehsani, R. MottaghiIEEE/CVF International Conference on Computer Vision (ICCV)2021 In the past few years, we have witnessed remarkable breakthroughs in self-supervised representation learning. Despite the success and adoption of representations learned through this paradigm, much is yet to be understood about how different training methods…
  • GridToPix: Training Embodied Agents with Minimal Supervision

    Unnat Jain, Iou-Jen Liu, S. Lazebnik, Aniruddha Kembhavi, Luca Weihs, A. SchwingICCV2021 While deep reinforcement learning (RL) promises freedom from hand-labeled data, great successes, especially for Embodied AI, require significant work to create supervision via carefully shaped rewards. Indeed, without shaped rewards, i.e., with only terminal…
  • “I’m Not Mad”: Commonsense Implications of Negation and Contradiction

    Liwei Jiang, Antoine Bosselut, Chandra Bhagavatula, Yejin ChoiNAACL2021 Natural language inference requires reasoning about contradictions, negations, and their commonsense implications. Given a simple premise (e.g., “I’m mad at you”), humans can reason about the varying shades of contradictory statements ranging from…
  • Learning Curves for Analysis of Deep Networks

    Derek Hoiem, Tanmay Gupta, Zhizhong Li, Michal Shlapentokh-Rothman arXiv2021 A learning curve models a classifier's test error as a function of the number of training samples. Prior works show that learning curves can be used to select model parameters and extrapolate performance. We investigate how to use learning curves to analyze…
  • Visual Semantic Role Labeling for Video Understanding

    Arka Sadhu, Tanmay Gupta, Mark Yatskar, R. Nevatia, Aniruddha Kembhavi CVPR2021 We propose a new framework for understanding and representing related salient events in a video using visual semantic role labeling. We represent videos as a set of related events, wherein each event consists of a verb and multiple entities that fulfill…
  • Visual Room Rearrangement

    Luca Weihs, Matt Deitke, Aniruddha Kembhavi, R. MottaghiarXiv2021 There has been a significant recent progress in the field of Embodied AI with researchers developing models and algorithms enabling embodied agents to navigate and interact within completely unseen environments. In this paper, we propose a new dataset and…