William W. Cohen's Papers: Deep Learning
- Hexiang Hu, Kelvin C.K. Chan, Yu-Chuan Su, Wenhu Chen, Yandong Li, Kihyuk Sohn, Yang Zhao, Xue Ben, William W. Cohen, Ming-Wei Chan, Xuhui Jia (2023): Instruct-Imagen: Imagen Generation with Multi-modal Instruction in CVPR.
- Accepted as an oral presentation (one of 90 orals out of 11,500 submissions).
- Chung-Ching Chang, William W. Cohen, Yun-Hsuan Sung (2023): Characterizing Tradeoffs in Language Model Decoding with Informational Interpretations in progress.
- Tal Schuster, Adam D. Lelkes, Haitian Sun, Jai Gupta, Jonathan Berant, William W. Cohen, Donald Metzler (2024): SEMQA: Semi-Extractive Multi-Source Question Answering in NAACL-2024.
- Yury Zemlyanskiy, Michiel de Jong, Luke Vilnis, Santiago Ontañón, William W. Cohen, Sumit Sanghai, Joshua Ainslie (2024): MEMORY-VQ: Compression for Tractable Internet-Scale Memory in NAACL-2024.
- Haitian Sun, William W. Cohen, Ruslan Salakhutdinov (2023): Answering Ambiguous Questions with a Database of Questions, Answers, and Revisions in progress.
- Following up the 'QA is the new KR' paper, we present a new collection of question-answer pairs automatically generated from Wikipedia which are more specific and ambiiguous than generated questions used in prior work, and show that this can be used to answer ambiguous questions. On the challenging ASQA benchmark, which requires generating long-form answers that summarize the multiple answers to an ambiguous question, our method improves performance by 10-15%. The new queston DB can also be used to improve diverse passage retrieval.
- Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Sumit Sanghai, William W. Cohen, Joshua Ainslie (2023): GLIMMER: generalized late-interaction memory reranker in progress.
- Wenhu Chen, Hexiang Hu, Yandong Li, Nataniel Ruiz, Xuhui Jia, Ming-Wei Chang, William W. Cohen (2023): Subject-driven Text-to-Image Generation via Apprenticeship Learning in NeurIPS-2023.
- Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Joshua Ainslie, Sumit Sanghai, Fei Sha, William W. Cohen (2023): Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute in ICML-2023.
- Wenhu Chen, Hexiang Hu, Xi Chen, Pat Verga, William W. Cohen (2023): MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text in EACL-2023.
- Haitian Sun, William W. Cohen, Ruslan Salakhutdinov (2022): Reasoning over Logically Interacted Conditions for Question Answering in progress.
- Michiel de Jong, Yury Zemlyanskiy, Joshua Ainslie, Nicholas FitzGerald, Sumit Sanghai, Fei Sha, William Cohen (2023): FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference in ACL-2023 (Findings).
- Wenhu Chen, Hexiang Hu, Chitwan Saharia, William W. Cohen (2023): Re-Imagen: Retrieval-Augmented Text-to-Image Generator in ICLR-2023.
- Julian Martin Eisenschlos, Jeremy R. Cole, Fangyu Liu, William W. Cohen (2023): WinoDict: Probing language models for in-context word acquisition in EACL-2023.
- One of two winners of an Outstanding Paper Award at EACL.
- John Wieting, Jonathan H. Clark, William W. Cohen, Graham Neubig, Taylor Berg-Kirkpatrick (2023): Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval in ACL-2023.
- Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen (2022): Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks in progress.
- Wenhu Chen, William W. Cohen, Michiel De Jong, Nitish Gupta, Alessandro Presta, Pat Verga, John Wieting (2023): QA Is the New KR: Question-Answer Pairs as Knowledge Bases in AAAI-2023.
- Proposes that symbolic KBs can be replaced with a collection of question-answer pairs automatically generated from a corpus, augmented with entity-linking annotations. Like a symbolic KB, this representation is well-suited to structured queries involving joins and aggregation, and can support 'multi-hop' reasoning. However, it has the advantage that the information in it is closely aligned to likely user information needs, as modeled by the question generation process.
- Vidhisha Balachandran, Hannaneh Hajishirzi, William Cohen, Yulia Tsvetkov (2022): Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling in EMNLP-2022.
- Haitian Sun, William W. Cohen, Ruslan Salakhutdinov (2023): Scenario-based Question Answering with Interacting Contextual Properties in ICLR-2023.
- Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slav Petrov, and Kellie Webster (2022): Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models in progress.
- Wenhu Chen, Pat Verga, Michiel de Jong, John Wieting, William W. Cohen (2022): Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering in EACL-2022.
- Extends the techniques of Mention Memory in several important ways. (1) The memory is a memory of generated question-answer pairs, which is more interpretable than neural entity-mention encodings; (2) it is based on pre-trained T5, not a custom Transformer; and (3) it allows use of the token-level encoding of retrieved QA pairs as well as neural encodings of them for reasoning. Using QA pairs instead of passages allows a clever pre-training trick for learning to retrieve, and the model greatly outperfoms a prior similar model (i.e., RePAQ) on smaller QA benchmarks.
- Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, Tal Schuster, William W. Cohen and Donald Metzler (2022): Transformer Memory as a Differentiable Search Index in NeurIPS 2022.
- Siddhant Arora, Danish Pruthi, Norman Sadeh, William W. Cohen, Zachary C. Lipton, Graham Neubig (2022): Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations in AAAI 2022.
- Vidhisha Balachandran and Bhuwan Dhingra and Haitian Sun and Michael Collins and William W. Cohen (2021): Investigating the Effect of Background Knowledge on Natural Questions in DeeLIO-2021.
- Proceedings of Deep Learning Inside Out (DeeLIO): The 2nd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures
- Haitian Sun, William W. Cohen, Ruslan Salakhutdinov (2021): ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers in ACL 2022.
- A novel dataset with (1) long context documents containing information that is related in logically complex ways; (2) multi-hop questions that require compositional logical reasoning. Intended as a more realistic version of ShARC, a QA task considered in 'End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents'
- Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Fei Sha, William Cohen (2021): Mention Memory: incorporating textual knowledge into Transformers through entity mention attention in ICLR 2021.
- Similar to the Entities-as-Experts model, but uses a much larger memory of entity mentions, which allows the model to potentially provide meaningful provenance for information. The model, called TOME, outperforms Entities-as-Experts on several tasks, and required some non-trivial technical innovations relating to memory pre-training and efficient retrieval.
- Keshav Kolluru, Martin Rezk, Pat Verga, William W. Cohen, Partha Talukdar (2021): Multilingual Fact Linking in AKBC-2021.
- Julian Martin Eisenschlos, Maharshi Gor, Thomas Muller, William W. Cohen (2021): MATE: Multi-view Attention for Table Transformer Efficiency in EMNLP-2021.
- Bhuwan Dhingra, Jeremy R. Cole, Julian Martin Eisenschlos, Daniel Gillick, Jacob Eisenstein, William W. Cohen (2021): Time-Aware Language Models as Temporal Knowledge Bases in preparation.
- Haitian Sun, William W. Cohen, Ruslan Salakhutdinov (2021): End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents in preparation.
- Adapts many of the ideas used for multihop KBQA to a new task - answering multihop questions over a large document. Retrieval steps in this "DocHopper" system retrieve passages of a document, and the retrieved items are combined with a question neurally: i.e., rather than appending text to a question and re-encoding that discrete object, what is retrieved is a vector summary of the document, which is mixed with the previous question encoding. This is fast, fully differentiable, allows retrieval of large document subsections, and gets a new SOTA on three datasets.
- Avishai Zagoury, Einat Minkov, Idan Szpektor, William W. Cohen (2021): What's the best place for an AI conference, Vancouver or ______: Why completing comparative questions is difficult in AAAI2021.
- Haitian Sun, Pat Verga, Bhuwan Dhingra, Ruslan Salakhutdinov, William W. Cohen (2021): Reasoning Over Virtual Knowledge Bases With Open Predicate Relations in ICML2021.
- Modifies the FILM model by using a virtual KB of small text passages containing pairs of entities. This required adding a Matching-the-Blanks pretraining phase, but got strong results on a number of QA-from-corpora tasks.
- Wenhu Chen, Ming-Wei Chang, Eva Schlinger, William Wang, William W. Cohen (2021): Open Question Answering Over Tables and Text in ICLR-2021.
- Answering open QA multi-hop questions over tables and text with a clever ``early fusion'' idea, which proposes and indexes likely reasoning chains, and uses large-document Transformers to merge these noisy evidence chains.
- Pat Verga, Haitian Sun, Livio Baldini Soares, and William W. Cohen (2021): Adaptable and Interpretable Neural Memory Over Symbolic Knowledge in NAACL-2021.
- Most recent paper on Fact-Injected Language Model (FILM), which includes an Entities-as-Experts style memory of neural entity encodings, plus a second "fact memory" of KG triples. FILM has good results on KBQA tasks, and allows one to use an edited KB with retraining.
- Danish Pruthi, Bhuwan Dhingra, Livio Baldini Soares, Michael Collins, Zachary C. Lipton, Graham Neubig, William W. Cohen (2020): Evaluating Explanations: How Much Do Explanations From the Teacher Aid Students? in preparation.
- Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren, William W. Cohen (2020): Differentiable Open-Ended Commonsense Reasoning in NAACL-2021.
- Extends DrKIT's virtual KB to a corpus of documents of common-sense statements ("facts"). In DrFact, entities are replaced by noisy and ambiguous concepts, and navigation is between documents with overlapping sets of mentions. Also introduces new "open" tasks for common-sense QA.
- Haitian Sun, Andrew O. Arnold, Tania Bedrax-Weiss, Fernando Pereira, William W. Cohen (2020): Faithful Embeddings for Knowledge Base Queries in NeurIPS2020.
- An extension to Neural Query Language (NQL) which extends the query language to work with a "centroid-sketch" representation of sets. The centroid encoders a geometric area, and the sketch is a randomized data structure that adds capacity to the sketch, allowing faithful differential logical reasoning to be combined with good generalization.
- Pat Verga, Haitian Sun, Livio Baldini Soares, and William W. Cohen (2020): Facts as Experts: Adaptable and Interpretable Neural Memory over Symbolic Knowledge in arxiv.
- Earlier draft of the NAACL paper on FILM (Fact-Injected LM).
- William W. Cohen, Fan Yang, and Kathryn Rivard Mazaitis (2020): TensorLog: A Probabilistic Database Implemented Using Deep-Learning Infrastructure in JAIR.
- Most complete paper on TensorLog, a predecessor of NQL/EmQL that was a Prolog-like logic, not a dataflow query language.
- William W. Cohen, Haitian Sun, R. Alex Hofer, Matthew Siegler (2020): Scalable Neural Methods for Reasoning With a Symbolic Knowledge Base in ICLR-2020.
- Paper on Neural Query Language (NQL) a differentiable dataflow query language. NQL is useful for building KBQA systems that can be trained from denotations, but relies heavily on sparse-matrix operations that are not implemented in all accelerators.
- Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen (2020): Differentiable Reasoning over a Virtual Knowledge Base in ICLR-2020.
- Describes DrKIT, which allows one to answer multihop chain queries on a "virtual KB"---a corpus of entity-linked documents. In DrKIT, entity mentions are indexed for neural retrieval with a rich representation of their context, and reasoning consists of navigating between co-occurring mentions.
- Yifeng Tao, Chunhui Cai, William W. Cohen, Xinghua Lu (2020): From genome to phenome: Predicting multiple cancer phenotypes based on somatic genomic alternations bia the genomic impact transformer in PSB-2020.
- Andrew O. Arnold, William W. Cohen (2019): Instance-based Transfer Learning for Multilingual Deep Retrieval in arxiv.
- Qiao Jin, Bhuwan Dhingra, Zhengping Liu, William W Cohen, and Xinghua Lu (2019): PubMedQA: A Dataset for Biomedical Research Question Answering in EMNLP-2019.
- Bhuwan Dhingra, Manaal Faruqui, Ankur Parikh, Ming-Wei Chang, Dipanjan Das, William W. Cohen (2019): Handling Divergent Reference Texts when Evaluating Table-to-Text Generation in ACL-2019.
- William W. Cohen, Haitian Sun, Alex Hofer, Matthew Siegler (2019): Differentiable Representations For Multihop Inference Rules in arxiv.
- Earlier version of ICLR paper on NQL.
- William W. Cohen, Matthew Siegler, Alex Hofer (2019): Neural Query Language: A Knowledge Base Query Language for Tensorflow in arxiv.
- Earlier version of ICLR paper on NQL focusing on the language constructs used.
- Haitian Sun, Tania Bedrax-Weiss, William W. Cohen (2019): PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text in EMNLP-2019.
- Qiao Jin, Bhuwan Dhingra, William W. Cohen, Xinghua Lu (2019): Probing Biomedical Embeddings from Language Models in NAACL-2019.
- Haohan Wang, Xiang Liu, Yifeng Tao, Wenting Ye, Qiao Jin, William W. Cohen and Eric P. Xing (2019): Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning in Biocomputing.
- Haitian Sun, William W. Cohen, Lidong Bing (2018): Semi-Supervised Learning with Declaratively Specified Entropy Constraints in NIPS-2018.
- Zhilin Yang, Jake (Junbo) Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann LeCun (2018): GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations in NIPS-2018.
- Qiao Jin, Bhuwan Dhingra, William W. Cohen, and Xinghua Lu (2018): AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer in BioASQ-2018.
- Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William W. Cohen, Ruslan Salakhutdinov, Christopher D. Manning (2018): HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering in EMNLP-2018.
- Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Kathryn Mazaitis, Ruslan Salakhutdinov, and William W. Cohen (2018): Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text in EMNLP-2018.
- Bhuwan Dhingra, Qiao Jin, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov (2018): Neural Models for Reasoning over Multiple Mentions using Coreference in NAACL-2018.
- Vidhisha Balachandran and Dheeraj Rajagopal and , Rose Catherine Kanjirathinkal and William W. Cohen (2018): Learning to Define Terms in the Software Domain in W-NUT 2018.
- Fan Yang, Jiazhong Nie, William W. Cohen, Ni Lao (2017): Learning to Organize Knowledge with N-Gram Machines in arxiv.org/abs/1711.06744.
- Zhilin Yang, Zihang Dai, Ruslan Salakhutdinov, and William W. Cohen (2017): Breaking the Softmax Bottleneck: A High-Rank RNN Language Model in arxiv.org 1711.03953.
- Fan Yang, Zhilin Yang, William W. Cohen (2017): Differentiable Learning of Logical Rules for Knowledge Base Reasoning in NIPS-2017.
- Zihang Dai, Zhilin Yang, William W. Cohen, and Ruslan Salakhutdinov (2017): Good Semi-supervised Learning that Requires a Bad GAN in NIPS-2017.
- William W. Cohen and Fan Yang (2017): TensorLog: Deep Learning Meets Probabilistic Databases in arxiv.org 1707.05390.
- Rose Catherine, Kathryn Mazaitis, Maxine Eskenazi, William W. Cohen (2017): Explainable Entity-based Recommendations with Knowledge Graphs (poster paper) in RecSys-2017.
- Bhuwan Dhingra, Kathryn Mazaitis, William W. Cohen (2017): Quasar: Datasets for Question Answering by Search and Reading in arxiv 1707.03904.
- Bhuwan Dhingra, Zhilin Yang, William W. Cohen, and Ruslan Salakhutdinov (2017): Linguistic Knowledge as Memory for Recurrent Neural Networks in arxiv 1703.02620.
- Bhuwan Dhingra, Hanxiao Liu, Ruslan Salakhutdinov, and William W. Cohen (2017): A Comparative Study of Word Embeddings for Reading Comprehension in arxiv 1703.00993.
- Rose Catherine, William W. Cohen (2017): TransNets: Learning to Transform for Recommendation in RecSys-2017.
- Bhuwan Dhingra, Hanxiao Liu, William W. Cohen, and Ruslan Salakhutdinov (2017): Gated-Attention Readers for Text Comprehension in ACL-2017.
- Zhilin Yang, Junjie Hu, Ruslan Salakhutdinov, William W. Cohen (2017): Semi-Supervised QA with Generative Domain-Adaptive Nets in ACL-2017.
- Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov (2017): Words or Characters? Fine-grained Gating for Reading Comprehension in ICLR 2017.
- Zhilin Yang, Ruslan Salakhutdinov, William W. Cohen (2017): Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks in ICLR 2017.
- William W. Cohen (2016): TensorLog: A Differentiable Deductive Database in arxiv.org 1605.06523.
- Zhilin Yang, Ye Yuan, Yuexin Wu, Ruslan Salakhutdinov, William W. Cohen (2016): Encode, Review, and Decode: Reviewer Module for Caption Generation in NIPS-2016.
- Bhuwan Dhingra, Zhong Zhou, Dylan Fitzpatrick, Michael Muehl and William W. Cohen (2016): Tweet2Vec: Character-Based Distributed Representations for Social Media in ACL-2016 (short paper).
- William Yang Wang and William W. Cohen (2016): Learning First-Order Logic Embeddings via Matrix Factorization in IJCAI-2016.
- Zhilin Yang, Jei Tang, and William W. Cohen (2016): Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs in IJCAI-2016.
- Zhilin Yang, Ruslan Salakhutdinov, William Cohen (2016): Revisiting Semi-Supervised Learning with Graph Embeddings in ICML-2016.
- Zhilin Yang, Ruslan Salakhutdinov, William Cohen (2016): Multi-Task Cross-Lingual Sequence Tagging from Scratch in arxiv 1603.06270.
[Selected papers| By topic: GNAT System| Retrieval Augmented LMs| Applications| Collaborative Filtering| Intelligent Tutoring| Explanation-Based Learning| Formal Results| Learning in Graphs| Inductive Logic Programming| Neural Knowledge Representation| Topic Modeling| Matching/Data Integration| Deep Learning| Prediction-powered inference| Rule Learning| Text Categorization| Info Extraction/Reading/QA| By year: All papers]