William W. Cohen's Papers: Matching/Data Integration
- William Yang Wang, Kathryn Mazaitis, Ni Lao, Tom Mitchell, and William W. Cohen (2015): Efficient Inference and Learning in a Large Knowledge Base: Reasoning with Extracted Information using a Locally Groundable First-Order Probabilistic Logic in Machine Learning, 2015.
- Bhavana Dalvi, Einat Minkov, Partha P. Talukdar, and William W. Cohen (2015): Automatic Gloss Finding for a Knowledge Base using Ontological Constraints in WSDM-2015.
- Jay Pujara, Hui Miao, Lise Getoor and William W. Cohen (2014): Using Semantics & Statistics to Turn Data into Knowledge in AI Magazine 2014.
- William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in CIKM-2013.
- Honorable Mention for Best Paper at CIKM-2013
- William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in arxiv 1305.2254.
- William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in ICML 2103 Workshop on Inferning.
- Einat Minkov and William W. Cohen (2010): Improving Graph-Walk Based Similarity with Reranking: Case Studies for Personal Information Management in TOIS-2010.
- William W. Cohen, Natalie Glance, Charles Schafer, Roy Tromble, Yuk Wah Wong (2009): Data Integration for Many Data Sources using Context-Sensitive Similarity Metrics in limbo.
- Einat Minkov and William Cohen (2007): Learning to Rank Typed Graph Walks: Local and Global Approaches in WebKDD-2007.
- Sarah Zelikovitz, William Cohen, and Haym Hirsh (2007): Extending WHIRL with background knowledge for improved text classification in Information Retrieval 10(1) pp 35-67.
- Einat Minkov and William W. Cohen (2006): An Email and Meeting Assistant using Graph Walks in CEAS-2006.
- Einat Minkov, Andrew Ng and William W. Cohen (2006): Contextual Search and Name Disambiguation in Email using Graphs in SIGIR-2006.
- William W. Cohen (2006): A Graph-Search Framework for GeneId Ranking (Extended Abstract) in BioNLP'06.
- William W. Cohen & Einat Minkov (2006): A Graph-Search Framework for Associating Gene Identifiers with Documents in BMC Bioinformatics.
- Einat Minkov, Richard Wang & William Cohen (2004): Extracting Personal Names from Emails: Applying Named Entity Recognition to Informal Text in NAACL-2005.
- Pradeep Ravikumar, William W. Cohen, Stephen E. Fienberg (2004): A Secure Protocol for Computing String Distance Metrics in PSDM-2004.
- Pradeep Ravikumar & William W. Cohen (2004): A Hierarchical Graphical Model for Record Linkage in UAI 2004.
- William W. Cohen & Sunita Sarawagi (2004): Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods in KDD 2004: 89-98.
- Mikael Bilenko, Ray Mooney, William W. Cohen, Pradeep Ravikumar & Steve Fienberg (2003): Adaptive Name-Matching in Information Integration in IEEE Intelligent Systems 18(5): 16-23 (2003).
- William W. Cohen, Pradeep Ravikumar & Stephen Fienberg (2003): A Comparison of String Metrics for Matching Names and Records in KDD Workshop on Data Cleaning and Object Consolidation.
- William W. Cohen, Pradeep Ravikumar & Stephen Fienberg (2003): A Comparison of String Distance Metrics for Name-Matching Tasks in IIWeb 2003: 73-78.
- William W. Cohen & Jacob Richman (2002): Learning to Match and Cluster Large High-Dimensional Data Sets For Data Integration in KDD 2002: 475-480.
- William W. Cohen & Jacob Richman (2001): Learning to Match and Cluster Entity Names in Proc. of the ACM SIGIR-2001 Workshop on Mathematical/Formal Methods in IR.
- Chumki Basu, Haym Hirsh, William W. Cohen & Craig Neville-Manning (2001): Technical Paper Recommendation: A Study in Combining Multiple Information Sources in J. Artif. Intell. Res. (JAIR) 14: 231-252 (2001).
- William W. Cohen, David McAllester, and Henry Kautz (2000): Hardening Soft Information Sources in KDD 2000: 255-259.
- William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in Computer Networks 33(1-6): 685-698 (2000).
- William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in WWW 2000.
- William W. Cohen (2000): Data Integration using Similarity Joins and a Word-based Information Representation Language in ACM Trans. Inf. Syst. 18(3): 288-321 (2000).
- William W. Cohen (1999): What Can We Learn from the Web in ICML 1999.
- William W. Cohen (1999): Recognizing Structure in Web Pages using Similarity Queries in AAAI/IAAI 1999: 59-66.
- William W. Cohen (2000): WHIRL: A Word-based Information Representation Language in Artif. Intell. 118(1-2): 163-196 (2000).
- William W. Cohen and Wei Fan (1999): Learning Page-Independent Heuristics for Extracting Data from Web Pages in Computer Networks 31(11-16): 1641-1652 (1999).
- William W. Cohen and Wei Fan (1999): Learning Page-Independent Heuristics for Extracting Data from Web Pages in WWW 1999.
- William W. Cohen (1999): Reasoning about Textual Similarity in a Web-Based Information Access in Autonomous Agents and Multi-Agent Systems 2(1): 65-86 (1999).
- William W. Cohen (1999): A Demonstration of WHIRL (demonstration abstract) in SIGIR 1999: 327.
- William W. Cohen (1999): Some Practical Observations on Integration of Web Information in WebDB (Informal Proc.) 1999: 55-60.
- William W. Cohen (1998): The WHIRL Approach to Information Integration in IEEE Intelligent Systems, Sept/Oct 1998, pp 20--23.
- William W. Cohen & Haym Hirsh (1998): Joins that Generalize: Text Classification Using WHIRL in KDD 1998: 169-173.
- William W. Cohen (1998): Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity in SIGMOD Conference 1998: 201-212.
- Won Ten-Year "Test of Time" Award at SIGMOD 2008
- William W. Cohen (1998): Providing Database-like Access to the Web Using Queries Based on Textual Similarity (demonstration abstract) in SIGMOD Conference 1998: 558-560.
- William W. Cohen (1998): A Web-based Information System that Reasons with Structured Collections of Text in Agents 1998: 400-407.
- William W. Cohen (1998): The WHIRL Approach to Integration: An Overview in IIWeb 1998 (informal proceedings).
- William W. Cohen (1997): Knowledge Integration for Structured Information Sources Containing Text (Extended Abstract) in SIGIR Workshop on Networked IR (informal proceedings).
[Selected papers| By topic: GNAT System| Retrieval Augmented LMs| Applications| Collaborative Filtering| Intelligent Tutoring| Explanation-Based Learning| Formal Results| Learning in Graphs| Inductive Logic Programming| Neural Knowledge Representation| Topic Modeling| Matching/Data Integration| Deep Learning| Prediction-powered inference| Rule Learning| Text Categorization| Info Extraction/Reading/QA| By year: All papers]