• Research
  • Teaching
  • Data (current)
  • Contact


  • PPDB 2.0: Most recent release of the paraphrase database
    Download Paper Citation

    PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification

    @InProceedings{PavlickEtAl-2015:ACL:Semantics,
      author =  {Ellie Pavlick and Pushpendre Rastogi and Juri Ganitkevich and Ben Van Durme, Chris Callison-Burch},
      title =   {PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification}
      booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)},
      month     = {July},
      year      = {2015},
      address   = {Beijing, China},
      publisher = {Association for Computational Linguistics},
      }
      
  • SimplePPDB: Subset of PPDB customized for performing text simplification
    Download Paper Citation

    Simple PPDB: A Paraphrase Database for Simplification

    @InProceedings{PavlickAndCallisonBuch-2016:ACL:Simple,
      author =  {Ellie Pavlick and Chris Callison-Burch},
      title =   {Simple PPDB: A Paraphrase Database for Simplification},
      booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)},
      month     = {August},
      year      = {2016},
      address   = {Berlin, Germany},
      publisher = {Association for Computational Linguistics},
    }
    
  • Add-One RTE data: 5,560 RTE sentence pairs involving the insertion of a single adjective.
    Download Paper Citation

    Most babies are little and most problems are huge: Compositional Entailment in Adjective Nouns

    @article{PavlickAndCallisonBurch-2016:ACL:Adjectives,
      author =  {Ellie Pavlick and Chris Callison-Burch},
      title =   {Most baies are little and most problems are huge: Compositional Entailment in Adjective Nouns},
      booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)},
      month     = {August},
      year      = {2016},
      address   = {Berlin, Germany},
      publisher = {Association for Computational Linguistics},
    }
    
  • Human Paraphrase Judgements: Phrase pairs scored on a 5-point scale
    Download Paper Citation

    PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification

    @InProceedings{PavlickEtAl-2015:ACL:Reranking,
      author =  {Ellie Pavlick and Pushpendre Rastogi and Juri Ganitkevich and Ben Van Durme, Chris Callison-Burch},
      title =   {PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification}
      booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)},
      month     = {July},
      year      = {2015},
      address   = {Beijing, China},
      publisher = {Association for Computational Linguistics},
      }
      
  • Human Lexical Entailment Judgements: Phrase pairs classified based on natural logic relations
    Download Paper Citation

    Adding Semantics to Data-Driven Paraphrasing

      @InProceedings{pavlick-EtAl:2015:ACL-IJCNLP,
      author    = {Pavlick, Ellie  and  Bos, Johan  and  Nissim, Malvina  and  Beller, Charley  and  Van Durme, Benjamin  and  Callison-Burch, Chris},
      title     = {Adding Semantics to Data-Driven Paraphrasing},
      booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)},
      month     = {July},
      year      = {2015},
      address   = {Beijing, China},
      publisher = {Association for Computational Linguistics},
      pages     = {1512--1522},
      url       = {http://www.aclweb.org/anthology/P15-1146}
      }
      
  • FrameNet+: Expanded FrameNet LU index, built via automatic paraphrasing and crowdsourcing
    Download Paper Citation

    FrameNet+: Fast Paraphrastic Tripling of FrameNet

    @InProceedings{pavlick-EtAl:2015:ACL-IJCNLP2,
      author    = {Pavlick, Ellie  and  Wolfe, Travis  and  Rastogi, Pushpendre  and  Callison-Burch, Chris  and  Dredze, Mark  and  Van Durme, Benjamin},
      title     = {FrameNet+: Fast Paraphrastic Tripling of FrameNet},
      booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
      month     = {July},
      year      = {2015},
      address   = {Beijing, China},
      publisher = {Association for Computational Linguistics},
      pages     = {408--413},
      url       = {http://www.aclweb.org/anthology/P15-2067}
    }
    
  • Style Lexicons: Human and automatic scores of formality and complexity for words, phrases, and sentences
    Download Paper Citation

    Inducing Lexical Style Properties for Paraphrase and Genre Differentiation

    @InProceedings{pavlick-nenkova:2015:NAACL-HLT,
      author    = {Pavlick, Ellie  and  Nenkova, Ani},
      title     = {Inducing Lexical Style Properties for Paraphrase and Genre Differentiation},
      booktitle = {Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
      month     = {May--June},
      year      = {2015},
      address   = {Denver, Colorado},
      publisher = {Association for Computational Linguistics},
      pages     = {218--224},
      url       = {http://www.aclweb.org/anthology/N15-1023}
    }
    
  • Formality Annotations: Sentence-level formality annotations for four different genres
    Download Paper Citation

    An Empirical Analysis of Formality in Online Communication

    Please cite both of the below papers:
    
    @article{PavlickAndTetreault-2016:TACL,
      author =  {Ellie Pavlick and Joel Tetreault},
      title =   {An Empirical Analysis of Formality in Online Communication},
      journal = {Transactions of the Association for Computational Linguistics},
      year =    {2016},
      publisher = {Association for Computational Linguistics}
    }
    
    @article{Lahiri-2015:arXiv,
      title={{SQUINKY! A} Corpus of Sentence-level Formality, Informativeness, and Implicature},
      author={Lahiri, Shibamouli},
      journal={arXiv preprint arXiv:1506.02306},
      year={2015}
    }
    
  • Bilingual dictionaries in 100 languages: High-confidence translations collected via crowdsourcing
    Download Paper Citation

    The Language Demographics of Amazon Mechanical Turk

    @article{Pavlick-EtAl-2014:TACL,
       author =  {Ellie Pavlick and Matt Post and Ann Irvine and Dmitry Kachaev and Chris Callison-Burch},
       title =   {The Language Demographics of {Amazon Mechanical Turk}},
       journal = {Transactions of the Association for Computational Linguistics},
       volume =  {2},
       number =  {Feb},
       year =    {2014},
       pages = {79--92},
       publisher = {Association for Computational Linguistics},
       url = {http://cis.upenn.edu/~ccb/publications/language-demographics-of-mechanical-turk.pdf}
     }
    
  • Code for extracting dictionaries: Code and data for building dictionaries. Download this file if you want to change the default quality thresholds, or if you are interested in the demographic information about our MTurk translators.
    Download Paper Citation

    The Language Demographics of Amazon Mechanical Turk

    @article{Pavlick-EtAl-2014:TACL,
       author =  {Ellie Pavlick and Matt Post and Ann Irvine and Dmitry Kachaev and Chris Callison-Burch},
       title =   {The Language Demographics of {Amazon Mechanical Turk}},
       journal = {Transactions of the Association for Computational Linguistics},
       volume =  {2},
       number =  {Feb},
       year =    {2014},
       pages = {79--92},
       publisher = {Association for Computational Linguistics},
       url = {http://cis.upenn.edu/~ccb/publications/language-demographics-of-mechanical-turk.pdf}
     }
    

Copyright © Ellie Pavlick 2017