CSCI 2952-D: Computational Semantics

Course Goals

Natural language understanding is a holy grail of AI. And with the machine learning advancing at such a rapid pace, breakthroughs in automatic language understanding seem to be just around the corner. But what exactly are the current barriers in automating human-like language capabilities? This course will dissect what makes language understanding so challenging, including both theoretical aspects (logic, formal semantics, pragmatics, knowledge representation) and practical methods (graphical models, game theory, neural networks). The course will be project-based, and will emphasize reading and critiquing current research in computer science, linguistics, and cognitive science.

Learning Objectives

The focus of this course is on understanding the state of the art in compoutational representations of natural language. The material is intended to provide a survey of theories of semantic representations and the ways that those theories have been operationalized as computational models for automatic natural language understanding. The goal of the course is to help you develop the following:

A deep understanding of the theoretical basis for the representations in use in modern NLP systems, including the linguistic and congitive science models to which they relate
The ability to compare representations to one another on a fundamental level, in terms of the assumptions they make about language and the world

This is not meant as a deep learning course or an engineering course. Although there will be applied assignments, the goal of the course is not to teach you how to build NLP systems. This is a graduate seminar with a heavy reading component–you’ll be expected to do the required reading and should be prepared to fill in background information on your own when needed. Class time will be spent discussing connections between papers and comparing current theories and models at a high level, not explaining the technical details of individual papers.

Prerequisites

Machine Learning (CSCI 1420) or Computational Linguistics (CSCI 1460)

Grading

Points

Grades will be based primarily on assignments and participation in in-class discussions. There will be no quizzes or exams. There will be a final project consisting of a coding and a written component, worth 20% of the final grade. The final grade will be out of 100 points, as follows. Grading rubrics will allow for fractional points.

Deliverable	Points
HW1: Word Vectors	15
HW2: Semantic Parsing	15
HW3: Neural Sentence Representations	15
HW4: Computational Pragmatics	15
Discussion Questions	10
Discussion Lead/Deep Dive	10
Final Project	20

Rubrics will be released at the same time as the assignment is released. We (Instructor and TA) reserve the right to alter the rubric after assignments are turned in if needed, but only if it is in the students’ favor (e.g. down-weighting a topic ex post facto if it appears we didn’t prepare the students sufficiently well). Don’t bank on this happening, though, i.e. no colluding to make it seem like you all understand things less well than you do. :)

Regrade Policy

Regrades will be available upon request and with thorough justification. Students who want their assignment regraded must email the me with an explanation of why the regrade is necessary. If approved, the student will come to the my office hours and walk through the problem and the regrade in person. Even if I accept the justification and aggrees to review the assignment in office hours, it does not guarentee that the grade will be changed.

Deadlines and Late Day Policy

Assignments are due at 11:59pm EST on their listed due date. Discussion questions for readings are due at 11:59 the day before the lecture in which the readings will be covered. Students are allowed 4 “no questions asked” late days each which can be used at any point during the semester (all on one assignment, or distributed across assignments). Late days can only be used on the programming assignments (HW1, HW2, HW3, and HW4). Late days cannot be used on the final project, or on the discussion-related deliverables (questions or deep-dive writeup) since these aren’t relevant unless completed before the day of the discussion. Note that there is leniency in the grading to allow for weeks when students fall behind on reading (see Assignments section).

Working in Groups

Regular assignments must be completed individually–i.e. every student must turn in their own assignment. It is expected and encouraged that you all discuss the assignments and work through ideas and concepts together, but the work that each student turns in must be their own. If there is doubt about whether or not an assignment was completed independently, the student will be asked to meet with me to walk through their code and demonstrate a line-by-line understanding of the assignment that they submitted.

The final project can be completed individually or in groups of two. Groups are expected to undertake two people worth of work.

Final Grades

Points will be allocated as laid out in the individual assignment/project rubrics, and final grades will be determined based on overall points. Grades will not be curved to fit a particular distribution. In other words, you are evaluated relative to our learning objectives, not relative to each other. Letter grades will be assigned based on the point values below. To save time and energy, there will be no “rounding up”, since it happens that, no matter where the line is, someone is always just below it. The one exception will be for students who are on the border between F and F+. In this case, we will entertain arbitrarily complex arguments for why the student deserves an F+, because, at that point, why not.

Grade	Points
A	[93, 100]
A-	[90, 93)
B+	[87, 90)
B	[83, 87)
B-	[80, 83)
C+	[77, 80)
C	[73, 77)
C-	[70, 73)
D+	[67, 70)
D	[63, 67)
D-	[60, 63)
F+	[57, 60)
F	[0, 57)

Assignments and Deliverables

There will be four technical assignments throughout the semester, which will make up 60% of the grade (15 points each). In addition, students are expected to do the readings and participate in the in-class discussions. The specific deliverables for the course are described below.

HW1: Word Vectors

Due September 25, 11:59pm

This assignment focuses on building and manipulating distributional representations of words–i.e. representations or words as vectors determined by the context in which the word is used. There will be two pieces: one looking at sparse, high-dimensional vectors and one looking at dense, low-dimensional vectors.

HW2: Semantic Parsing

Due October 11, 11:59pm

Description and rubric TDB

HW3: Neural Sentence Representations

Due October 30, 11:59pm

Description and rubric TDB

HW4: Computational Pragmatics

Due November 15, 11:59pm

Description and rubric TDB

Discussion Questions

Due: Bi-Weekly (11:59pm the night before each lecture)

You are expected to do the readings each week and participate in in-class discussions. To incentivize this, we will ask you to submit at least one discussion question per reading before the class in which the reading will be discussed. These questions should focus on high-level, conceptual questions (e.g. questions regarding assumptions made by a model, or limitations of a technical approach) rather than simply clarification questions (unless the clarification questions are especially salient/relevant to understanding the work). Questions will be submitted via a Google form linked from the website alongside the reading.

It is okay not to submit questions for every reading. Grades will be assigned hollistically at the end of the semester. Every student who submits questions for at least 70% of the readings will recieve 7/10 for the discussion questions. The remaining 3 points will be assigned based on the quality and depth of the questions and/or student’s in-class contributions. Submitting questions for 100% of the readings will not necessarily guarentee you 10/10. On weeks that you are not able to complete the readings, please just own it and leave the form blank rather than trying to BS it. (Yes, I’ll be able to tell. State-of-the-art language models can produce questions that sound beautifully fluent but are devoid of content. Own your humanity. Don’t be automatable.)

Discussion Lead/Deep Dive

Due: Varies (11:59pm night before chosen paper is discussed)

In addition to submitting questions weekly, each student will serve as a “discussion lead” for one paper during the semester. This will require choosing a paper to read in depth and submit a short (two page) write up describing the main contributions of the work and situating it in relation to the material discussed in the course so far. Students will be expected to lead the discussion on the day that their paper is being covered. Students can choose which paper they want to lead and should pick the topic that they find most exciting and interesting. It is okay if multiple students are “leading” on the same paper, however we reserve the right to cap the number of students on a paper if things get out of hand. Students will not give a formal in-class presentation of their paper, and grading will be based on the quality of the two-page writeup and/or the student’s in-class contributions on they day of their discussion.

Final Project

Due December 17, 2:00pm (Final Exam Time)

For the final project, you will devise and implement an approach to solving one of the 2019 SemEval Shared Tasks. These represent a range of challenging problems for modern NLP, and will allow students to apply the ideas of representation discussed in the course to a current open research problem. The final deliverable will be an 8 page conference-style description of the motivation, approach, and results. Code must also be submitted, although the grade will be based on the final writeup. Grades on the final project will follow the spirit of the course: i.e. good final projects will devise an approach to the problem that is well-motivated and situated in relation to the semantic theories discussed in the course. It is less important that students acheive state-of-the-art performance on the task.

I will entertain custom final project proposals if students have ideas of projects/problems they are particularly excited to pursue that don’t fit within the SemEval tasks. However, designing your own final project will mean developing a clear train-dev-test loop (with data and evaluation metric) akin to what is provided by SemEval. Students who want to pitch a custom final project must have it approved by me before November 17. There is a 100% chance I will not approve the idea on the first meeting, so please talk to me well before the 17th so you have time to iterate on the concept.

Diversity and Inclusion

This is a discussion-based course, in which the goal is to debate and critically analyze models and methods that are currently at the very forefront of NLP research. This class will only work if everyone is open-minded, respectful, and willing to take all ideas seriously. This means giving every student’s opinions equal attention and consideration, regardless of their background (social or academic). No one here (myself included!) knows everything there is to know about this topic–that is why we are studying it. It is expected that you feel over your head at some point: struggling to understand a paper doesn’t mean you don’t belong in this class. Don’t judge your colleagues or yourself for finding material difficult.

Your ability to learn and thrive in this class is my priority, and anything that prevents you from being fully present, physically and mentally, is my concern. Everyone should feel comfortable speaking freely both in class and outside of class (e.g. working in groups on assignments). If, for any reason–academic, personal, social, health, or other–you do not feel like you are able to be your unabridged, best self, please talk to me.

Course Schedule

September 6: Overview

September 11: Distributional Semantics Crash Course

Homework 1 Assigned: Distributional Word Representations
A Synopsis of Linguistic Theory, 1930-1955 (Firth, 1957)
From Frequency to Meaning: Vector Space Models of Semantics (Turney and Patel, 2010)
Distributed Representations of Words and Phrases,and their Compositionality (Mikolov et al, 2013)

September 13: Model Theory and Compositionality

Model Theory (Stanford Encyclopedia of Philosophy)
Frege’s Theory of Sense and Denotation (Stanford Encyclopedia of Philosophy)

September 18: Montague Semantics and Generative Grammar

Pages XX-XX of Semantics in Generative Grammar (Heim and Kratzer, 1998)
[Optional/Supplementary] Montague Semantics (Stanford Encyclopedia of Philosophy)

October 30: Evaluation: What is a good task?

Homework 3 Due: Neural Sentence Representations
The PASCAL Recognising Textual Entailment Challenge (Dagan et al., 2006)
Inherent Disagreements in Human Judgements of Entailment (Pavlick and Kwiatkowski, 2018) (Yes, sorry, I am selfishly making you read one of my own papers)

November 1: TBD

Ellie gone for EMNLP, Arun’s Day

November 6: Intro to Pragmatics and the Cooperative Principle

Homework 4 Assigned: Computational Pragmatics
Logic and Conversation (Grice, 1975)
Implicature (Stanford Encyclopedia of Philosophy)

November 8: Bayesian Models of Pragmatics

Pragmatic language interpretation as probabilistic inference (Goodman and Frank, 2016)
Learning in the Rational Speech Acts Model (Monroe and Potts, 2015)

November 13: Prototype Theory and Default Logic

Principles of Categorization (Rosch, 1978)
A Tutorial on Default Logics (Antoniou, 1999)

November 15: Frames and Scripts

Homework 4 Due: Computational Pragmatics
Frame Semantics (Fillmore, 1981)
Excerpts from Scripts, Plans, Goals, and Understanding: An Inquiry Into Human Knowledge Structures (Shanck, 1977)

November 20: Language and Vision

From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions (Young et al., 2014)
Visually Grounded Meaning Representations (Silberer et al., 2016)

November 22: Situated Language Learning

Learning Language Games through Interaction (Wang et al., 2016)
Understanding Grounded Language Learning Agents (Hill et al., 2017)

November 27, November 29, December 4: Miscellaneous Topics in Knowledge Representation

TBD based on students’ interests. Some possible readings below.
Extensional Versus Intuitive Reasoning: The Conjunction Fallacy in Probability Judgment (Tversky and Kahnemann, 1983)
Mental Models and Human Reasoning (Johnson-Laird, 2010)
Cognitive Grammar (Langacker, 2008)
Spatial language and spatial representation (Hayward and Tarr, 1995)
How Linguistic Metaphor Scaffolds Reasoning (Thibodeau et al. 2017)
This Summer’s Twitter Fight (Twitterverse, 2018)

CSCI 2952-D: Computational Semantics

Course Goals

Learning Objectives

Prerequisites

Grading

Points

Regrade Policy

Deadlines and Late Day Policy

Working in Groups

Final Grades

Assignments and Deliverables

HW1: Word Vectors

HW2: Semantic Parsing

HW3: Neural Sentence Representations

HW4: Computational Pragmatics

Discussion Questions

Discussion Lead/Deep Dive

Final Project

Diversity and Inclusion

Course Schedule

September 6: Overview

September 11: Distributional Semantics Crash Course

September 13: Model Theory and Compositionality

September 18: Montague Semantics and Generative Grammar

September 20: Lexicons (Nothing but Nets :))

September 25: CCG and Semantic Parsing

September 27: Probabilistic Logics

October 2: Elsewhere in Symbolic Representations of Semantics: DRT and AMR

October 4: Matrix Factorization, Word Embeddings Revisited

October 9: Deep Learning for NLP

October 11: Language Modeling for Representation Learning

October 16: Multilingual Data for Representation Learning

October 18: Distributional Compositional Semantics

October 23: Combining Distributional and Logical Approaches

October 25: Evaluation: What is a good representation?

October 30: Evaluation: What is a good task?

November 1: TBD

November 6: Intro to Pragmatics and the Cooperative Principle

November 8: Bayesian Models of Pragmatics

November 13: Prototype Theory and Default Logic

November 15: Frames and Scripts

November 20: Language and Vision

November 22: Situated Language Learning

November 27, November 29, December 4: Miscellaneous Topics in Knowledge Representation