A Minimal Span-Based Neural Constituency Parser. Mitchell Stern, Jacob Andreas, Dan Klein UC Berkeley

Similar documents
Syntax and Parsing II

Syntax and Parsing II

Joint Parsing and Translation

Fun Neural Net Demo Site. CS 188: Artificial Intelligence. N-Layer Neural Network. Multi-class Softmax Σ >0? Deep Learning II

Uninformed Search (Ch )

Uninformed Search (Ch )

Princess Nora University Faculty of Computer & Information Systems ARTIFICIAL INTELLIGENCE (CS 370D) Computer Science Department

Potentials for combining walking and public transport at the agglomeration scale

1.1 The size of the search space Modeling the problem Change over time Constraints... 21

Predicting Horse Racing Results with Machine Learning

Predicting Horse Racing Results with TensorFlow

Optimization and Search. Jim Tørresen Optimization and Search

Problem Solving Agents

A Novel Dependency-to-String Model for Statistical Machine Translation

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

DATA MINING ON CRICKET DATA SET FOR PREDICTING THE RESULTS. Sushant Murdeshwar

EE582 Physical Design Automation of VLSI Circuits and Systems

Deriving an Optimal Fantasy Football Draft Strategy

Complete Streets and Context Classification

i) Linear programming

Phrase-based Image Captioning

Planning and Acting in Partially Observable Stochastic Domains

CT PET-2018 Part - B Phd COMPUTER APPLICATION Sample Question Paper

SEARCH SEARCH TREE. Node: State in state tree. Root node: Top of state tree

SEARCH TREE. Generating the children of a node

Caltrain Bike Parking Management Plan

Introduction to Pattern Recognition

Relative Vulnerability Matrix for Evaluating Multimodal Traffic Safety. O. Grembek 1

Basketball field goal percentage prediction model research and application based on BP neural network

Harmonisation of Work Health and Safety Legislation

Solving MINLPs with BARON. Mustafa Kılınç & Nick Sahinidis Department of Chemical Engineering Carnegie Mellon University

Prudhoe Bay Oil Production Optimization: Using Virtual Intelligence Techniques, Stage One: Neural Model Building

A IMPROVED VOGEL S APPROXIMATIO METHOD FOR THE TRA SPORTATIO PROBLEM. Serdar Korukoğlu 1 and Serkan Ballı 2.

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

Biomechanics and Models of Locomotion

Benefits of Detailed Compressor Modeling in Optimizing Production from Gas-Lifted Fields. Manickam S. Nadar Greg Stephenson

The Cross-Calibrated Multi-Platform (CCMP) Ocean Vector Wind Analysis (V2.0)

CENG 466 Artificial Intelligence. Lecture 4 Solving Problems by Searching (II)

Generating None-Plans in Order to Find Plans 1

Resource Guide. Copyright 2017 Institute of Certified Management Accountants. Updated 8/30/17

Robot Walking with Genetic Algorithms

Programming For Open Water Swimming Success. Greg Towle National Open Water Coach Swimming Australia

Uninformed search methods II.

A REAL-TIME SIGNAL CONTROL STRATEGY FOR MITIGATING THE IMPACT OF BUS STOPS ON URBAN ARTERIALS

CS472 Foundations of Artificial Intelligence. Final Exam December 19, :30pm

CHAPTER 7 ACCESS MANAGEMENT. Background. Principles of Access Management. Hennepin County Transportation Systems Plan (HC-TSP)

Lucintel. Publisher Sample

TENNIS BC PROVINCIAL REGROUPINGS APPLICATION FORM

VLSI Design 12. Design Styles

Dealing with Dependent Failures in Distributed Systems

Standing Committee on Policy and Strategic Priorities. Mount Pleasant Industrial Area Parking Strategy and Access Improvements

A living laboratory on the cutting edge of Enterprise Scrum

Nuclear Division. Symposium Grand Targhee, Wyoming. 1 September 26, 2018 Curtiss-Wright

Visual Traffic Jam Analysis Based on Trajectory Data

NORTH TURNAROUND. Recommended Design: Expand the existing transit terminal

Neural Nets Using Backpropagation. Chris Marriott Ryan Shirley CJ Baker Thomas Tannahill

Nicolas Tricot & François Esch June 14th 2017

AGA Swiss McMahon Pairing Protocol Standards

Roundabouts In Colorado

beestanbul RoboCup 3D Simulation League Team Description Paper 2012

OIMB GK12 CURRICULUM

Impact of Climate Change on Bees in the Eastern Forest: Diversity and Adaptations of Organisms

Coupling distributed and symbolic execution for natural language queries. Lili Mou, Zhengdong Lu, Hang Li, Zhi Jin

2017 Float and Sensor Workshop. Conveners: Brian King, Steve Riser, Susan Wijffels Local Host: Steve Riser, U. Washington

2016 Tennis Regulations League Section

Product Decomposition in Supply Chain Planning

Caltrain Bike Parking Management Plan

838. Improved aerodynamic optimization for the design of wind turbine blades

INSIDE VOLLEY TENNIS

Pocatello Regional Transit Master Transit Plan Draft Recommendations

U11 Small Sided Competition Guidelines

SAFE HARBOR STATEMENT

GUIDE TO ESTIMATING TOTAL ALLOWABLE CATCH USING SIZE FREQUENCY IN CATCH, EFFORT DATA, AND MPAS

Satoshi Yoshida and Takuya Kida Graduate School of Information Science and Technology, Hokkaido University

Traffic Improvements for the Transit Oriented Development District (TODD) Meriden Public Library March 11, 2014

Pedestrian Behaviour Modelling

University of Notre Dame Department of Finance Economics of the Firm Spring 2012

2018 Grandville Calvin Christian Summer Camps and Descriptions:

A Brief History of the Development of Artificial Neural Networks

3 ROADWAYS 3.1 CMS ROADWAY NETWORK 3.2 TRAVEL-TIME-BASED PERFORMANCE MEASURES Roadway Travel Time Measures

Basic guiding principles. Guiding principles (cont.) 4-2 Offense (setter middle or right) 5-1 Offense. Different types of offensive systems

SPE Fuel gas supply (at the flowstations and gathering centers) and artificial lift gas supply for the lift gas compressors at GC1 are

ROSE-HULMAN INSTITUTE OF TECHNOLOGY Department of Mechanical Engineering. Mini-project 3 Tennis ball launcher

Static Extended Trailing Edge for Lift Enhancement: Experimental and Computational Studies

Intelligent Traffic Light Control Using Distributed Multi-agent Q Learning

The system design must obey these constraints. The system is to have the minimum cost (capital plus operating) while meeting the constraints.

FURY 2D Simulation Team Description Paper 2016

LABORATORY EXERCISE 1 CONTROL VALVE CHARACTERISTICS

Design and Evaluation of Adaptive Traffic Control System for Heterogeneous flow conditions. SiMTraM & CosCiCost2G

U11 Small Sided Competition Guidelines

APPLICATION OF PUSHOVER ANALYSIS ON EARTHQUAKE RESPONSE PREDICATION OF COMPLEX LARGE-SPAN STEEL STRUCTURES

Robust Task Execution: Procedural and Model-based. Outline. Desiderata: Robust Task-level Execution

Irrigation System Renovation. Your golf course. Our commitment.

Case Study. PayPal s Sparkline Case Study. About Sparkline

REACHING EVERY KIWI SKY NETWORK TELEVISION ANNUAL RESULTS 2018

Adaptability and Fault Tolerance

World Handicap System Club Information Session

Introduction to Pattern Recognition

MARKET AND CAPACITY UPDATE. Matthew Marsh September 2016

A Big Data approach to understand Central Banks Big Data Spain 2018

Transcription:

A Minimal pan-based Neural Constituency Parser Mitchell tern, Jacob Andreas, Dan Klein UC Berkeley

Parsing as pan Classification. he enjoys playing tennis

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. he enjoys playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. X he enjoys Y playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Parsing as pan Classification. X he enjoys Y playing tennis he enjoys playing tennis. 0 1 2 3 4 5

Grammar: Minimality in Parsing

Minimality in Parsing Grammar: (decided) (workers) (decided) [Collins (1999)]

Grammar: (decided) Minimality in Parsing (workers) (decided) ^ ^ [Collins (1999)] [Klein and Manning (2003)]

Grammar: (decided) Minimality in Parsing (workers) (decided) ^ ^ [Collins (1999)] [Klein and Manning (2003)] [Hall et al. (2014)]

Grammar: (decided) Minimality in Parsing (workers) (decided) ^ ^ [Collins (1999)] [Klein and Manning (2003)] [Hall et al. (2014)] [Vinyals et al. (2015)]

coring: Minimality in Parsing

Minimality in Parsing coring: score( ^ ^) [Klein and Manning (2003)]

Minimality in Parsing coring: score( ^ ^) score(i, k, j, ) [Klein and Manning (2003)] [Hall et al. (2014)]

Minimality in Parsing coring: score( ^ ^) score(i, k, j, ) score(i, j, ) and score action (i, k, j) [Klein and Manning (2003)] [Hall et al. (2014)] [Cross and Huang (2016)]

Minimality in Parsing coring: score( ^ ^) score(i, k, j, ) score(i, j, ) and score action (i, k, j) score(i, j, ) [Klein and Manning (2003)] [Hall et al. (2014)] [Cross and Huang (2016)] [This work]

Decoding: Minimality in Parsing

Minimality in Parsing Decoding: Chart-based Globally optimal, O(n 3 ) time complexity

Minimality in Parsing Decoding: Chart-based Globally optimal, O(n 3 ) time complexity Transition-based Greedy, O(n) or O(n 2 ) time complexity

Tree coring Function

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Tree coring Function he enjoys playing tennis. 0 1 2 3 4 5

Dynamic Program: Base Case

Dynamic Program: Base Case Pick best label

Dynamic Program: General Case

Dynamic Program: General Case Pick best label

Dynamic Program: General Case Pick best label Pick best split point

coring Function Implementation [Inspired by Cross and Huang (2016)]

coring Function Implementation he enjoys playing tennis. [Inspired by Cross and Huang (2016)]

coring Function Implementation Bidirectional LTM he enjoys playing tennis. [Inspired by Cross and Huang (2016)]

coring Function Implementation (f j - f i, b i - b j ) (f i, b i ) (f j, b j ) pan Difference Bidirectional LTM he enjoys playing tennis. [Inspired by Cross and Huang (2016)]

coring Function Implementation s s(i, j, X) (f j - f i, b i - b j ) (f i, b i ) (f j, b j ) Feedforward Network pan Difference Bidirectional LTM he enjoys playing tennis. [Inspired by Cross and Huang (2016)]

Training

Training Want for all

Training Want for all Require larger margin for higher loss:

Training Want for all Require larger margin for higher loss: Use hinge penalty function:

Training Use loss-augmented decoding during training:

Training Use loss-augmented decoding during training: Loss-augmented decoding for Hamming loss: Replace with

Initial Results Parser F1 core Hall et al. (2014) 89.2 Vinyals et al. (2015) 88.3 Cross and Huang (2016) 91.3 Dyer et al. (2016) 91.7 Liu and Zhang (2017) 91.7 Our Chart Parser 91.7

Top-Down Parsing. he enjoys playing tennis

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Top-Down Parsing. he enjoys playing tennis he enjoys playing tennis.

Greedily Label and plit

Greedily Label and plit

Greedily Label and plit

Top-Down Training Margin constraint for each decision: score(gold) 1 + score(other)

Top-Down Training Margin constraint for each decision: score(gold) 1 + score(other) Train with exploration using a dynamic oracle [Goldberg and Nivre (2012), Cross and Huang (2016)]

Initial Results Parser F1 core Hall et al. (2014) 89.2 Vinyals et al. (2015) 88.3 Cross and Huang (2016) 91.3 Dyer et al. (2016) 91.7 Liu and Zhang (2016) 91.7 Our Chart Parser 91.7 Our Top-Down Parser 91.6

Extensions

Extensions Label scoring for unary chains: plit unary chains into top-middle-bottom

Extensions Label scoring for unary chains: plit unary chains into top-middle-bottom tructured label loss for unary chains: Hamming distance on labels (vs. 0-1 loss)

Extensions Label scoring for unary chains: plit unary chains into top-middle-bottom tructured label loss for unary chains: Hamming distance on labels (vs. 0-1 loss) plit-based (vs. span-based) scoring: Left-right, concatenate, deep biaffine [Cross and Huang (2016)] [Dozat and Manning (2016)]

Final Results Parser F1 core Hall et al. (2014) 89.2 Vinyals et al. (2015) 88.3 Cross and Huang (2016) 91.3 Dyer et al. (2016) 91.7 Liu and Zhang (2016) 91.7 Our Best Chart Parser 91.8 Our Best Top-Down Parser 91.8

Conclusion

Conclusion A minimal span-based parser can achieve state-ofthe-art results.

Conclusion A minimal span-based parser can achieve state-ofthe-art results. Little is lost going from global to greedy decoding.

Conclusion A minimal span-based parser can achieve state-ofthe-art results. Little is lost going from global to greedy decoding. Various extensions yield only minimal gains beyond the core system.

Thanks!