The Santa Cruz Sluicing Dataset
Pranav Anand, Daniel Hardt, James McCloskey
August 2020
 

This paper describes a new research resource -- a searchable database of 4700 naturally occurring instances of sluicing in English, annotated so as to shed light on the questions which have shaped research on ellipsis since the 1960's. The paper describes the dataset and how it can be obtained, how it was constructed, how it is organized, and how it can be queried. It also highlights some initial empirical findings, first describing general characteristics of the data, then focusing more closely on issues concerning antecedents and possible mismatches between antecedents and ellipsis sites
Format: [ pdf ]
Reference: lingbuzz/005673
(please use that when you cite this article)
Published in: to appear
keywords: ellipsis sluicing english corpus annotation, semantics, syntax
Downloaded:112 times

 

[ edit this article | back to article list ]