%0 Generic %A Salimzadeh, Sara %A Gadiraju, Ujwal %A Hauff, Claudia %A Van Deursen, Arie %D 2024 %T SpiderDec, the decomposed version of the Spider dev data set %U %R 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1 %K Text-to-SQL %K Semantic Parsing %K Natural Language Interface to Databases %K Corpus Annotation %X

SpiderDec is an extension of the Spider Dataset. The original Spider dataset split the data into training, development, and a hidden test set. For this new dataset, we manually decomposed the questions and corresponding queries within the development set of the Spider dataset, focusing on those with hard and extra hard SQL queries. The result of this effort is the creation of SpiderDec.

%I 4TU.ResearchData