SpiderDec, the decomposed version of the Spider dev data set

doi:10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
doi: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
Datacite citation style:
Salimzadeh, Sara; Gadiraju, Ujwal; Hauff, Claudia; Arie Van Deursen (2024): SpiderDec, the decomposed version of the Spider dev data set. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset

SpiderDec is an extension of the Spider Dataset. The original Spider dataset split the data into training, development, and a hidden test set. For this new dataset, we manually decomposed the questions and corresponding queries within the development set of the Spider dataset, focusing on those with hard and extra hard SQL queries. The result of this effort is the creation of SpiderDec.

history
  • 2024-07-01 first online, published, posted
publisher
4TU.ResearchData
format
SQL file
funding
organizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Sciences, Department of Software Technology
AI for Fintech Research Lab at ING Group

DATA

files (1)