SpiderDec, the decomposed version of the Spider dev data set
doi:10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
doi: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
Datacite citation style:
Salimzadeh, Sara; Gadiraju, Ujwal; Hauff, Claudia; Arie Van Deursen (2024): SpiderDec, the decomposed version of the Spider dev data set. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
usage stats
49
views
13
downloads
licence
Apache-2.0
SpiderDec is an extension of the Spider Dataset. The original Spider dataset split the data into training, development, and a hidden test set. For this new dataset, we manually decomposed the questions and corresponding queries within the development set of the Spider dataset, focusing on those with hard and extra hard SQL queries. The result of this effort is the creation of SpiderDec.
history
- 2024-07-01 first online, published, posted
publisher
4TU.ResearchData
format
SQL file
associated peer-reviewed publication
Exploring the Feasibility of Crowd-Powered Decomposition of Complex User Questions in Text-to-SQL Tasks
funding
organizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Sciences, Department of Software TechnologyAI for Fintech Research Lab at ING Group
DATA
files (1)
- 39,986 bytesMD5:
9f4474cdd23cdb956fa0e0392cc43e27
SpiderDec.zip -
download all files (zip)
39,986 bytes unzipped