The Maven Dependency Dataset
Datacite citation style:
Steven Raemaekers; van Deursen, A. (Arie); Joost Visser (2013): The Maven Dependency Dataset. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:68a0e837-4fda-407a-949e-a159546e67b6
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
The Maven Dependency Dataset contains the data as described in the paper "Mining Metrics, Changes and Dependencies from the Maven Dependency Dataset".
NOTE: See the README.TXT file for more information on the data in this dataset.
The dataset consists of multiple parts: A snapshot of the Maven repository dated July 30, 2011 (maven.tar.gz), a MySQL database (complete.tar.gz) containing information on individual methods, classes and packages of different library versions, a Berkeley DB database (berkeley.tar.gz) containing metrics on all methods, classes and packages in the repository, a Neo4j graph database (graphdb.tar.gz) containing a call graph of the entire repository, scripts and analysis files (scriptsAndData.tar.gz), Source code and a binary package of the analysis software (fullmaven.jar and fullmaven-sources.jar), and text dumps of data in these databases (graphdump.tar.gz, processed.tar.gz, calls.tar.gz and units.tar.gz).
history
- 2013-01-10 first online, published, posted
publisher
Software Engineering Research Group (SERG), TU Delft
format
media types: application/java-archive, application/x-tar-gz, text/html
organizations
Software Engineering Research Group (SERG), TU Delft;Software Improvement Group (SIG), Amsterdam;
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Science, Department of Software Technology
DATA
files (12)
- 5,286 bytesMD5:
1dfb7d3371198ac2380f01217de58cbe
README.TXT - 19,519,425,071 bytesMD5:
10f86bc1d3c3bb0f6e3fa0b0cd936d5a
berkeley.tar.gz - 1,049,626,301 bytesMD5:
04038daad0a69a6c50f5dac6f909285c
calls.tar.gz - 54,581,015 bytesMD5:
e9d2a9609bf80b24cdd7ff52f20371d7
complete.tar.gz - 44,378,988 bytesMD5:
8d6e41551591e9445f0cecc8fe0d2008
fullmaven-sources.jar - 48,254,185 bytesMD5:
12b8b7f114d4a231f06f9d1dc8e351f3
fullmaven.jar - 4,491,341,089 bytesMD5:
d8a36080d343bdb130de782b9ac20df6
graphdb.tar.gz - 395,701,529 bytesMD5:
d74b6b4b13b63fb841328ca7fdc0c617
graphdump.tar.gz - 273,070,009,909 bytesMD5:
4f820e763fd0ab9792bc57e5973cd0e4
maven.tar.gz - 1,394,053,103 bytesMD5:
b454704c4e6dbac0a550deef82d8268f
processed.tar.gz - 46,872,095 bytesMD5:
69fce692dfe9c7e84676e40d719dc524
scriptsAndData.tar.gz - 458,418,799 bytesMD5:
8ba8a3bc6c94247bf3f67f5b053e4d43
units.tar.gz -
download all files (zip)
300,572,667,370 bytes unzipped