Cracks in Steel Bridges (CSB) dataset: data underlying the publication: Deep Learning for Segmentation of Cracks in High-Resolution Images of Steel Bridges
The presented dataset used for the experiments is described in the article "Deep Learning for Segmentation of Cracks in High-Resolution Images of Steel Bridges" (doi:https://doi.org/10.48550/arXiv.2403.17725). The dataset consists of images of steel bridge structures and pixel-wise fatigue crack annotations. Some of the images contain bridge structures with cracks or corrosion, while others capture structures without any defect.
The images are provided by bridge infrastructure owners "Rijkswatersaat" and "ProRail" and by "Nebest" engineering company. The annotation of images was made using a semi-automatic annotation tool described in the article "Segmentation Tool for Images of Cracks" (doi:https://doi.org/10.1007/978-3-031-35399-4_8) and which implementation is available at https://github.com/akomp22/crack-segmentation-tool.
The dataset consists of high-resolution images and is stored in the folder "entire images". The images are divided into test and train sets. Images that capture cracks are stored in the folder "crack_train" and "crack_test". Images capturing structure without a crack are stored in folders "nocrack_train" and "nocrack_test". For each image, a .json file is stored in the same folder and under the same name as the corresponding image. The .json file stores the position (x,y) of pixels on the image, which lie in a crack region. An example of a code to generate a binary segmentation map from the .json files is given in the "read_json_annotation.py" file.
Additional patch datasets were generated from the entire images. The patch datasets are stored in the “patch dataset” folder. The multiple patch datasets differ by the patch size, number of patches, and fraction of patches that do not contain cracks among all patches of the particular dataset. Furthermore, we provide segmentation maps in file "predictions.rar" for entire test images which are given by the method proposed in our research article.
For more explanations, please refer to the article: https://doi.org/10.48550/arXiv.2403.17725
- 2024-04-16 first online
- 2024-09-30 published, posted
DATA
- 3,106 bytesMD5:
8b7e8bed0489c6e03efd51cccac0a7a7
README.MD - 10,218,410,544 bytesMD5:
4913d09082b319c2a30c161d6cd0049f
CSB_dataset.rar - 1,528,703 bytesMD5:
b8e3dc52b55482ec45a4d67eced2e67c
predictions.rar - 825 bytesMD5:
40b082b0c7bff22b5f5a2df8a013d649
read_json_annotation.py -
download all files (zip)
10,219,943,178 bytes unzipped