TY - DATA T1 - Cracks in Steel Bridges (CSB) dataset: data underlying the publication: Loss function inversion for improved crack segmentation in steel bridges using a CNN framework PY - 2024/12/05 AU - Andrii Kompanets AU - Davide Leonetti AU - Remco Duits AU - Bert Snijder UR - DO - 10.4121/6162a9b6-2a20-4600-8207-e9dcd53a264a.v3 KW - image segmentation KW - fatigue crack KW - steel bridge inspection KW - computer vision KW - crack segmentation KW - crack detection N2 -

The presented dataset used for the experiments is described in the article "Loss function inversion for improved crack segmentation in steel bridges using a CNN framework" (doi:https://doi.org/10.1016/j.autcon.2024.105896). The dataset consists of images of steel bridge structures and pixel-wise fatigue crack annotations. Some of the images contain bridge structures with cracks or corrosion, while others capture structures without any defect. 

The images are provided by bridge infrastructure owners "Rijkswatersaat" and "ProRail" and by "Nebest" engineering company. The annotation of images was made using a semi-automatic annotation tool described in the article "Segmentation Tool for Images of Cracks" (doi:https://doi.org/10.1007/978-3-031-35399-4_8) and which implementation is available at https://github.com/akomp22/crack-segmentation-tool.

The dataset consists of high-resolution images and is stored in the folder "entire images". The images are divided into test and train sets. Images that capture cracks are stored in the folder "crack_train" and "crack_test". Images capturing structure without a crack are stored in folders "nocrack_train" and "nocrack_test". For each image, a .json file is stored in the same folder and under the same name as the corresponding image. The .json file stores the position (x,y) of pixels on the image, which lie in a crack region. An example of a code to generate a binary segmentation map from the .json files is given in the "read_json_annotation.py" file.

Additional patch datasets were generated from the entire images. The patch datasets are stored in the “patch dataset” folder. The multiple patch datasets differ by the patch size, number of patches, and fraction of patches that do not contain cracks among all patches of the particular dataset. Furthermore, we provide segmentation maps in file "predictions.rar" for entire test images which are given by the method proposed in our research article.

For more explanations, please refer to the article: https://doi.org/10.1016/j.autcon.2024.105896

ER -