Process Discovery Contest 2017
doi:10.4121/14625948.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/14625948
doi: 10.4121/14625948
Datacite citation style:
J. (Josep) Carmona; Massimiliano de Leoni; Benoît Depaire; Toon Jouck (2021): Process Discovery Contest 2017. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/14625948.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This is the data set that was used for the Process Discovery Contest of 2017 (PDC 2017). The data set contains 10 training logs, 10 corresponding test logs, and 10 corresponding ground truth logs. The logs are all stored using the IEEE XES file format (see either https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858).
In each ground truth log, the additional boolean “pdc:isPos” attribute denotes whether the trace is positive (fits the model, true) or negative (does not fit the model, false).
In each ground truth log, the additional boolean “pdc:isPos” attribute denotes whether the trace is positive (fits the model, true) or negative (does not fit the model, false).
history
- 2021-05-21 first online, published, posted
publisher
4TU.ResearchData
format
IEEE XES
organizations
Task Force on Process Mining (https://tf-pm.org)
DATA
files (4)
- 717 bytesMD5:
60887e34ed3fce1e27f6b3bd7a02de75
readme.txt - 14,193 bytesMD5:
e050651edf3903e71411500abf9efd07
Ground Truth Logs.zip - 13,285 bytesMD5:
5146828a16b1c0ac8c114bed3701f15d
Test Logs.zip - 284,840 bytesMD5:
fa3ed944b6190eb431c1285a43bd4c4e
Training Logs.zip -
download all files (zip)
313,035 bytes unzipped