Process Discovery Contest 2025

DOI:10.4121/7212a73a-1eac-4a08-8c01-973dca020822.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
DOI: 10.4121/7212a73a-1eac-4a08-8c01-973dca020822

Datacite citation style

Eric Verbeek (2025): Process Discovery Contest 2025. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/7212a73a-1eac-4a08-8c01-973dca020822.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

This folder contains the data set as was used for the Process Discovery Contest of 2025 (PDC 2025).

The data set contains 288 training logs, 96 corresponding test logs and base logs, 96 corresponding

ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either

https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are

workflow nets (a subclass of Petri nets) stored in the PNML fileformat (see

https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).


History

  • 2025-09-08 first online, published, posted

Publisher

4TU.ResearchData

Format

IEEE XES, ISO PNML

Organizations

Task Force on Process Mining (https://tf-pm.org)

DATA

Files (6)