cff-version: 1.2.0 abstract: "In model driven engineering (MDE), meta-models are the central artifacts. As a complement, the Object Constraint Language (OCL) is a language used to express constraints and operations on meta-models. The Eclipse Modeling Framework (EMF) provides an implementation of OCL, enabling OCL annotated meta-models. Existing empirical studies of the OCL have been conducted on small collections of data. To facilitate empirical research into the OCL on a larger scale, we present the first publicly available data set of OCL expressions. The data set contains 9188 OCL expressions originating from 504 EMF meta-models in 245 systematically selected GitHub repositories. Both the original meta models and the generated abstract syntax trees are included, allowing for a variety of empirical studies of the OCL. To illustrate the applicability of this data set in practice, we performed three case studies." authors: - family-names: Noten given-names: J. (Jeroen) - family-names: Mengerink given-names: J.G.M. (Josh) orcid: "https://orcid.org/0000-0002-1199-2311" - family-names: Serebrenik given-names: Alexander orcid: "https://orcid.org/0000-0002-1418-0095" title: "A Data Set of OCL Expressions on GitHub" keywords: version: 1 identifiers: - type: doi value: 10.4121/uuid:83317fd5-91f5-4e4b-b475-3e2d8ff12d1c license: 4TU General Terms of Use date-released: 2017-08-31