cff-version: 1.2.0
abstract: "In model driven engineering (MDE), meta-models are the central artifacts. As a complement, the Object Constraint Language (OCL) is a language used to express constraints and operations on meta-models. The Eclipse Modeling Framework (EMF) provides an implementation of OCL, enabling OCL annotated meta-models. Existing empirical studies of the OCL have been conducted on small collections of data. To facilitate empirical research into the OCL on a larger scale, we present the first publicly available data set of OCL expressions. The data set contains 9188 OCL expressions originating from 504 EMF meta-models in 245 systematically selected GitHub repositories. Both the original meta models and the generated abstract syntax trees are included, allowing for a variety of empirical studies of the OCL. To illustrate the applicability of this data set in practice, we performed three case studies."
authors:
  - family-names: Noten
    given-names: J. (Jeroen)
  - family-names: Mengerink
    given-names: J.G.M. (Josh)
    orcid: "https://orcid.org/0000-0002-1199-2311"
  - family-names: Serebrenik
    given-names: Alexander
    orcid: "https://orcid.org/0000-0002-1418-0095"
title: "A Data Set of OCL Expressions on GitHub"
keywords:
version: 1
identifiers:
  - type: doi
    value: 10.4121/uuid:83317fd5-91f5-4e4b-b475-3e2d8ff12d1c
license: 4TU General Terms of Use
date-released: 2017-08-31