The Qualitas Corpus is an curated collection of software systems intended to be used for empirical studies of code artefacts. The primary goal is to provide a resource that supports reproducible studies of software. The current release of the Corpus contains open-source Java software systems, often multiple versions.
The current release is version 20101126. It has 106 systems, 13 systems with 10 or more versions, and 585 versions total. There are two main distributions: the "r" (recent) release, containing the most recent versions we have of every system (106 systems) and the "e" (evolution) release, containing all versions of the 13 systems with 10 or more versions, a total of 414 versions. There are other distributions available. In publications that use the corpus, please cite either the specific release that was used, or the APSEC paper and give the release identifier.
Updated: 26-Aug-2011, Managed by Ewan Tempero