Personal data from source data sets is part of the git trees
Currently we stored personal data directly in the subfolder, without recognising for the need to not having it available public prior to individual consent.
Since the dataset will be partly made available, we could move all data mangling into its own private repository and only provide the distribution assets from there to the repository here, i.e. by adding it as a submodule or by commiting those here.