Link Search Menu Expand Document

Social Science Computing Unit - Budapest

A unit dedicated to aiding the production of high quality, reproducible and extendable social science research. We intend to make use of and extend open-source tools that have matured and gained popularity mainly in commercial data analysis and data science uses.


Here is what we are doing:

Providing a Platform for Exploration

We created a Data Exploration Catalog prototype, where we publish datasets, present some basic analysis and possible avenues of research. The datasets in the exploration catalog can be subsets of larger sets or even intermediate output steps of research projects that researchers can freely download and test ideas on. Most of them update periodically. If an idea explored on a smaller subset looks promising, but the complexity or the scale of the full dataset requires it, we can assist in implementing a solution to see the results on the full set.

Engineering Reproducible Research

Formalized, machine actionable, pipelines where all intermediate steps can be checked out and reused with minimal effort and without replicating code. A list of these can be found in the projects page

Developing Research Software

A tested, documented main library that handles most of the above:

Documentation Status codeclimate codecov pypi

Open Source Contributions

To closely or loosely related projects, like aswan, colassigner, dvc and others.