Social Science Computing Unit - Budapest
A unit dedicated to aiding the production of high quality, reproducible and extendable social science research. We intend to make use of and extend open-source tools that have matured and gained popularity mainly in commercial data analysis and data science uses.
Here is what we are doing:
Providing a Platform for Exploration
We created a Data Exploration Catalog prototype, where we publish datasets, present some basic analysis and possible avenues of research. The datasets in the exploration catalog can be subsets of larger sets or even intermediate output steps of research projects that researchers can freely download and test ideas on. Most of them update periodically. If an idea explored on a smaller subset looks promising, but the complexity or the scale of the full dataset requires it, we can assist in implementing a solution to see the results on the full set.
Engineering Reproducible Research
Formalized, machine actionable, pipelines where all intermediate steps can be checked out and reused with minimal effort and without replicating code. A list of these can be found in the projects page
Developing Research Software
A tested, documented main library that handles most of the above:
Open Source Contributions
To closely or loosely related projects, like aswan, colassigner, dvc and others.