Python Package Index - Pypi
Introduction
Python has a rich and versatile standard library immediately available after installation. In other words, Python follows the batteries included philosophy.
At the same time, Python has an active community that contributes an even more extensive set of packages that users can utilize. Python Package Index (PyPI) is the primary Python software repository. PyPI hosts an extensive collection of packages, including development frameworks, tools, and libraries.
One readily available information about packages stored at PyPi is their download statistics. These could be useful in a variety of research. For example, examining package popularity or looking at connections between community engagements and the number of downloads.
Availability and Limitations
PyPI Stats is the easiest way to access individual package information or monitor ceratin packages for a more extended period. Download stats obtained from the Python Software Foundation’s publicly available data on Google BigQuery. A JSON API is also available. Pypistats Python package provides the API client and CLI tool.
For a more comprehensive study it is preferable to use Google BigQuery. Using PyPI Stats, collecting historical data for every python package or a significant portion could be problematic. For further information visit.
Daily download statistics include:
- Number of downloads
- Used Python Major version
- Used Python Minor version
- Used Operating System
- With/Without mirror
PyPI Stats FAQs addresses some data limitations, such as mirrors and data reliability before 2018 July.
Related works
- An Empirical Analysis of the Python Package Index (PyPI)
- Ecosystem-level determinants of sustained activity in open-source projects: a case study of the PyPI ecosystem
- On the impact of using trivial packages: an empirical case study on npm and PyPI
- A Large-Scale Security-Oriented Static Analysis of Python Packages in PyPI