Are there any external (open) data we could use to enrich the analysis?

Hey @KKulma,

you asked this question in the title within you comments of the `ost_analysis.ipynb` notebook! I think it's worth starting a separate issue on this.The most important missing is about usage of open source projects. In combination with the analysis we already did this could help us to identify projects that are used a lot but do not have enough support. We could get this data in two way:

## Dependents and Dependencies 
For some programming languages GitHub integrated dependency trees into the platform. For Python and Java Script we still get this data. Here an example: 
https://github.com/pvlib/pvlib-python

The data mining script extracts this data from the website, because GitHub has not integrated this data into the API. If you are interested into the data we could still plot it. We also have it for the dependencies. We could create a plot like "The most used Python dependencies used in the listed GitHub projects".

## Download Statistics
Since most projects are in R and Python we could use the Package index platforms to get this data:

This package could get use the data for R:
https://github.com/GuangchuangYu/dlstats

This packages could help use get the download number from Python:
https://github.com/hugovk/pypistats 
https://github.com/asadmoosvi/pypi-search

The problem is how to find out the package name via the repo URL as input?
One projects exists, that gives you that:
https://github.com/librariesio/bibliothecary

Maybe we also could regex for pip install in the README. That could be simple work around that works with most projects.
What is your opinion?




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Are there any external (open) data we could use to enrich the analysis? #19

Dependents and Dependencies

Download Statistics

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Are there any external (open) data we could use to enrich the analysis? #19

Description

Dependents and Dependencies

Download Statistics

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions