directorate for applied fediverse research
Você não pode selecionar mais de 25 tópicos Os tópicos devem começar com uma letra ou um número, podem incluir traços ('-') e podem ter até 35 caracteres.
rra ce613426d0 rerun scrape may 2019 4 meses atrás
.gitignore gitignore and readme 1 ano atrás scraper now uses parallelism 1 ano atrás small fixes 1 ano atrás
instance_scrape.json rerun scrape may 2019 4 meses atrás
instances.txt first version, crawls only the announced peers 1 ano atrás

directorate for applied fediverse research

  • independently tries to verify fediverse statistics
  • draws conclusions from that


Currently the script starts from and queries /api/v1/instance/peers to find servers it is peering with. For each of the peering servers it hasn’t seen before it does the same and in addition it tries to query /api/v1/instance for meta data.

This method is a bit lacking because providing /api/v1/instance is voluntary and specific to later versions of mastodon/activitypub fediverse. We should study the methodology of for better results.

When the request fails on a given instance it just logs it as ‘error’ now.

Latest scrape results can be found in instance_scrape.json


  • add detailed error message to json when we get one
  • abstract the functions so we can multithread them
  • find a way to also scrape for instances that don’t announce themselves