Inspired by Simon Willison.

Download a file regularly using Git Actions, check it into Git and see how it changes over time.

Scraping Project

Projects Notes Updated
Toronto Lobbyist Registry Data Daily at 21:30 UTC ie 5:30 pm EST
(The data source is updated around 4:30 pm everyday)
City of Toronto Contracts Data Daily at midnight UTC
Ontario Marriage Officiants Daily at midnight UTC
City of Toronto Short Term Rentals Registration Daily at 16:30 UTC ie 12:30 pm EST
(The data source is updated around 12 pm everyday)
WHOIS TLDs
  • Downloads a list of WHOIS servers for top-level domains
  • Unlike other git-scaping projects, updates to the repo require me to manually approve a PR as opposed to being automatically merged in
Daily at midnight UTC
Ontario Legislature Bills
  • Downloads a list of bills for each session of the Ontario legislature since the 36th Parliament, 1st Session (September 26, 1995)
  • This is not particularly useful. I wrote this code so I see a list of private bills, but since I have it I thought why not hook it up to a GitHub action
Monthly on the 1st

Future scraping pages:

TODO: