This is a collection of scripts to download, convert, sort and clean submissions to the Zero Carbon Bill so that they can be analysed.
There's a collection of Ruby and Python scripts here:
python download.pyto download the PDFs listed in urls.jsontext.rbto convert the PDFs inpdfs/to textsorter.rbto sort the text submissions into typesclean_text.rbto turn submissions that follow the online submission format into json files
pdfs and txt folders are ignored from git, but the sorted folder is included.