Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 465 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 465 Bytes

illegible-us is a scraper for the hearing archive of the Senate Select Committee on Intelligence (SSCI)

the scraper collects hearing-related media (PDF documents and video) and metadata (location, time, witnesses, media-associated metadata).

illegible-us is written in node and has a number of dependencies beyond npm's scope: ffmpeg, youtube-dl, exiftool, and puppeteer (a headless chrome; fwiw i'd prefer puppeteer-firefox but setting up proxy is too annoying)