Prometheus 1.8 and Grafana 4.6.2 upgrades
This release of prometheus-support includes minor version upgrades to the Prometheus and Grafana servers, as well as an update of the bigquery-exporter to v0.3.
This release also includes multiple bigquery exporter query updates: ipv6 bais, ndt server metrics, and ndt test counts (which was running in mlab-oti as-hoc).
Resource changes:
- the blackbox exporter CPU alloc is now 1 CPU to resolve suspected overload.
Alert changes:
- ParserDailyVolumeTooLow added to track the pipeline daily volume.
- DownloaderDownOrMissing added to report if the downloader is not running at all. Coincides with the production deployment of downloader.
- ScraperMostRecentArchivedFileTimeIsTooOld will now fire only after 56 hours (instead of 36) to allow for rsyncd config updates. This is to reduce redundancy with the ParserDailyVolumeTooLow alert and reduce the frequency of this scraper alert, which is currently our most common one, which has a reputation for auto-closing without doing anything else.