Campbell, Matthew: Scaling to a Million Machines with Prometheus (PromCon 2016): https://promcon.io/2016-berlin/talks/scaling-to-a-million-machines-with-prometheus Consul: Secure service networking: https://consul.io Docker: Enterprise container platform: https://www.docker.com Grafana: The open observability platform: https://grafana.com/ Graphite: An enterprise-ready monitoring tool that runs equally well on cheap hardware or a cloud infrastructure: https://graphiteapp.org/ InfluxDB: A time-series database designed to handle high write and query loads: https://www.influxdata.com/products/influxdb-overview Nagios: The industry standard In IT infrastructure monitoring: https://www.nagios.org Prometheus: Configuration options: ...