Operations
Sections in this Page
Setup
The monitoring projects themselves are created by the Terraform project factory. However, some steps have yet to be automated and need to be carried out manually. Specifically:
- The non-prod projects have been added as monitored projects.
- The Google Ops Agent has been deployed on all instances, using Terraform.
Alerting Policies and Uptime Checks
- An alerting policy has been configured, currently sending emails to the org admins group. This is monitoring the URL that points to the external load balancer.
- CPU, memory and disk alerts have been set, based on high thresholds.
- An uptime check has been defined, which alerts if the Ghost service is unavailable.
Custom Dashboard
A custom dashboard has been created, showing some key indicators.