Monitoring
We have 3 monitoring tools for alerting.
- LogDNA: All errors from reOS are combined into batches every 5 minutes and submitted to a slack channel the engineers are part of.
- Sentry: All frontend errors are e-mailed to the engineers.
- Apex Ping: We are notified via Slack whenever there is downtime above 0 minutes.
Logging
If it moves log it, if it doesn't move log it until it does.
- anonymous
All logs are recorded in LogDNA but also kept on Google cloud through Stackdriver. We also log certain services, namely Banking
to Slack to monitor funds being moved around.
Todo
Alerting an on-call engineer if downtime occurs after office hours via an SMS.