[PSA] Automatic Incident Creation for AWS ALB Healthy Host Count

What is this about?

In regards to improve our automatic incident detection, BEI team will add additional centralized Datadog system monitor for the ALB healthy host count metric (aws.applicationelb.healthy_host_count). The monitor will automatically create Datadog incident and paging respective team when ALB healthy host count < 1.

Who is this announcement for?

All teams whose service runs behinds AWS Application Load balancer (ALB).

Impacted AWS Accounts

What do you need to do?

As of now, there is no action item from your side

Timeline

Questions/Concerns?

Should you have any question/concern, feel free to reach out to @U7FC1KYER @U02UU1QE7 and @U08G58KL3 in #C03A4ENFK, #techops-<vertical> or in this thread directly