Sample playbooks for dealing with common operational issues in production:
Large number of incidents reported to Site-Infra team after work hours are quite easy to mitigate using Ansible Tower and a simple playbook. Common operational issues like: overly verbose logs consuming all avialble disk space, need to truncate or delete large files, restart a database service, etc. are a reality of life that any proffesional service team should be prepared to handle before taking their service to production.
In order to show product teams "how to fish", we have began and will continue creating short and simple playbook samples, showcasing solutions to "oh-so-common" problems, with hope that pro-active teams will use these samples to build their own arsenal of troubleshooting playbooks and are able to mitigate future inciedents quickly and with ease.
Please treat thease as samples, not as prescriptive instructions or unquestionable best practices. We are not responsible for any damage resulting from use of these samples without proper testing :wink: