Hisbl-mongod-02 Re-sync

h1. Issue Details
h2. Summary
Hisbl-mongod-02 got sudden high incoming query request on November 14th. Because of this, the mongo was suddenly stuck and could not handle any traffic, such as insert, update, delete, and sync with primary.

h2. Chronology

Timezone: GMT +7

h3. 2017-11-14

h3. 2017-11-15

h3. 2017-11-16

h3. 2017-11-17

h2. Symptoms

h2. Impact
First of all, the pricing operation team and other product managers were disrupted because they could not check or analyze pricing correctness. Even though it had been mentioned in the code for secondaryReadPreference = true, the queries were still pointed to hisbl-mongod-02, and resulted in failed data retrievals.

Other than that, this mongo turned to the stale mode when trying to sync with mongo primary. It was caused by late-handling mitigation. The incident was realized on the morning around 09:40, but the the first system already started the day before at 18:30; which are more than 12 hours. Because of that, the secondary failed to sync with mongo primary.

h2. Mitigation


h2. References