Major Incident Review - Unable to Send and Receive Emails to/from Outside

Major Incident Review - Unable to Send and Receive Emails to/from Outside

Major Incident Causing Product - Active Directory DNS Service
Affected Users - All Email Users with the exception of Office 365
Date - February 26, 2019
Tests Performed
      Scenario 1 - Send a test email from inside the organization - Failed
      Scenario 2 - Send a test email from outside the organization - Failed
      Scenario 3 - Check exchange servers queue - Queue was showing on emails sent to outside email addresses
      Scenario 4 - Nslook up www.yahoo.com using SVHQADS01/ 10.0.225.44 - NSlookup failed
      Scenario 5 - Nslook up www.yahoo.com using SVHQADS02/ 10.0.225.22 - NSlookup successful
Why it Happened - DNS Service was not able to resolve outgoing DNS records
How it was Solved - Change the Active Directory and DNS server record on Symantec Gateway Servers to SVHQADS02/10.0.225.22 
How to Avoid Recurrence - We are working with Microsoft to ensure the overall Active Directory Health is in good shape, which includes DNS services.

Beza Melaku

    • Related Articles

    • Major Incident Review - Office 365 Users Unable to Receive Emails

      Major Incident Causing Product - Office 365 Affected Users - Office 365 Users Date - January 17, 2019 Tests Performed       Scenario 1 - Send a test email from inside the organization - Failed       Scenario 2 - Send a test email from outside the ...
    • Major Incident Review - Unable to Apply Signature on Email for On Premise Users

      Major Incident Causing Product - Code Two of Exchange Affected Users - On Premise Email Users  Date - March 10, 2019 Tests Performed       Scenario 1 - Updated the new image on Mailbox Server on each site - Signature was not applied for all on prem ...
    • Major Incident Review - Evisa Server Virus Attack on AWS

      Major Incident Causing Product - Evisa Affected Users - Evisa Users Date - February 01, 2019 Tests Performed       Scenario 1 - Check where the server is located on AWS       Scenario 2 - Check if the server is updated - it was not updated     ...
    • Major Incident Review - Crew Control Application Server Failure

      Major Incident Causing Product - Crew Control Application Server Affected Users - Crew Control Users Date - January 24, 2019 Tests Performed       Scenario 1 - Check if the server is up and running - The server was in failed status       Scenario 2 - ...
    • UNABLE TO CLOSE flight eg .ET512/25AUG

      Bellow is resolution detail from Sabre team. Your Service Request 1-25PKAEJ, displayed below, has been clarified. Below are the resolution details: We have detected 2 passengers in the DHSR list G*L512/25AUGABJ/DHSR«                                  ...