Internet Network Issues
Incident Report for OIT Services
Postmortem

ACS University of Alaska Network Outage

Physical Defect in ACS’ Fiber Results in Outage

Event Occurrence: April 24, 2024

Background

The University of Alaska receives network services from ACS.  The ISP provides the long haul WAN circuits that comprise the University of Alaska’s core network from/to Fairbanks, Anchorage, Juneau, Seattle, and Portland.  The circuits from Fairbanks <-> Seattle and Anchorage <-> Portland comprise the University’s connection to Internet2, AWS, and some commodity internet sites.

Break Down of the Problem

On April 22nd, one of ACS’ fibers that supply Wide Area Network (WAN) Connectivity (to Internet2, Google, Microsoft and AWS) to the University of Alaska Fairbanks Main Campus began generating faults and errors resulting in lost and delayed packets, thereby impacting internet performance for the vast majority of users at UAF and UA Statewide Fairbanks Information Technology services.

Target State / Goal

Restore services as soon as possible.

Root Cause Analysis

One of the two ACS fibers supplying connectivity from the University of Alaska to Internet2 and the Lower 48 became severely impacted resulting in poor and lost performance.  Due to the nature of the issue, it was not immediately clear as to what the exact root of the problem was. An emergency maintenance outage was published for Apr 24th at 1000.  During this window the ACS rebooted some network equipment and replaced fiber jumpers, but it did not fix the underlying issue.  The location and root cause of the fiber errors still eluded ACS’ ability to isolate and rectify.  In order to stabilize services, this fiber was disabled, rerouting all traffic onto the single remaining fiber.  This restored UA to proper functional status until fiber repairs could be accomplished.  ACS has not yet corrected the issue on the impacted fiber.

Develop Countermeasures

This issue was caused by a hardware fault in ACS’ gear, there are not many countermeasures that we can take to prevent this issue from happening again.  Short of provider diversification, which has historically been cost-prohibitive.

Posted May 15, 2024 - 10:25 AKDT

Resolved
This incident has been resolved.
Posted Apr 26, 2024 - 13:05 AKDT
Monitoring
After ACS's emergency maintenance our network has been operational with no interruptions. We'll continue monitoring the stability of our connection throughout the day.
Posted Apr 25, 2024 - 11:55 AKDT
Update
ACS is still working on solving the network path issue. We are still getting reports of the internet still being affected. Users may experience slowness and drops when using the internet.
Posted Apr 24, 2024 - 09:35 AKDT
Identified
ACS will be performing emergency maintenance this evening from 10pm to 1am. During this time services such as Google and Microsoft may be impacted and temporarily become unavailable. It is expected for unavailability to be in short bursts and not down for the entire window.
Posted Apr 23, 2024 - 16:12 AKDT
Investigating
There is currently an outage with the Fairbanks to Seattle internet network path which ACS is investigating. We may experience some internet slowness or network drops as a different route is being switched over too. Departments such as RCS and ASF who use internet2 may experience more of an impact. We thank you for your patience and apologize for the inconveniences.
Posted Apr 23, 2024 - 13:58 AKDT
This incident affected: UA Network Connectivity (Statewide Network Connectivity).