2017-06-05: Crane /work filesystem unplanned downtime
Category:
System FailureThe /work filesystem for Crane is partially unavailable. One of the storage servers crashed and rebooted. We are now running a filesystem check before placing the server back online. Pending jobs will be held until the maintenance is complete.
show details...
The filesystem check has been completed with no errors found. The /work filesystem is back online. Running jobs may be affected, but there was no data loss from this outage.
We believe the storage server crash was triggered by I/O delays as the RAID controller was rebuilding a failed disk drive. The rebuild is still running and we are monitoring the system.