Messages & Announcements

  • 2012-10-01:  Firefly: /work deletions needed
    Category:  General Announcement

    2012-10-01 Firefly only: usage of the shared /work filesystem is nearing capacity. Users are requested to make voluntary deletions of unneeded files. HCC Staff will be required to make deletions if space is not freed up soon.


    Due to space becoming limited on /work on Firefly, deletions of files will begin soon. The group with largest usage will be selected first, and the oldest files owned by that group will be deleted first. This will be repeated until usage is reduced to a manageable amount. It is preferred that this be done voluntarily, but the first priority is to provide space for new work. /work is not for long-term archiving or precious files. If you wish to acquire storage space for your group, contact David Swanson (dswanson@cse.unl.edu).

  • 2012-02-01:  SANDHILLS Outage Announcement
    Category:  Maintenance

    Sandhills nodes will be rebooted in the early afternoon of 02/01/2012 to perform maintenance on the Infiniband infrastructure.


    SANDHILLS nodes will be rebooted in the early afternoon of 02/01/2012 to perform maintenance on the Infiniband infrastructure. We expect the outage to last no more than 30 minutes, however all running jobs will be killed. Please let us know if this severely impacts your research so that we can try to accommodate you.

    Thanks!
    Tom

  • 2011-12-10:  Unexpected PRAIRIEFIRE/SANDHILLS /home filesystem outage
    Category:  System Failure

    There was an unexpected system failure of /home NFS fileserver on PRAIRIEFIRE/SANDHILLS evening of 12/09/2011. The server was rebooted and recovery of cluster functionality restored. Users are advised to check jobs for errors and resubmit if necessary.


    There was an unexpected system failure of /home NFS fileserver on PRAIRIEFIRE/SANDHILLS in the evening of 12/09/2011. The server was rebooted the morning of 12/10/2011. Cluster functionality was restored. Users are advised to check jobs for I/O errors and resubmit if necessary. Please contact us with any questions or concerns.

    Tom Harvill
    2-5660

  • 2011-12-02:  Partial Firefly disassembly
    Category:  General Announcement

    Firefly users: a partial disassembly of Firefly has begun in preparation for a new machine to be installed and ready for use by next semester. 352 nodes of Firefly were drained of jobs, powered off, and removed. Please check your Firefly jobs and notify HCC staff if you notice any issues. Other than reduced capacity, most users will be unaffected by these actions; new hardware is scheduled to begin arriving as soon as next week.


    POD 3 of Firefly was decommissioned this week in preparation for a new 6016 core machine, which is scheduled to arrive starting next week. This will reduce the available cores on Firefly to 3264, roughly 3/4 its capacity earlier this month. The remainder of Firefly will remain available through next summer to aid users in the transition to the new machine.

    The new machine will consist of 94 nodes with 256 GB RAM, 4 processors (AMD 6272 ) -- 64 cores, and interconnected with QDR (40 gbps) Infiniband. It will be connected to 400 TB of TeraScale (Lustre) storage. 2 "fat" nodes will be equipped with an extra 256 GB RAM for 512 GB RAM each. Shipping of components and installation will begin next week, with full implementation and testing scheduled to be completed in time for availability at the beginning of spring semester. The full new machine will have a peak capability of 50 TeraFlops, roughly twice the capability of Firefly, but occupy less than one-fourth the space. It is 8-times as energy efficient.

    Further details will be posted here as they become available.

  • 2011-09-13:  Brief network outages Sept. 14-15 evening for PKI/Firefly
    Category:  Maintenance

    On September 14th and 15th, between 9:00pm and 5:00am on both nights, UofN Networking will be upgrading the connection between Nebraska Hall in Lincoln and PKI in Omaha. This will cause multiple short (< 1 minute) drops as connections are moved.


Pages