Upcoming Plato Downtime Oct 30 through ?

Just a heads up that plato will be down starting October 30th and might not be back up until November 30th. We expect to be back up earlier, but can't guarantee it. This is because plato depends on Wynton's BeeGFS network filesystem, and that is going to be upgraded then. For reference, the Wynton announcement is included below. Since plato has already migrated to Rocky 8 from CentOS 7, we don't anticipate any plato specific issues. Scooter and I are directly involved in the Wynton upgrade, so we'll know when is safe to come back up. So during the downtime: * email will be unavailable * web sites hosted by plato will be unavailable o CGL/RVBI, ChimeraX toolshed, SPOKE, SFLD o web services (/e.g/., for ChimeraX) will be unavailable You should consider taking a few vacation days. 🙂 -- Greg -------- Forwarded Message -------- Subject: [Wynton-Announce] Upcoming Wynton Downtime Oct 30 Date: Fri, 13 Oct 2023 18:08:37 +0000 From: Ellestad, Erik <0000090458849058-dmarc-request@LISTSRV.UCSF.EDU> Reply-To: support@WYNTON.UCSF.EDU To: WYNTON@LISTSRV.UCSF.EDU TLDR: What: Full Wynton downtime, including no access to login, dev, data transfer, or app nodes When: 9am Monday October 30 through End of Business November 3rd Why: To update the Wynton HPC OS to Rocky 8 Linux, Update BeeGFS, Replace aging hardware, and to accommodate work by UCSF Facilities. How: The downtime has been added to SGE's calendar. If your job's runtime limit (h_rt) extends into the maintenance window, it won't start before the maintenance. The longer story: Rocky Linux 8 Migration. Wynton HPC is currently based on CentOS 7 Linux. The CentOS 7 operating system will reach its end of life in early 2024. To allow for security patches, newer versions of libraries and applications, and continued support we need to upgrade to a newer Linux Operating System. We have identified Rocky Linux 8 as being the most compatible with our current needs which has an end of life in 2029. NOTE: All local partitions WILL BE ERASED during the OS Upgrade to Rocky 8 Linux, (including /scratch,) on app, dev, and compute nodes (unless the node is already running Rocky 8 Linux before the downtime). BeeGFS Upgrade. We will update the version of the underlying shared file system, aka BeeGFS, to enable newly implemented features, increase reliability, and implement optimizations. NOTE: NO DATA ON WYNTON IS BACKED UP. While we have tested the BeeGFS upgrade and expect no problems or data loss due to the upgrade of the shared file system version, before the downtime, be sure you have migrated or backed up any working data from /wynton to its canonical storage location. Hardware Replacement. As part of the downtime we will be replacing several older components in our infrastructure. We have done our best to plan ahead and test for this downtime, but due to the number of systems which need to be updated and have their configurations migrated, we expect Wynton HPC to be unavailable until Friday Novemeber 3rd. More information about the Rocky 8 Linux migration project is available on our website: https://wynton.ucsf.edu/hpc/software/rocky-8-linux.html -- Erik Ellestad Wynton Cluster SysAdmin UCSF ------------------------------------------------------------------------ This list is used to keep users of the Wynton cluster updated on outages, system upgrades etc. List membership is automatically generated based on registered cluster users. To unsubscribe, please email the cluster admins at <support@wynton.ucsf.edu> to close your account.

Sorry to alarm you, I meant to type November 3rd, not the 30th. A request was made for email to give a vacation-like autoresponse during the downtime. That is a good idea, and is technically possible because cgl.ucsf.edu's incoming mail servers will still be up while plato is down. The idea would be for users to setup an autoresponse with https://mail.cgl.ucsf.edu/ -- you can do that regardless to automatically forewarn your correspondents while plato is still up. Then, in a way yet to be determined, that setup would be replicated on cgl.ucsf.edu's incoming mail servers. An easy alternative, is to forward all of your email to another email address. If that appeals to you, send me your alternative email addresses and whether you want (A) your email to be permanently forwarded there, (B) you want your email temporarily forwarded there during the downtime, or (C) you want your email temporarily forwarded there with a copy sent to plato when it comes up. If the autoresponse replication idea works, it would replace options B and C (because you choose the forwarding you want in the autoresponse setup). -- Greg On 10/21/2023 3:30 AM, Greg Couch via Plato-users wrote:
Just a heads up that plato will be down starting October 30th and might not be back up until November 30th. We expect to be back up earlier, but can't guarantee it. This is because plato depends on Wynton's BeeGFS network filesystem, and that is going to be upgraded then. For reference, the Wynton announcement is included below. ....
participants (1)
-
Greg Couch