[Issues] Peregrine maintenance extended
Ewout M. Helmich
e.m.helmich at rug.nl
Thu Nov 8 11:52:16 CET 2018
Hi everyone,
Please note that there are issues with the planned Peregrine
maintenance. It will be extended at least to tomorrow morning, see below.
Regards,
Ewout
----
Dear Peregrine users,
The process of copying /data from the Data Handling storage to the new
Peregrine storage is going much slower than anticipated, due to a few
crashes of the copying process. This forces us to restart the
synchronization process from the beginning, and this will take some time.
Because of the way this process works, we have decided not to allow
access to the cluster, since doing so can result not only in jobs
crashing (because of missing files), but also in file conflicts if new
files are created on the Peregrine storage with the same names as files
that still need to be copied over from the Data Handling storage.
We understand that the prolonged downtime of the cluster is adversely
impacting your work, but, on the whole, it is the least of two evils. We
apologize once again for this, and hope that you understand our reasoning.
As regards an estimate of when Peregrine will be back online, the
earliest time would be tomorrow morning, if everything works as it
should. We are also preparing for a worse case scenario. We will have
more information for you tomorrow.
Best regards,
Fokke Dijkstra
The HPC Team
--
Regards,
Ewout
More information about the Issues
mailing list