Norwegian version of this page

TSD Operational Log - Page 18

Downtime of TSD - Thursday 5/11-2015 from 15:00 to 15:30, everything good!

Published Nov. 4, 2015 12:22 AM

Dear TSD-users,

there will be a maintenance stop of the TSD infrastructure on Thursday 5/11 from 15:00 to 15:30 CET. During the downtime the users will not be able to access TSD. The VMs will be probably rebooted at the end of the downtime therefore all the running process will be stopped. The TSD downtime coincides with the Colossus maintenance stop, so there will be no jobs running on the cluster at the time of the downtime. The short notice is due to the fact that we have decided to merge two maintenance stops, namely HNAS and Colossus, to minimise the numbers of outages.

The downtime lasted from 1500 to 1510, and everything is back up and good. Performance should be better.

Sorry for the inconvenience.
Regards,
Francesca

Cerebrum upgrade in TSD tomorrow 21/10 (Completed).

Published Oct. 20, 2015 10:02 PM

Dear TSD-user,

tomorrow there will be an upgrade of Cerebrum instance in TSD. The outage will last for the entire day. As a consequence of the maintenance stop the brukerinfo will not work.
You will receive an informative email when the maintenance is finished.
Sorry for the inconvenience.

Regards,

TSD team

Colossus is back in production.

Published Sep. 16, 2015 3:58 PM

Dear TSD-users,

the issue regarding the missing communication between Colossus and the Domain Controllers has been solved and now Colossus is back in production as usual.
We expect that very few (if none) jobs had failed during the unplanned outage.

Sorry for the inconvenience.

Happy computing!

Francesca@TSD

Colossus can not run jobs due to an upgrade issue (16/9-15 at 13.00hrs)

Published Sep. 16, 2015 1:50 PM

Dear TSD user

We got a problem with Colossus because of our Domain Controller update made an unwanted situation pop up. We are hoping to get the situation back on track today, we´ll keep you posted.

For those of you paying for CPU hours and having had jobs killed, please email us at tsd-drift@usit.uio.no to get this refunded with interests.

Sorry for the inconvenience.

Gard

Upgrade of the TSD disk on the 10/06 at 12:00 CEST - finished

Published June 8, 2015 10:38 AM

Dear TSD user,

the 10th June 2015 at 12:00 CEST there will be an update of the TSD disk. We expect the upgrade to last for a hour. During this period the system might hang up for circa one minute at intervals of every 30 minutes (the first time at 12:00, the second time at 12:30 etc).

For the Colossus users: all the jobs on Colossus nodes will keep running as usual during the upgrade. Notice however that jobs finishing during the upgarde might crash because of the failure of data writing processes back on the VMs. We therefore advice you to schedule (when possible) your jobs in order to finished well after the upgrade period.

Regards,

TSD@USIT

Downtime of the TSD linux VMs now.

Published May 18, 2015 1:34 PM

Dear TSD-user,

the linux vms will be shut down for maintenance purposes, as announced previously.

You will be informed when we will finish.

Regards,

Francesca@TSD

TSD login problem solved 18/5-15, 12:45.

Published May 18, 2015 12:46 PM

Dear User,

the login problem we experienced this morning has just been solved.

Regards,

TSD@USIT

Published May 18, 2015 10:55 AM

Dear Users

We have an issue with the two factor login. Problem occures second time you try to log in today. We are on the case, hopefully solved quite soon.

Best

Gard@TSD

Longer run-times for jobs on Colossus

Published May 11, 2015 10:16 PM

Dear TSD user,

the maximum wall-time-limit for the jobs running on Colossus has been now increased to 28 days. This will facilitate the execution of long simulations/calculations. However we strongly advice you not to run jobs for more than 7 days, unless you have enable checkpointing (...