Fix #334, add some doc. on how to replace the Manager in case of failure by giuseppe-carboni · Pull Request #335 · discos/deployment

giuseppe-carboni · 2020-05-05T14:42:29Z

No description provided.

marco-buttu · 2020-05-05T15:38:27Z

doc/production.rst

 Production
 **********

+Unlike the Development environment, that uses Vagrant pre-configured virtual


In development.rst we wrote "development environment", to be coherent we should use the same criteria.

marco-buttu · 2020-05-05T15:41:02Z

doc/production.rst

+Replace the Manager in case of failure
+--------------------------------------
+In case the Manager machine suffers a failure of some sort, it has to be
+replaced. In order to do this, the first thing to do is perform again the


"is perform" or "is to perform" ?

I'm investigating this with an English speaking friend, I'll post the correct version ASAP

About the point below, it is not clear what are "all station systems".

marco-buttu · 2020-05-05T15:44:41Z

doc/production.rst

+- Make sure that all the station systems and machines accept incoming
+  connections from the newly allocated Manager's IP address. Specifically, the
+  ``TotalPower`` backend and the ``CalMux`` machines have to be tweaked in
+  order to allow them to be controlled by the new manager.


Where is the procedure?

This procedure involves logging in the said machines as root, if it has to be documented, this is not the place to do it. A suggestion about this is we perform this step in advance by allowing a range of addresses to control the said machines, so, in case of failure, this step can be skipped.

No clear to me how it is possible to replicate the manager without any information about this point. I think the procedure should be documented somewhere, and in case this is not the place, here we have to put a reference link to it.

marco-buttu · 2020-05-05T15:45:39Z

doc/production.rst

+  ``discos-console`` and ``discos-storage`` machines (in case the DISCOS
+  control software is running on a distributed environment). This will allow
+  other services such as the Lustre service on the ``discos-storage`` machine
+  to point again to the correct IP address.


Is there a procedure to point to?

marco-buttu · 2020-05-05T15:46:44Z

doc/production.rst

+  control software is running on a distributed environment). This will allow
+  other services such as the Lustre service on the ``discos-storage`` machine
+  to point again to the correct IP address.
+- Perform the ssh key exchange procedure between the ``discos`` user of the


Does Mauro do all this things? :-D We need an example for him :-)

This is not a procedure that a generic observer can do. Performing the ssh key exchange requires knowing the password of both the discos and the root users.

I was joking, the point is that we have to write the documentation thinking that the reader is not one of the discos team...

Fix #334, add some doc. on how to replace the Manager in case of failure

fd040d6

giuseppe-carboni added enhancement documentation labels May 5, 2020

giuseppe-carboni requested a review from marco-buttu May 5, 2020 14:42

giuseppe-carboni self-assigned this May 5, 2020

marco-buttu reviewed May 5, 2020

View reviewed changes

Update on duc file

193b99b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #334, add some doc. on how to replace the Manager in case of failure#335

Fix #334, add some doc. on how to replace the Manager in case of failure#335
giuseppe-carboni wants to merge 2 commits intomasterfrom
fix-issue-334

giuseppe-carboni commented May 5, 2020

Uh oh!

marco-buttu May 5, 2020

Uh oh!

marco-buttu May 5, 2020

Uh oh!

giuseppe-carboni May 6, 2020

Uh oh!

marco-buttu May 7, 2020

Uh oh!

marco-buttu May 5, 2020

Uh oh!

giuseppe-carboni May 6, 2020

Uh oh!

marco-buttu May 7, 2020

Uh oh!

marco-buttu May 5, 2020

Uh oh!

marco-buttu May 5, 2020

Uh oh!

giuseppe-carboni May 6, 2020

Uh oh!

marco-buttu May 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

giuseppe-carboni commented May 5, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants