diff options
author | Martin Schwenke <martin@meltin.net> | 2018-11-08 15:49:30 +1100 |
---|---|---|
committer | Amitay Isaacs <amitay@samba.org> | 2018-12-18 02:02:03 +0100 |
commit | da8aaf2aee2ff145f3001ae1fcf3626d6a9bc17d (patch) | |
tree | ee91796a32ff183071b8083f4a7824723baf8f6e /ctdb/server | |
parent | 9d1d5fa4ac3abfe464dda1c7adc4616927855b57 (diff) | |
download | samba-da8aaf2aee2ff145f3001ae1fcf3626d6a9bc17d.tar.gz |
ctdb-recoverd: Call an election when the recovery lock is lost
The lock may have been lost due to a failure in the underlying locking
mechanism. This could be due to quorum loss or similar. It is best
to call an election to confirm that this node should still be master.
At worst, the node will reelect itself, fail to take the lock and then
ban itself. This is a suitable outcome for a node that has been
partitioned from others in the cluster.
Signed-off-by: Martin Schwenke <martin@meltin.net>
Reviewed-by: Amitay Isaacs <amitay@gmail.com>
Diffstat (limited to 'ctdb/server')
-rw-r--r-- | ctdb/server/ctdb_recoverd.c | 13 |
1 files changed, 6 insertions, 7 deletions
diff --git a/ctdb/server/ctdb_recoverd.c b/ctdb/server/ctdb_recoverd.c index f000538bae2..578127a4514 100644 --- a/ctdb/server/ctdb_recoverd.c +++ b/ctdb/server/ctdb_recoverd.c @@ -915,20 +915,19 @@ static void take_reclock_handler(char status, s->locked = (status == '0') ; } -static bool ctdb_recovery_lock(struct ctdb_recoverd *rec); +static void force_election(struct ctdb_recoverd *rec, + uint32_t pnn, + struct ctdb_node_map_old *nodemap); static void lost_reclock_handler(void *private_data) { struct ctdb_recoverd *rec = talloc_get_type_abort( private_data, struct ctdb_recoverd); - DEBUG(DEBUG_ERR, - ("Recovery lock helper terminated unexpectedly - " - "trying to retake recovery lock\n")); + D_ERR("Recovery lock helper terminated, triggering an election\n"); TALLOC_FREE(rec->recovery_lock_handle); - if (! ctdb_recovery_lock(rec)) { - DEBUG(DEBUG_ERR, ("Failed to take recovery lock\n")); - } + + force_election(rec, ctdb_get_pnn(rec->ctdb), rec->nodemap); } static bool ctdb_recovery_lock(struct ctdb_recoverd *rec) |