Session disconnect doesn't shutdown work #16

jdanbrown · 2012-09-14T03:28:36Z

In Cluster.connectionWatcher a KeeperState.Disconnected doesn't trigger forceShutdown, which is responsible for calling listener.shutdownWork and listener.onLeave. My guess as to why this is is that it's waiting for a KeeperState.Expired, which does trigger forceShutdown and is otherwise identical (modulo logging).

Based on my recent experience and the zk session state-transition docs, this allows the following behavior:

Client partitions from zk cluster
Client-side zk session timeout triggers, client receives KeeperState.Disconnected
Clients stays partitioned from network for arbitrarily long, but ordasity continues running its previously-claimed work
Client eventually rejoins network and regains route to zk cluster
Client attempts to reconnect to the zk cluster, cluster says no, client receives KeeperState.Expired, ordasity finally shuts down work
Client establishes new session, ordasity claims new work

This is harmful in my application since I want work ownership to be best-effort exclusive, i.e. nodes should minimize their overlap in work.

Is this a bug, or was ordasity intentionally designed to maximize this kind of overlap when nodes partition? I can imagine that being a useful—or at least not harmful—behavior in some settings. If so, maybe I can elaborate this proposed change to include a config option to maximize vs. minimize oblivious work overlap?

jdanbrown · 2012-09-19T01:21:40Z

Ok, this change appears to violate some important assumptions—see the two FIXME commits above.

I'd be interested to hear your thoughts on how this could be made to work.

jdanbrown added 4 commits September 13, 2012 20:20

Fix: Any connection loss should shutdown work

26da0a9

Fix: No longer safe to skip Cluster.onConnect

da1a379

FIXME: Disable broken test

6dd2067

FIXME: joinCluster hangs if previousZKSessionStillActive

5eb9546

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Session disconnect doesn't shutdown work #16

Session disconnect doesn't shutdown work #16

Uh oh!

jdanbrown commented Sep 14, 2012

Uh oh!

jdanbrown commented Sep 19, 2012

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Session disconnect doesn't shutdown work #16

Are you sure you want to change the base?

Session disconnect doesn't shutdown work #16

Uh oh!

Conversation

jdanbrown commented Sep 14, 2012

Uh oh!

jdanbrown commented Sep 19, 2012

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant