kernel-fxtec-pro1x/fs/dlm
David Teigland 98f176fb32 [DLM] don't accept replies to old recovery messages
We often abort a recovery after sending a status request to a remote node.
We want to ignore any potential status reply we get from the remote node.
If we get one of these unwanted replies, we've often moved on to the next
recovery message and incremented the message sequence counter, so the
reply will be ignored due to the seq number.  In some cases, we've not
moved on to the next message so the seq number of the reply we want to
ignore is still correct, causing the reply to be accepted.  The next
recovery message will then mistake this old reply as a new one.

To fix this, we add the flag RCOM_WAIT to indicate when we can accept a
new reply.  We clear this flag if we abort recovery while waiting for a
reply.  Before the flag is set again (to allow new replies) we know that
any old replies will be rejected due to their sequence number.  We also
initialize the recovery-message sequence number to a random value when a
lockspace is first created.  This makes it clear when messages are being
rejected from an old instance of a lockspace that has since been
recreated.

Signed-off-by: David Teigland <teigland@redhat.com>
Signed-off-by: Steven Whitehouse <swhiteho@redhat.com>
2006-11-30 10:37:14 -05:00
..
ast.c [DLM] down conversion clearing flags 2006-08-23 16:07:31 -04:00
ast.h
config.c [DLM] PATCH 2/3 dlm: lowcomms close 2006-04-28 10:50:41 -04:00
config.h
debug_fs.c [GFS2] inode_diet: Replace inode.u.generic_ip with inode.i_private (gfs) 2006-09-28 08:32:24 -04:00
dir.c [DLM] Update DLM to the latest patch level 2006-01-20 08:47:07 +00:00
dir.h
dlm_internal.h [DLM] don't accept replies to old recovery messages 2006-11-30 10:37:14 -05:00
Kconfig [DLM] Fix DLM config 2006-11-30 10:35:41 -05:00
lock.c [DLM] clear sbflags on lock master 2006-11-30 10:35:27 -05:00
lock.h [DLM] dump rsb and locks on assert 2006-08-21 09:50:09 -04:00
lockspace.c [DLM] don't accept replies to old recovery messages 2006-11-30 10:37:14 -05:00
lockspace.h [DLM] dlm: user locks 2006-07-13 09:25:34 -04:00
lowcomms-sctp.c [DLM] Add support for tcp communications 2006-11-30 10:35:00 -05:00
lowcomms-tcp.c [DLM] Add support for tcp communications 2006-11-30 10:35:00 -05:00
lowcomms.h [PATCH] dlm gfp_t annotations 2006-10-09 14:19:08 -07:00
lvb_table.h
main.c [DLM] dlm: user locks 2006-07-13 09:25:34 -04:00
Makefile [DLM] Add support for tcp communications 2006-11-30 10:35:00 -05:00
member.c [DLM] fix aborted recovery during node removal 2006-11-30 10:35:13 -05:00
member.h
memory.c [DLM] print bad length in assertion 2006-08-09 09:44:54 -04:00
memory.h [DLM] Remove range locks from the DLM 2006-02-23 09:56:38 +00:00
midcomms.c [DLM] Update DLM to the latest patch level 2006-01-20 08:47:07 +00:00
midcomms.h
rcom.c [DLM] don't accept replies to old recovery messages 2006-11-30 10:37:14 -05:00
rcom.h
recover.c [DLM] res_recover_locks_count not reset when recover_locks is aborted 2006-11-30 10:35:03 -05:00
recover.h
recoverd.c [DLM] fix add_requestqueue checking nodes list 2006-11-30 10:37:00 -05:00
recoverd.h
requestqueue.c [DLM] fix add_requestqueue checking nodes list 2006-11-30 10:37:00 -05:00
requestqueue.h [DLM] fix requestqueue race 2006-11-30 10:35:10 -05:00
user.c [DLM] force removal of user lockspace 2006-08-31 12:15:37 -04:00
user.h [DLM] dlm: user locks 2006-07-13 09:25:34 -04:00
util.c [DLM] Remove range locks from the DLM 2006-02-23 09:56:38 +00:00
util.h