[MCOL-3945] load_brm will hang on dbroot1 failover - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.2.6, 1.4.4
Component/s: ?
Labels:
None

Description

saveBRM on failover runs before the dbroot is exchanged. this could lead to saveBRM being run before the brm_saves_journal file exists on the new primary module on a OAM parent failure and could lead to load_brm hanging.

Reproduce by setting up multi-node glusterfs installation and perform large table import. After import completes kill PM1 and wait for PM2 to take over primary roll will see save_brm command run first then dbroot1 moved to PM2 and then load_brm called in logging.

Fix is to first move dbroot1 then run saveBRM this should allow load_brm to run successfully.

Attachments

Activity

People

Assignee:: Ben Thompson (Inactive)

Reporter:: Ben Thompson (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 2020-04-14 17:10

Updated:: 2023-10-26 13:16

Resolved:: 2020-06-22 15:56

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.