[MCOL-3841] 1.4.2 multi-node with gluster failing to start via postConfigure - gluster assign/unassign errors Created: 2020-02-27 Updated: 2023-10-25 Resolved: 2023-10-25 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | Columnstore Select Handler |
| Affects Version/s: | 1.4.2 |
| Fix Version/s: | Icebox |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Unassigned |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Amazon EC2 2UM/3PM gluster storage centos 8 |
||
| Description |
|
Setting up a 2um/3pm gluster system on Centos 8 and it failed to start via postConfigure. Setup 5 EC2 instances with 1.4.2 installed and post-install Installed gluster on the PMS following this centos 8 procedure https://computingforgeeks.com/setup-glusterfs-storage-with-heketi-on-centos-server/ sudo gluster --version sudo mcsadmin getso getsoftwareinfo Thu Feb 27 17:44:29 2020 SoftwareVersion = 1.4.2 kicked of postConfigure on pm1 sudo postConfigure This is the MariaDB ColumnStore System Configuration and Installation tool. IMPORTANT: This tool requires to run on the Performance Module #1 Prompting instructions: Press 'enter' to accept a value in (), if available or ===== Setup System Server Type Configuration ===== There are 2 options when configuring the System Server Type: single and multi 'single' - Single-Server install is used when there will only be 1 server configured 'multi' - Multi-Server install is used when you want to configure multiple servers now or Select the type of System Server install [1=single, 2=multi] (2) > ===== Setup System Module Type Configuration ===== There are 2 options when configuring the System Module Type: separate and combined 'separate' - User and Performance functionality on separate servers. 'combined' - User and Performance functionality on the same server Select the type of System Module Install [1=separate, 2=combined] (1) > Seperate Server Installation will be performed. NOTE: Local Query Feature allows the ability to query data from a single Performance Enable Local Query feature? [y,n] NOTE: The MariaDB ColumnStore Schema Sync feature will replicate all of the MariaDB ColumnStore Schema Sync feature is Enabled, do you want to leave enabled? [y,n] NOTE: MariaDB ColumnStore Replication Feature is enabled Enter System Name (columnstore-1) > ===== Setup Storage Configuration ===== ----- Setup Performance Module DBRoot Data Storage Mount Configuration ----- Columnstore supports the following storage options...
These options are available on this system: [1, 2, 3, 4] ===== Setup Memory Configuration ===== NOTE: Setting 'NumBlocksPct' to 70% ===== Setup the Module Configuration ===== ----- User Module Configuration ----- Enter number of User Modules [1,1024] (1) > 2
Enter Nic Interface #1 Host Name (unassigned) > um1
Enter Nic Interface #1 Host Name (unassigned) > um2 ----- Performance Module Configuration ----- Enter number of Performance Modules [1,1024] (1) > 3
Enter Nic Interface #1 Host Name (ip-172-30-0-250.ec2.internal) > pm1
Enter Nic Interface #1 Host Name (unassigned) > pm2
Enter Nic Interface #1 Host Name (unassigned) > pm3 Next step is to enter the password to access the other Servers. Enter password, hit 'enter' to default to using a ssh key, or 'exit' > ----- Performing Install on 'um1 / um1' ----- Install log file is located here: /tmp/columnstore_tmp_files/um1_binary_install.log ----- Performing Install on 'um2 / um2' ----- Install log file is located here: /tmp/columnstore_tmp_files/um2_binary_install.log ----- Performing Install on 'pm2 / pm2' ----- Install log file is located here: /tmp/columnstore_tmp_files/pm2_binary_install.log ----- Performing Install on 'pm3 / pm3' ----- Install log file is located here: /tmp/columnstore_tmp_files/pm3_binary_install.log ===== Configuring MariaDB ColumnStore Data Redundancy Functionality ===== ----- Setup Data Redundancy Copy Count Configuration ----- Setup the Number of Copies: This is the total number of copies of the data Enter Number of Copies [2-3] (2) > 3 ----- Setup Data Redundancy Network Configuration ----- 'existing' - This is specified when using previously configured network devices. (NIC Interface #1) 'dedicated' - This is specified when it is desired for Data Redundancy traffic to use Select the data redundancy network [1=existing, 2=dedicated] (1) > ----- Performing Data Redundancy Configuration ----- gluster peer probe 172.30.0.250 ----- Data Redundancy Configuration Complete ----- ===== Checking MariaDB ColumnStore System Logging Functionality ===== The MariaDB ColumnStore system logging is setup and working on local server MariaDB ColumnStore System Configuration and Installation is Completed ===== MariaDB ColumnStore System Startup ===== System Configuration is complete. ----- Starting MariaDB ColumnStore on local server ----- MariaDB ColumnStore successfully started MariaDB ColumnStore Database Platform Starting, please wait ............. TIMEOUT: ProcMon not responding to getSystemStatus FAILED IMPORTANT: There was a system startup failed, once issue has been resolved, rerun postConfigure MariaDB ColumnStore System failed to start, check log files in /var/log/mariadb/columnstore . /etc/profile.d/columnstoreAlias.sh Enter 'mariadb' to access the MariaDB ColumnStore SQL console NOTE: The MariaDB ColumnStore Alias Commands are in /etc/profile.d/columnstoreAlias [centos@ip-172-30-0-250 ~]$ sudo ma MariaDB ColumnStore Admin Console Active Alarm Counts: Critical = 0, Major = 0, Minor = 0, Warning = 0, Info = 0 Critical Active Alarms: mcsadmin> getsystemi System columnstore-1 System and Module statuses Component Status Last Status Change Module um1 MAN_INIT Thu Feb 27 17:10:41 2020 Active Parent OAM Performance Module is 'pm1' MariaDB ColumnStore Process statuses Process Module Status Last Status Change Process ID ProcessMonitor um2 ACTIVE Thu Feb 27 17:10:18 2020 2780 ProcessMonitor pm1 ACTIVE Thu Feb 27 17:09:34 2020 3230 ProcessMonitor pm2 ACTIVE Thu Feb 27 17:10:28 2020 2735 ProcessMonitor pm3 ACTIVE Thu Feb 27 17:10:30 2020 2736 Active Alarm Counts: Critical = 0, Major = 0, Minor = 0, Warning = 0, Info = 0 sudo cat crit.log ll /var/lib/columnstore/ |
| Comments |
| Comment by David Hill (Inactive) [ 2020-02-27 ] |
|
Feb 27 17:09:06 ip-172-30-0-250 ProcessMonitor[3230]: 06.243322 |0|0|0| I 18 CAL0000: *********Process Monitor Started********* It looked like ProcMgr had crashed during this time. I turned on the corefile setting and postConfigure NOTE IN THE SECOND RUN, IT DIDNT REUSE THE NUMBER OF COPIES OF 3 I ENTERED THE FIRST TIME AROUND. IT SEEMS THAT SHOULD HAVE BEEN THE DEFAULT ----- Setup Data Redundancy Copy Count Configuration ----- Setup the Number of Copies: This is the total number of copies of the data Enter Number of Copies [2-3] (2) > 3 ----- Starting MariaDB ColumnStore on local server ----- MariaDB ColumnStore successfully started MariaDB ColumnStore Database Platform Starting, please wait ....................................... FAILED IMPORTANT: There was a system startup failed, once issue has been resolved, rerun postConfigure MariaDB ColumnStore System failed to start, check log files in /var/log/mariadb/columnstore BUT NO CORE FILE THIS TIME.. tried shutdown and start and getting this error. Not sure why. I have sshkeys between pm1 and the other nodes [centos@ip-172-30-0-250 ~]$ sudo mcsadmin starts startSystem command, 'columnstore' service is down, sending command to System being started, please wait...ERROR: Connection refused Invalid Password when running 'columnstore start' on module um1, can retry by providing password as the second argument
Invalid Password when running 'columnstore start' on module pm2, can retry by providing password as the second argument
[centos@ip-172-30-0-250 ~]$ sudo mcsadmin getsystemn System Network Configuration Module Name Module Description NIC ID Host Name IP Address [centos@ip-172-30-0-250 ~]$ sudo -i |