Importing Cluster - Job Failed - Wrong node list

Hi guys,

I'm trying to import an existing Galera Cluster into Cluster Control, but the job is failing on the stage where it gets the list of nodes on the cluster.
The process is getting the wrong node name/ip as we can see on the logs:

[12:27:48]: Adding existing MySQL cluster.
[12:27:48]: 10.0.1.4: Checking ssh/sudo.
[12:27:49]: 10.0.1.4: Access with ssh/sudo granted.
[12:27:49]: 10.0.1.4:3306: Verifying the MySQL user/password.
[12:40:20]: 10.0.1.4:3306: Getting node list from the MySQL server.
[12:40:20]: /etc/profile.d/motd.sh: Checking wsrep_node_address.
[12:40:20]: /etc/profile.d/motd.sh: Couldn't get wsrep_node_address status variable: ''
[12:40:20]: Found node: '/etc/profile.d/motd.sh'
[12:40:20]: Found in total 1 nodes.
[12:40:20]: Checking that nodes are not in another cluster.
[12:40:20]: /etc/profile.d/motd.sh: Checking ssh/sudo.
[12:40:20]: /etc/profile.d/motd.sh: Libssh connect error: Failed to resolve hostname /etc/profile.d/motd.sh (Name or service not known).

This motd.sh file is an ssh welcome script.
Logged on the cluster, I can't find the value on any status variable:

...

my.cnf file have:
wsrep_node_address=10.0.1.4
wsrep_node_name=percona-node-1511794714

My /etc/hosts file looks ok:
127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 localhost6.localdomain6

If I remove the script from the server everything works fine, but I lose my welcome message.
Any suggestion on how to debug this?

Thanks in advance,
Francisco Andrade

Official comment

Ashraf Sharif

November 28, 2017 04:01

Hi Franscisco,

Thanks for reporting this. The OS user must be configured with proper PATH environment variable, as expected by any root/sudo user. The below should be expected:

PATH=/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/home/user/.local/bin:/home/user/bin

Details at https://severalnines.com/docs/requirements.html#operating-system-user

Regards,

Ashraf

Deandradefj

November 27, 2017 21:02

I've just had some help debugging the problem and discovered that the problem is that the process executes the motd.sh script, that is failing when cluster control runs it but does not fail when is manually run.

It's failing because the ssh session doesn't have the user's environment variables.
So it tries to run a shell command and fails when it doesn't find the command.

When I add the full command path the problem is solved.

Thanks,
Francisco Andrade

Importing Cluster - Job Failed - Wrong node list

Comments

Didn't find what you were looking for?