Error in Job Log SND_SYSSTS
Comments
-
Hi,
errno = 3447 Error while connecting for error socket
Looks like your agent canot communicate with LPAR2RRD server.
have you correctly set lpar2rrd server hostname/IP?
As per docu:
1.
PING RMTSYS('LPAR2RRD-SERVER.SERVER.COM')
Verifying connection to host system LPAR2RRD-SERVER.SERVER.COM at address 192.168.1.2.
Connection verification statistics: 5 of 5 successful (100 %).
2.
CHGCMDDFT CMD(LPAR2RRD/RTV_SYSSTS) NEWDFT('IPADR(LPAR2RRD-SERVER.SERVER.COM)')
Using IP address instead of hostanme example:
CHGCMDDFT CMD(LPAR2RRD/RTV_SYSSTS) NEWDFT('IPADR(''192.168.1.2'')')
3.
is 8162 port opened? agent --> lpar2rrd server:8162
4. is lpar2rrd server running?
on lpar2rrd server: ps -ef| grep lpar2rrd-daemon
5. anything related in lpar2rrd daemon log?
on lpar2rrd server:
tail logs/error.log-daemon
-
How can I restart or refresh the conection because i see the ip server in status close-wait and the process lapr2trrd daemon is running?
tcp 0 0 0.0.0.0:8162 0.0.0.0:* LISTEN
tcp 0 0 172.19.149.175:8162 172.19.131.98:64073 CLOSE_WAIT
lpar2rrd 10383 1 0 2016 ? 00:24:00 /usr/bin/perl -w /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl
root 27784 27406 0 14:17 pts/1 00:00:00 grep --color=auto lpar2rrd-daemon
-
kill 10383
su - lpar2rrd
cd lpar2rrd
./load.sh
It starts lpar2rrd-daemon process on its start.
What LPAR2RRD server version do you use?
-
4.95
-
I kill 10383, run .load.sh, restart agent on ibm i and the error still exist.
Er: C_SNDSTS - connect() errno = 3447
lpar2rrd@lpar2rrd:~/lpar2rrd> ps -ef| grep lpar2rrd-daemon
lpar2rrd 32507 31442 0 15:17 pts/1 00:00:00 grep --color=auto lpar2rrd-daemon
Now, I cant view any conection to port 8162
-
tcp 0 0 0.0.0.0:8162 0.0.0.0:* LISTEN
tcp 0 0 172.19.149.175:8162 172.19.148.56:11102 ESTABLISHED
tcp 0 0 172.19.149.175:8162 172.16.12.5:51926 ESTABLISHED
tcp 0 0 172.19.149.175:8162 172.15.50.4:58023 ESTABLISHED
tcp 0 0 172.19.149.175:8162 172.15.50.30:23388 ESTABLISHED
-
this is ok, it is working fine, 5 sessions concurently tere can be.
1.What about your agent, had it ever workedd or is that new installation?
2. anyting related in logs/error.log-daemon?
-
it was working until the last week, its not a new installation
The last weekend I apllied an ipl to the power.
-
can you see any comunication on lpar2rrd server from IBM i agent?
(change IP and interface "en1")
tcpdump -ni en1 host <IBM i OS agent IP>
leave it running for 30mins at least.
send us logs/error.log-daemon if there was any communication
https://upload.lpar2rrd.com
Are you sure your lpar2rrd hostaname is correct set in IBm i client?
-
how can i validate in the server the correct name hostname of the agent ibm i?
-
use this cmd to set it up when the agent is not running
1. hostanme example
CHGCMDDFT CMD(LPAR2RRD/RTV_SYSSTS) NEWDFT('IPADR(LPAR2RRD-SERVER.SERVER.COM)')
2. IP example
CHGCMDDFT CMD(LPAR2RRD/RTV_SYSSTS) NEWDFT('IPADR(''192.168.1.2'')')
-
I executed this command and leave it running for more 30mins at least.
:
tcpdump -v -ni eth0 host <ip agent ibm i> > outfile.txt
and no have any data in outfile.txt
After I see this in error.log-daemonlpar2rrd:/home/lpar2rrd/lpar2rrd/logs # more error.log-daemonUse of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $secs in addition (+) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.Use of uninitialized value $mins in multiplication (*) at /home/lpar2rrd/lpar2rrd/bin/lpar2rrd-daemon.pl line 4217.
How can I do? -
if there is no packet captured then it means that agent does not send any data to lpar2rrd server what confirm that there is a connection error on the agent side.
(suppose that eth0 is the right interface where agent should communicate)
Assure that lpar2rrd hostanme is ok:
ADDLIBLE LPAR2RRD
GO MENU --> option 50 --> SND_SYSSTS line--> type 5--> type 10 --> F10
and you should see lpar2rrd server hostname (IP) and port (8162) like on attached screenshot.
-
CALL PGM(C_SNDSTS) PARM('172.19.149.175' '8162' X'0001D740404040C9C4F0F0F0C340404040F1F7F24BF1F94BF1F4F94BF1F7F5' 'PH_DTAQRCV*LIBL' 'PH_DTAQSND*LIBL' 'PH_USRSPC *LIBL' '*UTF8' 1 'N' '/home/LPAR2RRD')IF: C_SNDSTS - protocol 5.1 UTF8 convEr: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447Er: C_SNDSTS - connect() errno = 3447
This is the correct ip server and the correct port. -
Try this cmd, is that fail in 5 secs or does it time out in 10 minutes?
TELNET RMTSYS('172.19.149.175') PORT(8163) SSL(*NO) INZWAIT(5)
-
If above cmd is hanging (it times out in 10 mins) then it is ok, you can interrupt it by:
SysRq ➡ Enter ➡ 2. End previous request
-
Morning Pavel.
I executes the TELNET and fail in 5 seconds.Mensaje . . . . : No se ha obtenido respuesta del sistema principal remotodentro del tiempo de espera de apertura. -
opss, wrong port name in example, this is the right one:
TELNET RMTSYS('172.19.149.175') PORT(8162) SSL(*NO) INZWAIT(5)
-
Equal failure
-
ok, then there is not allowed network connection.
it is what says initial error, just confirmed trough the system tool (telnet)
Er: C_SNDSTS - connect() errno = 3447
Sorry we cannot help you more here, something is blocking that communication, it is all.
Try that telnet from other machine whoich work, you will see different behaviour.
-
With another lpar with same version in the same power it works succesfully.
Can you help me in determinate the problem, i can provide some log´s or anything that allows me what is the issue? -
verify it with your network/security department if communication is allowed:
<your IBM i host> --> 72.19.149.175:8162 (TCP)
-
I am validating this it.
Is that correct in the server?lpar2rrd:/home/lpar2rrd/lpar2rrd/logs # netstat -an |grep 8162tcp 0 0 0.0.0.0:8162 0.0.0.0:* LISTENtcp 0 0 172.19.149.175:8162 172.19.131.6:65267 CLOSE_WAIT -
well, there was a connection from 172.19.131.6 which waiting for TCP close now
-
I validated with security department and no exist any block for user or ip address. At the network level the package or petition don't pass to the default gateway, in other words, it doesn't out of the os400.
-
if you run telnet as per above then you have to see at least 1 TCP syn packet on the default gateway. It is not about our agent.
You might have wrong route on IBM i, use wrong interface, or anything is filtering it in IBM i. I am not an IBM i expert so I cannot advice more.
As soon as telnet works then even our agent starts working.
Try to trace it somehow in IBM i ...
Howdy, Stranger!
Categories
- 1.6K All Categories
- 41 XORMON NG
- 25 XORMON
- 149 LPAR2RRD
- 13 VMware
- 16 IBM i
- 2 oVirt / RHV
- 4 MS Windows and Hyper-V
- Solaris / OracleVM
- XenServer / Citrix
- Nutanix
- 6 Database
- 2 Cloud
- 10 Kubernetes / OpenShift / Docker
- 122 STOR2RRD
- 19 SAN
- 7 LAN
- 17 IBM
- 3 EMC
- 12 Hitachi
- 5 NetApp
- 15 HPE
- Lenovo
- 1 Huawei
- 1 Dell
- Fujitsu
- 2 DataCore
- INFINIDAT
- 3 Pure Storage
- Oracle