Clearing counter on SAN switches => peak in graph
Hello,
I found discusion https://forum.xorux.com/discussion/83/clearing-crc-error-counter-on-switch-leads-to-huge-numbers-in-crc-error-graph but lack the necessary result.
Clearing counter affects a lot of the charts and repair is manual work.
I think it would be possible to check reset counter before writing data to rrd and this resolve the problem.
It is necessary to know the expected maximum value and the previous value.
Regards,
Libor Vejpustek
I found discusion https://forum.xorux.com/discussion/83/clearing-crc-error-counter-on-switch-leads-to-huge-numbers-in-crc-error-graph but lack the necessary result.
Clearing counter affects a lot of the charts and repair is manual work.
I think it would be possible to check reset counter before writing data to rrd and this resolve the problem.
It is necessary to know the expected maximum value and the previous value.
Regards,
Libor Vejpustek
Comments
-
Hello,
there is no simple solution identify reset of the counter.
How to do it, when it might overflow its maximum, i.e. next value might have less absolute value then previous one.
BTW this is not a problem if you have Brocade Network Advisor as our data source where we do not save data as counters.
-
If you know last value, previous value and maximum value, it's result reset counter.(max_value - last_value)/max_value > max_value/100 => reset counter => update value in rrd with value U - undef. This eliminate peak in rrd file with type COUNTER.Second variant is use of DERIVE type in rrd.Reset counter is allways miss of data.It's possible to test this 2 alternatives in real usage and compare graph with current graph.
Regards,
Libor Vejpustek -
well, solution is use other type, gauge or derive.
we do not use anymore counter types, for new projetcs we always translate data to gauge before saving it in rrdtool (based on knowledge of actual and previous value)
Far biggest problem here is backward compatability.
Switch to different data type in rrdtool would mean losing complete history.
If you use as data source for SAN data BNA then there we already use only gauge and such problem cannot occure.
-
Update with value U after counter reset eliminate peak problem with COUNTER type.
Can we call in Czech language, its better for me.
Regards,
Libor Vejpustek -
Try this script :
#!/bin/bashfunction zapis{echo "`date` $1 # $2"rrdtool update datafile.rrd N:${1}0:${2}0sleep 60}.cd /root/mv datafile.rrd datafile_old.rrdrrdtool create datafile.rrd --step 60 --start N DS:packets:COUNTER:120:0:1000000000 DS:packets2:DERIVE:120:0:1000000000 RRA:AVERAGE:0.5:1:60 RRA:AVERAGE:0.5:4:60 RRA:AVERAGE:0.5:24:60#sleep 60zapis 10 10zapis 12322 12322zapis 22322 22322zapis 32322 32322zapis 35322 42322zapis 42322 52322zapis 52322 62322zapis 62322 72322zapis 2322 2322zapis 99333 99333zapis 10 10zapis 1000 1000zapis 2000 2000zapis 3000 3000zapis 4000 4000zapis 5000 5000zapis 6000 6000zapis 7000 7000zapis 8000 8000# This line eliminate peak in rrd file.#zapis U 0zapis 1000 1000zapis 2000 2000zapis 3000 3000zapis U 0zapis 1000 1000zapis 2000 2000zapis 3000 3000zapis U 100zapis 1000 1000zapis 2000 2000zapis 3000 3000 -
No to vypada pekne
Urcite to muzeme minimalne zkusit tam kde mame z hist duvodu jeste countery.
Ted se k tomu ale urcite nedostaneme, nejsou zdroje, tak jsem to alespon dal na todo list.
Dik!
-
Na vyzkouseni by bylo dobre zkusit udelat ukladani udaju paralelne do 2 souboru stejneho typu COUNTER a az by chybna spicka nastala porovnat jaky soubor ma lepsi vysledky.
My jsme zkusili nasadit monitoring na SAN switche a po resetovani counteru se to projevilo na jednom portu a na druhem ne.
Mozna bych nasel chvili a zkusil bych to otestovat sam, jenom bych se musel prohrabat zdrojakama.
Preji klidny den.
Libor Vejpustek
Howdy, Stranger!
Categories
- 1.6K All Categories
- 48 XORMON NG
- 25 XORMON
- 153 LPAR2RRD
- 13 VMware
- 16 IBM i
- 2 oVirt / RHV
- 4 MS Windows and Hyper-V
- Solaris / OracleVM
- XenServer / Citrix
- Nutanix
- 7 Database
- 2 Cloud
- 10 Kubernetes / OpenShift / Docker
- 124 STOR2RRD
- 19 SAN
- 7 LAN
- 17 IBM
- 3 EMC
- 12 Hitachi
- 5 NetApp
- 15 HPE
- Lenovo
- 1 Huawei
- 2 Dell
- Fujitsu
- 2 DataCore
- INFINIDAT
- 3 Pure Storage
- Oracle