pb starting 0.18.1 on a RedHat Entreprise 3 clone box

ged

08-11-2004 19:44:59

Hi,

I'm using a Tao Linux box that I want to be the monitor and the monitored.

I installed the RPMs for 0.18.1 on RH9.

I've followed the intructions/tutorial but cannot get it to run successfully.

I first tried with the default snpmd.conf which was on the box. I would get
[quoted81f9f4dc9]NetMRG starting.
[Dev 0002] Starting device thread at 2004-11-08 161644.
[Dev 0002] SNMP Agent Restart.
[Dev 0002] Performing SNMP Recache.
[Dev 0002] MySQL Update Failed. (INSERT INTO snmp_disk_cache SET dev_id = 2, disk_index = No more variables left in this MIB View (It is past the end of the MIB tree), disk_device = NULL, disk_path = NULL) (You have an error in your SQL syntax. Check the manual that corresponds to your MySQL server version for the right syntax to use near 'more variables left in this MIB View (It is past the end of the)
[Dev 0002] Ending device thread at 2004-11-08 161644, after a duration of 0 seconds.
Thread Ended.[/quoted81f9f4dc9]

I've then regenerated a new snmp.conf file with snmpconf
[quoted81f9f4dc9]rocommunity public
pass .1.3.6.1.4.1.4413.4.1 /usr/bin/ucd5820stat
disk /apps 100000
syslocation somewhere
syscontact myemail
com2sec notConfigUser default public
group notConfigGroup v1 notConfigUser
group notConfigGroup v2c notConfigUser
view systemview included .1.3.6.1.2.1.1
view systemview included .1.3.6.1.2.1.25.1.1
access notConfigGroup "" any noauth exact systemview none none[/quoted81f9f4dc9]

When I run it manually, I get
[netmrg@svcpdweb netmrg]$ /usr/bin/netmrg_cron.sh
/usr/bin/netmrg_cron.sh line 31 12553 Aborted $netmrg >$output 2>$error
[netmrg@svcpdweb netmrg]$

lastrun.err is empty and lastrun.log shows
[quoted81f9f4dc9]NetMRG starting.
[Dev 0002] Starting device thread at 2004-11-08 164222.
[Dev 0002] SNMP Agent Restart.
[Dev 0002] Performing SNMP Recache.
[Dev 0002] Ending device thread at 2004-11-08 164222, after a duration of 0 seconds.
Thread Ended.[/quoted81f9f4dc9]


Suggestions ? ? ?

Thaks,

PS Maybe we could have a better error message than
[netmrg@svcpdweb netmrg]$ /usr/bin/netmrg_cron.sh
/usr/bin/netmrg_cron.sh line 24 /usr/bin/netmrg-gatherer syntax error operand expected (error token is "/usr/bin/netmrg-gatherer")
[netmrg@svcpdweb netmrg]$
When the lockfile is preventing the next run ?

balleman

08-11-2004 21:20:57

Try "view systemview included .1" in your snmpd.conf instead of the two view lines you have there. That might clear up the problem.

For the "Aborted" a backtrace would be helpful. There should be instructions regarding that elsewhere in the forum.

Yes... it is supposed to have a nicer lockfile error than 'operend missing' or whatever. We might try looking into that, but the next major version will be pushing the use of netmrg-gatherer as a daemon, which should make that moot.

ged

09-11-2004 14:10:50

Thanks,

installed the "view systemview ...", and restarted snmpd daemon.
still abort. ran a gdb backtrace

[quote3dfc56b8a2][netmrg@svcpdweb netmrg]$ rm lockfile
[netmrg@svcpdweb netmrg]$ gdb /usr/bin/netmrg-gatherer -sa
gdb unrecognized option `-sa'
Use `gdb --help' for a complete list of options.
[netmrg@svcpdweb netmrg]$ gdb /usr/bin/netmrg-gatherer
GNU gdb Red Hat Linux (6.1post-1.20040607.17rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...(no debugging symbols found)...Using host libthread_db library "/lib/tls/libthread_db.so.1".

(gdb) run -sa
Starting program /usr/bin/netmrg-gatherer -sa
(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...[Thread debugging using libthread_db enabled]
[New Thread -1224661568 (LWP 15336)]
(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...(no debugging symbols found)...[L 0032, C 0001] NetMRG starting.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0064, C 0001] Creating Lockfile.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0064, C 0257] Initializing SNMP library.
[L 0064, C 0129] Initializing RRDTOOL pipe.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
Detaching after fork from child process 15339.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking MySQL
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Unlocking MySQL
[L 0128, C 1024] Mysql Query Succeeded. (SELECT version FROM versioninfo WHERE module = 'Main')
[L 0128, C 1024] Mysql Query Succeeded. (SELECT id FROM devices WHERE disabled=0 ORDER BY id)
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking ActiveThreads
[L 0064, C 0002] [ACTIVE] Last 0, Now 0
[New Thread -1226761296 (LWP 15341)]
[L 0032, C 0004] [Dev 0002] Starting device thread at 2004-11-09 110448.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Unlocking MySQL
[L 0064, C 0004] [Dev 0002] MySQL connection established.
[L 0128, C 1024] [Dev 0002] Mysql Query Succeeded. (SELECT name, ip, snmp_version, snmp_read_community, snmp_recache_method, snmp_uptime, snmp_ifnumber, snmp_port, snmp_timeout, snmp_retries, no_snmp_uptime_check FROM devices WHERE id=2)
[L 0064, C 0004] [Dev 0002] svcpdweb / <Field Omitted>
[L 0128, C 0256] [Dev 0002] Starting SNMP Session.
[L 0128, C 0256] [Dev 0002] SNMPv2c
[L 0128, C 0256] [Dev 0002] Port 161; Timeout 1000000; Retries 3.
[L 0128, C 0002] Locking SNMP
[L 0128, C 0002] Unlocking SNMP
[L 0128, C 0256] [Dev 0002] SNMP Query (<Field Omitted>, <Field Omitted>, 'system.sysUpTime.0')
[L 0064, C 0256] [Dev 0002] SNMP Uptime is 000002 (16636 centiseconds)
[L 0128, C 1024] [Dev 0002] MySQL Update Succeeded. (UPDATE devices SET snmp_uptime=16636 WHERE id=2)
[L 0128, C 0256] [Dev 0002] SNMP Query (<Field Omitted>, <Field Omitted>, 'interfaces.ifNumber.0')
[L 0064, C 0256] [Dev 0002] Number of Interfaces is 2
[L 0128, C 1024] [Dev 0002] Mysql Query Succeeded. (SELECT count(*) FROM snmp_interface_cache WHERE dev_id = 2)
[L 0128, C 1024] [Dev 0002] Mysql Query Succeeded. (SELECT id, type, name FROM sub_devices WHERE dev_id=2)
[L 0064, C 0008] [Dev 0002] [Sub 0004] Starting Subdevice.
[L 0128, C 1024] [Dev 0002] [Sub 0004] Mysql Query Succeeded. (SELECT name, value FROM sub_dev_variables WHERE type = 'static' AND sub_dev_id = 4)
[L 0128, C 1024] [Dev 0002] [Sub 0004] MySQL Update Succeeded. (DELETE FROM sub_dev_variables WHERE type = 'dynamic' AND sub_dev_id = 4)
[L 0128, C 1024] [Dev 0002] [Sub 0004] MySQL Update Succeeded. (INSERT DELAYED INTO sub_dev_variables SET type = 'dynamic', sub_dev_id = 4, name = 'subdev_name', value = 'System.')
[L 0128, C 1024] [Dev 0002] [Sub 0004] MySQL Update Succeeded. (INSERT DELAYED INTO sub_dev_variables SET type = 'dynamic', sub_dev_id = 4, name = 'snmp_read_community', value = 'public')
[L 0128, C 1024] [Dev 0002] [Sub 0004] MySQL Update Succeeded. (INSERT DELAYED INTO sub_dev_variables SET type = 'dynamic', sub_dev_id = 4, name = 'ip', value = 'svcpdweb.ca.atitech.com')
[L 0128, C 1024] [Dev 0002] [Sub 0004] MySQL Update Succeeded. (INSERT DELAYED INTO sub_dev_variables SET type = 'dynamic', sub_dev_id = 4, name = 'dev_name', value = 'svcpdweb')
[L 0128, C 1024] [Dev 0002] [Sub 0004] Mysql Query Succeeded. (SELECT monitors.data_type, data_types.rrd_type, monitors.min_val, monitors.max_val, monitors.tuned, monitors.test_type, monitors.test_id, monitors.test_params, monitors.last_val, monitors.id, NOW() - monitors.last_time FROM monitors LEFT JOIN data_types ON monitors.data_type=data_types.id WHERE sub_dev_id = 4)
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0017] Starting Monitor.
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0017] Mysql Query Succeeded. (SELECT oid, type, subitem FROM tests_snmp WHERE id = 7)
[L 0128, C 0256] [Dev 0002] [Sub 0004] [Mon 0017] SNMP Query (<Field Omitted>, <Field Omitted>, 'enterprises.ucdavis.systemStats.ssCpuRawSystem.0')
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0017] Value 820412
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0128] [Dev 0002] [Sub 0004] [Mon 0017] RRD 'update /apps/backed_up/netmrg/rrd/mon_17.rrd N820412'
[L 0128, C 0002] Locking RRD
[L 0128, C 0002] Unlocking RRD
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0017] Value is an integer.
OK u0.00 s0.00 r0.01
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0017] Mysql Query Succeeded. (SELECT id, trigger_type, last_status, situation, last_triggered, name FROM events WHERE mon_id=17 AND trigger_type = 1)
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0017] MySQL Update Succeeded. (UPDATE monitors SET tuned=1, last_val=820412, delta_val=13, delta_time=UNIX_TIMESTAMP(NOW())-UNIX_TIMESTAMP(last_time), last_time=NOW(), status=0 WHERE id=17)
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0018] Starting Monitor.
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0018] Mysql Query Succeeded. (SELECT oid, type, subitem FROM tests_snmp WHERE id = 5)
[L 0128, C 0256] [Dev 0002] [Sub 0004] [Mon 0018] SNMP Query (<Field Omitted>, <Field Omitted>, 'enterprises.ucdavis.systemStats.ssCpuRawUser.0')
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0018] Value 8424443
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0128, C 0128] [Dev 0002] [Sub 0004] [Mon 0018] RRD 'update /apps/backed_up/netmrg/rrd/mon_18.rrd N8424443'
[L 0128, C 0002] Locking RRD
[L 0128, C 0002] Unlocking RRD
OK u0.00 s0.00 r0.01

[...]

[L 0128, C 0128] [Dev 0002] [Sub 0004] [Mon 0023] RRD 'update /apps/backed_up/netmrg/rrd/mon_23.rrd N114980'
[L 0128, C 0002] Locking RRD
[L 0128, C 0002] Unlocking RRD
[L 0064, C 0016] [Dev 0002] [Sub 0004] [Mon 0023] Value is an integer.
OK u0.00 s0.01 r0.05
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0023] Mysql Query Succeeded. (SELECT id, trigger_type, last_status, situation, last_triggered, name FROM events WHERE mon_id=23 AND trigger_type = 1)
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0023] MySQL Update Succeeded. (UPDATE monitors SET tuned=1, last_val=114980, delta_val=-7256, delta_time=UNIX_TIMESTAMP(NOW())-UNIX_TIMESTAMP(last_time), last_time=NOW(), status=0 WHERE id=23)
[L 0128, C 1024] [Dev 0002] [Sub 0004] [Mon 0023] MySQL Update Succeeded. (UPDATE sub_devices SET status=0 WHERE id=4)
[L 0128, C 1024] [Dev 0002] MySQL Update Succeeded. (UPDATE devices SET status=0, snmp_avoided=0 WHERE id=2)
[L 0128, C 0256] [Dev 0002] Cleaning up SNMP Session.
[L 0064, C 0004] [Dev 0002] Trimming device event log.
[L 0128, C 1024] [Dev 0002] Mysql Query Succeeded. (SELECT id FROM log WHERE dev_id=2 ORDER BY id)
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0032, C 0004] [Dev 0002] Ending device thread at 2004-11-09 110448, after a duration of 0 seconds.
[L 0128, C 0002] Locking ActiveThreads
[L 0128, C 0002] Unlocking ActiveThreads
[L 0128, C 0002] Unlocking ActiveThreads
[L 0064, C 0001] Closed MySQL connection.
[L 0064, C 0001] Runtime 0
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings
[L 0032, C 0002] Thread Ended.
[L 0064, C 0129] Closed RRDTOOL pipe.
[L 0064, C 0257] Cleaned up SNMP.
[L 0128, C 0002] Locking Settings
[L 0128, C 0002] Unlocking Settings

Program received signal SIGABRT, Aborted.
[Switching to Thread -1226761296 (zombie)]
0xb7124cdf in raise () from /lib/tls/libc.so.6
(gdb) bt
#0 0xb7124cdf in raise () from /lib/tls/libc.so.6
#1 0xb71264e5 in abort () from /lib/tls/libc.so.6
#2 0xb7310097 in __cxa_call_unexpected () from /usr/lib/libstdc++.so.5
#3 0xb73100e4 in stdterminate () from /usr/lib/libstdc++.so.5
#4 0xb730ff0c in __gxx_personality_v0 () from /usr/lib/libstdc++.so.5
#5 0xb72395be in _Unwind_RaiseException () from /lib/libgcc_s.so.1
#6 0xb7239657 in _Unwind_ForcedUnwind () from /lib/libgcc_s.so.1
#7 0xb727d2a4 in _Unwind_ForcedUnwind () from /lib/tls/libpthread.so.0
#8 0xb727b4c3 in __pthread_unwind () from /lib/tls/libpthread.so.0
#9 0xb7277ae2 in pthread_exit () from /lib/tls/libpthread.so.0
#10 0x08068bd0 in child ()
#11 0xb7276dec in start_thread () from /lib/tls/libpthread.so.0
#12 0xb71d919a in clone () from /lib/tls/libc.so.6
(gdb) The program is running. Exit anyway? (y or n) y
[netmrg@svcpdweb netmrg]$
[/quote3dfc56b8a2]

ged

09-11-2004 15:13:09

I've fixed that crash by
- removing the 0.18.1 RPM package
- downloading the 0.18.2 sources and compiling/installing those.

Now I do see graphs (empty graphs ( but graphs nonetheless).

silfreed

09-11-2004 15:43:12

Interesting... we'll have to setup a CentOS or WBEL box and see what we can find out. Sorry for the problems; hopefully we can get it worked out.

-Doug