lists.netmrg.net lists.netmrg.net
NetMRG discussion and support
 
FAQ :: Search :: Memberlist :: Usergroups :: Register
Profile :: Log in to check your private messages :: Log in

Lots of crashes in Solaris

 
Post new topic   Reply to topic    lists.netmrg.net Forum Index -> Development
Author Message
cmsmith
Helper


Joined: 15 Feb 2005
Posts: 14

PostPosted: Tue May 17, 2005 7:17 pm    Post subject: Lots of crashes in Solaris Reply with quote

Running on Solaris/SPARC, getting frequent crashes, that *usually* do not happen twice in a row - meaning, I run it once, it crashes, change nothing, run again, run successfully.

Here is some GDB backtraces from the core-dump, anywhere you <...snip...> I've removed data to protect the ... innocent?

Let me know if this is helpful or not and if you need more info.

Code:

(gdb) core-file core-20050517123503
Core was generated by `/u01/app/netmrg/bin/netmrg-gatherer -a -C /u01/app/netmrg/etc/netmrg.xml'.
Program terminated with signal 10, Bus error.
Reading symbols from /usr/local/lib/libnetsnmp.so.5...done.
Loaded symbols for /usr/local/lib/libnetsnmp.so.5
Reading symbols from /usr/local/ssl/lib/libcrypto.so.0.9.7...done.
Loaded symbols for /usr/local/ssl/lib/libcrypto.so.0.9.7
Reading symbols from /usr/lib/librt.so.1...done.
Loaded symbols for /usr/lib/librt.so.1
Reading symbols from /u01/app/mysql/lib/mysql/libmysqlclient.so.12...done.
Loaded symbols for /u01/app/mysql/lib/mysql/libmysqlclient.so.12
Reading symbols from /usr/local/lib/libxml2.so.2...done.
Loaded symbols for /usr/local/lib/libxml2.so.2
Reading symbols from /usr/local/lib/libstdc++.so.6...done.
Loaded symbols for /usr/local/lib/libstdc++.so.6
Reading symbols from /usr/lib/libpthread.so.1...done.
Loaded symbols for /usr/lib/libpthread.so.1
Reading symbols from /usr/local/lib/libz.so...done.
Loaded symbols for /usr/local/lib/libz.so
Reading symbols from /usr/lib/libdl.so.1...done.
Loaded symbols for /usr/lib/libdl.so.1
Reading symbols from /usr/lib/libintl.so.1...warning: Lowest section in /usr/lib/libintl.so.1 is .hash at 00000074
done.
Loaded symbols for /usr/lib/libintl.so.1
Reading symbols from /usr/lib/libkstat.so.1...done.
Loaded symbols for /usr/lib/libkstat.so.1
Reading symbols from /usr/lib/libm.so.1...done.
Loaded symbols for /usr/lib/libm.so.1
Reading symbols from /usr/local/lib/libgcc_s.so.1...done.
Loaded symbols for /usr/local/lib/libgcc_s.so.1
Reading symbols from /usr/lib/libc.so.1...done.
Loaded symbols for /usr/lib/libc.so.1
Reading symbols from /usr/lib/libsocket.so.1...done.
Loaded symbols for /usr/lib/libsocket.so.1
Reading symbols from /usr/lib/libnsl.so.1...done.
Loaded symbols for /usr/lib/libnsl.so.1
Reading symbols from /usr/lib/libaio.so.1...done.
Loaded symbols for /usr/lib/libaio.so.1
Reading symbols from /usr/lib/libcrypt_i.so.1...done.
Loaded symbols for /usr/lib/libcrypt_i.so.1
Reading symbols from /usr/lib/libgen.so.1...done.
Loaded symbols for /usr/lib/libgen.so.1
Reading symbols from /usr/local/lib/libiconv.so.2...done.
Loaded symbols for /usr/local/lib/libiconv.so.2
Reading symbols from /usr/lib/libmp.so.2...done.
Loaded symbols for /usr/lib/libmp.so.2
Reading symbols from /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1...done.
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
Reading symbols from /usr/lib/libthread.so.1...done.
Loaded symbols for /usr/lib/libthread.so.1
Reading symbols from /usr/lib/nss_files.so.1...done.
Loaded symbols for /usr/lib/nss_files.so.1
#0  std::string::append(std::string const&) (this=0xfe509618, __str=@0x27bf08) at basic_string.h:265
265     basic_string.h: No such file or directory.
        in basic_string.h
(gdb) bt
#0  std::string::append(std::string const&) (this=0xfe509618, __str=@0x27bf08) at basic_string.h:265
#1  0x0003b55c in std::basic_string<char, std::char_traits<char>, std::allocator<char> > std::operator+<char, std::char_traits<char>, std::allocator<char> >(std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) (__lhs=@0xfe509610, __rhs=@0x27bf08)
    at bits/basic_string.h:1985
#2  0x00015608 in process_sub_device(DeviceInfo, st_mysql*) (info=
              {device_id = 440796, subdevice_id = 4, monitor_id = 440796, event_id = -17491376, response_id = 2819380, status = 2672430, mysql = 0x21, snmp_sess_p = 0x7efefeff, snmp_avoid = 2819132, snmp_recache = 65280, snmp_ifnumber = 4266694256, snmp_uptime = 1901175863750292, subdevice_type = 2224364, test_type = 2325260, test_id = -28273016, name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xff248b80 "@"}}, ip = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x6bb44 "INSE"}}, subdevice_name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfe509708 ""}}, test_params = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x40df8 "] "}}, curr_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xf3333333 <Address 0xf3333333 out of bounds>}}, last_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x4 <Address 0x4 out of bounds>}}, delta_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfe509780 ""}}, rate_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfe5096d0 ""}}, delta_time = 404480, snmp_version = 0, snmp_read_community = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x29c758 "<snip>"}}, snmp_timeout = 4266694496, snmp_retries = 4294967295, snmp_port = 65104, parameters = {<_List_base<ValuePair,std::allocator<ValuePair> >> = {_M_impl = {<allocator<std::_List_node<ValuePair> >> = {<new_allocator<std::_List_node<ValuePair> >> = {<No data fields>}, <No data fields>}, _M_node = {_M_next = 0xfe509a98, _M_prev = 0xfe509768}}}, <No data fields>}}, mysql=0x27bf08) at bits/stl_list.h:134


Code:

(gdb) core-file core-20050517114004
Core was generated by `/u01/app/netmrg/bin/netmrg-gatherer -a -C /u01/app/netmrg/etc/netmrg.xml'.
Program terminated with signal 10, Bus error.
Loaded symbols for /usr/local/lib/libnetsnmp.so.5
Loaded symbols for /usr/local/ssl/lib/libcrypto.so.0.9.7
Loaded symbols for /usr/lib/librt.so.1
Reading symbols from /u01/app/mysql/lib/mysql/libmysqlclient.so.12...done.
Loaded symbols for /u01/app/mysql/lib/mysql/libmysqlclient.so.12
Loaded symbols for /usr/local/lib/libxml2.so.2
Loaded symbols for /usr/local/lib/libstdc++.so.6
Loaded symbols for /usr/lib/libpthread.so.1
Loaded symbols for /usr/local/lib/libz.so
Loaded symbols for /usr/lib/libdl.so.1
Loaded symbols for /usr/lib/libintl.so.1
Loaded symbols for /usr/lib/libkstat.so.1
Loaded symbols for /usr/lib/libm.so.1
Loaded symbols for /usr/local/lib/libgcc_s.so.1
Loaded symbols for /usr/lib/libc.so.1
Loaded symbols for /usr/lib/libsocket.so.1
Loaded symbols for /usr/lib/libnsl.so.1
Loaded symbols for /usr/lib/libaio.so.1
Loaded symbols for /usr/lib/libcrypt_i.so.1
Loaded symbols for /usr/lib/libgen.so.1
Loaded symbols for /usr/local/lib/libiconv.so.2
Loaded symbols for /usr/lib/libmp.so.2
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
Loaded symbols for /usr/lib/libthread.so.1
Loaded symbols for /usr/lib/nss_files.so.1
#0  basic_string (this=0x2c9400, __str=@0x232306) at basic_string.h:182
182     in basic_string.h
(gdb) bt
#0  basic_string (this=0x2c9400, __str=@0x232306) at basic_string.h:182
#1  0x0003c134 in DeviceInfo (this=0xfe4076d0, _ctor_arg=@0xfe40788c) at devices.cpp:28
#2  0x00017060 in process_sub_devices(DeviceInfo, st_mysql*) (info=
              {device_id = 5, subdevice_id = 410, monitor_id = -1, event_id = -1, response_id = -1, status = 0, mysql = 0xfe407a98, snmp_sess_p = 0x27f510, snmp_avoid = 0, snmp_recache = 0, snmp_ifnumber = 8, snmp_uptime = 137181796, subdevice_type = 2, test_type = -1, test_id = -1, name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x23464c "<...snip...>"}}, ip = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x22550c "<...snip...>"}}, subdevice_name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x2c9e04 "<...snip...>"}}, test_params = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x62f0c ""}}, curr_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27f2d4 "U"}}, last_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27f274 "U"}}, delta_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27fa1c "U"}}, rate_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27fa34 "U"}}, delta_time = 0, snmp_version = 2, snmp_read_community = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x2306a4 "<...snip...>"}}, snmp_timeout = 1000000, snmp_retries = 3, snmp_port = 161, parameters = {<_List_base<ValuePair,std::allocator<ValuePair> >> = {_M_impl = {<allocator<std::_List_node<ValuePair> >> = {<new_allocator<std::_List_node<ValuePair> >> = {<No data fields>}, <No data fields>}, _M_node = {_M_next = 0xfe407754, _M_prev = 0xfe407754}}}, <No data fields>}}, mysql=0xfe40788c) at devices.cpp:175


Code:

(gdb) core-file core-20050517104504
Core was generated by `/u01/app/netmrg/bin/netmrg-gatherer -a -C /u01/app/netmrg/etc/netmrg.xml'.
Program terminated with signal 10, Bus error.
Loaded symbols for /usr/local/lib/libnetsnmp.so.5
Loaded symbols for /usr/local/ssl/lib/libcrypto.so.0.9.7
Loaded symbols for /usr/lib/librt.so.1
Reading symbols from /u01/app/mysql/lib/mysql/libmysqlclient.so.12...done.
Loaded symbols for /u01/app/mysql/lib/mysql/libmysqlclient.so.12
Loaded symbols for /usr/local/lib/libxml2.so.2
Loaded symbols for /usr/local/lib/libstdc++.so.6
Loaded symbols for /usr/lib/libpthread.so.1
Loaded symbols for /usr/local/lib/libz.so
Loaded symbols for /usr/lib/libdl.so.1
Loaded symbols for /usr/lib/libintl.so.1
Loaded symbols for /usr/lib/libkstat.so.1
Loaded symbols for /usr/lib/libm.so.1
Loaded symbols for /usr/local/lib/libgcc_s.so.1
Loaded symbols for /usr/lib/libc.so.1
Loaded symbols for /usr/lib/libsocket.so.1
Loaded symbols for /usr/lib/libnsl.so.1
Loaded symbols for /usr/lib/libaio.so.1
Loaded symbols for /usr/lib/libcrypt_i.so.1
Loaded symbols for /usr/lib/libgen.so.1
Loaded symbols for /usr/local/lib/libiconv.so.2
Loaded symbols for /usr/lib/libmp.so.2
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
Loaded symbols for /usr/lib/libthread.so.1
Loaded symbols for /usr/lib/nss_files.so.1
#0  basic_string (this=0x2d7fa8, __str=@0x21f52e) at basic_string.h:182
182     in basic_string.h
(gdb) bt
#0  basic_string (this=0x2d7fa8, __str=@0x21f52e) at basic_string.h:182
#1  0x0003c134 in DeviceInfo (this=0xfe5096d0, _ctor_arg=@0xfe50988c) at devices.cpp:28
#2  0x00017060 in process_sub_devices(DeviceInfo, st_mysql*) (info=
              {device_id = 4, subdevice_id = 175, monitor_id = -1, event_id = -1, response_id = -1, status = 0, mysql = 0xfe509a98, snmp_sess_p = 0x27f208, snmp_avoid = 0, snmp_recache = 0, snmp_ifnumber = 100, snmp_uptime = 979107209, subdevice_type = 2, test_type = -1, test_id = -1, name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x229fc4 "<...snip...>"}}, ip = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x229de4 "<snip>"}}, subdevice_name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x186294 "<...snip...>"}}, test_params = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x62f0c ""}}, curr_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27d894 "U"}}, last_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27e28c "U"}}, delta_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27bd14 "U"}}, rate_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x27c164 "U"}}, delta_time = 0, snmp_version = 2, snmp_read_community = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x21f6cc "<...snip...>"}}, snmp_timeout = 1000000, snmp_retries = 3, snmp_port = 161, parameters = {<_List_base<ValuePair,std::allocator<ValuePair> >> = {_M_impl = {<allocator<std::_List_node<ValuePair> >> = {<new_allocator<std::_List_node<ValuePair> >> = {<No data fields>}, <No data fields>}, _M_node = {_M_next = 0x2bb768, _M_prev = 0x2d7fd0}}}, <No data fields>}}, mysql=0xfe50988c) at devices.cpp:175


Code:

(gdb) core-file core-20050517092026
Core was generated by `/u01/app/netmrg/bin/netmrg-gatherer -a -C /u01/app/netmrg/etc/netmrg.xml'.
Program terminated with signal 10, Bus error.
Loaded symbols for /usr/local/lib/libnetsnmp.so.5
Loaded symbols for /usr/local/ssl/lib/libcrypto.so.0.9.7
Loaded symbols for /usr/lib/librt.so.1
Reading symbols from /u01/app/mysql/lib/mysql/libmysqlclient.so.12...done.
Loaded symbols for /u01/app/mysql/lib/mysql/libmysqlclient.so.12
Loaded symbols for /usr/local/lib/libxml2.so.2
Loaded symbols for /usr/local/lib/libstdc++.so.6
Loaded symbols for /usr/lib/libpthread.so.1
Loaded symbols for /usr/local/lib/libz.so
Loaded symbols for /usr/lib/libdl.so.1
Loaded symbols for /usr/lib/libintl.so.1
Loaded symbols for /usr/lib/libkstat.so.1
Loaded symbols for /usr/lib/libm.so.1
Loaded symbols for /usr/local/lib/libgcc_s.so.1
Loaded symbols for /usr/lib/libc.so.1
Loaded symbols for /usr/lib/libsocket.so.1
Loaded symbols for /usr/lib/libnsl.so.1
Loaded symbols for /usr/lib/libaio.so.1
Loaded symbols for /usr/lib/libcrypt_i.so.1
Loaded symbols for /usr/lib/libgen.so.1
Loaded symbols for /usr/local/lib/libiconv.so.2
Loaded symbols for /usr/lib/libmp.so.2
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
Loaded symbols for /usr/lib/libthread.so.1
Loaded symbols for /usr/lib/nss_files.so.1
#0  std::basic_string<char, std::char_traits<char>, std::allocator<char> > std::operator+<char, std::char_traits<char>, std::allocator<char> >(char const*, std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)
    (__lhs=0x3f170 "netmrg_", __rhs=@0x27f4b8) at bits/basic_string.h:265
265           { return &((reinterpret_cast<_Rep*> (_M_data()))[-1]); }
(gdb) bt
#0  std::basic_string<char, std::char_traits<char>, std::allocator<char> > std::operator+<char, std::char_traits<char>, std::allocator<char> >(char const*, std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)
    (__lhs=0x3f170 "netmrg_", __rhs=@0x27f4b8) at bits/basic_string.h:265
#1  0x00023004 in remove_params_from_env(DeviceInfo&) (info=@0xfd501374) at bits/stl_list.h:134
#2  0x00024ed0 in process_script_monitor(DeviceInfo, st_mysql*) (info=
              {device_id = 2620280, subdevice_id = 2620304, monitor_id = 65280, event_id = -45083720, response_id = 90164, status = 2921876, mysql = 0x2906a8, snmp_sess_p = 0x22c4f4, snmp_avoid = 2620724, snmp_recache = 0, snmp_ifnumber = 0, snmp_uptime = 19, subdevice_type = 4080218931, test_type = 4, test_id = 29, name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x0}}, ip = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfd501420 ""}}, subdevice_name = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x0}}, test_params = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfd501438 ""}}, curr_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfd5014e0 ""}}, last_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x0}}, delta_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfd5014c8 ""}}, rate_val = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0xfd5014cc ""}}, delta_time = -193625543839580160, snmp_version = 0, snmp_read_community = {static npos = 4294967295, _M_dataplus = {<allocator<char>> = {<new_allocator<char>> = {<No data fields>}, <No data fields>}, _M_p = 0x6bbb8 ""}}, snmp_timeout = 404480, snmp_retries = 4249884272, snmp_port = 1, parameters = {<_List_base<ValuePair,std::allocator<ValuePair> >> = {_M_impl = {<allocator<std::_List_node<ValuePair> >> = {<new_allocator<std::_List_node<ValuePair> >> = {<No data fields>}, <No data fields>}, _M_node = {_M_next = 0xfd5015f8, _M_prev = 0x6f}}}, <No data fields>}}, mysql=0x62c00) at monitors.cpp:241
Back to top
balleman
Site Admin


Joined: 20 Jan 2003
Posts: 282

PostPosted: Wed May 18, 2005 2:23 am    Post subject: Reply with quote

What version of gcc are you using?
Back to top
cmsmith
Helper


Joined: 15 Feb 2005
Posts: 14

PostPosted: Wed May 18, 2005 11:57 am    Post subject: GCC Version Reply with quote

3.4.2
Back to top
balleman
Site Admin


Joined: 20 Jan 2003
Posts: 282

PostPosted: Wed May 18, 2005 12:13 pm    Post subject: Reply with quote

It certainly looks like a thread-safeness issue.

Which version of Solaris are you using? Are all of the patches installed?
Back to top
cmsmith
Helper


Joined: 15 Feb 2005
Posts: 14

PostPosted: Wed May 18, 2005 12:16 pm    Post subject: Solaris Version Reply with quote

Solaris 8, - kernel 5.8 Generic_117350-13 - what other patches are you concerned with?
Back to top
balleman
Site Admin


Joined: 20 Jan 2003
Posts: 282

PostPosted: Wed May 18, 2005 12:28 pm    Post subject: Reply with quote

I'm not sure which specific patches might be applicable. Doing some googling, it looked like there were patches related to threading. I have access to a sandbox machine running Solaris 8, so I might try to reproduce your problem.

How many simultaneous threads are you running? How many devices are you monitoring? How often do the crashes occur?

Thanks, and hopefully we'll get this figured out.
Back to top
cmsmith
Helper


Joined: 15 Feb 2005
Posts: 14

PostPosted: Wed May 18, 2005 12:35 pm    Post subject: libthread? Reply with quote

Would this be libthread related? I'm at 108993-42, current is 108993-45. Patching that.

I'm currently running 10 threads and the frequency is somewhat erratic. I run netmrg every 5 minutes, and I have 40 core files yesterday, 48 the day before. Not all of those exhibit the same gdb output as described previously, but most of them do... I've got 47 rows in my "devices" table, and 265 in the "sub_devices" table.

Here is a listing of yesterday's cores. I've modified the cron script to watch for cores, save the core, then re-run if it found one.

Code:

-rw-------    1 netmrg   netmrg    3843508 May 17 00:00 core-20050517000002
-rw-------    1 netmrg   netmrg    4454036 May 17 00:10 core-20050517001002
-rw-------    1 netmrg   netmrg    3791276 May 17 00:15 core-20050517001503
-rw-------    1 netmrg   netmrg    6127920 May 17 01:15 core-20050517011515
-rw-------    1 netmrg   netmrg    6458444 May 17 01:45 core-20050517014531
-rw-------    1 netmrg   netmrg    6305940 May 17 02:10 core-20050517021028
-rw-------    1 netmrg   netmrg    3851700 May 17 02:20 core-20050517022002
-rw-------    1 netmrg   netmrg    3843508 May 17 03:45 core-20050517034502
-rw-------    1 netmrg   netmrg    6123096 May 17 04:10 core-20050517041009
-rw-------    1 netmrg   netmrg    5438936 May 17 04:10 core-20050517041013
-rw-------    1 netmrg   netmrg    6242960 May 17 04:55 core-20050517045508
-rw-------    1 netmrg   netmrg    5910992 May 17 05:40 core-20050517054005
-rw-------    1 netmrg   netmrg    6183028 May 17 05:45 core-20050517054510
-rw-------    1 netmrg   netmrg    6275696 May 17 06:00 core-20050517060030
-rw-------    1 netmrg   netmrg    4923012 May 17 06:00 core-20050517060034
-rw-------    1 netmrg   netmrg    6388544 May 17 06:05 core-20050517060509
-rw-------    1 netmrg   netmrg    9396784 May 17 08:10 core-20050517081015
-rw-------    1 netmrg   netmrg    3807660 May 17 08:15 core-20050517081503
-rw-------    1 netmrg   netmrg    5725624 May 17 08:45 core-20050517084505
-rw-------    1 netmrg   netmrg    6242960 May 17 08:45 core-20050517084515
-rw-------    1 netmrg   netmrg    6771944 May 17 09:20 core-20050517092026
-rw-------    1 netmrg   netmrg    5056928 May 17 10:45 core-20050517104504
-rw-------    1 netmrg   netmrg    5185156 May 17 11:40 core-20050517114004
-rw-------    1 netmrg   netmrg    3818408 May 17 11:55 core-20050517115503
-rw-------    1 netmrg   netmrg    4761984 May 17 12:35 core-20050517123503
-rw-------    1 netmrg   netmrg    3859892 May 17 14:20 core-20050517142002
-rw-------    1 netmrg   netmrg    5484152 May 17 14:20 core-20050517142006
-rw-------    1 netmrg   netmrg    8166820 May 17 14:20 core-20050517142040
-rw-------    1 netmrg   netmrg    3897772 May 17 16:10 core-20050517161003
-rw-------    1 netmrg   netmrg    5977704 May 17 16:55 core-20050517165538
-rw-------    1 netmrg   netmrg    5991020 May 17 17:00 core-20050517170018
-rw-------    1 netmrg   netmrg    6149672 May 17 17:40 core-20050517174016
-rw-------    1 netmrg   netmrg    6264980 May 17 18:25 core-20050517182509
-rw-------    1 netmrg   netmrg    3843508 May 17 19:05 core-20050517190502
-rw-------    1 netmrg   netmrg    6710184 May 17 20:20 core-20050517202003
-rw-------    1 netmrg   netmrg    6354076 May 17 20:35 core-20050517203536
-rw-------    1 netmrg   netmrg    6401132 May 17 21:00 core-20050517210012
-rw-------    1 netmrg   netmrg    5602840 May 17 21:10 core-20050517211004
-rw-------    1 netmrg   netmrg    8311740 May 17 23:10 core-20050517231004
-rw-------    1 netmrg   netmrg    3922348 May 17 23:10 core-20050517231006
-rw-------    1 netmrg   netmrg    6104188 May 17 23:30 core-20050517233009
Back to top
cmsmith
Helper


Joined: 15 Feb 2005
Posts: 14

PostPosted: Sun Jun 19, 2005 12:52 am    Post subject: Still dumping... Reply with quote

Still getting core dumps after upgrading libthread. Anyone else running this on Solaris 8?
Back to top
Display posts from previous:   
Post new topic   Reply to topic    lists.netmrg.net Forum Index -> Development All times are GMT
Page 1 of 1

 



smartBlue Style © 2002 Smartor
Powered by phpBB © 2001, 2002 phpBB Group