Hi All,
I have an OI 151a system that has been flawless for almost a year now. Recently it's been hard locking without leaving anything in the logs (so far as I can tell). There have been a few changes made to the system since the issue has started happening:
- Some ZFS tuning (zfs_vdev_max_pending = 4 and zfs_write_limit_override = 805306368).
- An install of a Brocade NIC software package which has since been removed (both software and the NIC itself).
- I tried to install the LSI MSM for more control over the LSI card, but it needed the Solaris SMA and a bunch of SNMP packages which OI doesn't appear to have. So, that was also uninstalled. However, two of the packages are still there (sassnmp and sasirsnmp) as they won't uninstall because the script looks for the SMA service (which doesn't exist).
- Added an Acard 9010 as a log device.
- Added an an SSD identical to the existing boot drive. Mirrored, copied the bootloader, etc.
I'm not seeing any drive errors or anything in the logs for that matter. The logs look normal right up to the point it hangs. When it does hang the system becomes completely unresponsive. The console is also unresponsive and just shows logging from the regularly scheduled napp-it commands. It stops responding to pings too. The only way to get it back is a hard reset.
Thoughts? Is there any more logging I can enable?
Thanks!!
Riley
I have an OI 151a system that has been flawless for almost a year now. Recently it's been hard locking without leaving anything in the logs (so far as I can tell). There have been a few changes made to the system since the issue has started happening:
- Some ZFS tuning (zfs_vdev_max_pending = 4 and zfs_write_limit_override = 805306368).
- An install of a Brocade NIC software package which has since been removed (both software and the NIC itself).
- I tried to install the LSI MSM for more control over the LSI card, but it needed the Solaris SMA and a bunch of SNMP packages which OI doesn't appear to have. So, that was also uninstalled. However, two of the packages are still there (sassnmp and sasirsnmp) as they won't uninstall because the script looks for the SMA service (which doesn't exist).
- Added an Acard 9010 as a log device.
- Added an an SSD identical to the existing boot drive. Mirrored, copied the bootloader, etc.
I'm not seeing any drive errors or anything in the logs for that matter. The logs look normal right up to the point it hangs. When it does hang the system becomes completely unresponsive. The console is also unresponsive and just shows logging from the regularly scheduled napp-it commands. It stops responding to pings too. The only way to get it back is a hard reset.
Thoughts? Is there any more logging I can enable?
Thanks!!
Riley