Tune SCOM alert
We have been frequently receiving alerts regarding the Mellanox ConnectX-4 Lx Virtual Ethernet Adapter being disabled and re-enabled within a short time span. The alert is triggered each time the adapter is disabled and then enabled, causing unnecessary noise in the system.
Specifically, the current alert triggers too quickly, and I would like to adjust the alert configuration to wait for a longer period before triggering. Ideally, the system should monitor the NIC adapter’s status and only generate an alert if the adapter remains disabled for a longer period of time. If it is re-enabled within the adjusted time frame, the alert should not be triggered.
I have checked the rules and overrides in the System Center Operations Manager (SCOM) but couldn’t find any option to modify the time interval for triggering the alert. Below is a sample of the alert details:
Alert Information:
Mellanox ConnectX-4 Lx Virtual Ethernet Adapter device is successfully stopped.
SCOM Data:
AlertId: b4456b37-5b12-4b56-9e1e-779ec6cccd6c
RuleId: 2f840cdd-71b6-05b4-da8e-9cb1a5b124d4
MonitorName: Microsoft.Windows.Server.10.0.Storage.StorageSpacesDirect.BadBlock
MonitorDisplayName: SCOM S2D: The Device has a Bad Block
Management Pack: Storage Spaces Direct Management Pack:1.0.47.4
MonitoringObjectDisplayName: ABCCCC.Domain.com
MonitoringObjectFullName: Microsoft.Windows.Computer
.Domain.com
We have been frequently receiving alerts regarding the Mellanox ConnectX-4 Lx Virtual Ethernet Adapter being disabled and re-enabled within a short time span. The alert is triggered each time the adapter is disabled and then enabled, causing unnecessary noise in the system.Specifically, the current alert triggers too quickly, and I would like to adjust the alert configuration to wait for a longer period before triggering. Ideally, the system should monitor the NIC adapter’s status and only generate an alert if the adapter remains disabled for a longer period of time. If it is re-enabled within the adjusted time frame, the alert should not be triggered.I have checked the rules and overrides in the System Center Operations Manager (SCOM) but couldn’t find any option to modify the time interval for triggering the alert. Below is a sample of the alert details:Alert Information:Mellanox ConnectX-4 Lx Virtual Ethernet Adapter device is successfully stopped.SCOM Data:AlertId: b4456b37-5b12-4b56-9e1e-779ec6cccd6cRuleId: 2f840cdd-71b6-05b4-da8e-9cb1a5b124d4MonitorName: Microsoft.Windows.Server.10.0.Storage.StorageSpacesDirect.BadBlockMonitorDisplayName: SCOM S2D: The Device has a Bad BlockManagement Pack: Storage Spaces Direct Management Pack:1.0.47.4MonitoringObjectDisplayName: ABCCCC.Domain.comMonitoringObjectFullName: Microsoft.Windows.Computer .Domain.com Read More