Documentation forServer & Application Monitor

Orion Server 2020.2.6 - Main Polling Engine

This SAM application monitor template assesses the status of Windows services on the Main Polling Engine, which is usually the Orion server.

The Legacy Orion Server 2020.2 - Main Polling Engine template was replaced by this template in SAM 2020.2.6.

Prerequisites

WMI access to the target server.

Credentials

Windows Administrator on the target server.

Component monitors

To learn about component monitor types, see Windows Service Monitor and Performance Counter Monitor.

Service: NetBackup Client

Service: NetBackup Legacy Client

Service: NetBackup Legacy Network

NetBackup Client TCP Port

Multistreamed Backups TCP port

NetBackup Client Error and Warning Events

Service: Distributed Transaction Coordinator

SolarWinds Administration Service

Returns the CPU and memory usage of the SolarWinds Administration Service that:

  • Manages installed Orion Platform products and upgrades.
  • Supports the installation of Additional Polling Engines (APEs), Additional Web Servers, and High Availability (HA) backups.
  • Controls the Orion Service Manager that you can use to stop, start, and restart Orion services and websites, including dependencies.

Service: Security Accounts Manager

SolarWinds Alerting Service V2

Returns the CPU and memory usage of the SolarWinds Alerting Service V2 that:

  • Evaluates alert conditions,
  • Triggers alerts,
  • Runs alert actions.

    By default, this monitor is disabled.

Service: Remote Registry

SolarWinds Collector Service

Returns the CPU and memory usage of the SolarWinds Collector Service that handles data synchronization between polling engines and the Orion database.

Service: Server

SolarWinds Cortex

Returns the CPU and memory usage of the SolarWinds Cortex service that supports polling for PerfStack and other Orion Platform products that collect data.

Service: Plug and Play

SolarWinds High Availability

Returns the CPU and memory usage of the SolarWinds High Availability (HA) service monitors the Orion Platform’s health and mediates switchover of responsibilities between active and backup Orion instances inside an HA pool.

Total Available Memory (MB)

Page File Usage

SolarWinds Information Service V3

Returns the CPU and memory usage of the SolarWinds Information service V3. This service is used by websites to talk to the database. This service is also responsible for how the polling engines to each other.

Disk Queue Length

SolarWinds Job Engine v2

Returns the CPU and memory usage of the SolarWinds Job Engine v2 service. This service is used to perform recurring work. This service creates various Job Engine Worker processes for scalability and robustness. The job engine writes information about each job to its database.

Virtual Memory

SolarWinds Log Analyzer for Orion Polling Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Polling Service. This service is responsible for logging events in log files.

Processor Time

SolarWinds Log Analyzer for Orion Syslog Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Syslog Service. This service is responsible for logging events in log files.

Pages/sec

SolarWinds Log Analyzer for Orion Trap Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Syslog Service. This service is responsible for logging events in log files.

Logical Disk: Avg. Disk Queue Length

SolarWinds Orion Module Engine

Returns the CPU and memory usage of the SolarWinds Orion Module Engine service. This service is used to talk to the database.

Physical Disk: Avg. Disk Queue Length

Process Monitor - SWJobEngineWorker2.exe

Returns the number of Job Engine worker processes and its CPU and memory usage. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

Process Monitor - SWJobEngineWorker2x64

Returns the number of Job Engine worker processes and its CPU and memory usage. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

SolarWinds Recommendations

Virtualization Manager (VMAN) recommendations focus on the optimization of resource allocation based on performance metrics and storage capacity. Recommendations calculate trends and risks based on enabled strategies, providing plans of action to consider and apply to resolve immediate issues or preemptively prevent issues from occurring.

SolarWinds Website (HTTP)

Tests a web server's ability to accept incoming sessions and transmit the requested page. The component monitor can optionally search the delivered page for specific text strings and pass or fail the test based on that search. By default, it monitors TCP port 80.

SolarWinds Website (HTTPS)

Tests a web server's ability to accept incoming sessions and transmit the requested page. The component monitor can optionally search the delivered page for specific text strings and pass or fail the test based on that search. By default, it monitors TCP port 443.

Job Engine v2: Jobs Lost

Returns the number of lost jobs. This value should be zero at all times.

Job Engine v2: Jobs Queued

Returns the number of jobs waiting for execution due to insufficient resources. This value should be zero at all times.

Job Engine v2: Jobs Running

Returns the number of jobs currently running.

Job Engine v2: Worker Processes

Returns the number of worker processes used. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

Job Scheduler v2: Average Execution Delay

Returns the average delay, in milliseconds, between the time when the job is supposed to be executed and the time that it actually is executed. This value should be less than 100,000.

Job Scheduler v2: Results Notified Error

Returns the number of errors that occurred when sending the results back. This value should be zero at all times.

MSMQ Folder Size

Returns the MSMQ folder size. This monitor should be less than 800 MB. The maximum size is 1 GB. If the 1-GB limit is reached, polling will stop working correctly.

The use of Message Queuing (MSMQ) technology was deprecated in Orion Platform 2020.2.6.

To increase the MSMQ size, open Computer Management > Features > Messaging Queuing. Right-click and change MSMQ Messaging 1 GB Limit to 1.5 GB. See this SolarWinds Success Center article for details: Microsoft Message Queue Fills Directory with Orphaned Files.

MSMQ Messages in Queue

The total number of Message Queuing messages that currently reside in the selected queue. When the Data Processor receives more results into MSMQ than it can process and pass to the Standard Poller, MSMQ continues growing. The size of MSMQ should be near 0 most of the time. Some spikes may appear, but the Data Processor must be able to clean up the MSMQ quickly, otherwise it may not be able to handle database blackouts or maintenance. (Standard Poller performance is affected by DB performance significantly.)

The use of MSMQ was deprecated in Orion Platform 2020.2.6.

Before using this counter, set the correct instance beginning with: <HOSTNAME>\private$\solarwinds\collector\processingqueue
where <HOSTNAME> is the hostname (without < >) of the target server.
For example: APMhost

By default, the instance is set to: <HOSTNAME>\private$\solarwinds\collector\processingqueue\solarwinds.node.hardwarehealth.wmi

To find all available instances, run the PerfMon utility and search for “Messages in Queue” counter in the “MSMQ Queue” category.

This monitor is disabled by default. Enable through the component monitor settings.

Collector queues Folder Size

Collector messages in Queue

Perfmon DPPL Avg. Time to Process Item

Returns the time needed to process one item. If this number is 1, it means you are able to process one item per second. 0.01 means 100 items per second. The returned value should be as low as possible.

Note: This monitor is disabled by default.

Perfmon DPPL Waiting Items

Returns items in the queue pulled from the message queue but waiting for other results to be processed. This should be less than 40. If this number is holding at or above 40, this may indicate issues concerning DB response time, performance issues, or many down elements.

Note: This monitor is disabled by default.

RabbitMQ Folder Size

Returns the Orion RabbitMQ folder size. If the folder is growing, RabbitMQ is writing messages not beeing delivered to disk, or the machine is under memory pressure.

Note: This monitor is disabled by default.

RabbitMQ Service Monitor

Returns information about the RabbitMQ services running on a node with the Windows operating system.

SWIS PubSub Messages Queued

The total number of Message Queuing messages that currently reside in the SWIS PubSub queue. When publisher sends more messages then subscribers are able to process, or if there are any message delivery issues, RabbitMQ continues growing. The size of the queue should be near 0 almost all of the time. Some spikes may appear, but SWIS needs to be able to clean up the MSMQ quickly,

Note: This monitor is disabled by default.

TCP Port Usage Count

The number of TCP ports in use.