Documentation forServer & Application Monitor

Orion Server 2020.2 - Main Polling Engine

This template assesses the status of Windows services on the Main Polling Engine (which is usually the Orion server) in Orion Platform 2020.2.

Prerequisites

WMI access to the target server.

Credentials

Windows Administrator on the target server.

Component monitors

For details about component monitor types, see Windows Service Monitor and Performance Counter Monitor.

SolarWinds Administration Services

The SolarWinds Administration Service manages installed Orion Platform products and upgrades. It also supports the installation of Additional Polling Engines (APEs), Additional Web Servers (AWS), and High Availability (HA) backups. This service controls the Orion Service Manager, accessible via the Orion Web Console, where Orion administrators can stop, start, and restart Orion services and websites, including dependencies.

SolarWinds Alerting Service V2

Returns the CPU and memory usage of the SolarWinds Alerting Service V2. This service is responsible for evaluating alert conditions, triggering alerts and running alert actions.

By default, this monitor is disabled.

SolarWinds Collector Service

Returns the CPU and memory usage of the SolarWinds Collector service that handles data synchronization between the polling engine and the Orion database.

SolarWinds Cortex

Returns the CPU and memory usage of the SolarWinds Cortex service that supports polling for PerfStack and other Orion Platform products that collect data.

SolarWinds High Availability

Returns the CPU and memory usage of the SolarWinds High Availability (HA) service monitors the Orion Platform’s health and mediates switchover of responsibilities between active and backup Orion instances inside an HA pool.

SolarWinds Information Service

Returns the CPU and memory usage of the SolarWinds Information service. This service is used by websites to talk to the database. This service is also responsible for how the polling engines talk to each other.

By default, this monitor is disabled.

SolarWinds Information Service V3

Returns the CPU and memory usage of the SolarWinds Information service V3. This service is used by websites to talk to the database. This service is also responsible for how the polling engines to each other.

SolarWinds Job Engine v2

Returns the CPU and memory usage of the SolarWinds Job Engine v2 service. This service is used to perform recurring work. This service creates various Job Engine Worker processes for scalability and robustness. The job engine writes information about each job to its database.

SolarWinds Log Analyzer for Orion Polling Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Polling Service. This service is responsible for logging events in log files.

SolarWinds Log Analyzer for Orion Syslog Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Syslog Service. This service is responsible for logging events in log files.

SolarWinds Log Analyzer for Orion Trap Service

Returns the CPU and memory usage of the SolarWinds Log Manager for Orion Syslog Service. This service is responsible for logging events in log files.

SolarWinds Orion Module Engine

Returns the CPU and memory usage of the SolarWinds Orion Module Engine service. This service is used to talk to the database.

Process Monitor - SWJobEngineWorker2.exe

Returns the number of Job Engine worker processes and its CPU and memory usage. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

Process Monitor - SWJobEngineWorker2x64

Returns the number of Job Engine worker processes and its CPU and memory usage. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

SolarWinds Recommendations

Virtualization Manager (VMAN) recommendations focus on the optimization of resource allocation based on performance metrics and storage capacity. Recommendations calculate trends and risks based on enabled strategies, providing plans of action to consider and apply to resolve immediate issues or preemptively prevent issues from occurring.

SolarWinds Website (HTTP)

Tests a web server's ability to accept incoming sessions and transmit the requested page. The component monitor can optionally search the delivered page for specific text strings and pass or fail the test based on that search. By default, it monitors TCP port 80.

SolarWinds Website (HTTPS)

Tests a web server's ability to accept incoming sessions and transmit the requested page. The component monitor can optionally search the delivered page for specific text strings and pass or fail the test based on that search. By default, it monitors TCP port 443.

Job Engine v2: Jobs Lost

Returns the number of lost jobs. This value should be zero at all times.

Job Engine v2: Jobs Queued

Returns the number of jobs waiting for execution due to insufficient resources. This value should be zero at all times.

Job Engine v2: Jobs Running

Returns the number of jobs currently running.

Job Engine v2: Worker Processes

Returns the number of worker processes used. A value of 10 or lower is acceptable. If the returned value is 100 or greater, there may be problems with jobs hanging.

Job Scheduler v2: Average Execution Delay

Returns the average delay, in milliseconds, between the time when the job is supposed to be executed and the time that it actually is executed. This value should be less than 100,000.

Job Scheduler v2: Results Notified Error

Returns the number of errors that occurred when sending the results back. This value should be zero at all times.

MSMQ Folder Size

Returns the MSMQ folder size. This monitor should be less than 800 MB. MSMQ maximum size is 1 GB. If the 1-GB limit is reached, polling will stop working correctly.

To increase the MSMQ size, open Computer Management > Features > Messaging Queuing. Right-click and change MSMQ Messaging 1 GB Limit to 1.5 GB. See this SolarWinds Success Center article for additional information: Microsoft Message Queue Fills Directory with Orphaned Files.

MSMQ Messages in Queue

The total number of Message Queuing messages that currently reside in the selected queue. When the Data Processor receives more results into MSMQ than it can process and pass to the Standard Poller, MSMQ continues growing. The size of MSMQ should be near 0 most of the time. Some spikes may appear, but the Data Processor must be able to clean up the MSMQ quickly, otherwise it may not be able to handle database blackouts or maintenance. (Standard Poller performance is affected by DB performance significantly.)

Before using this counter, set the correct instance beginning with: <HOSTNAME>\private$\solarwinds\collector\processingqueue
where <HOSTNAME> is the hostname (without < >) of the target server.
For example: APMhost

By default, the instance is set to: <HOSTNAME>\private$\solarwinds\collector\processingqueue\solarwinds.node.hardwarehealth.wmi

To find all available instances, run the PerfMon utility and search for “Messages in Queue” counter in the “MSMQ Queue” category.

This monitor is disabled by default. Enable through the component monitor settings.

Perfmon DPPL Avg. Time to Process Item

Returns the time needed to process one item. If this number is 1, it means you are able to process one item per second. 0.01 means 100 items per second. The returned value should be as low as possible.

Note: This monitor is disabled by default.

Perfmon DPPL Waiting Items

Returns items in the queue pulled from the message queue but waiting for other results to be processed. This should be less than 40. If this number is holding at or above 40, this may indicate issues concerning DB response time, performance issues, or many down elements.

Note: This monitor is disabled by default.

RabbitMQ Service Monitor

Returns information about the RabbitMQ services running on a node with the Windows operating system.

RabbitMQ Folder Size

Returns the Orion RabbitMQ folder size. If the folder is growing, RabbitMQ is writing messages not beeing delivered to disk, or the machine is under memory pressure.

Note: This monitor is disabled by default.

SWIS PubSub Messages Queued

The total number of Message Queuing messages that currently reside in the SWIS PubSub queue. When publisher sends more messages then subscribers are able to process, or if there are any message delivery issues, RabbitMQ continues growing. The size of the queue should be near 0 almost all of the time. Some spikes may appear, but SWIS needs to be able to clean up the MSMQ quickly,

Note: This monitor is disabled by default.

TCP Port Usage Count

The number of TCP ports in use.