Documentation forServer & Application Monitor

Microsoft Azure Batch Account

This Microsoft Azure Batch Account template contains performance and statistics counters for monitoring Azure batch accounts in SAM.

For an overview of configuring Azure application monitoring templates, watch SolarWinds Lab #82 (Part 2).

Prerequisites

  • PowerShell module (Azure, AzureRM) is installed before using this template.

    To install the Azure module, execute these PowerShell commands:

    • Install-Module -Name Azure
    • Install-Module -Name AzureRM
  • To connect with the Azure account, you'll need the SubscriptionID, ApplicationID, TenantID, and Secret Key.
    Any Azure App (with its name and ID) with minimum role of Read Only.
  • Batch Account name for which metrics are being calculated,
  • PowerShell 5.0 or later.

Credentials

  • Login credential to access the Azure Portal, passed as script arguments per prerequisites (<SubscriptionID>, <TenantID>, <ApplicationID>, value=<Secret Key>, <Batch Account Name>).
  • Optional parameters can be passed to script argument:
    • value=<Time Interval> - during which data must be fetched, in minutes. The default is 10 minutes.
    • Number of retries as the second last argument; the default is 3.
    • WaitTime as the last argument; the default is 0.5 seconds

Notes

Component monitors

Dedicated number of cores in the batch account

This component monitors the total number of dedicated cores in the batch account.

Unit: Count

Number of nodes created

This component monitors the total number of dedicated nodes in the batch account.

Unit: Count

Number of idle nodes

This component monitors the number of idle nodes.

Unit: Count

Number of nodes leaving the Pool

This component monitors the number of nodes leaving the Pool.

Unit: Count

Number of offline nodes

This component monitors the number of offline nodes.

Unit: Count

Total number of pools that have been created

This component monitors the total number of pools that have been created.

Unit: Count

Total number of pool deletes that have completed

This component monitors the total number of pool deletes that have completed.

Unit: Count

Total number of pool deletes that have started

This component monitors the total number of pool deletes that have started.

Unit: Count

Total number of pool resizes that have completed

This component monitors the total number of pool resizes that have completed.

Unit: Count

Total number of pool resizes that have started

This component monitors the total number of pool resizes that have started.

Unit: Count

Count of rebooting nodes

This component monitors the number of rebooting nodes.

Unit: Count

Count of reimaging nodes

This component monitors the total number of reimaging nodes.

Unit: Count.

Count of running nodes

This component monitors the number of running nodes.

Unit: Count

Count of nodes where the Start Task has failed

This component monitors the number of nodes where the Start Task has failed.

Unit: Count

Count of nodes starting

This component monitors total number of nodes that are starting.

Unit: Count

Total count of tasks that have completed

This component monitors the total number of tasks that have completed.

Unit: Count

Total count of tasks that have completed in a failed state

This component monitors the total number of tasks that have completed in a failed state.

Unit: Count

Total count of tasks that have started

This component monitors the total number of tasks that have started.

Unit: Count

Average count of nodes in the batch account

This component monitors the average number of nodes.

Average count takes following into consideration: Offline Node Count, Idle Node Count, Running Node Count, and Unusable Node Count.

Unit: Count

Count of unusable nodes

This component monitors the number of unusable nodes.

Unit: Count

Count of nodes waiting for the Start Task to complete

This component monitors the number of nodes waiting for the Start Task to complete.

Unit: Count

Average number of cores in the batch account

This component monitors the average number of dedicated cores in the batch account.

Unit: Count