Microsoft Azure Batch Account

This Microsoft Azure Batch Account template contains performance and statistics counters for monitoring Azure batch accounts in SAM.

Prerequisites

  • PowerShell 5.0 or later is installed on the system where the template will run.
  • PowerShell module (Azure, AzureRM) is installed on the system where the template will run.

    To install the Azure module, execute these PowerShell commands:

    • Install-Module -Name Azure
    • Install-Module -Name AzureRM
  • To connect with the Azure account, the following parameters are required:
    • subscriptionID
    • ApplicationID
    • TenantID
    • Secret Key
    • Application Name: Any Azure App (with name and ID) with Contributor or Reader role in Azure role-based access control (RBAC).
  • Batch Account name for which metrics are being calculated.

Credentials

  • Login credentials for the Azure Portal, passed as script arguments per prerequisites.
    For example, <SubscriptionID>, <TenantID>, <ApplicationID>, value=<Secret Key>, <Batch Account Name>).
  • Optional parameters can be passed to script argument:
    • value=<Time Interval> - during which data must be fetched, in minutes. The default is 10 minutes.
    • Number of retries as the second last argument; the default is 3.
    • WaitTime as the last argument; the default is 0.5 seconds

Notes

Component Monitors

Dedicated number of cores in the batch account

This component monitors the total number of dedicated cores in the batch account.

Unit: Count

Number of nodes created

This component monitors the total number of dedicated nodes in the batch account.

Unit: Count

Number of idle nodes

This component monitors the number of idle nodes.

Unit: Count

Number of nodes leaving the Pool

This component monitors the number of nodes leaving the Pool.

Unit: Count

Number of offline nodes

This component monitors the number of offline nodes.

Unit: Count

Total number of pools that have been created

This component monitors the total number of pools that have been created.

Unit: Count

Total number of pool deletes that have completed

This component monitors the total number of pool deletes that have completed.

Unit: Count

Total number of pool deletes that have started

This component monitors the total number of pool deletes that have started.

Unit: Count

Total number of pool resizes that have completed

This component monitors the total number of pool resizes that have completed.

Unit: Count

Total number of pool resizes that have started

This component monitors the total number of pool resizes that have started.

Unit: Count

Count of rebooting nodes

This component monitors the number of rebooting nodes.

Unit: Count

Count of reimaging nodes

This component monitors the total number of reimaging nodes.

Unit: Count.

Count of running nodes

This component monitors the number of running nodes.

Unit: Count

Count of nodes where the Start Task has failed

This component monitors the number of nodes where the Start Task has failed.

Unit: Count

Count of nodes starting

This component monitors total number of nodes that are starting.

Unit: Count

Total count of tasks that have completed

This component monitors the total number of tasks that have completed.

Unit: Count

Total count of tasks that have completed in a failed state

This component monitors the total number of tasks that have completed in a failed state.

Unit: Count

Total count of tasks that have started

This component monitors the total number of tasks that have started.

Unit: Count

Average count of nodes in the batch account

This component monitors the average number of nodes.

Average count takes following into consideration: Offline Node Count, Idle Node Count, Running Node Count, and Unusable Node Count.

Unit: Count

Count of unusable nodes

This component monitors the number of unusable nodes.

Unit: Count

Count of nodes waiting for the Start Task to complete

This component monitors the number of nodes waiting for the Start Task to complete.

Unit: Count

Average number of cores in the batch account

This component monitors the average number of dedicated cores in the batch account.

Unit: Count