You are here
Cisco ACI Monitoring
Cisco Application Centric Infrastructure (ACI) is Cisco's software-defined networking (SDN) offering, it allows application requirements to define the network. This architecture simplifies, optimizes, and accelerates the entire application deployment life cycle.
What You Can Monitor
This Opspack will provide a selection of Host Templates, ranging from the top level Fabric Host Template to more lower level monitoring via Host Templates for Pods and Nodes, as well as a Custom Query Host Template, allowing you to monitor all aspects of your Cisco Application Centric Infrastructure.
Host Templates
The following Host Templates are currently provided by this Opspack. Click the name of each Host Template to be taken to the relevant information page, including a full Service Check description and usage instructions.
Network - Cisco ACI - APIC
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - APIC - CPU Usage | The live CPU usage for a given APIC | apic_cpu_usage=70,90 | % |
ACI - APIC - Disk Usage | The live disk usage for a given APIC | apic_disk_usage=70,90 | % |
ACI - APIC - Faults | A summary of all faults for a given APIC | apic_node_critical_faults=,0 apic_node_major_faults=,0 apic_node_minor_faults=0, apic_node_warning_faults=0, apic_node_info_faults=, apic_node_cleared_faults=, |
N/A |
ACI - APIC - Health State | The health state for a given APIC | N/A | N/A |
ACI - APIC - Memory Free | The live memory available for a given APIC | N/A | B |
ACI - APIC - Temperature | The live temperature for a given APIC | N/A | N/A |
Network - Cisco ACI - APIC Cluster
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - APIC Cluster - Average CPU Usage | The average CPU usage across all APIC nodes | cluster_cpu_usage=70,90 | % |
ACI - APIC Cluster - Total Memory Free | The total amount of memory available across all APIC nodes | N/A | B |
ACI - APIC Cluster - Health State | The health state across all APIC nodes | N/A | N/A |
Network - Cisco ACI - Application Profile
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Application Profile - Faults | A summary of all faults for a given application profile | application_profile_critical_faults=,0 application_profile_major_faults=,0 application_profile_minor_faults=0, application_profile_warning_faults=0, application_profile_info_faults=, application_profile_cleared_faults=, |
N/A |
ACI - Application Profile - Health Score | The health score for a given application profile | application_profile_health_score=90:,80: | N/A |
Network - Cisco ACI - Bridge Domain
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Bridge Domain - Faults | A summary of all faults for a given bridge domain | bridge_domain_critical_faults=,0 bridge_domain_major_faults=,0 bridge_domain_minor_faults=0, bridge_domain_warning_faults=0, bridge_domain_info_faults=, bridge_domain_cleared_faults=, |
N/A |
ACI - Bridge Domain - Health Score | The health score for a given bridge domain | bridge_domain_health_score=90:,80: | N/A |
Network - Cisco ACI - Custom
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Custom - Query | Monitor a specific object and field from the object store browser | N/A | N/A |
Network - Cisco ACI - Endpoint Group
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Endpoint Group - Faults | A summary of all faults for a given endpoint group | endpoint_group_critical_faults=,0 endpoint_group_major_faults=,0 endpoint_group_minor_faults=0, endpoint_group_warning_faults=0, endpoint_group_info_faults=, endpoint_group_cleared_faults=, |
N/A |
ACI - Endpoint Group - Health Score | The health score for a given endpoint group | endpoint_group_health_score=90:,80: | N/A |
Network - Cisco ACI - Fabric
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Fabric - Faults | A summary of all faults across the fabric. | fabric_critical_faults=,0 fabric_major_faults=,0 fabric_minor_faults=0, fabric_warning_faults=0, fabric_info_faults=, fabric_cleared_faults=, |
N/A |
ACI - Fabric - Health Score | The overall health score for the fabric. | fabric_health_score=95:,80: | N/A |
Network - Cisco ACI - Leaf Node
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Leaf Node - Fan Summary | A summary of the fans' operational states for a given leaf node | N/A | N/A |
ACI - Leaf Node - Faults | A summary of all faults for a given leaf node | leaf_node_critical_faults=,0 leaf_node_major_faults=,0 leaf_node_minor_faults=0, leaf_node_warning_faults=0, leaf_node_info_faults=, leaf_node_cleared_faults=, |
N/A |
ACI - Leaf Node - Health Score | The health score for a given leaf node | leaf_node_health_score=95:,80: | N/A |
ACI - Leaf Node - PSU Summary | A summary of all power supplies for a given leaf node | N/A | N/A |
ACI - Leaf Node - CPU Usage | The CPU usage for a given leaf node | user_avg_cpu=70,90 kernel_avg_cpu=70,90 idle_avg_cpu=30:,10: |
% |
ACI - Leaf Node - Memory Usage | The memory usage for a given leaf node | leaf_node_memory_usage=70,90 | % |
Network - Cisco ACI - Pod
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Pod - Faults | A summary of all faults for a given pod | pod_critical_faults=,0 pod_major_faults=,0 pod_minor_faults=,0 pod_warning_faults=,0 pod_info_faults=, pod_cleared_faults=, |
N/A |
ACI - Pod - Health Score | The health score for a given pod | pod_health_score=95:,80: | N/A |
ACI - Pod - Leaf Summary | A summary of the leaf nodes assigned to a given pod | N/A | N/A |
ACI - Pod - Spine Summary | A summary of the spine nodes assigned to a given pod | N/A | N/A |
Network - Cisco ACI - Spine Node
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Spine Node - Fan Summary | A summary of the fans' operational states for a given spine node | N/A | N/A |
ACI - Spine Node - Faults | A summary of all faults for a given spine node | spine_node_critical_faults=,0 spine_node_major_faults=,0 spine_node_minor_faults=0, spine_node_warning_faults=0, spine_node_info_faults=, spine_node_cleared_faults=, |
N/A |
ACI - Spine Node - Health Score | The health score for a given spine node | spine_node_health_score=95:,80: | N/A |
ACI - Spine Node - PSU Summary | A summary of all power supplies for a given spine node | N/A | N/A |
ACI - Spine Node - CPU Usage | The CPU usage for a given spine node | user_avg_cpu=70,90 kernel_avg_cpu=70,90 idle_avg_cpu=30:,10: |
% |
ACI - Spine Node - Memory Usage | The memory usage for a given spine node | spine_node_memory_usage=70,90 | % |
Network - Cisco ACI - Tenant
add_circleService Check Name | Description | Default Thresholds (Warning, Critical) | UOM |
---|---|---|---|
ACI - Tenant - Faults | A summary of all faults for a given tenant | tenant_critical_faults=,0 tenant_major_faults=,0 tenant_minor_faults=0, tenant_warning_faults=0, tenant_info_faults=, tenant_cleared_faults=, |
N/A |
ACI - Tenant - Health Score | The health score for a given tenant | tenant_health_score=90:,80: | N/A |
Cisco ACI Monitoring Prerequisites
Cisco APIC 4.1, but alternative versions may be compatible.
Cisco ACI Monitoring Setup
Pods, Spine Nodes, Leaf Nodes and APIC Nodes
To retrieve the IDs of the objects you wish to monitor, log in to the Cisco ACI dashboard. The Pod ID and APIC ID are highlighted below. To retrieve the Node ID, right click the column of the node you want to monitor and select "Open in Object Store Browser".
Copy the ID of the object you wish to monitor to the relevant host variable, as defined within each host template.
Tenants, Application Profiles, Endpoint Groups, Bridge Domains
To retrieve the name of the Tenant you wish to monitor, navigate to the system dashboard and copy the Tenant name from the table to the relevant host variable.
To retrieve the Application Profile name or Endpoint Group name, double-click on the Tenant to browse the child objects. You can then copy the name of the object you want to monitor to the relevant host variable.
To retrieve the Bridge Domain, double-click on the Tenant to browse the child objects. You can then copy the name of the Bridge Domain you want to monitor to the relevant host variable from the Networking tab.
Custom Queries
To retrieve a specific metric or field from the object store browser that is not included in the above Host Templates, copy the DN and field name from the object store browser into the relevant custom query host variables. For example, to create a custom query for the 'delayed heartbeat' field of node-104, use the following highlighted variables: