|
Measures reported by AWSMSKCPUDetTest
Excess traffic to AWS MSK clusters can impose a prohibitive load on the AWS MSK server, choking the CPU. To proactively avoid such bottlenecks, you have to constantly monitor the CPU utilization of the clusters in the target AWS MSK server. This is where the AWSMSKCPUDetTest helps. This test tracks the CPU usage of each cluster in the target server over time, and alerts you to potential CPU contentions, so that sudden spikes in CPU usage can be promptly captured.
Outputs of the test: One set of results for each cluster executing in the target AWS Managed Service Kafka server.
Descriptor: Cluster
The measures made by this test are as follows:
| Measurement |
Description |
Measurement Unit |
Interpretation |
| cpuCreditBalance |
Indicates the CPU credit balance on the brokers. |
Number |
Once a burstable instance is started, it begins consuming Initial CPU credits of 30 that is provisioned to it. While at it, the burstable instance also earns CPU credits at a fixed rate that is determined by the instance type. The amount of CPU credits that a CPU can earn per hour is based on its baseline performance - i.e., the amount of CPU capacity that is continuously provisioned to a burstable instance. For example, 25% baseline performance of instance A indicates that the CPU credits that a CPU of the instance earns per hour can keep the CPU running at 25% utilization for an hour or at 100% utilization for 15 minutes (60 × 25%). In response to its baseline performance, each CPU earns 15 CPU credits per hour. Therefore, if instance A has two CPUs, it earns 30 CPU credits per hour.
If the CPU credits so earned exceed the credits consumed, the net credits are accrued as CPU credit balance. This is the value that is reported by the CPU credit balance measure. A high value is desired for this measure, as a high CPU credit balance for a burstable instance means that CPU resources are guaranteed to that instance for a maximum of 24 hours.
|
| cpuIdle |
Indicates the percentage of time that the CPU spent in an idle state. |
Percent |
If the CPU wait time measure is abnormally high, then compare the value of this measure with that of the Swap wait measure to know where the CPU spent maximum time - waiting for swapping? in the idle state? or waiting for an I/O operation?
|
| cpuSystem |
Indicates the percentage of CPU in kernel space. |
Percent |
|
| cpuUser |
Indicates the percentage of CPU in user space. |
Percent |
|
| cpuCreditUsage |
Indicates the CPU credit usage on the instances. |
Number |
If your CPU usage is sustained above the baseline level of 20% you can run out of the CPU credit balance which can have a negative impact on cluster performance. This measure value is monitored and corrective actions to be taken when alerted.
|
|