Hi there
I’d like to hear if anyone encountered similar problem as we are seeing after upgrading to AWS Amazon Linux 2 AMIs with kernel 5.10 (and also Amazon Linux 2023 - kernel 6.1).
We are running an Aerospike Community Edition cluster version 6.3 on i4i instances. After upgrading to AMIs with kernel 5.10 the the local SSD device utilization jumped significantly even for moderate workloads. It’s topping 100% where the old instances report 15%. This has been triggering some alerts on our side but in reality there doesn’t seem to be any negative performance impact.
I ran bunch of tests with the act
tool and while the observed behavior was the same there was no impact on performance.
Can someone shed some more light on why this is happening or if there’s anything we need to tweak to get the utilization numbers report some realistic values?
We partition each device with four partitions and use device
namespace configuration:
# lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
nvme1n1 259:0 0 1.7T 0 disk
├─nvme1n1p1 259:4 0 436.6G 0 part
├─nvme1n1p2 259:5 0 436.6G 0 part
├─nvme1n1p3 259:6 0 436.6G 0 part
└─nvme1n1p4 259:7 0 436.6G 0 part
nvme0n1 259:1 0 20G 0 disk
├─nvme0n1p1 259:2 0 20G 0 part /
└─nvme0n1p128 259:3 0 1M 0 part
storage-engine device {
device /dev/nvme1n1p1
device /dev/nvme1n1p2
device /dev/nvme1n1p3
device /dev/nvme1n1p4
write-block-size 1024K
max-write-cache 1024M
}
# iostat -zxmty 3
Linux 5.10.210-201.852.amzn2.x86_64 (ip-10-30-101-32.ec2.internal) 03/20/2024 _x86_64_ (4 CPU)
03/20/2024 10:45:14 AM
avg-cpu: %user %nice %system %iowait %steal %idle
1.59 0.00 0.26 18.09 0.00 80.05
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz await r_await w_await svctm %util
nvme1n1 0.00 0.00 20235.33 235.33 58.71 29.42 8.82 3.34 0.16 0.16 0.08 0.05 100.00
03/20/2024 10:45:17 AM
avg-cpu: %user %nice %system %iowait %steal %idle
1.67 0.00 0.53 18.05 0.00 79.75
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz await r_await w_await svctm %util
nvme1n1 0.00 0.00 20235.00 235.33 58.71 29.42 8.82 3.34 0.16 0.16 0.08 0.05 100.00
03/20/2024 10:45:20 AM
avg-cpu: %user %nice %system %iowait %steal %idle
1.59 0.00 0.35 18.08 0.00 79.98
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz await r_await w_await svctm %util
nvme1n1 0.00 0.00 20235.67 235.33 58.71 29.42 8.82 3.34 0.16 0.16 0.08 0.05 100.00
nvme0n1 0.00 0.00 0.00 2.00 0.00 0.01 14.33 0.00 0.83 0.00 0.83 0.67 0.13
Thanks, Zbynek