As well-known, direct 4k random read performance is the worst-case read performance test for the file system area.
The below screenshot shows the following fio workload's result on different environments.
[global]
filename=/dev/sda
direct=1
time_based
bssplit=4K
[front]
rw=randread
ioengine=pvsync
iodepth=32
numjobs=4
runtime=60
startdelay=0

- host1,host2 can show almost the same IOPS.
- without the cpu-pinning result of giantvm is crazy-lower than the host's result.
- furthermore, giantvm with pinned remote-CPU (giantvm4-7) is also lower than local CPU pinned one.
In these simple experiments,
we can find 2 ways for enhancement.
First, we need to increase the total throughput of reading and check the bottleneck during remote-CPU file I/O work.
Any opinions?
As well-known, direct 4k random read performance is the worst-case read performance test for the file system area.
The below screenshot shows the following fio workload's result on different environments.
In these simple experiments,
we can find 2 ways for enhancement.
First, we need to increase the total throughput of reading and check the bottleneck during remote-CPU file I/O work.
Any opinions?