Bluefield Network Latency vs Host Network Latency

Hi all,
I have a question about the data path,
I have setup Bluefield 2 in the embedded mode,
In another server, I tried to ping the host and Bluefield2.
I was expecting getting lower delay on Bluefield2, because the packets should go to Bluefield 2 first then Host os. but looks like the network latency of Bluefield is 2 times more

Any idea why this is the case? Or ping is not a good metric to measure the network latency ?

— Bluefield2 ping statistics —
33 packets transmitted, 33 received, 0% packet loss, time 32732ms
rtt min/avg/max/mdev = 0.232/0.296/1.318/0.182 ms

— Host os ping statistics —
34 packets transmitted, 34 received, 0% packet loss, time 33781ms
rtt min/avg/max/mdev = 0.113/0.125/0.300/0.035 ms

Update:
In my previous experiment I was pinging ens2f0 on the host which is the intel nic of the host, when I ping the physical interface ens2f0 instead, the delay on bluefield2 is 50 ms lower.

ens10f0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet xx.xx.226.xx netmask 255.255.254.0 broadcast
inet6 fe80::6a05:caff:fee1:da1c prefixlen 64 scopeid 0x20
ether 68:05:ca:e1:da:1c txqueuelen 1000 (Ethernet)
RX packets 4146194 bytes 353873668 (353.8 MB)
RX errors 0 dropped 178 overruns 0 frame 0
TX packets 6721556 bytes 10118944545 (10.1 GB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
device memory 0xa3a00000-a3afffff

ens2f0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet xx.xx.231.xx netmask 255.255.255.0 broadcast
inet6 fe80::bace:f6ff:fea8:82fa prefixlen 64 scopeid 0x20
ether b8:ce:f6:a8:82:fa txqueuelen 1000 (Ethernet)
RX packets 1632295 bytes 1448764119 (1.4 GB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 2312763 bytes 1786897169 (1.7 GB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0

ens2f1: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet xx.xx.231.xx netmask 255.255.255.0 broadcast
inet6 fe80::bace:f6ff:fea8:82fb prefixlen 64 scopeid 0x20
ether b8:ce:f6:a8:82:fb txqueuelen 1000 (Ethernet)
RX packets 1036823 bytes 1010709940 (1.0 GB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 60222 bytes 7743265 (7.7 MB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0