云网络性能测试流程

有几个云上的小伙伴想测测VPC网络性能,于是写了一些dpdk代码在阿里云上做了一个实验,也适用于其它云.
安装相关的库

云网络性能测试流程

使用root登录,更新一下源

#备份原有的配置文件 
 mkdir /etc/yum.repos.d/bak 
 mv /etc/yum.repos.d/*.repo /etc/yum.repos.d/bak/ 
#使用阿里云的源覆盖 
wget -O /etc/yum.repos.d/CentOS-Base.repo http://mirrors.aliyun.com/repo/Centos-8.repo 
yum install -y https://mirrors.aliyun.com/epel/epel-release-latest-8.noarch.rpm 
sed -i ‘s|^#baseurl=https://download.fedoraproject.org/pub|baseurl=https://mirrors.aliyun.com|‘ /etc/yum.repos.d/epel* 
sed -i ‘s|^metalink|#metalink|‘ /etc/yum.repos.d/epel* 
sudo dnf config-manager --set-enabled PowerTools 
 
yum makecache 
yum update 
 
yum groupinstall "Development tools" 
yum install gcc-gfortran kernel-modules-extra tcl tk tcsh terminator tmux kernel-rpm-macros elfutils-libelf-devel libnl3-devel meson createrepo numactl-devel 
pip3 install pyelftools 
启用iommu
sudo vi /etc/default/grub 
 
//在 GRUB_CMDLINE_LINUX 行添加"intel_iommu=on iommu=pt"  
//保存退出 

然后更新grub并重启系统

sudo grub2-mkconfig -o /boot/grub2/grub.cfg 
sudo grub2-mkconfig -o /boot/efi/EFI/centos/grub.cfg 
sudo reboot 
安装DPDK

CentOS上需要添加/usr/local路径, 主要是LD_LIBRARY_PATH PATH 和 PKG_CONFIG_PATH 以及sudo的path

sudo vi /etc/ld.so.conf.d/dpdk.conf 
 
>>添加如下path 
/usr/local/lib64 
>>退出 
 
sudo ldconfig 
 
vim ~/.bashrc 
>>添加如下path 
 
export PATH=/usr/local/bin:$PATH 
export PKG_CONFIG_PATH=/usr/local/lib64/pkgconfig:${PKG_CONFIG_PATH} 
 
 
保存后source 
source ~/.bashrc 
sudo vim  /etc/sudoers 
 
>>将secure_path添加/usr/local/bin 
Defaults    secure_path = /sbin:/bin:/usr/sbin:/usr/bin:/usr/local/bin 

然后解压dpdk,并编译安装

wget http://fast.dpdk.org/rel/dpdk-21.05.tar.xz 
tar xf dpdk-21.05.tar.xz 
 
cd dpdk-21.05 
meson build -D examples=all  
 
cd build 
ninja 
sudo ninja install 
sudo ldconfig 

设置Hugepage和bind接口

dpdk-hugepages.py --setup 4G 
 modprobe vfio-pci 
 dpdk-devbind.py -s 
 
Network devices using kernel driver 
=================================== 
0000:00:05.0 ‘Virtio network device 1000‘ if=eth0 drv=virtio-pci unused=vfio-pci *Active* 
0000:00:06.0 ‘Virtio network device 1000‘ if=eth1 drv=virtio-pci unused=vfio-pci *Active* 

注意虚拟机环境需要noniommu_mode

ifconfig eth1 down 
echo 1 > /sys/module/vfio/parameters/enable_unsafe_noiommu_mode 
dpdk-devbind.py -b vfio-pci 0000:00:06.0 

验证

dpdk-devbind.py -s 
 
Network devices using DPDK-compatible driver 
============================================ 
0000:00:06.0 ‘Virtio network device 1000‘ drv=vfio-pci unused= 
 
Network devices using kernel driver 
=================================== 
0000:00:05.0 ‘Virtio network device 1000‘ if=eth0 drv=virtio-pci unused=vfio-pci *Active* 
检查接口支持情况

下载代码

cd ~ 
wget https://github.com/zartbot/learn_dpdk/archive/refs/heads/main.zip 
unzip main.zip 
 cd learn_dpdk-main/ 

编译

cd 01_port_init/devinfo/ 
make clean;make 

检查接口支持情况

./build/devinfo 
 
EAL: Detected 24 lcore(s) 
EAL: Detected 1 NUMA nodes 
EAL: Detected shared linkage of DPDK 
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket 
EAL: Selected IOVA mode ‘PA‘ 
EAL: No available 1048576 kB hugepages reported 
EAL: VFIO support initialized 
EAL:   Invalid NUMA socket, default to 0 
EAL: Probe PCI driver: net_virtio (1af4:1000) device: 0000:00:05.0 (socket 0) 
eth_virtio_pci_init(): Failed to init PCI device 
 
EAL: Requested device 0000:00:05.0 cannot be used 
EAL:   Invalid NUMA socket, default to 0 
EAL: Probe PCI driver: net_virtio (1af4:1000) device: 0000:00:06.0 (socket 0) 
EAL: Using IOMMU type 8 (No-IOMMU) 
TELEMETRY: No legacy callbacks, legacy socket not created 
 
 
 
***************************************** 
number of available port: 1 
========================================= 
port: 0         Driver:net_virtio 
Link down 
MAC address: 00:16:3E:25:3F:0A 
PCIe:0000:00:06.0 
Max RX Queue:   12      Desc:   65535 
Max TX Queue:   12      Desc:   65535 
Offload Capability: 
  DEV_RX_OFFLOAD_VLAN_STRIP 
  DEV_RX_OFFLOAD_UDP_CKSUM 
  DEV_RX_OFFLOAD_TCP_CKSUM 
  DEV_RX_OFFLOAD_TCP_LRO 
  DEV_RX_OFFLOAD_JUMBO_FRAME 
----------------------------------------- 
  DEV_TX_OFFLOAD_VLAN_INSERT 
  DEV_TX_OFFLOAD_UDP_CKSUM 
  DEV_TX_OFFLOAD_TCP_CKSUM 
  DEV_TX_OFFLOAD_TCP_TSO 
  DEV_TX_OFFLOAD_MULTI_SEGS 
========================================= 
测速
cd ~/learn_dpdk-main/02_send_recv/traffic_gen/ 

修改send_pkt.c的源目的地址,注意目的MAC在阿里云上要为eeff.ffff.ffff

//init mac 
struct rte_ether_addr s_addr = {{0x00, 0x16, 0x3e, 0x25, 0x0b, 0xe3}}; 
struct rte_ether_addr d_addr = {{0xee, 0xff, 0xff, 0xff, 0xff, 0xff}}; 
 
//init IP header 
rte_be32_t s_ip_addr = string_to_ip("10.66.1.220"); 
rte_be32_t d_ip_addr = string_to_ip("10.66.1.219"); 

由于接口支持有限,修改 common.h

#define NUM_RX_QUEUE 1 
#define NUM_TX_QUEUE 1 
 
static const struct rte_eth_conf port_conf_default = { 
    .rxmode = { 
        .max_rx_pkt_len = RTE_ETHER_MAX_LEN, 
    .mq_mode = ETH_MQ_RX_NONE, 
    }, 
    .txmode = { 
        .mq_mode = ETH_MQ_TX_NONE, 
    } 
}; 

修改portinit.c 关闭RX-CHECKSUM OFFLOAD, 注释掉下面这段:

if (dev_info.rx_offload_capa & DEV_RX_OFFLOAD_CHECKSUM) 
 { 
     printf("port[%u] support RX cheksum offload.\n", port); 
     port_conf.rxmode.offloads |= DEV_RX_OFFLOAD_CHECKSUM; 
 } 

最后测速大概3.3Mpps左右,接近官方售卖时的4Mpps

[root@iZuf64vmgrtj12kczyslhdZ traffic_gen]# ./build/run 
EAL: Detected 24 lcore(s) 
EAL: Detected 1 NUMA nodes 
EAL: Detected shared linkage of DPDK 
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket 
EAL: Selected IOVA mode ‘PA‘ 
EAL: No available 1048576 kB hugepages reported 
EAL: VFIO support initialized 
EAL:   Invalid NUMA socket, default to 0 
EAL: Probe PCI driver: net_virtio (1af4:1000) device: 0000:00:05.0 (socket 0) 
eth_virtio_pci_init(): Failed to init PCI device 
 
EAL: Requested device 0000:00:05.0 cannot be used 
EAL:   Invalid NUMA socket, default to 0 
EAL: Probe PCI driver: net_virtio (1af4:1000) device: 0000:00:06.0 (socket 0) 
EAL: Using IOMMU type 8 (No-IOMMU) 
TELEMETRY: No legacy callbacks, legacy socket not created 
 
 
initializing port 0... 
port[0] support TX UDP checksum offload. 
port[0] support TX TCP checksum offload. 
Port[0] MAC: 00:16:3e:25:0b:e3 
Core 1 doing RX dequeue. 
Core 2 doing packet enqueue. 
 
RX-Queue[0] PPS: 3280464 
RX-Queue[0] PPS: 3277792 
RX-Queue[0] PPS: 3303116 
RX-Queue[0] PPS: 3307443 
RX-Queue[0] PPS: 3296451 
RX-Queue[0] PPS: 3294396 
RX-Queue[0] PPS: 3297737 
RX-Queue[0] PPS: 3290069 
RX-Queue[0] PPS: 3279720 
RX-Queue[0] PPS: 3285987 
RX-Queue[0] PPS: 3279424 

然后把common.h 中收发都改为4个线程

#define NUM_RX_QUEUE 1 
#define NUM_TX_QUEUE 1 

测试结果和官方售卖的4Mpps一致了。

RX-Queue[0] PPS: 578918 
RX-Queue[1] PPS: 866823 
RX-Queue[2] PPS: 2288950 
RX-Queue[3] PPS: 865335 

CPU Info

[root@iZuf64vmgrtj12kczyslhdZ traffic_gen]# cat /proc/cpuinfo | grep Xeon 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 
model name      : Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz 

本文地址:https://www.linuxprobe.com/cloud-test.html

云网络性能测试流程

上一篇:leetcode 1572. 矩阵对角线元素的和


下一篇:九、监控网络连接状态