Loading…
Cephalocon 2020 has ended
March 4 - 5, 2020 | South Korea
View More Details & Registration
Thursday, March 5 • 11:55 - 12:00
NUMA Awared Ceph Deployment - luorixin, Huawei

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
As the cpu NUMA nodes and cpu sockets increasing. The Ceph daemons deployed in different sockets have different performance.
There are four NUMA nodes in ARM server(based on KunPeng 920) with 2 cpu sockets.Each socket has direct pcie slots connecting
Network Devices and Storage Devices. Under the circumstance, there is Non-Uniform IO Access
between cpu and IO device. We needs a multi-NUMA affinity solution for Ceph.In Ceph, there are MDS,RGW,OSD which needing to be binded to NUMA node, especially for OSDs. We use 3 servers, every server have 2 cpu socket, 1 NVME SSD and 2 NICs for each socket. On each server,we divide OSDs into 4 groups according to NUMA nodes. OSDs in the same socket share the SSD and NICs.We set SSD's and NIC's irq affinity to the local socket. To avoid the memory access cross sockets which is caused by network's RX/TX. We set different IP for different socket NIC and use policy route to isolate network traffic cross sockets. Finally we let the OSDs using all of local socket resource(memroy,ssd,nic). Our experiment show lower latency and throughput improvement.

Speakers
RL

Rixin Luo

Senior software engineer, Huawei
I am a software enginer from Huawei. I have worked for years on enabling and optiminzing Ceph performance on aarch64.


Thursday March 5, 2020 11:55 - 12:00 JST
Baekdu 2