ChanServ changed the topic of #openvswitch to: Open vSwitch, a Linux Foundation Collaborative Project || FAQ: http://docs.openvswitch.org/en/latest/faq/ || OVN meeting Thurs 9:15 am US Pacific || Use ovs-discuss@openvswitch.org for questions if you don't get an answer here. || Channel logs can be found at https://libera.irclog.whitequark.org/openvswitch
ChmEarl has quit [Quit: Leaving]
vlrpl has joined #openvswitch
vlrpl has quit [Quit: Leaving]
kuraudo has joined #openvswitch
mj2 has joined #openvswitch
mj2 has quit [Ping timeout: 268 seconds]
mj2 has joined #openvswitch
tpires has joined #openvswitch
<tpires> Hi all, I have an OVN Central cluster where the leader of the NB started to use 100% of CPU load most of the time.
<tpires> The database size is around 460MB and I already tried to use 24.03 but no luck, same behavior.
<tpires> While in 100% CPU the read and write of the NB cluster is impacted. Doing a debugging when there is this increase of CPU load, I can see jsonrpc reply to a member of the cluster.
<tpires> Do you know anything that can cause such behavior or other steps that I can run?
<felixhuettner> is there maybe some client that regularly dumps/loads the whole database?
<tpires> Hi Felix, we were suspecting it and we setup up an ovn-fake-multinode cluster and imported this database there and the behavior still the same. The jsonrpc reply that I mentioned is to a member of the cluster and the size of the reply is about 460MB, I think could be it that is increasing the load but I'm not sure what could be done to fix it.
<tpires> that is the coverage output
<tpires> ovs-appctl -t /var/run/ovn/ovnnb_db.ctl coverage/show
<tpires> Event coverage, avg rate over last: 5 seconds, last minute, last hour, hash=6087dcfb:
<tpires> hmap_pathological 5.8/sec 3.667/sec 3.6136/sec total: 499941
<tpires> raft_entry_serialize 0.0/sec 0.000/sec 0.0000/sec total: 58
<tpires> hmap_expand 79731.0/sec 53154.400/sec 52378.7333/sec total: 7245581653
<tpires> hmap_reserve 0.0/sec 0.000/sec 0.0000/sec total: 36
<tpires> lockfile_lock 0.0/sec 0.000/sec 0.0000/sec total: 1
<tpires> poll_create_node 4.4/sec 4.683/sec 4.5703/sec total: 3477126
<tpires> poll_zero_timeout 0.6/sec 0.133/sec 0.1447/sec total: 102305
<tpires> seq_change 0.8/sec 0.433/sec 0.4336/sec total: 365523
<tpires> pstream_open 0.0/sec 0.000/sec 0.0000/sec total: 4
<tpires> stream_open 0.0/sec 0.000/sec 0.0000/sec total: 3
<tpires> unixctl_received 0.0/sec 0.000/sec 0.0000/sec total: 5
<tpires> unixctl_replied 0.0/sec 0.000/sec 0.0000/sec total: 5
<tpires> util_xalloc 3428152.4/sec 2285430.383/sec 1059025.8833/sec total: 311602019027
<tpires> 100 events never hit
donhw has quit [Read error: Connection reset by peer]
donhw has joined #openvswitch
mj2 has quit [Quit: WeeChat 4.1.1]
otherwiseguy has quit [Ping timeout: 276 seconds]
donhw has quit [Read error: Connection reset by peer]
donhw has joined #openvswitch
kuraudo has quit [Remote host closed the connection]
kuraudo has joined #openvswitch
otherwiseguy has joined #openvswitch
ChmEarl has joined #openvswitch
stephen87 has quit [Ping timeout: 245 seconds]
kuraudo has quit [Remote host closed the connection]
stephen87 has joined #openvswitch
otherwiseguy has quit [Ping timeout: 268 seconds]
tpires has quit [Remote host closed the connection]
donhw has quit [Read error: Connection reset by peer]
donhw has joined #openvswitch
tpires has joined #openvswitch
hamburgler has joined #openvswitch
otherwiseguy has joined #openvswitch
GNUmoon has quit [Remote host closed the connection]
GNUmoon has joined #openvswitch
hamburgler has quit [Remote host closed the connection]