#openvswitch on 2025-02-20 — irc logs at libera.irclog.whitequark.org

2024-04-04 19:52 ChanServ changed the topic of #openvswitch to: Open vSwitch, a Linux Foundation Collaborative Project || FAQ: http://docs.openvswitch.org/en/latest/faq/ || OVN meeting Thurs 9:15 am US Pacific || Use ovs-discuss@openvswitch.org for questions if you don't get an answer here. || Channel logs can be found at https://libera.irclog.whitequark.org/openvswitch

01:37 ChmEarl has quit [Quit: Leaving]

02:02 donhw has quit [Read error: Connection reset by peer]

02:07 donhw has joined #openvswitch

03:23 otherwiseguy has quit [Ping timeout: 252 seconds]

03:23 otherwiseguy has joined #openvswitch

07:22 GNUmoon has quit [Remote host closed the connection]

07:23 GNUmoon has joined #openvswitch

07:38 kuraudo has joined #openvswitch

07:47 froyo has joined #openvswitch

08:11 kuraudo has quit [Quit: kuraudo]

08:11 kuraudo has joined #openvswitch

08:21 donhw has quit [Read error: Connection reset by peer]

08:26 donhw has joined #openvswitch

08:56 elvira has joined #openvswitch

09:04 kuraudo has quit [Remote host closed the connection]

09:04 kuraudo has joined #openvswitch

11:45 imaximets has quit [Remote host closed the connection]

11:48 imaximets has joined #openvswitch

11:49 otherwiseguy has quit [Ping timeout: 260 seconds]

11:50 otherwiseguy has joined #openvswitch

12:03 BlackDex has quit [Quit: ByeBye]

12:05 tpires has joined #openvswitch

14:19 imaximets has quit [Changing host]

14:19 imaximets has joined #openvswitch

14:43 dceara has joined #openvswitch

15:07 elvira has quit [Ping timeout: 248 seconds]

16:16 froyo has quit [Ping timeout: 244 seconds]

16:17 froyo has joined #openvswitch

16:30 otherwiseguy has quit [Read error: Connection reset by peer]

16:47 otherwiseguy has joined #openvswitch

17:12 zhouhan has joined #openvswitch

17:13 mkalcok has joined #openvswitch

17:14 mmichelson has joined #openvswitch

17:14 amusil has joined #openvswitch

17:14 mj2 has joined #openvswitch

17:14 <mj2> hi !!!

17:15 <mmichelson> Hi everybody. It's time for the weekly OVN developers' meeting.

17:15 <mkalcok> hello \o

17:15 <_lore_> hi all

17:15 <mmichelson> My update this week is pretty quick.

17:15 <mmichelson> Last Friday I branch ovn25.03 upstream. Thanks again to everyone who contributed either with code or with reviews.

17:15 <mmichelson> s/branch/branched/

17:15 <felixhuettner> o/

17:16 <mmichelson> This week, I've alternated between doing code reviews and making progress on composable services.

17:16 <mmichelson> Currently I'm reviewing _lore_'s MAC binding probe patch.

17:16 <mmichelson> I should have a review posted later this afternoon.

17:17 <mmichelson> That's all from me.

17:17 <mmichelson> Who's next?

17:17 <felixhuettner> i can continue

17:17 <mmichelson> go ahead felixhuettner

17:17 <felixhuettner> i mostly worked on the incremental support for learned routes. That was quite interesting to build

17:18 <felixhuettner> also i am working with some collegues on some performance improvement to ovn-controller in case of large southbound updates

17:18 <felixhuettner> as we regularly see updates >400kb which is larger than the default receives of jsonrpc, so we need multiple iterations of ovn-controller for one message

17:18 <felixhuettner> i guess there might be a first version on the ML sometime next week

17:19 <felixhuettner> I also wanted to ask if there are any regular ovn-heater runs as we are interested in running some otherwise

17:20 <felixhuettner> especially to test performance based on the changes we observe in our environment

17:20 <mmichelson> felixhuettner, we (Red Hat) do weekly ovn-heater runs on some of our machines each weekend.

17:20 <felixhuettner> is there any fancy tooling around that, that you can share

17:20 <felixhuettner> or is it mostly just what the regular repo provides?

17:21 <dceara> felixhuettner, it's mostly what the regular repo provides, with custom deployment yaml files (to match our actual machines).

17:21 <felixhuettner> ok, thanks a lot. Then we will probably try to build something similar on our side.

17:21 <felixhuettner> Thanks a lot, thats it from me

17:21 <imaximets> felixhuettner, btw, we do idl batching northd. And there are maybe other use cases for it to be implemented in ovn-controller.

17:21 <dceara> felixhuettner, we also run the existing https://github.com/ovn-org/ovn-heater/tree/main/test-scenarios with both ipv4 and ipv6

17:22 <imaximets> s/batching northd/batching in northd/

17:22 <felixhuettner> imaximets: thats what we found too and the current idea is to generalize it a little and then use it

17:22 <felixhuettner> dceara: thanks a lot

17:23 <imaximets> felixhuettner, ack. Another use case: https://mail.openvswitch.org/pipermail/ovs-dev/2025-February/421160.html

17:24 <felixhuettner> yep that might fit as well

17:24 <felixhuettner> we actually observed sb connection timeouts, because we have too many incoming messages to process the echo in time :)

17:25 <imaximets> Uff, OK. :)

17:26 <felixhuettner> its like 10 updates a second

17:26 <felixhuettner> and a recompute currently eats away around 25 seconds

17:26 <felixhuettner> so we have high chances to recompute again

17:26 <felixhuettner> at least on some chassis

17:26 <zhouhan> felixhuettner: for the timeout problem, usually users set the probe interval from DB server side as large as >100s

17:26 <mmichelson> 25 seconds for a recompute? Ouch.

17:27 <felixhuettner> we have 60s

17:27 <imaximets> felixhuettner, that's exactly the northd behavior batching was targeting.

17:27 <felixhuettner> but honestly i dont like that at all :)

17:27 <imaximets> ovn-controller is really slow in recomputes in my experience. :(

17:28 <felixhuettner> seems to be in logical_flow_output, but we are also looking to improve that

17:28 <imaximets> Especially with many ACLs with conjunctions.

17:28 <dceara> imaximets, well, to be pedantic ovn-northd's batching was initially designed to avoid continuous cpu usage on streams of NB changes. But it works for the cases you're discussing here too I guess.

17:28 <felixhuettner> but we also have 1.6 mio flows on that chassis, so some time i guess is normal :)

17:28 <imaximets> dceara, not really.

17:28 <dceara> imaximets, ah, i was thinking of the backoff now, never mind.

17:29 <imaximets> This ^ :)

17:30 <zhouhan> imaximets: which IDL batching were you talking about? Sorry I don't recall anything

17:30 <imaximets> 703949bd8b9a ("northd: Accumulate more database updates before processing.")

17:32 <mmichelson> felixhuettner, did you have anything else to add for your update?

17:32 <felixhuettner> nope that it

17:32 <mmichelson> OK thanks felixhuettner, I'm looking forward to seeing the patch(es) when you have them available.

17:32 <zhouhan> imaximets: thanks, I see. (I reviewed it :) )

17:32 <felixhuettner> thanks

17:33 <mmichelson> Before I ask for the next person, I need to note that I have a hard cutoff in ~15 minutes so if we're still going then, I'll have to leave and pass control off to someone else.

17:33 <mmichelson> (I have to take my son to the dentist)

17:33 <mmichelson> Who's going next?

17:33 <_lore_> can I go next? quite fast.

17:33 <mmichelson> _lore_, go ahead.

17:34 <_lore_> as mmichelson said, I posted a series to enable re-arping destination before they are expiring

17:34 <_lore_> I posted v1 upstream and I added a new ovn test locally, I will post it as soon as I have some feedbacks on v1

17:34 <_lore_> that's all from my side

17:35 <zhouhan> imaximets: but felixhuettner's problem here was different. He says the message was larger than a single jsonrpc, and the concern was multiple iterations of ovn-controller for a single message?

17:35 <zhouhan> _lore_, sorry for interrupting, please continue

17:36 <_lore_> no worries ;)

17:36 <_lore_> I was done

17:36 <felixhuettner> zhouhan: we actually have both. Too many messages and too large messages :)

17:36 <imaximets> zhouhan, yeah, the problem is a bit different, but solution may be similar and we also have a case with a lot of very small updates reported on the list. So, maybe we can cover both issues at once.

17:36 <zhouhan> felixhuettner: ok

17:37 <felixhuettner> and we also have patches for both, so hopefully that is then gone

17:37 <mmichelson> OK, who wants to give the next update?

17:37 <mj2> i can

17:37 <zhouhan> imaximets: for message larger than a jsonrpc, I was under the impression that ovn-controller shouldn't do anything in the iteration because the inc-engine will find nothing changed in the input

17:38 <mmichelson> mj2, go ahead.

17:39 <mj2> so im still working on the multinode test between various microovn tasks, I noticed that it was sliently failing though it was reporting passing, so I have had to spend a while figuring out why this is and how to fix it

17:40 <mj2> I think this effort is nearing a conclusion but I will admit its taking longer than I would like

17:40 <mj2> the multinode test being the bgp unnumbered with external bgp deamon

17:40 <mj2> that is all

17:40 <mmichelson> Thanks mj2

17:40 <mmichelson> Who's next?

17:40 <imaximets> May I?

17:41 <mmichelson> go for it imaximets

17:41 <imaximets> zhouhan, for your comment, it seems like ovn-controller does a lot of work unconditionally outside of inc-proc engine, just when it wakes up. But I didn't measure that myself, so can't add any details.

17:42 <imaximets> From my side, I released OVS 3.5 and sent a patch to move OVN to v3.5.0 submodule, which is applied now.

17:43 <imaximets> Spent some time thinking on how to make address-set processing in ovn-controller faster, but have no good ideas so far.

17:43 <imaximets> Will spend some more time on that next week.

17:43 <imaximets> That's all from me.

17:43 <mmichelson> Thanks imaximets

17:43 <mmichelson> Who wants to go next?

17:43 <dceara> I can go next if that's ok.

17:44 <mmichelson> dceara, it's perfectly ok

17:44 <zhouhan> imaximets: the work outside of inc-proc engine is not trivial but still relatively small, shouldn't be in seconds (25s mentioned by felixhuettner), which he observed was in flow-output node which is part of inc-proc engine.

17:45 <dceara> I reviewed and applied some patches. Out of these I liked that it seemed easier than other times for northd i-p to be implemented: https://patchwork.ozlabs.org/project/ovn/list/?series=444957&state=*

17:46 <mmichelson> (sorry, I have to head out)

17:46 <dceara> For that series I was actually thinking of also applying it to branch-25.03 after we accept it on main but I'll start that discussion on-list.

17:47 <dceara> Related to I-P I played a tiny bit and hacked something that would generate a graphical visualization of the I-P graphs (northd and ovn-controller). It seems to have some potential, I'll refine it and maybe post it at some point in the future.

17:47 <zhouhan> This is cool

17:47 <mkalcok> dceara: that sounds amazing.

17:48 <dceara> I also realized we do quite some unnecessary parsing for LB/NAT IPs that are advertised when dynamic routing is enabled so I started on a patch that improves that but it's not yet ready.

17:48 <dceara> That's it on my side, planning to do more reviews next week.

17:49 <imaximets> Thanks, dceara!

17:50 <dceara> imaximets, will you moderate the remainder of the meeting?

17:50 <imaximets> zhouhan, I'm not sure how 25 seconds related to outside-engine processing, my best guess is that there is a smal amount of other changes outside of the one that got stuck, but we need to ask felixhuettner .

17:50 <imaximets> dceara, sure.

17:50 <imaximets> Who wants to go next?

17:50 <amusil> I can quickly

17:50 <imaximets> amusil, go ahead!

17:50 <felixhuettner> lets maybe continue the discussion after the others are done :)

17:51 <amusil> I have posted the ct-commit-all optimiztion/fix that was discussed before release

17:51 <amusil> I have also posted fix to have action name in the pinctrl dbg and some optimization for AS lflow processing

17:52 <amusil> That's about it, thanks

17:52 <amusil> Oh zhouhan would be nice if you could take a look at ct-commit-all patch

17:52 <zhouhan> amusil: am following up with Alin on the HW offload test for the commit-all change. It seems the HW offload still doesn't work, and we are still debugging it.

17:53 zhouhan has quit [Quit: Client closed]

17:53 <imaximets> OK. Thanks, amusil!

17:53 zhouhan has joined #openvswitch

17:53 <zhouhan> sorry I was disconnected.

17:54 <zhouhan> amusil: did you see my last message?

17:54 <imaximets> I saw it. You're still debugging. :_

17:54 <imaximets> s/_/)/

17:54 <zhouhan> That's it from me

17:55 <amusil> Yeah I saw it too, hmm strange let's see what is wrong then

17:55 <imaximets> OK. Who wants to go next?

17:55 <mkalcok> I can drop a quick update

17:56 <imaximets> mkalcok, sure.

17:56 <mkalcok> I wanna say huge thank you to everyone that helped out with and reviewd NAT/LB route advertisement series. dceara felixhuettner amusil

17:56 <mkalcok> This week I was catching up mostly on our downstream stuff, and I took a look at Felix's incremental processing of learned routes.

17:56 <mkalcok> Though that has been already mostly acked by dceara.

17:56 <mkalcok> I'll keep my eye out on the NAT/LB parsing improvement for review.

17:56 <mkalcok> that's it from me.

17:56 <dceara> mkalcok, the more reviews the better :)

17:57 <imaximets> Thanks, mkalcok !

17:57 <imaximets> Anyone else want to give an update?

17:58 <imaximets> If not, felixhuettner you wanted to clarify some things for zhouhan regarding ovn-controller and inc-engine?

17:58 <felixhuettner> yep, i can add some more details

17:58 <dceara> Sorry, I need to drop early, thanks everyone, bye!

17:58 <mkalcok> thanks all o/

17:58 mkalcok has quit [Quit: leaving]

17:59 <felixhuettner> so we see multiple things in combination here. On the one hand a large amount of small messages, maybe at 10/sec

17:59 <felixhuettner> additionally sometimes larger messages at a maybe 1/sec rate

17:59 <felixhuettner> in addition to that the logical_flow_output recomputes come from changes to non_vif_data

17:59 <felixhuettner> we assume that is related to bfd sessions going up and down to compute nodes

17:59 <felixhuettner> at least that correlates quite well to the logs

18:00 <felixhuettner> and in combination that gets quite ugly :)

18:00 <felixhuettner> we have seen this mostly resulting in traffic outages when live-migrating a VM

18:01 <imaximets> Ah, OK. So, you hit a recompute on non-vif-data changes. During recompute you accumulate a lot of idl updates and then it takes forever to process them in batches of 50. Is that right?

18:01 <felixhuettner> as then there was a high chance that we where stuck in a recompute and maybe after that have such a long queue of messages that we cant handle them all in one increment

18:01 <felixhuettner> yep

18:01 <imaximets> Ack, makes sense.

18:01 <imaximets> zhouhan, ^

18:02 <felixhuettner> probably making that non_vif_data change incremental would also help a lot, but we need to look into that further

18:02 <imaximets> Thanks, felixhuettner, for the explanation.

18:02 <imaximets> Yep.

18:02 <felixhuettner> thanks for all the suggestions

18:02 <imaximets> In general, it would be good for recomputes to not take so long too. :)

18:03 <felixhuettner> yep, but at least perf did not give us a nice signal this time

18:03 <felixhuettner> there is just a lot of small things happening there that take a lot of time in sum

18:03 <imaximets> Ack.

18:03 <imaximets> OK, I guess we can call it a meeting.

18:04 <felixhuettner> yep, sounds good

18:04 <imaximets> Thanks, everyone! See you next week.

18:04 <felixhuettner> thanks a lot

18:04 <imaximets> Bye!

18:04 <felixhuettner> bye

18:04 <amusil> Thanks, bye

18:04 amusil has quit [Quit: Client closed]

18:06 kuraudo has quit [Remote host closed the connection]

18:28 dceara has quit [Ping timeout: 248 seconds]

18:28 zhouhan has quit [Quit: Client closed]

18:28 mj2 has quit [Ping timeout: 268 seconds]

19:48 ChmEarl has joined #openvswitch

19:55 dceara has joined #openvswitch

20:11 dceara has quit [Quit: Leaving]

20:44 mmichelson has quit [Quit: Leaving]

21:12 froyo has quit [Ping timeout: 268 seconds]