#riscv on 2023-03-21 — irc logs at libera.irclog.whitequark.org

2021-08-01 01:31 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv

00:20 Trifton has joined #riscv

00:26 balrog has quit [Read error: Connection reset by peer]

00:29 balrog has joined #riscv

00:31 gdd has quit [Ping timeout: 265 seconds]

00:31 gdd has joined #riscv

01:08 rurtty has quit [Quit: Leaving]

01:09 wingsorc has quit [Remote host closed the connection]

01:10 wingsorc has joined #riscv

01:14 wingsorc has quit [Remote host closed the connection]

01:15 wingsorc has joined #riscv

01:31 jacklsw has joined #riscv

02:28 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

02:30 hrberg has quit [Ping timeout: 250 seconds]

02:30 ___nick___ has joined #riscv

02:32 hrberg has joined #riscv

02:33 ___nick___ has quit [Client Quit]

02:34 vagrantc has quit [Quit: leaving]

02:35 ___nick___ has joined #riscv

02:55 Stat_headcrabed has joined #riscv

03:22 Dyskos has quit [Ping timeout: 250 seconds]

03:31 motherfsck has joined #riscv

03:42 wiagn has joined #riscv

03:43 Stat_headcrabed has quit [Ping timeout: 255 seconds]

03:43 wiagn is now known as Stat_headcrabed

03:47 motherfsck has quit [Ping timeout: 265 seconds]

03:49 BootLayer has joined #riscv

03:58 motherfsck has joined #riscv

04:00 pabs3 has quit [Quit: Don't rest until all the world is paved in moss and greenery.]

04:02 pabs3 has joined #riscv

04:24 matoro has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

04:26 matoro has joined #riscv

04:31 wiagn has joined #riscv

04:33 Stat_headcrabed has quit [Ping timeout: 265 seconds]

04:33 wiagn is now known as Stat_headcrabed

05:13 wiagn has joined #riscv

05:15 Stat_headcrabed has quit [Ping timeout: 265 seconds]

05:15 wiagn is now known as Stat_headcrabed

05:16 billchenchina- has quit [Ping timeout: 246 seconds]

05:50 motherfsck has quit [Ping timeout: 260 seconds]

05:50 BootLayer has quit [Quit: Leaving]

06:07 wiagn has joined #riscv

06:10 Stat_headcrabed has quit [Ping timeout: 252 seconds]

06:10 wiagn is now known as Stat_headcrabed

06:13 mahk has joined #riscv

06:14 junaid_ has joined #riscv

06:15 junaid_ has quit [Remote host closed the connection]

06:15 mahk has quit [Changing host]

06:15 mahk has joined #riscv

06:16 Gravis has quit [Ping timeout: 276 seconds]

06:16 Gravis has joined #riscv

06:45 wiagn has joined #riscv

06:47 Stat_headcrabed has quit [Ping timeout: 255 seconds]

06:47 wiagn is now known as Stat_headcrabed

07:00 wiagn has joined #riscv

07:02 Stat_headcrabed has quit [Ping timeout: 265 seconds]

07:02 wiagn is now known as Stat_headcrabed

07:11 MaxGanzII has joined #riscv

07:18 TheEldest_ has joined #riscv

07:20 TheEldest has quit [Ping timeout: 265 seconds]

08:23 JanC_ has joined #riscv

08:23 JanC has quit [Killed (lithium.libera.chat (Nickname regained by services))]

08:23 JanC_ is now known as JanC

08:26 wiagn has joined #riscv

08:27 Stat_headcrabed has quit [Ping timeout: 276 seconds]

08:27 wiagn is now known as Stat_headcrabed

08:27 jobol has joined #riscv

08:28 Stat_headcrabed has quit [Client Quit]

08:35 ldevulder has joined #riscv

08:38 <bjdooks> conchuod: should we do anything to advance the CMO dma-coherent memory issue?

08:38 sh1r4s3 has quit [Ping timeout: 276 seconds]

08:38 <conchuod> Ping arnd ;)

08:39 sh1r4s3 has joined #riscv

08:39 <conchuod> He was planning to submit some cross arch refactor which became a prereq for Prabhakars stuff

08:40 <arnd> yes, I need to get back to that, I wonder if there is any way to split it up

08:41 <arnd> the first step I had planned was to go through https://docs.google.com/spreadsheets/d/1qDuMqB6TnRTj_CgUwgIIm_RJ6EZO76qohpTJUMQjUEo/edit#gid=0 and ensure everything follows the same rules

08:41 <arnd> followed by moving the then common logic into shared code

08:42 <bjdooks> yeah, i was going to send something for that, a couple of the flushes i think are meant to be invalidate in riscv

08:42 <bjdooks> arnd: my issue is the dma-allocator mapping, if you use dma_alloc and expect the CMO to work, it still maps the memory as uncached on riscv

08:42 <bjdooks> which makes the CMO code irrelevant

08:43 <arnd> which dma_alloc variant?

08:43 <bjdooks> any of them

08:43 <arnd> dma_alloc_coherent() must map things as uncached if the DMA is non-coherent

08:43 <arnd> if DMA is fully coherent, then you need no CMO

08:44 <arnd> so I think that bit is correct

08:44 <bjdooks> no

08:44 <bjdooks> so i've faked a non-coherent system, done extensive testing, if you have a device marked noncohereht and do a dma-alloc on it, it does the CMOs but the page is marked as uncached.

08:45 <bjdooks> the issue is is SVPBMT and ZICBOM exist, SVPBMT gets picked first for both ioremap() and dma-alloc

08:46 <bjdooks> where you want SVPBMT for ioreamp() and ZICBOM for dma-alloc

08:46 <arnd> help me out with the acronyms: what does SVPBMT mean, and how is it differnet from ZICBOM?

08:46 <bjdooks> ZICBOM is the cahce management like clean/flush/inval

08:46 <bjdooks> SVPBMT allows memory pages to be marked uncached/weakly-ordered

08:49 <bjdooks> https://patchwork.kernel.org/project/linux-riscv/patch/20230307205834.1426289-3-ben.dooks@codethink.co.uk/

08:50 <arnd> still confused. so ioremap() should generally require uncached/strongly-ordered, not uncached/weakly-ordered in order to guarantee ordering between individual mmio register accesses, but that is probably covered by the barriers in readl()/writel(), so that's fine.

08:51 <arnd> the riscv_page_dmacoherent() function in the patch looks correct to me (assuming only ZICBOM is supported, not custom cache management operations)

08:53 <arnd> at least as far as I can tell, this does the same as any other architecture: if the DMA is fully coherent, pgprot_dmacoherent() is regular cached mapping, and the CMOs are all nops

08:53 <conchuod> arnd: are your changes there a strict requirement for Prabhakars series adding CMO stuff for Renesas/Andes stuff?

08:53 <arnd> but if you have ZICBOM, then pgprot_dmacoherent() returns some variant of uncached memory and CMOs do the appropriate flushes

08:53 <bjdooks> but if it isn't, then you get uncached pages

08:53 <bjdooks> totally unached is stupid, then you don't need any ops as it is uncached

08:54 <bjdooks> totally unached is stupid, then you don't need any cmo ops as it is uncached

08:54 <arnd> bjdooks: I think you mix up the streaming mapping (dma_map_*) with the coherent mapping (dma_alloc_*)

08:54 <arnd> CMO is only used for streaming mappings, but cannot work for coherent mapping

08:55 <arnd> doing a flush on an uncached page should result in a CPU fault

08:55 <arnd> (not sure if it does on zicbom, but it does on some other ones)

08:55 <bjdooks> arnd: those instructions do not fault

08:55 <arnd> ok

08:56 <arnd> not a big deal, just makes it a little harder to find bugs in drivers that get the interface wrong

08:56 <bjdooks> I still think riscv is doing it wrong

08:57 <arnd> the typical example is a network driver using dma_alloc_coherent() to create a buffer for its descriptors that is uncached, and dma_map_sg() for the SKBs

08:58 <arnd> In the descriptors, you need the individual accesses to be strictly ordered (first the data pointer, then the valid flag), which you cannot enforce on cached memory

08:58 <bjdooks> so i'm fairly sure in that case, riscv with both svpbmt and zicbom will provide an uncached area and then do CMO ops, which seems strange when uncached should be more than sufficient there,

08:59 <arnd> for the descriptor access, the fictional network driver in my example only does a dma_wmb() between the address write and the flag write.

08:59 <arnd> if dma_wmb() turns into a CMO, that is indeed a bug

09:01 <arnd> I only see

09:01 <arnd> include/asm-generic/barrier.h:#define dma_wmb() wmb()

09:01 <arnd> arch/riscv/include/asm/barrier.h:#define wmb() RISCV_FENCE(ow,ow)

09:01 <arnd> that's not a CMO, right?

09:01 <bjdooks> no

09:01 * bjdooks is now confused

09:02 <arnd> bjdooks: do you have a particular driver that you were looking at, or just the architecture code?

09:02 <conchuod> bjdooks: oh, I think I got what you meant in your original message wrong. I didn't realise you meant your recent patches, saying "ping arnd" was for allowing CMO stuff from functions.

09:03 <bjdooks> so I've been using a test driver i wrote for testing, that does dma_alloc() with differetn attributes and then uses dma_sync_single_for_cpu and dma_sync_single_for_device

09:04 <arnd> ah, that makes sense. So you are using a broken testcase ;-)

09:04 <arnd> dma_sync_single_for_cpu() is only defined on memory you got from dma_map_*()

09:08 <conchuod> arnd: are your cross-arch changes a strict requirement for Prabhakars series adding CMO stuff for Renesas/Andes?

09:08 prabhakarlad has joined #riscv

09:10 <arnd> conchuod: as far as I'm concerned, the strict requirement for new CMOs is that we come up with a sensible definition of what each dma operation should do

09:11 <arnd> both the current riscv definition for ZICBOM and the version that prabhakarlad was adding are common across other architectures, but they are fundamentally at odds with one another, so the bit I'm interested in is making them do the same thing first

09:11 <conchuod> Okay, that makes sense.

09:13 zjason` is now known as zjason

09:13 <arnd> I think the most controversial bit is the question about DMA_BIDIRECTIONAL: powerpc started the flush/flush semantics a long time ago, and this has made it into parisc, microblaze and now riscv over time

09:13 mahk has quit [Ping timeout: 268 seconds]

09:14 <arnd> the idea was to deal with a partially shared cache line at the beginning of the mapping, where one part of it is used by the CPU and another part is used by a device

09:14 <arnd> having a flush in dma_map_*() here makes sense, as this means the device will see the data that was written by the CPU and the CPU doesn't lose any of its own data

09:15 mahk has joined #riscv

09:15 <arnd> but in dma_unmap_*() there is absolutely no way to preserve both the data from the device and the CPU, if they concurrently write into the same cacheline

09:16 <arnd> invalidate loses any new data written by the CPU, and flush loses data written by the device

09:16 BootLayer has joined #riscv

09:19 <conchuod> That sounds like a topic for the Wills and Christophs of the world :)

09:20 <arnd> absolutely. There are a number of easier changes to make where I hope we can easily agree

09:21 <arnd> such as powerpc always doing the same thing for map and unmap, agaict that is just a historic artifact and changing it just makes it more efficient

09:23 <conchuod> I won't hold my horses for this to be resolved soon so! I did like the idea of removing the ability to decide what op is called for what, if there's gonna end up being several methods for doing this on riscv, that approach sounds ideal.

09:27 Sos has joined #riscv

09:28 <bjdooks> ok, in the case of a non-coherent device, then zicbom isn't going to cut it with dma_alloc as there's no way to sync the data or make it uncached

09:43 Sos has quit [Quit: Leaving]

09:46 <bjdooks> ok, so one of the tests is dma_alloc_noncoherent and that if i read it correctly should require the dma-sync calls

09:50 <bjdooks> ^arnd ?

09:51 pecastro has joined #riscv

09:52 <arnd> bjdooks: correct, though note that dma_alloc_noncoherent() is rarely used, it pretty much only exists for old MIPS and Itanium workstations from 25 years ago that had custom requirements

09:53 <arnd> it's not even mentioned in Documentation/core-api/dma-api-howto.rst

09:55 <jrtc27> why would anyone try to support a cache line being shared between dma and something else...

09:55 <jrtc27> that's just broken by design

09:56 <jrtc27> unless you have an architecture that guarantees partial writebacks

09:56 <arnd> jrtc27: yes, that was pretty much my point. I think we have a couple of device drivers that did this in violation of the interface, and they worked fine on machines with coherent caches but caused bugs on certain machines

09:56 <arnd> and then we had architecture maintainers trying to work around this without fully understanding the problem

09:57 <jrtc27> perhaps you want a sanitiser mode where the cmo implementation zeroes out the partial ends...

09:57 <jrtc27> (or some other junk pattenr)

09:57 <arnd> right, I had already considered adding a WARN_ONCE(unaligned address or size)\

10:02 bauruine has joined #riscv

10:12 mahk has quit [Changing host]

10:12 mahk has joined #riscv

10:18 sh1r4s3 has quit [Ping timeout: 264 seconds]

10:28 <bjdooks> https://www.pinterest.de/pin/672936369316516141/ <= when somene mentions new laptop

10:37 ldevulder has quit [Remote host closed the connection]

10:46 jacklsw has quit [Ping timeout: 265 seconds]

11:13 sh1r4s3 has joined #riscv

11:18 wingsorc has quit [Ping timeout: 246 seconds]

11:18 <arnd> jrtc27: I wonder if KASAN could do even better here: mark whole cache line as invalid in dma_sync_*_for_device(..., DMA_FROM_DEVICE) but mark only the actual data as valid in dma_sync_*_for_cpu(..., DMA_FROM_DEVICE)

11:19 <arnd> DMA_TO_DEVICE on partial cache lines is not harmful because there is no data corruption as long as the device only reads

11:19 sh1r4s3 has quit [Remote host closed the connection]

11:20 sh1r4s3 has joined #riscv

11:26 joev has quit [Ping timeout: 255 seconds]

11:26 joev has joined #riscv

11:29 Andre_Z has joined #riscv

11:30 sh1r4s3 has quit [Read error: Connection reset by peer]

11:31 sh1r4s3 has joined #riscv

11:36 joev has quit [Ping timeout: 255 seconds]

11:37 joev has joined #riscv

11:43 joev has quit [Ping timeout: 250 seconds]

11:43 joev has joined #riscv

11:58 joev has quit [Ping timeout: 250 seconds]

11:59 joev has joined #riscv

11:59 billchenchina- has joined #riscv

11:59 billchenchina- has quit [Remote host closed the connection]

12:00 billchenchina has joined #riscv

12:00 cwebber has joined #riscv

12:05 rneese has joined #riscv

12:11 Tenkawa has joined #riscv

12:20 Andre_Z has quit [Quit: Leaving.]

12:34 jmdaemon has quit [Ping timeout: 265 seconds]

12:48 ldevulder has joined #riscv

12:50 MaxGanzII has quit [Remote host closed the connection]

12:51 MaxGanzII has joined #riscv

12:53 billchenchina- has joined #riscv

12:56 billchenchina has quit [Ping timeout: 265 seconds]

12:56 elastic_dog has quit [Killed (zinc.libera.chat (Nickname regained by services))]

12:56 elastic_dog has joined #riscv

13:32 rurtty has joined #riscv

13:47 sh1r4s3_ has joined #riscv

13:47 sh1r4s3 has quit [Read error: Connection reset by peer]

13:51 Andre_Z has joined #riscv

14:04 motherfsck has joined #riscv

14:08 Andre_Z has quit [Ping timeout: 265 seconds]

14:30 MaxGanzII has quit [Remote host closed the connection]

14:31 jacklsw has joined #riscv

14:46 elastic_dog has quit [Remote host closed the connection]

14:47 elastic_dog has joined #riscv

15:00 Andre_Z has joined #riscv

15:00 lagash has quit [Quit: ZNC - https://znc.in]

15:02 lagash has joined #riscv

15:08 rneese has quit []

15:09 <arnd> geertu: I'm trying to make sense of the m68k arch_sync_dma_for_device() function, which has operations called 'push' and 'clear' instead of the normal 'clean'/'invalidate'/'flush'.

15:09 <arnd> is this a write-through or write-back cache?

15:11 <geertu> arnd: '020/ '040/'060 is write-back

15:11 <geertu> arnd: '020/'030 is write-through, '040/'060 is write-back

15:13 <arnd> ok, so 'push' is 'clean' (on WB) plus 'invalidate' on all, while 'clear' is just 'invalidate', right?

15:13 <geertu> arnd: yes, cfr. the documented semantics in arch/m68k/mm/memory.c

15:18 <arnd> geertu: got it, so this uses the regular writeback semantics, except that it does an extra invalidate in sync_dma_for_device(..., DMA_TO_DEVICE), where others just do a 'clean', and no 'invalidate'.

15:20 <arnd> I'm still unsure what semantics we actually want on write-through caches. I think what you do here (all operations in *_for_device, just skip the clean when that is a nop) would be the easiest, but it's not what other architectures do today

15:21 <arnd> on sparc32, xtensa and writethrough variants of armv4, the invalidate happens in _for_cpu() rather than for_device(), and I'm not sure whether there are any important tradeoffs

15:22 Noisytoot has quit [Read error: Connection reset by peer]

15:22 <geertu> Doing it in for_device() avoids ever pushing out the data twice, corrupting memory if the DMA wrote something in between

15:23 <geertu> BTW, I don't like "clean"

15:24 <geertu> Your Google Docs document also uses "flush", which is ambiguous.

15:24 <arnd> is 'wback' better?

15:25 <geertu> IIRC, "push" and "invalidate" are the non-ambiguous terms?

15:25 <arnd> I don't think anyone else uses 'push', so that would be more confusing

15:26 lagash has quit [Quit: ZNC - https://znc.in]

15:26 <arnd> 'wbinv' instead of 'flush' would be less ambiguous but smells very x86

15:26 <geertu> That's write-back + invalidate?

15:26 <arnd> right

15:27 <geertu> wback is unambihuous, too.

15:27 Noisytoot has joined #riscv

15:28 <geertu> "flush" is typically used in sayings like "yeah, you have to flush the cache to avoid corruption", but doesn't clarify what exactly needs to be done (push/wback? invalidate? Both?)

15:29 <arnd> I'll stick with the wback/inval/flush naming for the moment, hopefully that's clear enough. clean/inval/flush is the terminology from arch/arm, so I started with that, but that is a bit ambigous as both 'clean' and 'flush'

15:30 <arnd> have been used with multiple meanings

15:30 lagash has joined #riscv

15:33 <geertu> Exactly.

15:34 <geertu> What terminology does the buffer cache use?

15:39 <geertu> OK, that one is not write-through

15:39 Andre_Z has quit [Ping timeout: 276 seconds]

15:49 Tenkawa has quit [Ping timeout: 250 seconds]

15:50 Tenkawa has joined #riscv

15:59 vagrantc has joined #riscv

16:04 <dh`> the last thing I needed names for those on I used wb/wbinv/inv

16:23 motherfsck has quit [Ping timeout: 276 seconds]

16:28 <geertu> dh`: These are unambiguous, too.

16:34 pecastro has quit [Ping timeout: 264 seconds]

16:34 Andre_Z has joined #riscv

16:36 billchenchina has joined #riscv

16:38 MaxGanzII has joined #riscv

16:39 billchenchina- has quit [Ping timeout: 256 seconds]

16:58 lagash has quit [Quit: ZNC - https://znc.in]

16:59 lagash has joined #riscv

17:04 Perflosopher has joined #riscv

17:04 <geist> i really find the arm clean and invalidate to be about as unambiguous as it gets

17:09 jacklsw has quit [Read error: Connection reset by peer]

17:17 <Esmil> I'm guessing wback means write what is in cache to ram and inval means forget what is in the cache and read from ram. But what is flush then?

17:19 <geist> yah i think you need like 2 of the 3 terms at the same time, because wback/clean/flush are kinda ambiguous with each other

17:19 <geist> flush does tend to get codified into various apis as 'synchronize i and d cache' annoyingly

17:25 <Esmil> ah, sorry. arnd said earlier that flush is just wback + inval

17:25 prabhakarlad has quit [Quit: Client closed]

17:31 <geist> yah it depends on what api, they're all used differently

17:31 <geist> the flush one i'm thinking about is iirc a builtin in gcc/llvm for 'synchronize i & d' that's called flush

17:31 <geist> and thus is somewhat codified everywhere, across a bunch of OSes

17:31 sh1r4s3 has joined #riscv

17:32 <geist> though hmm, now i see it as __builtin___clear_cache

17:32 sh1r4s3_ has quit [Ping timeout: 255 seconds]

18:15 sh1r4s3 has quit [Ping timeout: 246 seconds]

18:21 jmdaemon has joined #riscv

18:24 jobol has quit [Quit: Leaving]

18:43 motherfsck has joined #riscv

18:59 pecastro has joined #riscv

19:16 <bjdooks> ok, now i've moved to using kzalloc() and dma_map it seems the kernel is possibly using bounce buffers, which sort of defats the idea of trying to use cmo ops... dma_alloc_noncoherent does however work

19:19 vagrantc has quit [Quit: leaving]

19:20 sh1r4s3 has joined #riscv

19:28 prabhakarlad has joined #riscv

19:44 KombuchaKip has quit [Quit: Leaving.]

20:01 BootLayer has quit [Quit: Leaving]

20:03 billchenchina has quit [Ping timeout: 248 seconds]

20:22 jmdaemon has quit [Ping timeout: 268 seconds]

20:34 lagash has quit [Quit: ZNC - https://znc.in]

20:37 lagash has joined #riscv

21:04 jmdaemon has joined #riscv

21:04 ___nick___ has quit [Ping timeout: 265 seconds]

21:20 ntwk has quit [Ping timeout: 248 seconds]

21:24 MaxGanzII_ has joined #riscv

21:27 MaxGanzII has quit [Ping timeout: 255 seconds]

21:34 ntwk has joined #riscv

21:41 Andre_Z has quit [Ping timeout: 268 seconds]

21:42 vineetg762 has joined #riscv

21:42 <palmer> bjdooks: IIRC we've got bounce buffers enabled by default as some systems need them (SiFive's ethernet, for example). Not sure what you're running on...

21:43 KombuchaKip has joined #riscv

21:43 <bjdooks> The sifive_u qemu and an internal FPGA farm

21:44 vineetg762 has quit [Client Quit]

21:51 <palmer> I guess the FPGAs are up to you to decide, but I think the sifive_u would end up emulating the same ethernet addressing related issues as in the Unleashed and thus have bounce buffers

21:53 ldevulder has quit [Ping timeout: 256 seconds]

21:56 bauruine has quit [Remote host closed the connection]

22:25 pedja has quit [Quit: Leaving]

22:26 MaxGanzII_ has quit [Quit: Leaving]

23:03 jmdaemon has quit [Ping timeout: 264 seconds]

23:18 ntwk has quit [Ping timeout: 255 seconds]

23:21 jmdaemon has joined #riscv

23:26 wingsorc has joined #riscv