ChanServ changed the topic of #armlinux to: ARM kernel talk [Upstream kernel, find your vendor forums for questions about their kernels] | https://libera.irclog.whitequark.org/armlinux
qubuepe24_ has quit [Remote host closed the connection]
lain6141 has quit [Remote host closed the connection]
lain6141 has joined #armlinux
MilkMann has joined #armlinux
mraynal has quit [Read error: Connection reset by peer]
mraynal has joined #armlinux
<MilkMann>
hey everyone, I'm working on optimizing arm-based systems running linux for real-time processing tasks, specifically targeting a heterogeneous architecture with arm cortex-a72 cores and a dedicated mali-g71 gpu. the goal is to implement efficient inter-process communication (ipc) between the cpu and gpu for machine learning inference, leveraging
<MilkMann>
arm's big.little architecture. the bottleneck seems to be in the cpu-gpu synchronization, especially when offloading computational tasks to the gpu using opencl, as i’m experiencing significant overhead from memory transfers between the cpu and gpu.
<MilkMann>
i’ve tried fine-tuning the `cgroups` settings for cpu affinity and the `dma-buf` framework for zero-copy memory management, but the system still exhibits latency spikes when switching between cpu and gpu-heavy workloads. i'm also considering using `streaming simd extensions` (sse) to optimize the cpu’s performance, but i’m concerned that it
<MilkMann>
could interfere with the gpu memory bandwidth, especially given the limited number of memory channels in the arm system.
<MilkMann>
additionally, I’m exploring the possibility of utilizing arm's hardware virtualization extensions (like kvm) for isolating gpu tasks in a separate vm for better resource management. however, I'm encountering kernel panics when attempting to pass-through the gpu to a guest os using vfio, possibly due to mismatched irq routing or pcie device
<MilkMann>
assignment issues.
<MilkMann>
does anyone have experience optimizing cpu-gpu communication in arm linux environments? or any advice on handling real-time memory transfers and managing virtualized gpus in this architecture? any tips for reducing latency, improving resource contention, or better utilizing arm’s architecture for these use cases would be greatly appreciated
heat has quit [Ping timeout: 248 seconds]
MilkMann has quit [Quit: Client closed]
qubuepe24 has joined #armlinux
qubuepe24 has quit [Remote host closed the connection]
qubuepe24 has joined #armlinux
qubuepe24 has quit [Remote host closed the connection]
mvaittin has joined #armlinux
qubuepe24 has joined #armlinux
qubuepe24 has quit [Remote host closed the connection]
System_Error has joined #armlinux
monstr has joined #armlinux
headless has joined #armlinux
amitk_ has joined #armlinux
amitk has quit [Ping timeout: 245 seconds]
gclement has joined #armlinux
cbeznea has joined #armlinux
<linusw__>
MilkMann: that's a tall order, but start by talking to the DRM developers. They can be found in #dri-devel on the OFTC network.
headless has quit [Quit: Konversation terminated!]
Perflosopher2 has joined #armlinux
Perflosopher has quit [Ping timeout: 260 seconds]
Perflosopher2 is now known as Perflosopher
frieder has joined #armlinux
apritzel has joined #armlinux
nsaenz has joined #armlinux
nsaenz has quit [Remote host closed the connection]
nsaenz has joined #armlinux
_rgallaispou has joined #armlinux
System_Error has quit [Ping timeout: 260 seconds]
sszy has joined #armlinux
System_Error has joined #armlinux
mripard has quit [Quit: WeeChat 4.4.2]
jclsn has joined #armlinux
monstr_ has joined #armlinux
jclsn has quit [Quit: WeeChat 4.4.3]
jclsn has joined #armlinux
mripard has joined #armlinux
<socksins1>
that guy sure is busy, having both realtime arm linux and netbsd systems to optimize :D (he asked a similar yet distinctly different qestion over there)
TheCoffeMaker has quit [Remote host closed the connection]
TheCoffeMaker has joined #armlinux
<phh>
it's weird to jump on virtualisation, cgroups or anything, before realizing that "memory transfers" are noop
<phh>
(if anyone knows /any/ product with different memory bus for Cortex-A cpu vs Mali gpu I'm curious)
XV8 has joined #armlinux
mvaittin has quit [Remote host closed the connection]