IRC Logs of #dri-devel on irc.freenode.net for 2025-01-27

08:09 MrCooper: DemiMarie mareko: FWIW, xf86-video-intel doesn't (always) enable DRI3 by default either
10:27 phasta: Does anyone here know how amdgpu pairs up with its cards? I looked at amdgpu_drv.c's section with the PCI IDs, and it seems they haven't added new IDs in years… do they have a catch-all ID or something like that?
11:49 MrCooper: phasta: indeed, it does
11:51 MrCooper: and it checks the individual IP blocks of the device to determine whether or not it can actually drive it
11:59 phasta: MrCooper, do you have a pointer to the code?
11:59 phasta: And, in this context, what is an IP-Block?
12:10 pq: Intellectual Property block, a.k.a hardware block
12:49 robmur01: daniels: FWIW i.MX6S/SX still exist ;)
13:00 kobboi: I have a gnome-shell > mutter > mesa crash in my VM. Backtrace at https://bpa.st/KUQA. Should I log this with mesa or with one of its users?
13:01 kobboi: (it makes gnome-shell/gdm crash at startup, falling back to X iso wayland)
13:35 MrCooper: phasta: not offhand, agd5f ?
13:36 MrCooper: kobboi: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12467
13:36 phasta: I think it's in amdgpu_drv.c line 2139 following where they set up a catch-all and then do IP magic
13:36 phasta:doesn't like IP magic
13:42 MrCooper: not sure what you mean by "IP magic"; this compares the HW configuration to what the driver can actually support, would you prefer it to roll a dice or something? ;)
13:48 kobboi: MrCooper: awesome, I had encountered this same coredump already some weeks ago (although the side effect was different at the time) but there was no bug about it yet and I did not have the time to do anything with it (other than downgrade mesa). So I should have checked again for an existing bug. Thanks!
13:50 phasta: MrCooper, just register the actual PCI device IDs. This way it becomes totally obvious what hardware the driver is actually supposed to support
13:51 MrCooper: phasta: you're assuming a 1:1 mapping between PCI IDs and HW configurations, which doesn't exist for newer AMD GPUs
14:07 DemiMarie: MrCooper: isn't that deprecated in favor of modesetting?
14:17 Ermine: so... all new cards have the same id?
14:19 Ermine: all new amd cards that is
14:34 knurd: Lo! Is it possible to install mesa's vulkan drivers in a separate directory (say /usr/lib64/dri-freeworld) and then make the Linux install use those by default (e.g. instead of the ones from the distro) using just drop-in files?
14:34 Mis012[m]: magic is the opposite of having self-describing hw, obviously the benefit of ugly magic where the kernel has to have everything hardcoded is that you don't need to rely on dumps from people with the hw to get the full picture
14:34 knurd: Touching the distro's driver/icd files nor using the environment variables like VK_DRIVER_FILE are options for what I want to do: provide mesa-vulkan-driver add-on packages for fedora
14:35 knurd: that contain vulkan drivers supporting hw video acceleration for patent encumbered codecs.
14:35 knurd: I did a few experiments (https://fosstodon.org/@kernellogger/113899675657764802 ) and was able to install drivers in parallel without any problems and use the one I build with VK_DRIVER_FILE
14:35 knurd: But I did not manage to make the system use it by default; to my untrained eyes it looks like libVkLayer_MESA_device_select.so reorders things and prefers the distro's vulkan driver.
15:06 MrCooper: Ermine: not all the same, not all unique ones though
15:36 alyssa: unrolling a loop fixes a CTS test with lavapipe
15:36 alyssa: this does not spark joy (:
15:37 zmike: 🤕
15:38 alyssa: umm... can someone with an x86 machine try a branch for me real quick?
15:39 alyssa: alyssa/mesa:gallivm-brokenness with lavapipe cts test dEQP-VK.ray_tracing_pipeline.watertightness.0.65536
15:39 alyssa: it's RT pipelines which means the diff is... large..
15:39 zmike: you won't believe this
15:39 zmike: but I don't have you in my remotes
15:40 zmike: are we even developer friends?
15:40 zmike: oh no it was just on one machine
15:43 zmike: alyssa: it passes before/after your branch for me
15:44 alyssa: extreme clown moment
15:45 alyssa: so this is breaking with my brnach.. only on arm64 then..? *melts*
15:46 alyssa: konstantin: passes with your loop limiter change cherry-picked
15:46 alyssa: can i go bed to sleep now
16:20 stsquad: when it comes to device memory allocated for native-context is that TTM or the driver that does it? I want to try an experiment to see if improving alignment helps
16:40 agd5f: phasta, amdgpu claims all devices with VID=0x1002 and VGA and Display PCI classes. Then we read the IP discovery table from the device to determine if a device is supported or not. See amdgpu_discovery.c
16:42 jenatali: mareko: Re: glthread: I think the WGL frontend doesn't have glthread hooked up yet. I don't think there's a reason not to do it, it just hasn't been a priority yet. So we either need to hook that up or else just make sure we don't break non-glthread
16:42 jenatali: If Linux is trending towards glthread then we should probably enable it for WGL just so we don't get divergence that makes hard-to-investigate bugs
16:42 agd5f: phasta, specifically amdgpu_discovery_set_ip_blocks() for how we match ip blocks
16:51 alyssa: pendingchaos: metadata validation has significantly regressed vtn_bindgen2 performance in debug builds of mesa (== dev build times), profiling shows vtn_bindgen2 is entirely nir_validate bound and reverting the metadata/dominance validation commits helps a ton
16:51 alyssa: is there any low hanging optimization fruit in nir_validate for the new metadata stuff?
16:52 alyssa: should we necessarily be validating everything after every pass? IDK the answers to these
17:13 guludo: jani: uh-oh... Looks like dim rebuild-tip -d doesn't work as I expected (it appears I need to use -d before rebuild-tip).
17:14 jani: guludo: all dim parameters go right after 'dim'
17:15 guludo: I realized that now... Sorry.
17:15 guludo: ah, thankfully it appears rebuild-tip doesn't use the local drm-intel-next.
17:16 guludo: so I guess, hopefully, no harm was done? I just ended up pusing a new drm-tip with the same trees as the previous one?
17:19 guludo: because I did not use the local branch positional arg... I guess I was lucky
17:26 jani: yeah, dim is a bit dim
17:37 guludo: now I used dim -d rebuild-tip drm-intel-next and build-tested locally; then used dim push.
17:39 jani: I've never used dim -d myself. if the patch applied to drm-tip during CI and to an individual branch while merging, it's extremely unlikely to cause problems
17:39 jani: unless there's a long time between the two
17:41 vsyrjala: i usually do 'dim -d rebuild-tip + build test' after resolving a conflict
17:47 jani: I avoid applying patches that conflict ;)
17:48 pendingchaos: I'm not aware of any low hanging fruit
17:48 pendingchaos: and I think validating after each pass is generally best, since a later pass could hide the issue before the next validation
17:56 demarchi: jani: when applying a big patch series, I don't want to be caught by surprise, **after pushing**, that a merge failed
17:57 demarchi: jani: so I usually do "dim -d rebuild-tip drm-xe-next"
17:57 jani: demarchi: yeah I get that too
17:57 demarchi: so it uses my local drm-xe-next branch just to check if it will be able to merge everything
17:59 demarchi: guludo: I wouldn't say lucky... I added a check for that on purpose ;)
18:53 alyssa: pixelcluster: do we have any idea for how scratch/stack works in NIR with multiple real functions?
18:53 alyssa: it feels like what NIR calls scratch really should be stack, and nir->scratch_size should be per-function_impl?
18:54 guludo: demarchi: thanks :-)
18:54 alyssa: (and then things should "just work" with real functions)
18:54 alyssa: (provided the backend maintains the sp appropriately across calls)
18:55 alyssa: I'm hitting this with vtn_bindgen2 but that's just a symptom tbh
18:56 alyssa: (and then I guess nir_inline_functions would need to handle sp bumps and -- and then scratch_size breaks -- oh ffs)
19:00 karolherbst: yeah... it's a big topic and yeah.... a lot of things to figure out :)
19:13 rodrigovivi: sima: https://lore.kernel.org/dri-devel/20241017075725.207384-1-giedriuswork@gmail.com/
19:14 rodrigovivi: this one requires ack from you or dave since it touches include/drm right? or ack from drm-misc maintainers is enough?
19:15 alyssa: karolherbst: definitely the current model of shader global vars for scratch is not what we want
19:15 alyssa: although I don't know exactly what is
19:28 karolherbst: yeah...
19:29 karolherbst: from a "everything is inlined" perspective it works well enough, but once we move away from that......
19:30 karolherbst: I still have to figure out CL global vars in the global address space and believe me, that's a mess on an entire different level 🙃 I honestly don't know what they were thinking with that one...
19:52 sima: rodrigovivi, drm-misc is enough but ack
19:52 rodrigovivi: thank you
20:11 yrlf: quick DMA-BUF question: if I create a DMA-BUF with gbm_bo_create, and the underlying DMA-BUF will be kept alive as long as I have a valid fd to it, right?
20:12 yrlf: i.e. I can gbm_bo_destroy() it as soon as I have another fd to it and I have effectively "exported" it out of libGBM
20:13 emersion: yeah
20:13 emersion: GEM handles also keep a refcount
20:13 yrlf: great, thanks
20:13 yrlf: currently trying to clean up the mess I have in wl-mirror, finally trying to get the DMA-BUF details right
21:08 mareko: alyssa: scratch is only for spilling, though if you have function calls, you need it to work like stack
21:08 alyssa: mareko: I'm talking about nir's load/store_scratch intrinsics
21:09 mareko: that's just for spilling
21:09 alyssa: nope :(
21:09 mareko: and indirect indexing emulatino
21:09 alyssa: ..and function temps in OpenCL (`private`)
21:11 mareko: isn't that just indirect indexing
21:16 austriancoder: daniels: if you have time to talk about what's needed to land !3418 - feel free to ping me or leave a comment in the MR
23:40 karolherbst: alyssa: sooooooo CL global vars are global to the _program_ not individual kernels, so it shares the content across kernel invocations and everything + it supports initializers, so you need to initialize it after compilation, not on a queue. worse, the initializer can depend on the address of the global var + any deref chain based on it
23:40 karolherbst: which means... specconstantops can be derefs
23:40 karolherbst: which means it's not a constant
23:40 karolherbst: which means.. I need to be able to spill spec constant op chains to an initialization kernel 🙃
23:40 karolherbst: it's horrible
23:41 karolherbst: though...
23:41 karolherbst: an alternative approach would be, that I reserve an address and pass it into spirv_to_nir, but then I'd have to do deref chain resolving in spirv_to_nir and not in lower_io 🙃
23:42 karolherbst: this feature is just cursed
23:42 jenatali: karolherbst: The spec says initialization can happen on first-enqueue of any kernel from the program
23:42 karolherbst: and the only benefit is, that you don't have to bind a buffer containing that global state...
23:42 karolherbst: jenatali: sure, but what if you have 5 enqueues on 5 threads
23:42 karolherbst: on different queues
23:42 jenatali: Serialize 'em
23:42 karolherbst: well...
23:43 karolherbst: I'd prefer not to
23:43 karolherbst: but yeah.. I could
23:43 karolherbst: doesn't really change much because that's not the hard problem
23:43 jenatali: Heh sure
23:43 karolherbst: the hard problem is the specconstantop situation
23:44 jenatali: Yeah that's gnarly
23:44 karolherbst: I liked my idea with passing in the address, but then later I noticed: wait... there are deref chains on top
23:44 karolherbst: I have a prototype for it, but I hate it
23:44 karolherbst: (spilling the constant op chains to an init kernel that is)
23:45 karolherbst: it's just sad that 1. generic_address_space CTS testing _requires_ this feature
23:45 karolherbst: and generic_address_space is required by adapticecpp
23:45 karolherbst: it's all pain
23:47 jenatali: Yeah. CL was a mistake