mesa/src/amd at 104f21c27b0c8b2dce1ba5e6a3b3c05e77067e1f - mesa

Files

Georg Lehmann 6d2190300a radv/nir/lower_cmat: tightly pack 8bit gfx11 acc matrix

Invalid for now, but used by vkd3d-proton, where the use case is to convert
a result matrix to lower precision, followed by a store.

For 16bit accumulation matrices, GFX11 only uses 16bits per 32bit register.
RADV's coop matrix code pads the unused space with undefs and uses a vector
with twice as many elements as the matrix length. Extending that to 8bit by
leaving 24 bits unused is unnecessary as these matrices as there
is no hw unit that requires it. And in wave32, it would also result in
vectors larger than NIR's limit.
So tightly pack 8bit matrices without any undef padding.

Reviewed-by: Timur Kristóf <timur.kristof@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34382>

2025-04-24 06:37:44 +00:00

addrlib

amd/addrlib: remove the DCC page fault workaround

2025-04-01 03:23:22 -04:00

ci/piglit: Use structured tagging for Piglit

2025-04-17 09:22:39 +00:00

common

ac/nir: init blake3 for cs blit shader

2025-04-23 07:59:10 +00:00

compiler

aco: use v_perm_b32 for byte swaps within a VGPR on gfx10

2025-04-23 18:23:18 +00:00