Alyssa Rosenzweig
3f795a2b8d
nir/divergence_analysis: handle more AGX
...
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com >
Reviewed-by: Marek Olšák <maraeo@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265 >
2025-07-23 14:15:57 +00:00
Alyssa Rosenzweig
ebc18de6f5
nir/opt_vectorize_io: allow i/o semantics w/o component
...
load_uvs_index_agx always implicitly uses component 0.
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com >
Reviewed-by: Marek Olšák <maraeo@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265 >
2025-07-23 14:15:57 +00:00
Alyssa Rosenzweig
75f25b35f8
nir: handle frag_coord_z/w intrinsics
...
so we can gather_info later.
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com >
Reviewed-by: Marek Olšák <maraeo@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265 >
2025-07-23 14:15:57 +00:00
Alyssa Rosenzweig
8716012b21
glsl,nir: factor out nir_opt_varyings_bulk
...
Correctly/optimally using nir_opt_varyings directly is pretty tricky. For GL, we
have all the right logic in the GLSL linker. for VK, we don't want to duplicate
this dance in every driver. Wrap it all up in a nir_opt_varyings_bulk helper
that operates on an entire pipeline of nir_shader's, following the GLSL linker's
logic. This is suitable for Vulkan drivers.
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com >
Reviewed-by: Marek Olšák <maraeo@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265 >
2025-07-23 14:15:57 +00:00
Alyssa Rosenzweig
bc64ea2815
vulkan: fix shader linking with common pipelines
...
Despite appearances, the current vk_pipeline implementation fails to link any
shaders, unless GPL is used or the link_geom_stages option is set (which no
drivers do). Notably monolithic pipelines don't get linked.
This patch attempts to fix our linking issues. Monolithic pipelines now get
linked, GPL optimized pipelines do too. GPL fast link is still not linked.
Geometry stages are now always linked because - despite the option - I think all
hardware wants this. Apps love writing random dead varyings for literally no
reason, which isn't free even on NVIDIA. This removes the option, effectively
setting it for all drivers, which in retrospect is the right decision.
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com >
Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265 >
2025-07-23 14:15:57 +00:00
Georg Lehmann
e8ebc40f22
pvr/rogue: replace NIR_PASS_V with NIR_PASS(_, ...)
...
Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36312 >
2025-07-23 14:00:51 +00:00
Georg Lehmann
83945e2247
pvr/rogue: return progress in rogue_nir_pfo
...
Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36312 >
2025-07-23 14:00:51 +00:00
Konstantin Seurer
cabcd7e9ea
lavapipe/rt: Do not use vk_acceleration_structure::size
...
vkd3d-proton sets this to the size of the backing memory which means we
will overwrite random buffer contents when performing acceleration
structure copies.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
536f5d3496
lavapipe/rt: Set push_constant_size
...
This is necessary for push constants to be emitted.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
a8cc143044
lavapipe/rt: Fix watertightness for real this time
...
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
9313a16e24
util: Fix sparse tile size when dimensions=1
...
The return value is blocks and not bytes.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
45d48ecebf
gallivm: Implement arrayed non-arrayed descriptor compatibility
...
Sampling with layer!=0 from a non arrayed descriptor should return 0 and
sampling without an explicit array layer from and arrayed descriptor
should sample the first layer.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
bc56ec7ce0
gallivm: Implement txs with divergent explicit lod
...
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
6d3c10577d
lavapipe: Set image_array for input attachment loads
...
They have the layer as an array component.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
e323b01703
lavapipe/ci: Add context to some vkd3d-proton test fails
...
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
c9c1110149
lavapipe: Adjust imageGranularity for block formats
...
The cts tests are wrong.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
70c4f2f91f
gallium/util: Fix an assert in util_resource_copy_region
...
The assert can fail when copying from uncompressed to compressed with
dst_level!=0 which is valid.
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Konstantin Seurer
f1fdd26482
gallivm: Silence a warning
...
Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36197 >
2025-07-23 13:29:18 +00:00
Christoph Pillmayer
1c23f18ea8
panvk: Advertise VK_EXT_mutable_descriptor_type on v9+
...
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36002 >
2025-07-23 12:54:24 +00:00
Christoph Pillmayer
ec02137c86
panvk: Support DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT
...
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36002 >
2025-07-23 12:54:24 +00:00
Christoph Pillmayer
b5a91e537a
panvk: Support VK_DESCRIPTOR_TYPE_MUTABLE_EXT on v9+
...
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36002 >
2025-07-23 12:54:24 +00:00
Georg Lehmann
fcc9203550
nir/opt_remove_phis: skip unreachable phis
...
block->imm_dom is NULL for unreachable phis, so the dominance checks would crash.
These blocks should be removed by nir_opt_dead_cf, so don't bother optimizing
them here.
Fixes: 60776f87c3 ("nir/opt_remove_phis: rematerialize constants")
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev >
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35935 >
2025-07-23 12:24:31 +00:00
Jose Maria Casanova Crespo
f0b3a4fcaf
v3dv: limit V3D_TFU_READAHEAD to buffers/images with USAGE_TRANSFER_SRC flag
...
We avoid adding unconditionally the 64-bytes padding to all usages
of the vulkan memory allocations. The readahead padding is only added
for buffers/images with USAGE_TRANSFER_SRC_BIT usage enabled as this
is enough for having a full vk-cts without reported MMU TFU errors.
vk-cts doesn't exercise the added image memory requirements codepath to
handle this readahead. This is because the required 64-bytes image
alignments for images with flag VK_IMAGE_USAGE_TRANSFER_SRC_BIT.
But the alignment didn't cover when the image is already aligned to
64-bytes at the end of the memory page.
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36159 >
2025-07-23 10:44:02 +00:00
Jose Maria Casanova Crespo
4e033ffb27
v3d: Add V3D_TFU_READAHEAD padding for allocated resources
...
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36159 >
2025-07-23 10:44:02 +00:00
Jose Maria Casanova Crespo
310aa198f4
v3dv: Move V3D_TFU_READAHEAD_SIZE to src/broadcom/common
...
We will use it in v3d.
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36159 >
2025-07-23 10:44:01 +00:00
Rhys Perry
7ed9fdf85b
nir/search: check variable requirements even if it's already seen
...
Even if it's already seen, the variable might have some unchecked
requirements.
Signed-off-by: Rhys Perry <pendingchaos02@gmail.com >
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12320
Backport-to: 25.1
Reviewed-by: Georg Lehmann <dadschoorse@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32837 >
2025-07-23 09:43:33 +00:00
Rhys Perry
f45026751f
nir/cf: have nir_remove_after_cf_node remove phis at the start too
...
Signed-off-by: Rhys Perry <pendingchaos02@gmail.com >
Backport-to: 25.1
Reviewed-by: Georg Lehmann <dadschoorse@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35975 >
2025-07-23 09:06:36 +00:00
Christoph Pillmayer
9c104fa026
panvk: Make most end work instrumentation synchronous
...
Most of it is tracking stuff that is ending synchronously anyways.
For example, in emit_barrier_insert_waits, cs_sync64_wait is sync and
therefore there is no need to defer the timestamp write on any SBs.
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Reviewed-by: Olivia Lee <olivia.lee@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36161 >
2025-07-23 08:28:23 +00:00
Christoph Pillmayer
8255af9f54
panvk: Make ts in panvk_instr_begin_work synchronous
...
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Reviewed-by: Olivia Lee <olivia.lee@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36161 >
2025-07-23 08:28:23 +00:00
Christoph Pillmayer
737156b4df
panvk: Make panvk_utrace_record_ts wait mask configurable
...
This is so that in subsequent commits the wait mask can be:
- set by the caller
- not provided at all -> synchronous ts write
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Reviewed-by: Olivia Lee <olivia.lee@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36161 >
2025-07-23 08:28:22 +00:00
Christoph Pillmayer
7358f0e045
panvk: hide utrace behind more generic interface
...
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com >
Reviewed-by: Olivia Lee <olivia.lee@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36161 >
2025-07-23 08:28:21 +00:00
Erik Faye-Lund
0ed1a10b73
Revert "upanfrost: make 128-bit opt-in with driconf on v4"
...
This is no longer needed, thanks to the previous commit.
This reverts commit 23a32b948b .
Backport-to: 25.1
Reviewed-by: Eric R. Smith <eric.smith@collabora.com >
Acked-by: Marek Olšák <marek.olsak@amd.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36156 >
2025-07-23 07:37:23 +00:00
Erik Faye-Lund
0178a4bd9d
Revert "lima: make fp16 render-targets opt-in with driconf"
...
This is no longer needed, due to the previous commit.
This reverts commit 1617778c38 .
Backport-to: 25.1
Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com >
Acked-by: Marek Olšák <marek.olsak@amd.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36156 >
2025-07-23 07:37:23 +00:00
Erik Faye-Lund
c7bc454dbd
mesa/st: do not check single-sampled for max_samples
...
According to screen.rst, sample_count of 1 is the same as a sample_count
of 0, neither of which are multi-sampling. We only want to include
formats that support multi-sampling when calculating max samples,
otherwise we can't support the combination of floating-point textures
and multi-sampling on DX9-class GPUs.
...There's one special-case, though; FakeMSAA, which is implemented as
sample_count = 1. In that case, we actually need to check for a single
sample. So let's check for that first, to figure out what the actual
min value is. Ugh, this is hairy.
This brings back GL_EXT_framebuffer_multisample and
GL_EXT_framebuffer_multisample_blit_scaled on R300, and should get
Crocus back to GL 3.x as it was before.
Fixes: f56443ac ("st/mesa: search for smallest supported sample-count")
Acked-by: Marek Olšák <marek.olsak@amd.com >
Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36156 >
2025-07-23 07:37:23 +00:00
Erik Faye-Lund
7970ac6234
r300/ci: update expected failures
...
This isn't related to the changes in this MR, but was discovered while
testing it. Let's update these while we're at it.
Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36156 >
2025-07-23 07:37:23 +00:00
Bohan Yu
778b8600d5
gallium: Fix LLVMpipe function parameter of Vector type call load mismatch
...
When using vector parameter type in Rusticl with LLVMpipe backend, the
generated LLVM IR function definition part uses an Array of Vector
(correct type) LLVM type, but the caller generates a Vector of Vector
(invalid type). This mismatch causes three problems:
1. Type check fails in debug build;
2. The dumped LLVM BC cannot be disassembled due to Invalid Type;
3. Potential faulty JIT code.
This bugfix construct an Array of Vector for NIR function call by a new
`lp_build_gather_array` function. Special thanks to Zhichao Guan
<vbcpascal@outlook.com > for the joint effort in this bugfix, and
Jose Fonseca for sggestions.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36287 >
2025-07-23 07:14:57 +00:00
Georg Lehmann
b572bd9b91
svga: use NIR_PASS(_, ...) for gl_nir_lower_images
...
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
d02b1893f1
llvmpipe: use NIR_PASS(_, ...) for nir_lower_fragcolor
...
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
3bc691f116
broadcom/compiler: use NIR_PASS for nir_schedule
...
This should work now that the pass returns progress and invalidates metadata.
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
e1be7213e4
nir/schedule: return progress and fix metadata
...
Tracking if we changed the shader is somewhat complicated, so assume
we always do.
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
ee0f2ed241
gallivm: use NIR_PASS(_, ...) instead of NIR_PASS_V
...
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
a43a26ad2b
gallium/draw: use NIR_PASS(_, ...) instead of NIR_PASS_V
...
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Georg Lehmann
18be5603ed
lavapipe: use NIR_PASS(_, ...) instead of NIR_PASS_V
...
Reviewed-by: Emma Anholt <emma@anholt.net >
Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291 >
2025-07-23 06:47:58 +00:00
Eric Engestrom
544d0c03e2
radv/ci: lower timeouts for vkd3d jobs
...
No timeout was set, so the project-wide default of 1h was applied.
radv-raven-vkd3d had to be excluded as it caused lava_job_submitter.py
to fail (without any error message).
All the jobs touched here take between 2 and 4 minutes to run total, and
between 40 and 100 sec of test time.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36305 >
2025-07-23 06:24:30 +00:00
Eric Engestrom
9e692a176f
radv/ci: lower timeouts for newly added gfx1201 jobs
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36305 >
2025-07-23 06:24:30 +00:00
Timothy Arceri
971af9cf98
util: add workaround for Interstellar Rift
...
Without this shaders fail to compile due to use of the texture2D
function.
Cc: mesa-stable
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36281 >
2025-07-22 23:58:00 +00:00
Gurchetan Singh
5740c65b6a
gfxstream: vulkan-mapper: special case Nvidia
...
Also, VIRTGPU_KUMQUAT_HANDLE_TYPE_MEM_DMABUF is 0x2, not 0x1
Fixes create device failures in end2end tests.
Reviewed-by: Jason Macnak <jmacnak@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36295 >
2025-07-22 23:44:28 +00:00
Alyssa Rosenzweig
803e61837e
agx: reassociate ALU
...
GL:
total instrs in shared programs: 2881862 -> 2801415 (-2.79%)
instrs in affected programs: 2264277 -> 2183830 (-3.55%)
total alu in shared programs: 2362306 -> 2281986 (-3.40%)
alu in affected programs: 1882190 -> 1801870 (-4.27%)
total fscib in shared programs: 2359848 -> 2279314 (-3.41%)
fscib in affected programs: 1891013 -> 1810479 (-4.26%)
total ic in shared programs: 661722 -> 661702 (<.01%)
ic in affected programs: 1304 -> 1284 (-1.53%)
total gprs in shared programs: 899341 -> 900319 (0.11%)
gprs in affected programs: 48696 -> 49674 (2.01%)
total uniforms in shared programs: 2069880 -> 2064570 (-0.26%)
uniforms in affected programs: 426411 -> 421101 (-1.25%)
total threads in shared programs: 27802432 -> 27802624 (<.01%)
threads in affected programs: 5568 -> 5760 (3.45%)
total preamble in shared programs: 1202295 -> 1222360 (1.67%)
preamble in affected programs: 452890 -> 472955 (4.43%)
VK:
Totals:
MaxWaves: 53077184 -> 53075712 (-0.00%); split: +0.05%, -0.05%
Instrs: 23845634 -> 23561020 (-1.19%); split: -1.22%, +0.02%
CodeSize: 170339242 -> 168601666 (-1.02%); split: -1.04%, +0.02%
Spills: 65594 -> 65784 (+0.29%); split: -1.43%, +1.72%
Fills: 43190 -> 43178 (-0.03%); split: -2.21%, +2.18%
Scratch: 404208 -> 403474 (-0.18%); split: -0.27%, +0.08%
ALU: 18566800 -> 18288141 (-1.50%); split: -1.52%, +0.02%
FSCIB: 18511881 -> 18230860 (-1.52%); split: -1.54%, +0.02%
IC: 5260462 -> 5259748 (-0.01%); split: -0.02%, +0.00%
GPRs: 3831837 -> 3838887 (+0.18%); split: -0.25%, +0.43%
Uniforms: 10453510 -> 10443173 (-0.10%); split: -0.29%, +0.19%
Preamble instrs: 10409287 -> 10496713 (+0.84%); split: -0.10%, +0.94%
Totals from 32343 (59.87% of 54019) affected shaders:
MaxWaves: 31027072 -> 31025600 (-0.00%); split: +0.08%, -0.08%
Instrs: 19806186 -> 19521572 (-1.44%); split: -1.46%, +0.03%
CodeSize: 141121024 -> 139383448 (-1.23%); split: -1.25%, +0.02%
Spills: 65252 -> 65442 (+0.29%); split: -1.44%, +1.73%
Fills: 42745 -> 42733 (-0.03%); split: -2.23%, +2.20%
Scratch: 403096 -> 402362 (-0.18%); split: -0.27%, +0.08%
ALU: 15544339 -> 15265680 (-1.79%); split: -1.82%, +0.03%
FSCIB: 15491754 -> 15210733 (-1.81%); split: -1.84%, +0.03%
IC: 4817376 -> 4816662 (-0.01%); split: -0.02%, +0.01%
GPRs: 2735551 -> 2742601 (+0.26%); split: -0.35%, +0.61%
Uniforms: 7717506 -> 7707169 (-0.13%); split: -0.39%, +0.25%
Preamble instrs: 7713698 -> 7801124 (+1.13%); split: -0.14%, +1.27%
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Acked-by: Adam Jackson <ajax@redhat.com >
Reviewed-by: Mel Henning <mhenning@darkrefraction.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36147 >
2025-07-22 23:17:01 +00:00
Alyssa Rosenzweig
7a786a9c7a
agx: run more opt passes
...
preparing for reassoc
GL mostly noise, Vulkan:
Totals from 32853 (60.82% of 54019) affected shaders:
MaxWaves: 31747776 -> 31758272 (+0.03%); split: +0.04%, -0.01%
Instrs: 18017616 -> 18016663 (-0.01%); split: -0.11%, +0.11%
CodeSize: 128159164 -> 128249442 (+0.07%); split: -0.13%, +0.20%
Spills: 63634 -> 62658 (-1.53%); split: -1.83%, +0.30%
Fills: 42547 -> 41669 (-2.06%); split: -2.51%, +0.44%
Scratch: 341914 -> 341748 (-0.05%); split: -0.09%, +0.04%
ALU: 13999432 -> 13998308 (-0.01%); split: -0.13%, +0.12%
FSCIB: 13979325 -> 13978584 (-0.01%); split: -0.13%, +0.12%
IC: 3953418 -> 3957996 (+0.12%); split: -0.03%, +0.14%
GPRs: 2621294 -> 2619432 (-0.07%); split: -0.13%, +0.06%
Uniforms: 7118591 -> 7040633 (-1.10%); split: -1.91%, +0.82%
Preamble instrs: 6800746 -> 6571058 (-3.38%); split: -3.76%, +0.39%
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Acked-by: Adam Jackson <ajax@redhat.com >
Reviewed-by: Mel Henning <mhenning@darkrefraction.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36147 >
2025-07-22 23:17:01 +00:00
Alyssa Rosenzweig
ecc51d9b9b
agx: make sure denorm flushing really happens
...
Backport-to: 25.1
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io >
Reviewed-by: Mel Henning <mhenning@darkrefraction.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36147 >
2025-07-22 23:17:01 +00:00