broadcom/compiler: enable perquad with uses_wide_subgroup_intrinsics

This fixes a number of regressions in Vulkan subgroups tests in CTS.

Fixes: 97f5721bfc ('broadcom/compiler: needs_quad_helper_invocation enable PER_QUAD TMU access')
cc: mesa-stable

Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28797>
This commit is contained in:
Iago Toral Quiroga
2024-04-18 09:35:24 +02:00
committed by Marge Bot
parent fbd6da9bcd
commit 1070c9b0e7
+4 -3
View File
@@ -656,9 +656,10 @@ ntq_emit_tmu_general(struct v3d_compile *c, nir_intrinsic_instr *instr,
*/
uint32_t perquad =
is_load && !vir_in_nonuniform_control_flow(c) &&
c->s->info.stage == MESA_SHADER_FRAGMENT &&
c->s->info.fs.needs_quad_helper_invocations &&
!c->emitted_discard ?
((c->s->info.stage == MESA_SHADER_FRAGMENT &&
c->s->info.fs.needs_quad_helper_invocations &&
!c->emitted_discard) ||
c->s->info.uses_wide_subgroup_intrinsics) ?
GENERAL_TMU_LOOKUP_PER_QUAD :
GENERAL_TMU_LOOKUP_PER_PIXEL;
config = 0xffffff00 | tmu_op << 3 | perquad;