aco/optimizer: use new helpers for v_add_u32 opts
Foz-DB Navi48: Totals from 1554 (1.89% of 82419) affected shaders: Instrs: 5154325 -> 5151499 (-0.05%); split: -0.08%, +0.02% CodeSize: 27310012 -> 27318708 (+0.03%); split: -0.01%, +0.05% VGPRs: 97236 -> 97200 (-0.04%); split: -0.05%, +0.01% Latency: 34121873 -> 34120894 (-0.00%); split: -0.02%, +0.01% InvThroughput: 6735276 -> 6730418 (-0.07%); split: -0.08%, +0.01% VClause: 130106 -> 130090 (-0.01%); split: -0.05%, +0.04% SClause: 90439 -> 90449 (+0.01%); split: -0.00%, +0.01% Copies: 382920 -> 382401 (-0.14%); split: -0.18%, +0.05% Branches: 130089 -> 130091 (+0.00%) PreSGPRs: 67745 -> 67743 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 72710 -> 72674 (-0.05%) VALU: 2941866 -> 2938129 (-0.13%); split: -0.13%, +0.00% SALU: 651032 -> 651779 (+0.11%); split: -0.02%, +0.14% VOPD: 2446 -> 2393 (-2.17%); split: +0.70%, -2.86% Foz-DB Navi21: Totals from 1534 (1.86% of 82387) affected shaders: MaxWaves: 32481 -> 32479 (-0.01%) Instrs: 4732755 -> 4730039 (-0.06%); split: -0.06%, +0.00% CodeSize: 25305728 -> 25313148 (+0.03%); split: -0.00%, +0.03% VGPRs: 84424 -> 84448 (+0.03%) SpillVGPRs: 2420 -> 2419 (-0.04%) Scratch: 180224 -> 179200 (-0.57%) Latency: 36843383 -> 36846269 (+0.01%); split: -0.01%, +0.02% InvThroughput: 9252495 -> 9238142 (-0.16%); split: -0.17%, +0.02% VClause: 146629 -> 146671 (+0.03%); split: -0.02%, +0.05% SClause: 94502 -> 94512 (+0.01%); split: -0.00%, +0.01% Copies: 403672 -> 403592 (-0.02%); split: -0.09%, +0.07% Branches: 141145 -> 141137 (-0.01%) PreSGPRs: 70003 -> 70001 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 70835 -> 70800 (-0.05%) VALU: 3114513 -> 3111338 (-0.10%); split: -0.10%, +0.00% SALU: 651177 -> 651925 (+0.11%); split: -0.02%, +0.13% VMEM: 271263 -> 271261 (-0.00%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38530>
This commit is contained in:
@@ -363,9 +363,9 @@ BEGIN_TEST(optimize.bcnt)
|
||||
bcnt = bld.vop3(aco_opcode::v_bcnt_u32_b32, bld.def(v1), Operand(inputs[0]), Operand::zero());
|
||||
writeout(2, bld.vadd32(bld.def(v1), bcnt, Operand::c32(42u)));
|
||||
|
||||
//! v1: %bnct3 = v_bcnt_u32_b32 %b, 0
|
||||
//~gfx8! v1: %bnct3 = v_bcnt_u32_b32 %b, 0
|
||||
//~gfx8! v1: %res3, s2: %_ = v_add_co_u32 %bcnt3, %a
|
||||
//~gfx(9|10)! v1: %res3 = v_add_u32 %bcnt3, %a
|
||||
//~gfx(9|10)! v1: %res3 = v_bcnt_u32_b32 %b, %a
|
||||
//! p_unit_test 3, %res3
|
||||
bcnt = bld.vop3(aco_opcode::v_bcnt_u32_b32, bld.def(v1), Operand(inputs[1]), Operand::zero());
|
||||
writeout(3, bld.vadd32(bld.def(v1), bcnt, Operand(inputs[0])));
|
||||
|
||||
Reference in New Issue
Block a user