AlexIndustrial/mesa

Author	SHA1	Message	Date
Daniel Schürmann	56ec814b56	nir/algebraic: add some more unop + bcsel optimizations Totals from affected shaders: (VEGA) SGPRS: 284392 -> 284400 (0.00 %) VGPRS: 261080 -> 261076 (-0.00 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 24698596 -> 24277788 (-1.70 %) bytes LDS: 196 -> 196 (0.00 %) blocks Max Waves: 10101 -> 10105 (0.04 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	2fca183910	nir/algebraic: add optimizations for fsign/isign This just reverts fsign/isign lowering. Totals from affected shaders: SGPRS: 257496 -> 256672 (-0.32 %) VGPRS: 181800 -> 178864 (-1.61 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 11355852 -> 11141840 (-1.88 %) bytes LDS: 3789 -> 3789 (0.00 %) blocks Max Waves: 30453 -> 30951 (1.64 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	8e1b75b330	nir/algebraic: optimize iand/ior of (n)eq zero Found in some Detroit: Become Human shaders. Totals from affected shaders: SGPRS: 700256 -> 700256 (0.00 %) VGPRS: 507208 -> 507212 (0.00 %) Spilled SGPRs: 142531 -> 142531 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 76404616 -> 76301768 (-0.13 %) bytes LDS: 43 -> 43 (0.00 %) blocks Max Waves: 21438 -> 21438 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	e4281dbecc	nir: also move b2i in case of nir_move_copies Booleans are often more efficient with register usage. This also allows to move comparisons further. Totals from affected shaders: (VEGA) SGPRS: 451608 -> 450320 (-0.29 %) VGPRS: 351448 -> 351256 (-0.05 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 1008 -> 1008 (0.00 %) dwords per thread Code Size: 26555596 -> 26551080 (-0.02 %) bytes LDS: 10323 -> 10323 (0.00 %) blocks Max Waves: 42850 -> 42934 (0.20 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	de0ebaf09d	nir/algebraic: optimize bcsel(a, 0, 1) to b2i This avoids combination with other bcsel operations, and as b2i is often a no-op (when used for iadd and such), the resulting pattern is preferable. Totals from affected shaders: (VEGA) SGPRS: 598448 -> 598448 (0.00 %) VGPRS: 457940 -> 457352 (-0.13 %) Spilled SGPRs: 127154 -> 127154 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 64836352 -> 64802728 (-0.05 %) bytes LDS: 781 -> 781 (0.00 %) blocks Max Waves: 22931 -> 22931 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Icecream95	e764192f40	pan/mdg: Use the blend RT for blend shader framebuffer fetches Fixes piglit test fbo-drawbuffers-blend-add when fixed-function blending is disabled in panfrost_get_blend_for_context. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	3ec252a3b2	panfrost: 8x MRT support Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	978f963ea4	panfrost: Use more tilebuffer sizes This will be needed for 8x MRT with 128-bit framebuffer formats. Adds support for 256-bit, 1024-bit, and 2048-bit tilebuffer allocations, depending on the amount of data required. v2: Squash commits (Alyssa) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	c1d3d39e97	panfrost: Fake RGTC support For most GPUs RGTC is disabled, so it needs to be emulated, using the fake_rgtc option of u_transfer_helper. Passes the rgtc-teximage tests in piglit. v2: Update docs/features.txt (Alyssa) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5975>	2020-07-20 09:54:49 -04:00
Rhys Perry	fac813dc61	spirv: don't split memory barriers If the SPIR-V had a shared+image memory barrier, we would emit two NIR barriers: a shared barrier and an image barrier. Unlike a single barrier, two barriers allows transformations such as: intrinsic image_deref_store (ssa_27, ssa_33, ssa_34, ssa_32, ssa_25) (1) intrinsic memory_barrier_shared () () intrinsic memory_barrier_image () () intrinsic store_shared (ssa_35, ssa_24) (0, 1, 4, 0) -> intrinsic memory_barrier_shared () () intrinsic store_shared (ssa_35, ssa_24) (0, 1, 4, 0) intrinsic image_deref_store (ssa_27, ssa_33, ssa_34, ssa_32, ssa_25) (1) intrinsic memory_barrier_image () () This commit fixes two dEQP-VK.memory_model.* CTS tests with ACO. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5951>	2020-07-20 12:05:16 +00:00
Samuel Pitoiset	28c227c7ca	radv/winsys: always allow GTT placements on APUs When the VRAM size is small and the preferred heap only VRAM, the kernel tries to always honor the requested heap and it does a ton of evictions which is a disaster for performance. On APUs, VRAM and GTT have similar performance, so allow the kernel to choose the best placement (GTT or VRAM) itself. This gives a huge performance boost with Doom Eternal on RAVEN. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5665>	2020-07-20 11:41:07 +00:00
Samuel Pitoiset	d1bba2eee7	radv: disable CPU caching for IBS to reduce fetch latency AMDGPU_GEM_CREATE_CPU_GTT_USWC should be faster when CPU reads are unexpected (because they aren't cached). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5959>	2020-07-20 11:23:19 +00:00
Pierre-Eric Pelloux-Prayer	d2a3ca289f	radeonsi: adjust epitch for PIPE_FORMAT_R8G8_R8B8_UNORM This fix si_compute_copy_image for yuyv image (so using PIPE_FORMAT_R8G8_R8B8_UNORM). With this change, the following gst pipeline produce the expected results for various image sizes (with or without AMD_DEBUG=nodma): gst-launch-1.0 filesrc location=input.jpg ! jpegparse ! vaapijpegdec ! filesink location=output.yuv Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5841>	2020-07-20 10:32:44 +00:00
Pierre-Eric Pelloux-Prayer	87ecfdfbf0	ac/surface: adapt surf_size when modifying surf_pitch Otherwise we might get VM_L2_PROTECTION_FAULT_STATUS errors. Fixes: `8275dc1ed5` ("ac/surface: fix epitch when modifying surf_pitch") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5841>	2020-07-20 10:32:44 +00:00
Gert Wollny	1fa36c1d3d	d600/sfn: write stream outputs to correct mem ring Fixes: arb_gpu_shader5-xfb-streams Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	21d296a481	r600/sfn: Make the pin_to_channel generic Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	3ea847e6d1	r600/sfn: Only use sample mask if the according shader key is set This fixes all the piglits from arb_sample_shading "samplemask * *" with the nir backend. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	c91979c634	r600: Add shader key item to identify when the sample mask should be used The sample mask must be applied when more then one sample is available or multisamplig is not enabled, so add a shader key to track this. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	05df4bfbca	r600/sfn: Fix default z swizzle for GDS instructions Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	2779aa360e	r600/sfn: Fix Ring output swizzle masks Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	c18b1c6df5	r600/sfn: Add a forced output swizzle for depth write This makes sure no components are written that shouldn't be written. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	d31ef0b7a4	r600/sfn: correct handling of loading vec4 with fetching constants Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	aca99e6fc9	r600/sfn: Add option to get a temp value for a specific channel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	258618815b	r600/sfn: emit texture instructions in one block Setting the offset must happen in the same CF like using it, so don't emit ALU instruction between the tex instructions. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	deccf02246	r600/sfn: Pipe through requesting a register at a given channel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	55cc712991	r600/sfn: lower rotate ALU ops Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Dave Airlie	41c7bb6ec0	llvmpipe: add framebuffer fetching support (v1.1) v1.1: Merge two if blocks (Roland) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5914>	2020-07-20 15:14:09 +10:00
Dave Airlie	0e0b6d477b	llvmpipe/cs: respect render condition Running complete CTS turned up a missing cond render. Fixes KHR-GL45.compute_shader.conditional-dispatching Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5944>	2020-07-19 15:07:26 +10:00
Rob Clark	912ad09112	freedreno/ir3/ra: fix array conflicts for split/merged Properly handle the difference between split and merged register file when determining where arrays can fit without conflicting with other arrays or pre-colored instructions. 1) if not mergedregs, only consider other things with same precision as potentially conflicting 2) if mergedregs, calculate everything in therms of half-regs and convert back to fullregs in the end Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:21:09 -07:00
Rob Clark	b1465c382b	freedreno/ir3/ra: assign vreg names to all array elements We shouldn't divide-by-two for half-reg arrays. We set the proper node interference class, based on `arr->half`. Fixes a RA fail with 16b arrays: src/freedreno/ir3/ir3_ra.c:633: name_to_array: Assertion `!"invalid array name"' failed. Caused by use/def iterators returning `arr->length` vreg namess, but only assigning the array half that many names. Also, since we are assigning unique vreg names to each array element, there is no need to try and convert from half-reg to it's conflicting full reg when pre-coloring the array elements. Getting us closer to having half-arrays work sanely with split-register-file (a5xx and earlier). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:21:09 -07:00
Rob Clark	6317f7d574	freedreno/ir3/ra: debug msgs tweak Print out the assigned vreg names earlier. Also print the few special nodes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	c2d94aa365	freedreno/ir3: fix half-reg array stores Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	5be171b888	freedreno/ir3: set array precision on creation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	0472ca2aa5	freedreno/ir3/parser: half-precision relative regs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	79b0651c24	freedreno: whitespace fix Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:11:35 -07:00
Rob Clark	835201dd76	freedreno: small comment re-word Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:11:35 -07:00
Mike Blumenkrantz	2b343238a1	zink: free all ntv allocations after creating shader module these are all fairly large sources of leaks Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5887>	2020-07-18 07:51:37 +00:00
Mike Blumenkrantz	adc4f3896a	zink: free pipeline cache during program destroy more leaks Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5887>	2020-07-18 07:51:37 +00:00
Mike Blumenkrantz	1ff2d195b0	zink: destroy descriptor pools on context destroy this is a big leak Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5887>	2020-07-18 07:51:37 +00:00
Mike Blumenkrantz	7116decfce	zink: destroy gfx program when a shader is freed there's no sense in having these objects sitting around when they can never be used again requires adding a zink_context* pointer to each program in order to prune the hash table entry Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5887>	2020-07-18 07:51:37 +00:00
Mauro Rossi	9e9c8e2f79	android: panfrost/encoder: add libmesa_nir static dependency Fixes the following build error: In file included from external/mesa/src/panfrost/encoder/pan_blit.c:34: In file included from external/mesa/src/panfrost/encoder/../midgard/midgard_compile.h:27: external/mesa/src/compiler/nir/nir.h:52:10: fatal error: 'nir_opcodes.h' file not found ^~~~~~~~~~~~~~~ 1 error generated. Fixes: `293f251871` ("panfrost: Use Midgard-specific reloads") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5961>	2020-07-18 08:28:27 +02:00
Icecream95	0ef168d513	panfrost: Fix calls to panfrost_flush_batches_accessing_bo The function now takes a bool flush_readers instead of an access type, but some calls were not updated. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5962>	2020-07-18 01:36:59 +00:00
Icecream95	858cc13eb2	panfrost: Make panfrost_bo_wait take a wait_readers bool panfrost_bo_wait is often used after panfrost_flush_batches_accessing_bo, so make them take similar arguments for consistency. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5962>	2020-07-18 01:36:59 +00:00
Eric Anholt	5b38048347	freedreno/ir3: Add unit tests for derivatives disasm. Since I was going back to look at fine derivs again, add some tests of instruction encoding. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5699>	2020-07-18 00:43:44 +00:00
Eric Anholt	3d7d5d220b	freedreno/ir3: Fix duplicated fine derivatives instructions. legalize_block() can get run multiple times, which I didn't notice when adding fine derivs support. Other instruction clones change things such that the legalization won't trigger again, but that didn't apply to the DS.PP legalization. To keep someone else from tripping over this, split the one-shot legalization out of the iterative sync flag application. Fixes failures in dEQP-VK.glsl.derivate.dfdxfine.* Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3198 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5699>	2020-07-18 00:43:44 +00:00
Bas Nieuwenhuizen	862d85a63f	amd/addrlib: Clean up unused colorFlags argument Cleanup. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5865>	2020-07-18 00:28:35 +00:00
Bas Nieuwenhuizen	a37aeb128d	amd/common: Cache intra-tile addresses for retile map. However complicated DCC addressing is it is still based on tiles. If we have the intra-tile offsets + tile dimensions we can expand that to the full image ourselves. Behavior around ~1080p on a 2500U: old: 30-60 ms on every miss new: 5 ms initally (miss in the tile cache) <0.5 ms afterwards The most common case is that the tile cache only contains data for 2 tiles, which for Raven/Renoir/Navi14 will be 4 KiB each, so the size increase is fairly modest. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5865>	2020-07-18 00:28:35 +00:00
Rhys Perry	f302ef3853	aco: use s_waitcnt_depctr to mitigate VMEMtoScalarWriteHazard Apparently this is potentially faster than v_nop: https://reviews.llvm.org/D83872 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5923>	2020-07-18 00:14:12 +00:00
Rhys Perry	bcf94bb933	aco: properly recognize that s_waitcnt mitigates VMEMtoScalarWriteHazard fossil-db (Navi): Totals from 555 (0.41% of 135946) affected shaders: CodeSize: 1005716 -> 1003400 (-0.23%) Instrs: 195326 -> 194744 (-0.30%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5923>	2020-07-18 00:14:12 +00:00
Alyssa Rosenzweig	610af7e1ac	panfrost: Enable FP16 by default I see no reason to hide this. The small hit in cycle count is offset in practice by the increase in thread count. So let's ship it and get some testing. If this regresses a workload: 1. Open an issue on the tracker and attach an apitrace. 2. In the meantime set PAN_MESA_DEBUG=nofp16 to override. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5960>	2020-07-17 23:37:13 +00:00

1 2 3 4 5 ...

116468 Commits