AlexIndustrial/mesa

Author	SHA1	Message	Date
Georg Lehmann	6cd78281f6	aco: deduplicate Format definition Reviewed-by: Daniel Schürmann <daniel@schuermann.dev Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25943>	2023-11-06 23:16:38 +00:00
Georg Lehmann	6e0bf33a89	aco: deduplicate instr_class definition Reviewed-by: Daniel Schürmann <daniel@schuermann.dev Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25943>	2023-11-06 23:16:38 +00:00
Georg Lehmann	bdd81c6be7	aco: namespace aco_opcode Reviewed-by: Daniel Schürmann <daniel@schuermann.dev Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25943>	2023-11-06 23:16:38 +00:00
Georg Lehmann	1b9a3b7466	aco: stop using cstdint We use stdint.h everywhere else. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25943>	2023-11-06 23:16:38 +00:00
Georg Lehmann	04956d54ce	aco: force uniform result for LDS load with uniform address if it can be non uniform Because a LDS load is 2 separate loads on gfx10+ with wave64, a different wave can write LDS in between and cause a non uniform result. Use v_readfirst_lane instead of p_as_uniform because it cannot be copy propagated. Fixes a OpenCL CTS test with zink+rusticl. Totals from 136 (0.17% of 78196) affected shaders: MaxWaves: 3236 -> 3244 (+0.25%) Instrs: 130069 -> 131221 (+0.89%) CodeSize: 698048 -> 703436 (+0.77%) VGPRs: 5464 -> 5440 (-0.44%) SpillSGPRs: 94 -> 96 (+2.13%) Latency: 5361017 -> 5363781 (+0.05%); split: -0.00%, +0.05% InvThroughput: 883010 -> 884100 (+0.12%) SClause: 3822 -> 3821 (-0.03%); split: -0.05%, +0.03% Copies: 14220 -> 14314 (+0.66%); split: -0.01%, +0.68% Branches: 4549 -> 4551 (+0.04%) PreSGPRs: 4934 -> 4940 (+0.12%) PreVGPRs: 4666 -> 4655 (-0.24%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25973>	2023-11-06 22:43:33 +00:00
Georg Lehmann	ab87831ae8	aco, radv: vectorize f2f16 if rounding mode is rtz Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25952>	2023-11-06 21:05:34 +00:00
Connor Abbott	55f3f952aa	vk/graphics_state, tu: Rewrite renderpass flags handling Before this, the render pass code or the driver combined the pipeline create flags and the implicit flags from the render pass, but the pipeline create flags will need to be sanitized when they are dynamic state, so we need to do it in vk_graphics_state where we know that information. We also weren't combining pipeline flags correctly when linking, which on turnip was being hidden by the lack of sanitizing for driver-provided flags. We can't combine them correctly if they're part of the render pass state, so they need to be pulled out into the overall pipeline state. For drivers using emulated renderpasses or tracking feedback loop information themselves, this won't make a difference, but we have to adapt turnip to not pass pipeline flags. This also means that we can drop all handling of feedback_loop_input_only in turnip and just set it in the runtime. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25436>	2023-11-06 14:33:51 +00:00
Connor Abbott	e6f5d7222c	vk,lvp,tu,radv,anv: Add common vk_*_pipeline_create_flags() helper And replace the various homegrown or copy-pasted helpers in drivers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25436>	2023-11-06 14:33:51 +00:00
Samuel Pitoiset	790fabd38e	radv: advertise VK_EXT_device_fault This is only exposed if the kernel supports GPUVM fault query. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25875>	2023-11-06 10:35:05 +00:00
Samuel Pitoiset	8097becc7f	radv: add initial VK_EXT_device_fault support This implementation only returns VM faults information for now, but vendor crash dumps will be adder later. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25875>	2023-11-06 10:35:05 +00:00
Marek Olšák	ca1d37e1db	radeonsi: adjust setting PA_SC_EDGERULE once more based on PAL. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26055>	2023-11-05 14:06:52 -05:00
Marek Olšák	44eaf50a34	ac/surface/tests: cosmetic changes Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26055>	2023-11-05 12:39:42 -05:00
Marek Olšák	dfcc7f83a4	ac/surface: cosmetic changes Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26055>	2023-11-05 12:39:42 -05:00
Marek Olšák	355242f055	ac/gpu_info: adjust attribute ring size for gfx11 these are better numbers Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26055>	2023-11-05 12:36:52 -05:00
Marek Olšák	bd57630885	ac: add missing gfx11.5 bits Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26055>	2023-11-05 12:34:55 -05:00
Jesse Natalie	228329f4da	vulkan: Consolidate common ICD methods Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25998>	2023-11-03 20:01:14 +00:00
Chia-I Wu	227300345e	radv: stop using vk_render_pass_state::render_pass vk_render_pass_state::pipeline_flags is derived from vk_get_pipeline_rendering_flags and has the info we need. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10074 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26000>	2023-11-03 17:23:30 +00:00
Eric Engestrom	47398c65ee	ci/radeonsi: add another flake Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26020>	2023-11-03 09:20:32 +00:00
Samuel Pitoiset	fc9bab73a9	radv/ci: document one more flake test This test consistently fails on some GPUs (already documented) but on some others it's a flake. It's a known issue that should be fixed soon in RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26019>	2023-11-03 09:16:54 +01:00
Konstantin Seurer	11282598e6	radv: Add radv_nir_lower_hit_attrib_derefs_tests Tests hit attrib lowering for various variable/type configurations. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24271>	2023-11-02 15:48:36 +00:00
Konstantin Seurer	f51227d253	radv/clang-format: Do not indent C++ modifiers Turns class asd { private: }; into class asd { private: }; Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24271>	2023-11-02 15:48:36 +00:00
Konstantin Seurer	ba8d3afa56	radv/nir: Handle boolean hit attribs Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24271>	2023-11-02 15:48:36 +00:00
Konstantin Seurer	3a69424e09	radv/nir: Add radv_nir_lower_hit_attrib_derefs Move out the pass so it can be unit tested. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24271>	2023-11-02 15:48:36 +00:00
Konstantin Seurer	b7c582e5c7	radv: Add RADV_MAX_HIT_ATTRIB_DWORDS Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24271>	2023-11-02 15:48:36 +00:00
Rhys Perry	0a418561da	radv: skip radv_remove_varyings for mesh shaders Fixes compilation of a Talos Principle 2 shader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `9fa9782c17` ("radv: stop compiling a noop FS when the application doesn't provide a FS") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25659>	2023-11-02 12:44:43 +00:00
Rhys Perry	ed12be533e	radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements nir_lower_io_arrays_to_elements does not support array derefs of vectors, even when nir_deref_instr_is_known_out_of_bounds is fixed. They can occur with mesh shaders. Found by inspection. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25659>	2023-11-02 12:44:43 +00:00
Chia-I Wu	796cba9bda	radv: fix vkCmdCopyImage2 for emulated etc2/astc When the image copy is between size-compatible formats with different block sizes, we need to fix up the extent. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25984>	2023-11-01 20:02:14 +00:00
Rhys Perry	b18f0dec41	aco: collect Pre-Sched SGPRs/VGPRs before spilling The usage after spilling is usually either the same as before or the maximum. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25559>	2023-11-01 19:41:30 +00:00
Rhys Perry	d200916ca2	aco: add VALU/SALU/VMEM/SMEM statistics This lets us measure optimizations without interference of waitcnt instructions. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25559>	2023-11-01 19:41:30 +00:00
Rhys Perry	7de34ad3ef	radv: use NIR_LOOP_PASS helpers A somewhat random collection of fossils: N Min Max Median Avg Stddev x 6 16.59 16.61 16.605 16.603333 0.0081649658 + 6 15.99 16 16 15.998333 0.0040824829 Difference at 95.0% confidence -0.605 +/- 0.00830327 -3.64385% +/- 0.0485573% (Student's t, pooled s = 0.00645497) I'm not sure if nir_opt_if and nir_opt_loop_unroll are actually idempotent or not. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24197>	2023-11-01 14:16:37 +00:00
Timur Kristóf	a19e46f5d0	radv: Implement workaround for unaligned buffer/image copies. When the pitch or slice pitch isn't properly aligned, the SDMA HW is unable to copy between tiled images and buffers. To work around this, we process the image chunk by chunk, copying the data to a temporary buffer which uses supported pitches, and then copy it to the intended destination. The implementation assumes that at least one pixel row of the image will fit into the temporary buffer, and will try to copy as many rows at once as possible. Sadly, this still results in a lot of packets being generated for large images. A possibe future improvement is to copy the image slice by slice when only the slice pitch is misaligned. However, that is out of scope for this commit. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25831>	2023-11-01 13:21:01 +00:00
Timur Kristóf	ec0605ff72	radv: Add temporary BO for transfer queues. Some copy operations are poorly supported by the SDMA hardware, meaning that the built-in packets don't support them, so we will need to work around that by copying to and from a temporary BO. The size of the temporary buffer was chosen so that it can fit at least one full pixel row of the largest possible image. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25831>	2023-11-01 13:21:01 +00:00
Timur Kristóf	8156c923ee	radv: Implement buffer/image copies on transfer queues. Previously, RADV only had a simple implementation of image to buffer copies using the SDMA for the PRIME copy. This commit replaces that with a full-featured implementation that includes buffer to image and image to buffer copies and removes the assumptions that the PRIME copy had, as well as adds new helper functions which will be shared with other copy functions in upcoming commits. Unaligned buffer/image copies require a workaround, which will be implemented by a future commit. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25831>	2023-11-01 13:21:01 +00:00
Timur Kristóf	ed21f1c962	radv: Expose radv_get_dcc_max_uncompressed_block_size function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25831>	2023-11-01 13:21:01 +00:00
Timur Kristóf	848f2f2b99	radv: Remove always false tmz variables from SDMA functions. We can re-add them later as-needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25831>	2023-11-01 13:21:00 +00:00
Samuel Pitoiset	17daa08dff	radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE This register seems needed to enable compute shader shader invocations on GFX7. On GFX8+ it's working fine without emitting this register but I think it doesn't hurt. This fixes dEQP-VK.query_pool.statistics_query.*_cq on GFX7. Fixes: `a9945216ba` ("radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25957>	2023-11-01 12:46:17 +00:00
Samuel Pitoiset	9a0a77cb53	radv: fix compute shader invocations query on compute queue on GFX6 Looks like GFX6 always writes the number of compute shader invocations at offset 0 when used on compute queue. This fixes dEQP-VK.query_pool.statistics_query.*_cq on GFX6. Fixes: `a9945216ba` ("radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25957>	2023-11-01 12:46:17 +00:00
Samuel Pitoiset	46dc02354a	radv: adjust binning settings to improve performance on GFX9 This partially reverts `74ab940156` which was a fix for random GPU hangs with binning on GFX10+. Though, according to RadeonSI, only GFX10+ is affected and this reduced perf on GFX9 chips. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25951>	2023-11-01 12:24:45 +00:00
Samuel Pitoiset	e4a1bc70dd	radv: bind the non-dynamic graphics state from the pipeline unconditionally The following sequence is valid (although weird) but many other drivers (including RADV) were broken: - bind pipeline with some static state - set state command for that static state (to a bad value) - bind the same pipeline again - draw Fixes new dEQP-VK.dynamic_state.*.double_static_bind. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25954>	2023-11-01 09:44:13 +00:00
Samuel Pitoiset	024dab650e	radv/ci: enable RADV_DEBUG=nomeshshader for vkcts-navi31-valve To make VKCTS on NAVI31 stable until the task shader issue is resolved. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25969>	2023-11-01 08:50:06 +01:00
Samuel Pitoiset	a97160cad8	radv: add RADV_DEBUG=nomeshshader This option will be used to disable VK_EXT_mesh_shader in Mesa CI for GFX11 because running task shader tests in parallel can hang the GPU. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10051 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25969>	2023-11-01 08:49:35 +01:00
Samuel Pitoiset	15f92c3b2c	radv/ci: update list of expected failures/flakes for NAVI31 Let's clean the flakes list, it might be needed to re-add some of them but we will track those. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25969>	2023-11-01 08:49:35 +01:00
Bas Nieuwenhuizen	da7e6f303b	radv: Add some initial graphics DGC preprocessing support. Just the bits that obviously need no adjustment in the DGC preprocessing code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25835>	2023-11-01 00:06:10 +00:00
Bas Nieuwenhuizen	c4fb827441	radv: Add compute DGC preprocessing support. This should reduce the overhead due to reduced syncs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25835>	2023-11-01 00:06:10 +00:00
Bas Nieuwenhuizen	108227a84e	radv: Add DGC preprocessing barrier support. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25835>	2023-11-01 00:06:09 +00:00
Samuel Pitoiset	0477346c0b	aco: remove dead code in nir_intrinsic_xfb_counter_{add,sub}_amd This code path is only used by GFX11 now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25903>	2023-10-30 08:51:51 +00:00
Samuel Pitoiset	d390cd7c5d	ac/nir: remove dead code in nir_intrinsic_xfb_counter_{add,sub}_amd This code path is only used by GFX11 now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25903>	2023-10-30 08:51:51 +00:00
Samuel Pitoiset	5176f75e0d	radv: remove unnecessary VS_PARTIAL_FLUSH for NGG streamout This used to be a PS_PARTIAL_FLUSH to fix synchronization issues with NGG streamout on RDNA2 but it's no longer needed for RDNA3. It's already synchronized in CmdEndTransformFeedbackEXT(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25903>	2023-10-30 08:51:51 +00:00
Samuel Pitoiset	eb47e07782	radv: remove NGG streamout support for RDNA1-2 This was useful for experimenting it on RDNA2 and during RNDA3 bringup, but now the support is rock solid on RDNA3 and it's useless to keep the RADV_PERFTEST=ngg_streamout option. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25903>	2023-10-30 08:51:51 +00:00
Samuel Pitoiset	7beddd4f5c	radv: use the GPUVM fault protection status helper To print more useful information when a fault happens. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25855>	2023-10-30 08:10:23 +00:00

1 2 3 4 5 ...

13429 Commits