AlexIndustrial/mesa

Author	SHA1	Message	Date
Samuel Pitoiset	b9237bdc6b	radv: reset more DB registers when emitting a null ds target PAL does that. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23209>	2023-06-13 07:52:43 +02:00
Samuel Pitoiset	42dbfad01d	radv: add a helper for emitting a null depth/stencil target Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23209>	2023-06-13 07:52:43 +02:00
Qiang Yu	b4403d8985	radeonsi: enable aco support for compute shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23540>	2023-06-13 03:41:03 +00:00
Qiang Yu	df4f84f806	radeonsi: fix crash when AMD_DEBUG=cs,initnir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23540>	2023-06-13 03:41:02 +00:00
Qiang Yu	5f52f8a6ba	ac/llvm,radeonsi: lower nir_load_user_data_amd in abi Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23540>	2023-06-13 03:41:02 +00:00
Qiang Yu	0a7014328f	radeonsi: add scratch_offset arg for aco cs Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23540>	2023-06-13 03:41:02 +00:00
Timothy Arceri	a337a0c807	st/glsl: move linking code to the same st file Since they call one another this makes it easier to see what is going on without looking in multiple files. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23541>	2023-06-13 02:25:54 +00:00
Jesse Natalie	92dcaf7deb	dxil: Remove custom SSBO lowering Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:37 +00:00
Jesse Natalie	16aeaad73e	microsoft/compiler: Don't over-align raw buffer load/store intrinsics DXC doesn't generate these for raw loads/stores, only structured, and old WARP had bugs with this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:37 +00:00
Jesse Natalie	38617dc726	microsoft/compiler: Don't lower bit sizes for movs Otherwise we run into problems by putting this optimization loop before I/O lowering, where there might still be 8-bit values that haven't been lowered to 16 or 32. Once that's done, any remaining movs or vec ops will have higher bit sizes already. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	ecfbc16f61	dxil: Delete load_ubo_dxil intrinsic Instead of splitting unaligned UBO loads while still using derefs, and then lowering load_ubo to load_ubo_dxil in lower_loads_stores_to_dxil, use lower_mem_access_bit_sizes and lower_ubo_vec4 to handle load size and alignment restrictions while converting to load_ubo_vec4 instead, which has the same semantics as load_ubo_dxil. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3842 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	42877c8b63	dxil: Don't generate load_ubo_dxil directly Just use load_ubo and let it get lowered appropriately later on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	6a5ed9e2e9	microsoft/compiler: Support load_ubo_vec4 Add support for 16-bit UBO loads, delete handling of byte-addressed UBO loads (which I think was never used anyway) and add handling for the component const index to optimize out unneeded extractResults. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	f960b37986	spirv2dxil: Don't lower shared/temp to explicit I/O Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	f121d8fe12	microsoft/compiler: Un-lower shared/scratch to derefs Derefs have index-based access semantics, which means we don't need custom intrinsics to encode an index instead of a byte offset. Remove the "masked" store intrinsics and just emit the pair of atomics directly. This massively reduces duplication between scratch, shared, and constant, while also moving more things into nir so more optimizations can be done. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	95bfee6a85	microsoft/compiler: Use mem_constant instead of shader_temp for consts We still use shader_temp as a temporary variable mode to differentiate which variables have simple deref patterns vs ones that need to be lowered to ssbo, but then we put it back to mem_constant when we're done to restore sanity. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	f9b0382faf	microsoft/compiler: Emit const accesses as load_deref There's a few changes in here that are very inter-related. First, we stop lowering load_deref on shader_temp to load_ptr_dxil, and just leave it as load_deref. In order for that to work, we need the derefs to be in a shape that's acceptable to DXIL, so the only current producer of shader_temp loads (the CLC frontend) needs to run some lowering passes on them first. The DXIL backend is augmented to just write out deref indices while walking a deref chain, which will get combined in the load op into a GEP instruction. For non-mesh/raytracing shaders, these are required to be single-level scalar arrays, but the complexity here is preparation for when we don't need to do that anymore. Additionally, the const lookups are changed from using a hash table to just putting an index on the variable. All of this together is enough to enable the authored-forever-ago test which uses indirect array access into a const packed struct. The load_ptr_dxil handling didn't deal with packed structs / unaligned accesses, but now that we're in a logical address space with derefs instead of physical, there's no alignment to deal with anymore and the fact that it's packed goes out the window. This removes one custom DXIL intrinsic. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	572e02a3b7	microsoft/compiler: Add some more lowering passes for derefs DXIL requires GEP chains to point to a global variable that's a flat array of primitive types. If we're converting deref chains to GEP chains, we're effectively in a logical address space, which means we can do things like change sizes of variables, since we know they won't alias with anything else. If they could alias, we'd be lowering them to an explicit I/O op instead. That means we can start disabling some of the low-bit-size lowering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	d40c64c4c3	microsoft/compiler: Improvements to constant -> shader_temp pass used for CL Now that we try harder for memcpys, we can use nir's complex usage helper. We also can just mark the vars instead of using a hash map, since location doesn't mean anything for constant vars. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	13e5d51f8e	microsoft/compiler: Support vec/struct const vals Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	33ce7c4b90	microsoft/clc: Fix progress reporting for some lowering Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	e9b2bb244b	microsoft/clc: Try harder to optimize memcpys before lowering them For the case of memset, the SPIR-V translator produces a copy from a byte array of 0s. If we wait to lower memcpys until after types are sized, we can potentially turn those 0s into SSA zeros and remove the entire constant array. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	fba82797d7	nir: Optimize unpacking 16 bit values that were originally packed I was seeing u2u64 still in my final shader after pack/unpack were lowered, which sounds to me like some other optimizations are missing for detecting the post-lowering pack/unpack patterns, but let's at least add some patterns for the simple cases. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	663d957480	nir: Fix constant expression for unpack_64_4x16 Cc: Mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	c70d94a889	nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8282 Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	082eba6165	nir_lower_mem_access_bit_sizes: Move options into a struct Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	4217353e2d	nir_lower_mem_access_bit_sizes: Add a bit_size input to the callback We'd like to use this callback to adjust loads and stores from things that are unsupported to things that are supported, but if the input is already supported, we'd prefer not to change it. Rather than making up a bit size that'd work and doing a bunch of pack/unpack bit math, only return a different bit size if the input one doesn't work for us (i.e. can't load enough memory or just an unsupported size entirely). Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	e77fe70b1e	nir_lower_ubo_vec4: Delete an invalid assert This pass handles 16-component 8-bit loads, 8-component 16-bit loads, and 2-component 64-bit loads. The number of components for the fallback case doesn't need to be 4. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	bb311ce370	nir: Allow atomics as non-complex uses for var-splitting passes The var splitting pass can rearrange the variables as long as their position in memory doesn't matter. For block-arranged variables, or things like memcpys or casts, the layout matters, but atomics don't imply anything about the layout of the overall variable, so don't treat them as "complex" for this use case. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	cf9ea94958	nir_split_struct_vars: Support more modes and constant initializers Idiomatic DXIL has constants contained within global variables rather than a big blob of data. Doing this allows us to have 16-bit and 64-bit data as well, where normally bitcasts would be disallowed on variable GEP chains. Unfortunately, DXIL validation requires SOA to be turned into AOS, which means we need to split structs. We want to be able to run this on nir_var_mem_constant variables which have constant initializers, so add a bit of logic to handle that case, and relax the mode validation. There's nothing special about the modes it was set up to handle. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	c0e41e9b3e	vtn: Set is_null_constant Note that pointers are not considered to be nir null constants, since a null pointer value might not be 0s. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	4edfb67fd4	nir: Add is_null_constant to nir_constant Indicates that the values contained within are 0s, regardless of type. Enables some optimizations. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Jesse Natalie	009d2de88f	nir_opt_constant_folding: Fix nir_deref_path leak Cc: Mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173>	2023-06-13 00:43:36 +00:00
Dylan Baker	ce07aabab1	meson: Key whether to build batch decoder on expat Instead of on Android. Which allows an end user to turn off expat without breaking or disabling Intel support. I've additionally refactored to separate expat and xmlconfig a bit more in the root meson.build This does make expat a hard dependency for building Intel tools, despite the fact that only aubinator actually requires it. This simplifies the build for the common case, and in the event that someone wants to build the Intel tools and doesn't have libexpat, they can fall back to the meson wrap for expat instead. fixes: `75276deebc` closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8791 Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23605>	2023-06-12 23:07:00 +00:00
Jesse Natalie	b717a43826	dzn: Don't support VK R4G4B4A4_UNORM_PACK16 unless we have B4G4R4A4 Fixes: `a4ce095bad` ("dzn: Use A4B4G4R4 instead of B4G4R4A4 when available") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23602>	2023-06-12 22:25:19 +00:00
Emma Anholt	1dd1147408	mapi: Delete execmem support code. No longer used now that we don't dynamically generate dispatch stubs. Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	34808de737	mapi: Drop the unused_functions table. Since we don't support loading an older driver with newer loader any more, we don't need to bother tracking entrypoints that Mesa no longer supports. Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	a4b2825228	mesa: Drop the aliases from the remap table. Mesa core doesn't need to have mapi sanity check that our aliases all map to the same offset. That's a build-time decision. Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	e0213a6953	mapi: Clean up mapi_stub struct. We no longer use the address field, and the name is always a size_t offset in the string pool (never a dynamic strduped name). Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	29397f2e00	mesa: Drop the function parameter spec from the remap table. Since we don't generate dynamic dispatch stubs any more, we don't need this data. Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	398a8d43dc	mapi: Delete dynamic stub generation. Since Mesa drivers are now version-locked to the loader, that means that we never need to support a newer hardware driver than the loader, and thus don't need to generate dynamic dispatch stubs. This is great news, given that we don't test those paths, and it involved delightful features like arrays of hex for code to be pasted into executable memory. More code removal will follow, this is the first cut of "don't generate, and DCE generation code". Fixes: #9158 Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Emma Anholt	3033252966	mapi: clang-format _glapi_add_dispatch(). The formatting was so broken I couldn't follow what was going on. Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23451>	2023-06-12 21:37:37 +00:00
Alyssa Rosenzweig	5c1d614256	nir: Add interleave_agx instruction While this is a generic bit twiddling ALU instruction, it's especially useful for address calculations, since the architecture's tiled textures use Morton coding within the tiles. This will be used when lowering image_texel_address on AGX, as part of the image atomics implementation. I don't know if there's any other neat uses I could detect with opt_algebraic, this doesn't seem like an operation a shader would open-code... Maybe useful for BVH building or something... Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23513>	2023-06-12 20:09:53 +00:00
Alyssa Rosenzweig	176c3a2ab7	agx: Use common nir_steal_tex_src Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23513>	2023-06-12 20:09:53 +00:00
Alyssa Rosenzweig	d1b94a11bd	nir/lower_tex: Use nir_steal_tex_src The find-remove-use pattern is quite natural for texture lowering :) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23513>	2023-06-12 20:09:53 +00:00
Alyssa Rosenzweig	36e779e4a9	nir/builder: Add steal_tex_src helper I have this in the AGX compiler but I want to use it in more places. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23513>	2023-06-12 20:09:53 +00:00
Georg Lehmann	bbda9f7390	aco: validate ir for prologs and after lower_to_hw_instr Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23507>	2023-06-12 19:43:17 +00:00
Georg Lehmann	2028df8757	aco: don't validate p_constaddr_addlo/p_resumeaddr_addlo operands These can have two literals so validation would fail. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23507>	2023-06-12 19:43:17 +00:00
Georg Lehmann	b9854a9097	aco: move cfg validation to its own function Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23507>	2023-06-12 19:43:17 +00:00
Georg Lehmann	e5df6ee605	aco: make validation work without SSA temps Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23507>	2023-06-12 19:43:17 +00:00

1 2 3 4 5 ...

159808 Commits