AlexIndustrial/mesa

Author	SHA1	Message	Date
Nanley Chery	71d52a4d85	iris: Add a barrier to iris_mcs_partial_resolve Partial resolves read from the MCS and write to the MSAA surface. Add a texture barrier to prepare for the reads. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4179 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22545>	2023-05-11 23:41:16 +00:00
Nanley Chery	a1ed41dec7	intel/isl: Bump the MCS halign value for BDW+ Select a horizontal alignment value that matches the main MSAA surface. We need a valid horizontal alignment to perform MCS ambiguates. The halign value doesn't actually affect test behavior, but it is validated by isl_surf_fill_state. We currently have an invalid halign for gfx125. This patch fixes that. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22545>	2023-05-11 23:41:16 +00:00
Asahi Lina	0a398b0ef9	ail: Add MSAA tests This tests the following matrix: - Format: RGBA8Unorm, RGBA16Unorm, RGBA32Float - Samples: 2 or 4 - Layers: 1 or 2 - Width: Interesting values 1..4097 - Height: Interesting values 1..4097 Compression is based on the dimensions (that is, everything that can be compressed is). This test compares both the total texture size and the compression metadata offset. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Alyssa Rosenzweig	e918509284	ail: Handle larger block sizes We need to support up to 16 bytes/sample * 4 samples/pixel = 64 bytes/pixel for multisampling to work with formats like RGBA32F. Fixes dEQP-GLES3.functional.fbo.msaa.4_samples.rgba32f Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	f545a2b948	asahi: Use ail_can_compress() in agx_compression_allowed() This moves the compression size threshold logic into ail, where it belongs. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	59a6c5b357	ail: Implement multisampling for compression meta calculation For multisampled textures, the decision about whether to compress or not is based on the effective width and height in samples, not pixels. Introduce ail_can_compress() to encode this logic in ail, so the driver can use it to decide whether to compress or not before the full layout is determined. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	94c9115aa0	asahi: Make bo->writer_syncobj atomic BOs can be written from several contexts, so writing to this member is racy. We only care about this for the purposes of exporting BOs after a submission (and if the app is racing writers/submissions at that point all bets are off), so just keeping track of the last written value is sufficient. Switch to atomic operations to eliminate the race, and drop the assert in the batch cleanup path that no longer holds when the BO might have been written to from another context. Fixes: asahi/mesa#20 Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	dc1a18b0ed	asahi: Lazily initialize batch state on first draw We track buffers written by batches, but this gets messy when we end up with an empty batch that is never submitted, since then it might have taken over writer state from a prior already submitted batch (for its framebuffers). Instead of trying to track two tiers of resource writers, let's just defer initializing batch state until we know we have a draw (or compute launch, or clear). This means that if a batch is marked as a writer for a buffer, we know it will not be an empty batch. This should be a small performance win for empty batches (no need to emit initial VDM state or run the writer code), but more impontantly it eliminates the empty batch writer state revert corner case. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	f8b055eb96	asahi: Partially identify some missing index list stuff Still unclear what the extra 2 blocks do, but at least we know the size/order now. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	64a595291e	asahi: Add some more system registers Core and opfifo stuff from the compute helper blob, vm_slot because it was the only one changing when I poked around yesterday and it hit me what it was ^^ Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	9608e57524	asahi: Fix check for sprite coord mode in agx_bind_rasterizer_state We need to set ctx->rast = so after comparing them. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	e92ff4f809	asahi: Add missing stdbool include to lib/hexdump.h Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Asahi Lina	2e377190f5	asahi: Disable tilebuffer write masking optimization This seems to flake some dEQPs due to some kind of race/UB (which doesn't even always cause the dEQPs to fail due to leeway in the image comparison, since the problem is usually just a few pixels, but it's there). I spent a bunch of time trying other flags/things, and almost everything changed the bad pixel pattern randomly but nothing fixed it. Let's revisit this one later, since it looks like a pretty deep rabbit hole. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Asahi Lina	6f57f952fc	asahi: Make framebuffer texture barriers a no-op Framebuffer fetch is coherent, so there is no need for barriers here. This avoids pointless flushing if an app calls glBlendBarrier(). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Asahi Lina	69740fb82b	asahi: Implement create_fence_fd and fence_server_sync Apparently we were still missing some fence stuff, and it started crashing Firefox in apitrace? I'm not sure why we never noticed this before, but it's trivial enough. Cargo culted from Panfrost. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Asahi Lina	86d41cb7bd	asahi: Implement memory_barrier Cargo culted from panfrost. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Matt Turner	435a607909	intel: Disable shader cache when executing intel_clc during the build With the shader cache enabled, intel_clc attempts to write to ~/.cache. Many distributions' build systems limit file-system access, and will kill the process thus causing the build to fail. Fixes: `639665053f` ("anv/grl: Build OpenCL kernels") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22968>	2023-05-11 23:00:01 +00:00
Chia-I Wu	6aee7848bb	radv: improve externalMemoryFeatures for android ahb VK_EXTERNAL_MEMORY_FEATURE_DEDICATED_ONLY_BIT should always be set, as required by the spec. VK_EXTERNAL_MEMORY_FEATURE_EXPORTABLE_BIT should be set when radv_ahb_format_for_vk_format knowns the format. That is, radv_create_ahb_memory should at least know how to call AHardwareBuffer_allocate. VK_EXTERNAL_MEMORY_FEATURE_IMPORTABLE_BIT is always set. We can't know if gralloc can allocate the format/flags/usage combo or not (gralloc might use a private format for the combo). Fixed dEQP-VK.api.external.memory.android_hardware_buffer.image_formats.*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:03 +00:00
Chia-I Wu	eaf1776586	anv,hasvk: android ahb is not always exportable anv_ahb_format_for_vk_format needs to know the format at least. There is no guarantee that AHardwareBuffer_allocate will succeed, but we are reluctant to check with AHardwareBuffer_isSupported which may test-allocate internally and is expensive. v2: add anv_ahb_format_for_vk_format to anv_android_stubs.c Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:03 +00:00
Chia-I Wu	47b37651f8	vulkan: add vk_image_format_to_ahb_format There should be no functional change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:03 +00:00
Chia-I Wu	380180516c	anv,hasvk,radv: do not fall back to AHARDWAREBUFFER_FORMAT_BLOB When allocating a VkDeviceMemory exportable as AHB, it seems incorrect to fall back to AHARDWAREBUFFER_FORMAT_BLOB when the image has no known AHB format. We should fail the allocation instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:03 +00:00
Chia-I Wu	50e703f347	vulkan: add vk_ahb_format_to_image_format There should be no functional change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	2bbe0462e8	vulkan: define inline stubs when android api level < 26 This allows us to reduce ANDROID #ifdef's. v2: always include vk_android.h in radv_formats.c Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	f81dce9bcc	vulkan: rename vk_image::ahardware_buffer_format Rename it to ahb_format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	5561abcb2c	vulkan: make sure vk_image_view::format is never UNDEFINED Remove redundant override in anv and hasvk as well. Fixed android.graphics.cts.BasicVulkanGpuTest#testBasicBufferImportAndRenderingExternalFormat for radv. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	df8ec99c81	vulkan: make sure vk_image::format is never UNDEFINED vk_image::android_external_format is only used for sanity check and is removed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	0a4c92b646	hasvk: Use the common vk_ycbcr_conversion object Based on commit `30a91d333d` ("anv: Use the common vk_ycbcr_conversion object"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	cb6d655f53	hasvk/android: Use VkFormat for externalFormat Same as commit `18feb32df0` ("anv/android: Use VkFormat for externalFormat"), but for hasvk. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Chia-I Wu	6039f2a22f	hasvk: Refactor Android externalFormat handling in CreateYcbcrConversion Same as commit `9fc046a87d` ("anv: Refactor Android externalFormat handling in CreateYcbcrConversion"), but for hasvk. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22619>	2023-05-11 22:18:02 +00:00
Jesse Natalie	bafa5efcfc	dzn: Enable KHR_shader_integer_dot_product Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Jesse Natalie	a6ea08c542	microsoft/compiler: Enable packed dot product intrinsics for SM6.4+ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Jesse Natalie	217bbdc4fd	microsoft/compiler: Take inputs from callers before providing nir options The base nir options were assuming all bit sizes were supported at shader model 6.2. Multiple callers were then changing properties based on actual support. Standardize behavior by providing the majority of things that can impact nir options when getting them. Some callers (e.g. meta blit shaders or libclc) don't bother, because they are known to have contents that are unaffected by these options. Other callers might munge more properties afterwards, but this minimizes that. Note that lower_helper_invocation was incorrectly being turned off for SM6.6+ by some callers, despite load_helper_invocation being unimplemented by the backend. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Jesse Natalie	f2945409b3	dzn: Enable 64-bit ints and floats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Jesse Natalie	9dc009e7ae	d3d12: Convert from D3D shader model to Mesa shader model earlier Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Jesse Natalie	7cdbf4f065	spirv2dxil: Support int64 and doubles Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22952>	2023-05-11 21:56:31 +00:00
Alyssa Rosenzweig	95d93b24f6	zink: Always set a blend state for shader-db If we're compiling shaders in shader-db, with shader-db's ./run and ZINK_DEBUG=shaderdb, we won't get much state set on the graphics pipeline, since shader-db doesn't actually do any rendering. For a driver like RADV, that is almost ok... Since we use dynamic vertex input, we don't need to make up any state for vertex inputs; since we use dynamic rendering, we don't need to make up any render attachments. All of that being said, we do need to make up a blend state to ensure that the Vulkan driver doesn't optimize away all of store_derefs in the fragment shader (and in turn, optimize the entire fragment shader away, if there are no image/SSBO writes.) So set the obvious blend state, fixing fragment shaders in shader-db with zink + radv. I don't know why other people would want to use Zink with shader-db, but for me it's an easy way to test ACO, at least until radeonsi gains aco support. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22948>	2023-05-11 21:29:47 +00:00
Caio Oliveira	d3bdddcf2a	spirv: Use NIR_PASS for spirv2nir --optimize This allows us to use NIR_DEBUG=print to see each step. Also use an OPT macro to make code slightly more readable. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22764>	2023-05-11 19:53:17 +00:00
Caio Oliveira	f4c4832689	spirv: Do more on spirv2nir --optimize Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22764>	2023-05-11 19:53:16 +00:00
Lionel Landwerlin	c61eea2ff3	intel/mi_builder: fixup tests for newer kernel uAPI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22966>	2023-05-11 19:15:06 +00:00
José Roberto de Souza	4d4b0dfdb8	anv: Set memory types supported by Xe KMD Due the lack of APIs to set mmap modes, Xe KMD can't support the same memory types as i915. So here adding a i915 and Xe function to set memory types supported by each KMD. Iris function iris_xe_bo_flags_to_mmap_mode() has a table with all the mmaps modes of each type of placement. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22906>	2023-05-11 18:28:11 +00:00
Leo Liu	ffbbf23ef8	radeonsi: Use vcn version instead of CHIP family for VCNs Decouple it from CHIP family, based on HW query infomation. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22904>	2023-05-11 18:01:10 +00:00
Leo Liu	09e59553ec	amd: Add vcn ip version info And make it support for kernel w/wo ip_discovery. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22904>	2023-05-11 18:01:10 +00:00
Leo Liu	82a064020c	radeonsi: Remove redundant vcn_decode from info Use the number of queue instead. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22904>	2023-05-11 18:01:10 +00:00
MouriNaruto	90c3fd0c83	dzn: Fix segmentation fault when Direct3D 12 user mode driver from at least one of GPUs is not available. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22961>	2023-05-11 15:58:51 +00:00
Alyssa Rosenzweig	5a80bf2eb0	agx: Optimize multiplies We have an imad instruction and our iadd has a small immediate shift on the second source. Together, these allow expressing lots of integer multiplies more efficiently. Add some rules to optimize these now that the backend compiler can ingest the optimized forms. Half-register changes are from load_const scheduling changing in some vertex shaders. total instructions in shared programs: 1539092 -> 1537949 (-0.07%) instructions in affected programs: 167896 -> 166753 (-0.68%) total bytes in shared programs: 10543012 -> 10533866 (-0.09%) bytes in affected programs: 1218068 -> 1208922 (-0.75%) total halfregs in shared programs: 483180 -> 483448 (0.06%) halfregs in affected programs: 1942 -> 2210 (13.80%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	c2793a304d	agx: Fix packing of imsub instructions The negate for imad is on the third source (a * b - c), not the second source. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	8289fa253b	agx: Handle imadshl_agx, imsubshl_agx Same hardware instructions as iadd/isub/imad/imsub, just with the extra input represented in NIR as required. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	18e19882fa	nir: Model AGX-specific multiply-shift-add Models `(a * b) + (c << d)` in general, as implemented in various forms on AGX. This will be fused with backend NIR opt algebraic rules, both for the literal pattern as well as to strength reduce certain multiplications, e.g. replacing a * 5 with `a + (a << 2)` expressed as imadshl_agx(a, 1, a, 2). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:09 -04:00
Alyssa Rosenzweig	3df4ae3334	agx: Use nir_alu_src_as_uint Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:04 -04:00
Alyssa Rosenzweig	445e2f1620	pan/bi: Use nir_alu_src_as_uint Fixes some theoretical issues with swizzle handling. Unsure if this could cause actual end-to-end miscompiles. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:04 -04:00

1 2 3 4 5 ...

171026 Commits