AlexIndustrial/mesa

Author	SHA1	Message	Date
Oleksii Bozhenko	d5d8bb1dbb	brw: fix saturate propagation region overlap range Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/commit/947c828d5cbffe9640ac63103a6223112eeff27f Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7691 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Oleksii Bozhenko <oleksii.bozhenko@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20206>	2022-12-09 00:39:05 +00:00
Tapani Pälli	bc4b7de0d0	intel/fs: implement Wa_14017989577 The first instruction of any kernel should have non-zero emask. This restriction needs to be obeyed to avoid GPU hangs. Patch adds a function to insert dummy mov as first instruction to make sure this requirement is fulfilled. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20194>	2022-12-08 23:58:32 +00:00
Kenneth Graunke	bafbe7c23a	intel/compiler: Set NoMask on cr0 access for float controls mode This is trying to clear a bit in the control register. However, it's executing with whatever channel mask happens to be active. Typically this is the one at the start of the program, so at least some channels will be active. Typically the first channel will be active due to packed dispatch, but that's not always guaranteed. Without NoMask, the float controls writes may randomly not happen. Recent GPUs also seem to have a hang issue when the first instruction in the shader doesn't have any active channels. Having an instruction with NoMask at the start of the program works around the issue. See HSD bug 14017989577. In our case, the float controls preamble was breaking that restriction every time, causing us to run into this problem frequently. Thanks to Tapani Pälli for finding this hang issue, and Francisco Jerez and Lionel Landwerlin for helping pinpoint this issue during review of a workaround patch in !20194. Fixes GPU hangs in Elder Scrolls Online, Witcher 3, and likely more. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7639 Fixes: `9da56ffc52` ("i965/fs: add emit_shader_float_controls_execution_mode() and aux functions") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20214>	2022-12-08 09:54:09 +00:00
Väinö Mäkelä	4035853523	hasvk: Report correct multisampling limits on gfx7 Some limits reported by hasvk were too high, which caused CTS tests to fail. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19393>	2022-12-08 00:16:44 +00:00
Otavio Pontes	2e775b8bdb	anv/hasvk: Clamping Scissor Rect values in a valid range On cmd_buffer_emit_scissor(), if VkViewport height or width are set to a value lower than 1.0, y_max or x_max can be attributed negative values, causing an overflow. That leads to ScissorRectangleYMax or ScissorRectangleXMax to be set to values on an unsupported range. Clamping x_max and y_max in the valid range solves the problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7471 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20200>	2022-12-07 12:19:42 +00:00
Lionel Landwerlin	e25e17dd0c	intel/fs: clamp per vertex input accesses to patchControlPoints In a tesselation control shader where an input array is accessed using the index gl_InvocationID, we can end up accessing elements beyond the number of input vertices specified in the shader key. This happens because of the lowering in nir_lower_indirect_derefs(). This lowering will affect compact variables which happens in this case : in gl_PerVertex { vec4 gl_Position; float gl_ClipDistance[1]; } gl_in[gl_MaxPatchVertices]; The lowered code produced by NIR is somewhat ineffecient (implements a binary seach) : if (gl_InvocationID < 16) { if (gl_InvocationID < 8) { if (gl_InvocationID < 4) { vec4 vals = load_at_offset(0); value = bcsel(vals, gl_InvocationID); } else { vec4 vals = load_at_offset(4); value = bcsel(vals, gl_InvocationID - 4); } } else { if (gl_InvocationID < 12) { vec4 vals = load_at_offset(8); value = bcsel(vals, gl_InvocationID - 8); } else { vec4 vals = load_at_offset(12); value = bcsel(vals, gl_InvocationID - 12); } } } else { if (gl_InvocationID < 24) { ... } else { ... } } By default the gl_MaxPatchVertices must be set at 32 items and that's what the lowering code will use to divide the access into chunks of 4. But when running with 3 input vertices, this means we'll pull one more item than what was delivered in the shader payload. This triggers issues further down the register scheduling where the g5UD (register for the 4th item) is overwritten by a previous SEND, leading the URB read to use an invalid handle. This pass clamps any access load_per_vertex_input intrinsic vertex indice to (input_vertices - 1). Fixes issues with tests like : dEQP-VK.clipping.user_defined.clip_distance.vert_tess.* Also fixes a hang with zink/anv on : KHR-GL46.draw_elements_base_vertex_tests.AEP_shader_stages v2: Don't replace source register v3: Implement in NIR v4: Clamp per vertex array sizes in NIR (Jason) v5: Move the clamping on the intel compiler Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9749>	2022-12-07 08:16:03 +00:00
Marcin Ślusarz	7809f76fe8	intel/compiler/mesh: align payload size to the size of vec4 This reduces the number of instructions in task shaders when payload size is not aligned to vec4 and payload_in_shared WA is enabled, because nir_lower_task_shader will not need to handle the unaligned size case. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20080>	2022-12-06 16:31:11 +00:00
Nanley Chery	ea4de4ad3d	anv: Don't ambiguate for undefined layouts on TGL+ For Tiger Lake and onward, we generally don't need to ambiguate the CCS before accessing it. This is safe for two reasons: - Tiger Lake and onward treat all CCS values as legal. - We enable compression on all writable image layouts. The CCS will receive all writes and will therefore always be valid. When dealing with modifiers, we continue to allow ambiguates in some instances. Before this patch, I found ~19.5k ambiguates in Wolfenstein: Youngblood's Riverside benchmark (note that this includes manually entering the benchmark and exiting the app). With this patch, the number of ambiguates goes down to zero. Improves performance of Fallout 4 at 1080p/High settings on Arc A380 by around 22%. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	5c84b31891	anv: Move aux vars up in transition_color_buffer I'd like to reuse one of them for an assert. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	822687f4c0	intel/dev: Add a has_illegal_ccs_values flag Whether or not CCS can be used without initialization depends on the platform: - On gfx7-8, each CCS element is 1-bit and encodes "fast-cleared" or "pass-through". So, those platforms have no illegal values. - On gfx9-11, each CCS element is 2-bits and some bit combinations are invalid. - On gfx12+, each CCS element is 4-bits but they have no truly illegal values. Unused encodings are interpreted as "pass-through". Refer to the "MCS/CCS Buffers for Render Target(s)" sections of the PRMs for more info. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	d307655e52	anv: Use specific flush reasons for CCS operations When INTEL_DEBUG=pc is set and a CCS operation is being performed, the driver reports that flushes are happing before and after the operation. It also reports that the operation is a fast clear, but that's not always the case. We could be resolving for example. Reporting the specific operation can help avoid confusion. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Lionel Landwerlin	d4cd33630a	intel: add missing restriction on fragment simd dispatch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7755 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169>	2022-12-06 00:37:50 +02:00
Lionel Landwerlin	b9403b1c47	intel: factor out dispatch PS enabling logic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169>	2022-12-06 00:37:47 +02:00
Sviatoslav Peleshko	77ecf9149c	anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode Fixes: `313aeee8` ("anv: Use pending pipe control mechanism in flush_pipeline_select() ") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7816 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20124>	2022-12-03 00:10:32 +00:00
Lionel Landwerlin	b7b91ae51e	anv: enable VK_KHR_ray_tracing_maintenance1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	d844fa4def	anv: implement new queries for VK_KHR_ray_tracing_maintenance1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	4d05be49c2	anv: implement vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	675c5bd4cc	anv: refactor ray tracing dispatch Preparing for vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	df38426072	intel/rt/nir: add support for RayCullMaskKHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	6202a2c6b4	intel/rt/nir: enable the trampoline shader to load the indirect ray shader bsr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	af3f7948d1	anv: correctly predicate ray tracing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7479fe6ae0` ("anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	7d7c32de4c	anv/genxml: make gen_rt more like other genxml files The main goal is to be able to generate genX_bits.h for those structures so we can get generated field offsets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	8baacba4d6	hasvk: remove coarse pixel checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	2d150f3ecd	hasvk: Drop more DG2 code v2: remove unused devinfo (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	d0fea83d7b	hasvk: Rip out local memory support Things could probably be simplified further but this at least gets rid of most of the dead code and the dead flags and fields. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	4256d2cbc2	hasvk: Rip out scratch surfaces These are a DG2+ thing Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	eea49c7d32	hasvk: Drop SKL+ features Most of these have already had all the code removeed. We just need to remove the feature bits and queries. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	b71ac720a8	hasvk: Drop support for atomic_int64 and atomic_float2 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	49201fe8c1	hasvk: Drop bindless image support Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	7b700369b1	hasvk: Drop A64 descriptor set support It's only used by task/mesh and ray-tracing. Also drop a couple remaining ray query things and a task/mesh we left behind. v2: Fix incorrect use of nir_load_desc_set_address_intel (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	85cfa21e04	hasvk: Drop remnants of ray queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	e490434479	hasvk: Drop CCS_E support Oh, for the days of Broadwell and earlier where compression was called fast-clear. That was a simpler time. The birds sang in the trees, the oceans weren't brown from oil spills, and Intel surface compression was actually comprehendable by humans. To help the reviewer, keep the following in mind: 1. CCS_E is SKL+ 2. Implicit CCS is TGL+ 3. The AUX TT (AKA aux map) is TGL+ 4. HIZ+CCS, stencil CCS, and CCS for storage images are all TGL+ 4. CCS_D surfaces only ever get full resolves and MCS surfaces only ever get partial resolves Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	5f1dbd80b3	hasvk: Rip out primitive replication Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	7f97cd04c9	hasvk: Rip out remaining traces of CPS/FSR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	90aab6e9a5	hasvk/gpu_memcpy: Rip out SKL+ Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Jason Ekstrand	6d80ce1283	hasvk/state: Rip out SKL+ v2: Fix incorrectly removed l3cr.SLMEnable setting (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Jason Ekstrand	ce57cc4397	hasvk/blorp: Rip out SKL+ Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Jason Ekstrand	cc68b7cd94	hasvk/pipeline: Rip out SKL+ v2: Fix incorrect DispatchMode removal (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Jason Ekstrand	91090e39af	hasvk/cmd_buffer: Rip out SKL+ support Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Lionel Landwerlin	0626b68c88	isl: don't report I915_FORMAT_MOD_Y_TILED_CCS on Gfx8 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:16 +00:00
Lionel Landwerlin	a855bdbf47	intel/nir/rt: switch to workgroup_id_zero_base RT don't use a base workgroup id so no reason of using workgroup_id. Additionally the lowering introduced in `b4dd3df227` requires something provides base_workgroup_id which we don't have for RT as it's not needed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b4dd3df227` ("intel/nir: Set has_base_workgroup_id for lower_compute_system_values") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7812 Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20115>	2022-12-02 05:25:22 +00:00
Jordan Justen	686ada78cd	intel/dev: Add (disabled) device info for MTL Reworks: * Jordan: INTEL_PLATFORM_MTL_M/INTEL_PLATFORM_MTL_P * Lionel: .has_coarse_pixel_primitive_and_cb * Jordan: .has_mesh_shading & .has_ray_tracing * Paulo: .has_64bit_float * José: .has_integer_dword_mul (BSpec: 47431) * Jordan: Comment pci device ids for now similar to DG2: * `70a4e64685` ("intel: Add disabled device ids for DG2") * `ad565f6b70` ("intel/dev: Enable first set of DG2 PCI IDs") Ref: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/include/drm/i915_pciids.h?h=v6.0-rc4#n736 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19658>	2022-12-01 16:22:47 +00:00
Marcin Ślusarz	db0e6f9a07	intel/compiler: user payload starts after TUE header & its padding All data written by the user are offset by TUE header size. Without this patch we copy the correct amount of user data, but both "from" and "to" offsets are wrong. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409>	2022-12-01 11:19:47 +00:00
Marcin Ślusarz	7aaafaa8ae	intel/compiler: adjust [store\|load]_task_payload.base too Base also needs to be converted from bytes to words. Fixes: `c36ae42e4c` ("intel/compiler: Use nir_var_mem_task_payload") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409>	2022-12-01 11:19:47 +00:00
Jason Ekstrand	216e5d6e10	hasvk: Drop anv_nir_add_base_work_group_id() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	2806968af8	anv: Drop anv_nir_add_base_work_group_id() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	b4dd3df227	intel/nir: Set has_base_workgroup_id for lower_compute_system_values This option didn't exist half a decade ago when I first implemented base workgroup support in ANV. It's cleaner to just have split system values like all the other zero_base+base things do. We currently only do this for COMPUTE and not KERNEL because it lets us avoid changing intel_clc for now. We can add KERNEL later if needed. We also don't do this lowering for task/mesh. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	19ad2629d0	hasvk: Implement lower_base_workgroup_id Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	3c09571f67	anv: Implement lower_base_workgroup_id Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	7d2e3f660c	intel/fs: Support load_workgroup_id_zero_base Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00

1 2 3 4 5 ...

8749 Commits