AlexIndustrial/mesa

Author	SHA1	Message	Date
Ian Romanick	c5d731ac5c	intel/stub: Implement I915_PARAM_HAS_USERPTR_PROBE Just say no for now. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:57 -08:00
Ian Romanick	832db9d900	intel/stub: Implement DRM_I915_QUERY_MEMORY_REGIONS Borrowed from sim-drm. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:55 -08:00
Ian Romanick	4c429b6be6	intel/stub: Implement DRM_I915_QUERY_ENGINE_INFO Borrowed from sim-drm. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:53 -08:00
Ian Romanick	12d35892e0	intel/stub: Suppress warnings about DRM_I915_QUERY_PERF_CONFIG There's not a useful way to implement this, so just silence the warning to cleanup shader-db runs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:48 -08:00
Ian Romanick	ff74d5dd1b	intel/compiler: Don't store "scalar stage" bits on Gfx8 or Gfx9 Since `1d71b1a311`, only Gfx7 and earlier have any vec4 stages ever. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14128>	2021-12-08 14:59:32 -08:00
Ian Romanick	4563261ad1	intel/compiler: Don't predicate a WHILE if there is a CONT Previously a predicated BREAK that appeared immediately before the WHILE would get merged into the WHILE. This doesn't work if other flow control (e.g., a CONT) can transfer directly to the WHILE. On Intel platforms, this fixes the CTS test dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional. No shader-db changes on any Intel platform. When this commit was first created (over a month before it is going to land), there were some regressions that were prevented by other commits in MR !13095. That does not appear to be the case now, so I don't know what changed. Basically, the treatment of discard as a combination of demote and terminate causes additional continues in some loops, and those continues trigger this bug. The other commits from that MR prevent those continues from being generated in the first place. All Intel platforms had simlar fossil-db results. (Ice Lake shown) Instructions in all programs: 144419989 -> 144419995 (+0.0%) SENDs in all programs: 6947332 -> 6947332 (+0.0%) Loops in all programs: 38277 -> 38277 (+0.0%) Spills in all programs: 204075 -> 204075 (+0.0%) Fills in all programs: 319480 -> 319480 (+0.0%) A few shaders in Doom 2016 were hurt by one instruction each. It seems likely that these shaders would have experienced at least some mis-rendering. Closes: #4213 Fixes: `d13bcdb3a9` ("i965/fs: Extend predicated break pass to predicate WHILE.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14128>	2021-12-08 14:56:32 -08:00
Dave Airlie	d051854cca	treewide: drop mtypes/macros includes from main These aren't required in lots of places, so remove them. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14127>	2021-12-08 22:14:45 +00:00
Francisco Jerez	de55fd358f	intel/fs/xehp: Teach SWSB pass about the exec pipeline of FS_OPCODE_PACK_HALF_2x16_SPLIT. This virtual instruction is translated into multiple half float physical instructions, even though its destination is typically of integer type, which prevents the software scoreboard pass from inferring the correct execution pipeline for the virtual instruction on XeHP+ platforms. Teach the SWSB lowering pass about this inconsistency between the IR and physical instruction types. Fixes among other tests: dEQP-GLES31.functional.shaders.builtin_functions.pack_unpack.packhalf2x16_compute Fixes: `d4537770bb` ("intel/fs: Add helper functions inferring sync and exec pipeline of an instruction.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5685 Reported-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14002>	2021-12-08 02:47:11 +00:00
Dave Airlie	34804e1266	intel/crocus: push main/macros.h out to the users Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	9105cf1955	intel/compiler: drop shader_info.h from compiler header include it explicitly in the correct places Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	9265d1d62d	brw/compiler: drop mtypes.h from compiler This adds a bunch of other headers in, and adds mtypes.h to iris for perf query object. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	3f35b5fdc9	anv: include futex.h explicitly in allocator. This file needs futexes so make an explicit include, so it doesn't come via the compiler Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Nanley Chery	0733266706	intel/isl: Drop extra devinfo checks for CCS support These checks are done in isl_format_supports_ccs_*. Since isl_surf_supports_ccs calls these functions, it doesn't need to check them itself. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14082>	2021-12-07 23:31:23 +00:00
Nanley Chery	0e075227d8	intel/isl: Restore CCS_E support for YUYV and UYVY These formats are used when creating surfaces with the I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS modifier. Makes iris pass the out-of-tree piglit test, ext_image_dma_buf_import-intel-modifiers. Fixes: `1433fe7860` ("intel/isl: Unify fmt checks in isl_surf_supports_ccs") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14082>	2021-12-07 23:31:23 +00:00
Dave Airlie	55b396e743	mesa/crocus/iris/blorp: drop minify macro in favour of u_minify This macro is duplicated, clean it up. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14103>	2021-12-07 19:04:01 +00:00
Dave Airlie	9bb375b0be	intel/compiler: drop glsl options from brw_compiler Only the nir options are used now, since i965 was dropped, the glsl options come from the state tracker Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14102>	2021-12-07 08:52:36 +00:00
Kenneth Graunke	2d7c25fb9d	isl: Move some genxml surface state helpers into an include file On XeHP, the XY_BLOCK_COPY_BLT command has a number of fields that describe the layout of the surface, much like SURFACE_STATE does. Several of them are encoded in such a similar manner that we really would like to reuse the isl helpers for emitting those. This commit moves them into a new isl_genX_helpers.h file which I can include from the BLORP code. (The alternative would be to add XY_BLOCK_COPY_BLT filling commands to isl, but that...seems more like a BLORP feature.) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14094>	2021-12-06 17:23:56 -08:00
Lionel Landwerlin	d44478483c	genxml: protect _length defines in genX_bits.h Those defines exist in the packing headers too and some parts of the code (like mi_builder.h) include both. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13954>	2021-12-06 08:02:59 +00:00
Lionel Landwerlin	e9b58116ea	genxml: fix compilation with P/I defines Those names are a bit too common and sometimes clash variables. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13954>	2021-12-06 08:02:59 +00:00
Lionel Landwerlin	365903ebbb	intel/debug: reclaim 7 unused bits from classic driver Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14060>	2021-12-06 09:44:04 +02:00
Lionel Landwerlin	7661237a31	intel/nir: preserve access value when duping intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6339aba775` ("intel/compiler: Lower SSBO and shared loads/stores in NIR") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Marcin Ślusarz	bd2c11dfa8	intel/compiler: Load draw_id from XP0 in Task/Mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Marcin Ślusarz	b717872e08	intel/compiler: Get mesh_global_addr from the Inline Parameter for Task/Mesh Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Marcin Ślusarz	28e0c63a4c	intel/compiler: extract brw_nir_load_global_const out of rt code Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	1f438eb033	intel/compiler: Implement Mesh Output Use the same URB access helpers that were added for Task Output. The Arrayed I/O (per-primitive and per-vertex) is handled by applying the pitch from the MUE layout into the NIR intrinsics and including the non-arrayed offset on top of it. After that, the index src can be used directly for lowering. Because we keep around the non-arrayed offset AND the pitch is aligned, we can identify cases where the access is indirect but guaranteed to be aligned, and dispatch a single message. Added a TODO to explore that later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	70ace2bbcd	intel/compiler: Implement Task Output and Mesh Input Implement the output written by the task workgroup and available to all the mesh workgroups dispatched from that task. We currently ignore any layout annotations (since they are not really testable) and produce a (packed) layout ourselves. The URB messages are only SIMD8, so for larger SIMDs, the functions will produce multiple messages. Making this lowering here instead of the generic lower_simd_width() since it is not just a matter of zip/unzip, e.g. the offset must be adjusted. Indirect writes/reads are implemented by handling one component at a time and using the PER_SLOT variant of the messages. Note that VK_NV_mesh_shader allows reading outputs, so add support for that as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	171bdd2ec6	intel/compiler: Lower Task/Mesh local_invocation_{id,index} The Invocation index is provided by the payload, so we can skip the usual math done to get to it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	db23c41537	intel/compiler: Add backend compiler basics for Task/Mesh Task/Mesh stages are CS-like stages, and include many builtins (e.g. workgroup ID/index) and intrinsics (e.g. workgroup memory primitives) originally present only in CS. This commit add two new stages (task and mesh) that 'inherit' from CS by embedding a brw_cs_prog_data in their own prog_data structure, so that CS functionality can be easily reused. They also currently use the same helpers to select the SIMD variant to use -- that was recently added for CS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	827cf65a26	intel/compiler: Export brw_nir_lower_simd Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	09dd05a219	intel/compiler: Make MUE available when setting up FS URB access Allows to assert its existence for per-primitive variables and will later be useful to implement the "more than 16 attributes" case for Mesh. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	79e5e353e4	intel/compiler: Add structs to hold TUE/MUE Used to specify the layout of 'Task URB Entry' and 'Mesh URB Entry'. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	fcc1ccf541	intel/compiler: Don't lower Mesh/Task I/O to temporaries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	18e1c9c542	intel/compiler: Don't stage Task/Mesh outputs in registers Since the outputs are shared among the whole workgroup, these can't be staged in registers as they will not be always visible for all the invocations (to read/flush). If they ever need to be staged, we should use SLM for that. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	be89ea3231	intel/compiler: Handle per-primitive inputs in FS In Fragment Shader, regular inputs are laid out in the thread payload in a one dword per each half-GRF, that gives room for having the two delta dwords needed for interpolation. Per-primitive inputs are laid out before the regular inputs, and since there's no need to have delta information, they are packed. So half-GRF will be fully filled with 4 dwords of input. When num_per_primitive_inputs is zero (the default case), behavior should be the same as before. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	7938c38778	intel/compiler: Properly lower WorkgroupId for Task/Mesh Task/Mesh currently only support a single dimension (both in NV API and HW), so make Y and Z be zero. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	76f55d7556	intel: Add INTEL_DEBUG=task,mesh Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Dylan Baker	cdde031ac2	classic/i965: Remove driver Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Tapani Pälli	d44d2e823f	anv: allow VK_IMAGE_LAYOUT_UNDEFINED as final layout From VK_KHR_synchronization2: "Image memory barriers that do not perform an image layout transition can be specified by setting oldLayout equal to newLayout. E.g. the old and new layout can both be set to VK_IMAGE_LAYOUT_UNDEFINED, without discarding data in the image." v2: make assert more readable (Lionel Landwerlin) Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14008>	2021-12-03 09:10:32 +00:00
Jordan Justen	0634cb741b	intel: Add intel_gem_create_context_engines Engines based contexts operate somewhat different for executing batches. Previously, we would specify a bitmask value such as I915_EXEC_RENDER to specify to run the batch on the render ring. With engines contexts, instead this becomes an array of "engines", and when the context is created we specify the class and instance of the engine. Each index in the array has a separate hardware-context. Previously we had to create separate kernel level contexts to create multiple hardware contexts, but now a single kernel context can own multiple hardware contexts. Another forward looking advantage to using the engines based contexts is that the kernel does not plan to add new supported I915_EXEC_FOO masks, whereas they instead plan to add new I915_ENGINE_CLASS_FOO engine classes. Therefore some rings may only be usable with an engine based class. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:38 -08:00
Jordan Justen	9a9042a904	intel: Add intel_gem_count_engines Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:31 -08:00
Nanley Chery	1433fe7860	intel/isl: Unify fmt checks in isl_surf_supports_ccs On TGL+, require that the surface format supports CCS_E in order to support CCS. This aligns with the ISL code that pads the primary surface for CCS on this platform. Pre-TGL, require support for either CCS_D or CCS_E. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12398>	2021-12-01 20:36:38 +00:00
Nanley Chery	18cd0a5409	anv: Drop code from get_blorp_surf_for_anv_buffer The code to handle ASTC surfaces hasn't been needed since commit `dd92179a72` ("anv: Canonicalize buffer formats for image/buffer copies"). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	355f318843	anv: Allow transfer-only linear ASTC images Some apps depend on this to run. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2397 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	bdf8b36c4c	anv: Require transfer features for transfer usages In order for an image to support the transfer usage, require that its format can be used for blits or copies. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	caa998ca8f	intel/isl: Allow creating non-Y-tiled ASTC surfaces The sampler can only decode ASTC surfaces that are Y-tiled. ISL has been asserting this restriction at surface creation time. However, some drivers want to create a surface that is only used for copying compressed data. And during the copy, the surface won't have a compressed format. To enable this behavior, we choose to move the tiling assertion to the moment a surface state is created for the sampler. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Kenneth Graunke	574c5d1540	blorp: Disallow multisampling for BLORP compute blits and copies. We don't support typed image writes for multisampling, so we can't handle multisampled destinations. We also usually handle MSAA by running the fragment shader per-sample, which we aren't accounting for in our compute shaders, so we can't handle MSAA sources either. We could do both of these things if we really wanted to, but we don't. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	f0744ebef2	blorp: Assert that BLORP_BATCH_PREDICATE_ENABLE isn't set for compute We don't support this, so make sure it isn't happening. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	5dc36e5e93	blorp: Don't try to use the 3D stencil write hardware for compute When we're doing a stencil blit via a fragment shader, we can avoid W-tiling shenanigans by using the stencil write hardware on Skylake and later. Of course, the compute engine doesn't have stencil fragment writes, so it can't do that. Just fall back to the detiling shenanigans. Caught by Piglit's arb_copy_image-formats when forcing iris to use BLOCS for resource_copy_region on Icelake. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	d832209a78	blorp: Fix compute-blits for rectangles not aligned to the workgroup When dispatching compute shaders to do a blit, our destination rectangle may not line up perfectly with the workgroup size. For example, we may round the left x0 coordinate down to a multiple of the workgroup width, and the right x1 coordinate up to the next multiple of the workgroup width. Similarly for y0/y1 and workgroup height. This means that we may dispatch additional invocations which should not actually do any blitting. We need to set key->uses_kill to bounds check and drop those. Caught by Piglit's arb_copy_image-simple when forcing iris to perform resource_copy_region via BLOCS and running with INTEL_DEBUG=norbc on Icelake. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Lionel Landwerlin	87888c0b3f	anv: fix execbuf syncobjs/syncobj_values array leak Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `36ea90a361` ("anv: Convert to the common sync and submit framework") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13945>	2021-11-24 17:29:46 +00:00

... 63 64 65 66 67 ...

10573 Commits