AlexIndustrial/mesa

Author	SHA1	Message	Date
Mike Blumenkrantz	00a4dc57ce	zink: defer acquire semaphore destruction these have noticeable overhead, so handle them in the submit thread Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18364>	2022-09-19 19:49:11 +00:00
Mike Blumenkrantz	513fcb7936	zink: fix/relax resolve geometry check there's no requirement in the spec that the geometry for resolves must match, only that the geometry must be positive (i.e., no flipped extents) this avoids major perf issues for scaled resolves cc: mesa-stable Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18364>	2022-09-19 19:49:11 +00:00
Kuixi Ren	9c5edda3ca	radeonsi/vcn: Add ability to encode with ltr reads flags field from CurrPic struct in pps for VA_PICTURE_H264_LONG_TERM_REFERENCE. If found, Curr_pic.frame_idx wil be used for the long term reference index In get_picture_storage, check if current frame is ltr, and whether its ref frame is ltr. In radeon_enc_slice_header, adds the ref_pic_list_modification_flag_l0 and long_term_reference_flag for ltr v2: fix code formatting issues Reviewed-by: Ruijing Dong ruijing.dong@amd.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18219>	2022-09-19 18:22:38 +00:00
Alyssa Rosenzweig	bf8c08a0df	pan/bi: Implement unpack_64_2x32 This duplicates the lowering from nir_lower_packing. However, nir_lower_packing also lowers a pile of other instructions that we do implement natively, and this is easier than adding a bunch of knobs to nir_lower_packing to get just what we need. Fixes test-printf address_space_4. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	e9b69c2f79	pan/bi: Stub out scoped_barrier Implement like other workgroup barriers. No subgroup barriers yet, but that doesn't seem needed yet. Fixes test_basic.async_copy_global_to_local and a pile of other OpenCL tests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	bd8c9442f9	pan/bi: Fix 1D array indexing on Valhall Array index always goes in the fourth 16-bit component on Valhall. I'm unsure whether that should also apply to Bifrost. `f256ec2a88` ("pan/bi: Fix 1DArray image coordinate retrieval") says that it should be in the third component on Bifrost, but I can't remember why that would be the case. Fixes OpenCL test image_streams.write.1darray on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	76d6bb4822	pan/bi: Use .auto for image stores Works around LLVM/SPIR-V stupidity. In effect this means we always use typeless image stores, which is good enough for both CL and GL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	8b6611f4bf	pan/bi: Call nir_lower_64bit_phis Fixes test_basic.local_kernel_scope Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	1b03a04239	pan/bi: Scalarize phis before the opt loop Scalarizing phis results in vector constructions (nir_op_vec) of the same size as the phi, so a wide phi (>128-bit) will result in a wide vector op that the backend can't handle. These wide vector ops can always be copypropped away, but that relies on running NIR copy/prop after scalarizing phis, which was not always happening before. By scalarizing phis before the opt loop instead of after, we guarantee that copyprop and DCE run to completion and we get appropriately lowered code in the backend. Fixes parts of integer_ops.integer_divideAssign with longs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	55837efe14	pan/bi: Lower fisnormal Fixes test_bruteforce.isnormal. We don't implement fisnormal in the backend, but actually lower_bool_to_bitsize was failing earlier since there's no fisnormal32 to lower to either. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	ddcf4b1c7e	pan/bi: Lower <32-bit bit_count While we have a POPCOUNT.i32 instruction, we do not have v2i16/v4i8 variants. The code generated by lower_to_bitsize doesn't seem any better than what we could do ourselves, so let's use that. While we're at it, give bitfield_reverse the same treatment as we have only BITREV.i32. I don't think we can get <32-bit bitfield_reverse in either GL or CL, but that seems likely to change in the future. (It looks to be valid SPIR-V, at least.) Fixes integer_ops.popcount. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	bb0606f0ba	pan/bi: Handle swizzles in unpack_64_2x32_split_{x,y} No known fixes but this would still be wrong for OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	f9a01af4f3	pan/bi: Allow selecting from an 8-bit vec8 The word offset is already handled by the above code, there's no need to restrict the further restrict the swizzle. This pattern can come up with OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	65961848b1	pan/bi: Remove bogus assert for pack_32_2x16 The following IR is valid NIR: vec1 16 ssa_0 = ... vec1 32 ssa_1 = pack_32_2x16 ssa_0.xx In this case, pack_32_2x16 takes in a two component vector, but the source itself ssa_0 has only a single component. This is fine due to the shuffle, but will fail the assert. Remove the assert and all is well. Fixes test_relational.shuffle_copy. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	5689a932e8	pan/bi: Lower f2i8, f2u8 These need a simple two-instruction lowering regardless of the size of float involved. Fixes integer_ops.integer_divideAssign Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	82b912f844	pan/bi: Lower 8-bit min/max to bcsel+comparison We don't have an 8-bit CSEL, so this is the best we can do. It's easier to write the lowering as an algebraic rule since we don't need to do anything clever. Fixes integer_ops.integer_clamp. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	4ee56ecd9c	pan/va: Add 8-bit integer max assembler case This needs to be lowered to a two instruction sequence because there is no CSEL.v4s8. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	31a5eb6165	pan/bi: Add HADD.v4s8.rhadd packing test cases To confirm the XML is right. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	decc24b18b	pan/va: Pack .rhadd bit Fixes integer_ops.integer_rhadd. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	42a474daac	pan/bi: Handle uhadd, urhadd opcodes Fixes integer_ops.integer_hadd. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	c717c28d87	pan/va: Fix v4s8 form of R2 opcodes The XML had a typo which was copypasted (incorrectly) into various instructions. Fixes a pile of integer_ops subtests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	48ba7f8627	pan/va: Pack IADD.sat bit Fixes 32-bit portion of integer_ops integer_add_sat. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	77fcb4b291	pan/bi: Strip negate when lowering swizzles When we lower swizzles, we move source modifiers (except for the swizzle) after the swizzle operation. In particular, we change the order of composition for negates and abs. However, copying the source will copy the modifiers unless we specifically strip the extra modifiers. That's harmless in practice on Bifrost, which doesn't check for extraneous modifiers, but is incorrect IR and trips an assertion in the Valhall packing code. Fixes test_relations.relational_bitselect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	377bf3a5a4	pan/bi: Lower swizzles for 8-bit shifts Fixes integers_ops.integer_ctz Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	2e1b02e6a3	pan/bi: Test some 8-bit swizzle lowering Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	d76c48103f	pan/bi: Lower some 8-bit swizzles Fixes the 8-bit portion of OpenCL's integer_ops.integer_clz test case. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	d471b386c1	pan/bi: Unit test swizzle lowering We're about to extend this pass to support 8-bit swizzles. That will be a nontrivial change, so let's get some testing for what's already in the pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	1370c27728	pan/va: Fix missing swizzle on CLZ.v2u16 Fixes 16-bit portion of integer_clz. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	bdab1f9ce9	panfrost: Assume launch_grid parameters always change This is only a theoretical bug fix because, for now, we always reemit everything. But this aligns launch_grid with draw_vbo and makes the intention explicit, both seem like good things to me. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	b261a18550	panfrost: Honour flush-to-zero controls on Valhall Fixes math_bruteforce.atan2 and contractions tests. For OpenCL, we want to flush fp32 and preserve fp16, applying to both inputs and outputs so F16_TO_F32 acts as preserve, which implements CL spec text: > Denormalized numbers for the half data type which may be generated when converting a float to a half using vstore_half and converting a half to a float using vload_half cannot be flushed to zero Note that our libclc builds flush denorms and rusticl does not advertise denorms so we're expected to flush to zero. rusticl correctly sets the desired float controls, we just have to match to the hardware requirements. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	9333428ca2	panfrost: Advertise PIPE_CAP_INT64 nir_lower_int64 should be able to chew through everything anyway. Fixes compilers.feature_macro (with LLVM 15). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	b27589b5d4	panfrost: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS Bump to 2048, the minimum maximum for image support in the full profile of OpenCL. The relevant hardware limit is 65536 so we have plenty of clearance. Fixes api.get_image1d_array_info. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	ff29ff5fad	panfrost: Upload default sampler for txf In NIR, txf does not take a sampler. However, in the hardware it does take a sampler. If there is no sampler bound and we use txf, the hardware will read back all-0's due to bounds checking. As a workaround, bind a trivial sampler and use that. As-is this workaround is Valhall specific, making use of an extra resource table. I'm punting on generalizing back to Bifrost until I can discuss the issue in more depth with Jason and Karol and figure out the right fix. Fixes api.image_properties_query. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	6d180c84fb	panfrost: Allow compiling MESA_SHADER_KERNEL Required for Rusticl. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	185b3e2d7e	panfrost: Default pipe->clear_texture impl For rusticl. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Jason Ekstrand	8f4af4d700	nir/load_libclc: Don't add generic variants that already exist At some point in the future, adding generic variants to libclc will hopefully no longer be needed. At that point, we don't want the NIR code adding duplicates. Check if the generic version already exists and, if it does, don't re-add it. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18675>	2022-09-19 16:52:17 +00:00
Jason Ekstrand	2aa9eb497d	nir: Add a helper for finding a function by name Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18675>	2022-09-19 16:52:17 +00:00
Jason Ekstrand	0a06abbb91	spirv: Don't use libclc for wait_group_events v2: Drop old code (Karol) Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18675>	2022-09-19 16:52:17 +00:00
Vinson Lee	093b19b09a	egl/dri2: Fix missing return with dri2_egl_error_unlock. Fix defect reported by Coverity Scan. Double unlock (LOCK) double_unlock: dri2_egl_error_unlock unlocks dri2_dpy->lock while it is unlocked. Fixes: `f1efe037df` ("egl/dri2: Add display lock") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18655>	2022-09-19 16:24:08 +00:00
Alyssa Rosenzweig	a1faab0b90	agx: Convert and clamp array indices in NIR ..Rather than at backend IR translation time. This is considerably simpler because we can use the txs lowering instead of special casing array sizes. Unfortunately it generates worse code, but that gap should close once nir_opt_preamble is wired in. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18652>	2022-09-19 16:14:24 +00:00
Alyssa Rosenzweig	1304f4578d	panfrost: Adapt emit_shared_memory for indirect dispatch Indirect dispatch does not actually require any dynamic memory allocation, even with shared memory. We just need to set wls_instances to some (mostly arbitrary) value, statically allocate memory based on that, and let the hardware throttle workgroups to fit if needed. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18661>	2022-09-19 15:18:40 +00:00
Alyssa Rosenzweig	79b66a28cd	rusticl: Build Panfrost We want OpenCL, too! Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18663>	2022-09-19 14:50:09 +00:00
James Park	b7d4897df9	meson,amd: Remove Windows libelf wrap Functionality isn't worth the maintenance cost. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18478>	2022-09-19 12:51:12 +00:00
Illia Polishchuk	74658b01d2	driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue Intel has different Z interpolation float point rounding than other mesa gpus For example gl_Position.z = 0.0 will be interpolated to gl_FragCoord.z = 0.5 for all gpus gl_FragCoord = -0.00000001 will be interpolated to gl_FragCoord.z = 0.4999999702 for Intel and rounded to gl_FragCoord.z = 0.5 for other gpus Games with LEQUAL depth func will fail depth test on Intel and will pass it on other gpus in such case This workaround lowers translated depth range and several gl_FragCoord.z coords with extra small difference will be translated to the same UINT16\UINT24\UINT32 value of an integer depth buffer Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7199 Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18412>	2022-09-19 10:08:48 +00:00
Marcin Ślusarz	dedd8affd8	anv: fix emission of primitive replication packet for mesh stage anv_pipeline_get_last_vue_prog_data (used by emit_3dstate_primitive_replication) doesn't work for mesh stage. Fixes: `ae57628dd5` ("anv: Drop anv_pipeline::use_primitive_replication") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18495>	2022-09-19 09:44:00 +00:00
Dave Airlie	9452e5e03a	lavapipe: fix 3d depth stencil image clearing. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18665>	2022-09-19 17:26:57 +10:00
Mike Blumenkrantz	73797c2f46	zink: use screen interfaces for pipeline barriers Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18628>	2022-09-19 01:42:28 +00:00
Mike Blumenkrantz	8c4aaa154a	zink: add screen interfaces for pipeline barriers this will enable direct calling of the right function without the overhead of having conditionals in the barrier functions themselves eventually, the '2' variants will be widely enough deployed that this can be deleted Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18628>	2022-09-19 01:42:28 +00:00
Mike Blumenkrantz	5a78fe4445	zink: add functions for using '2' variants of pipeline barriers Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18628>	2022-09-19 01:42:28 +00:00
Mike Blumenkrantz	9b0b8cad60	zink: add have_vulkan13 to device info Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18628>	2022-09-19 01:42:28 +00:00

1 2 3 4 5 ...

159993 Commits