AlexIndustrial/mesa

Author	SHA1	Message	Date
Bas Nieuwenhuizen	688f5e456a	radv: Disable depth clamping even without EXT_depth_range_unrestricted. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-20 23:24:31 +00:00
Bas Nieuwenhuizen	9f7e0523ce	radv: Implement VK_EXT_depth_clip_enable. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-20 23:24:31 +00:00
Bas Nieuwenhuizen	1ef2855692	radv: Handle clip+cull distances more generally as compact arrays. Needed for https://gitlab.freedesktop.org/mesa/mesa/merge_requests/248 . That MR keeps the clip and cull arrays split. So we have to handle - compact arrays with location_frac != 0 - VARYING_SLOT_CLIP_DIST1 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-20 22:49:52 +00:00
Bas Nieuwenhuizen	572854e706	radv: Clean up a bunch of compiler warnings. Random unused vars. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-02-20 03:21:09 +01:00
Bas Nieuwenhuizen	7631feaa00	radv: Sync ETC2 whitelisted devices. Fixes: `4bb6c49375` "radv: Allow ETC2 on RAVEN and VEGA10 instead of all GFX9." Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-02-20 02:55:41 +01:00
Kenneth Graunke	ba7519ca36	radeonsi: Go back to using llvm.pow intrinsic for nir_op_fpow ARB_vertex_program and ARB_fragment_program define 0^0 = 1 (while GLSL leaves it undefined). Performing fpow lowering in NIR would break this behavior, preventing us from using prog_to_nir. According to llvm/lib/Target/AMDGPU/SIInstructions.td, POW_common expands to <V_LOG_F32_e32, V_EXP_F32_e32, V_MUL_LEGACY_F32_e32>, which presumably does a zero-wins multiply. Lowering in NIR results in a non-legacy multiply, where: pow(0, 0) = 2^(log2(0) * 0) = 2^(-INF * 0) = 2^(-NaN) = -NaN which isn't the desired result. This reverts: - commit `d6b7539206` (ac/nir: remove emission of nir_op_fpow) - commit `22430224fe` (radeonsi/nir: enable lowering of fpow) and prevents a regression in gl-1.0-spot-light with AMD_DEBUG=nir after enabling prog_to_nir in st/mesa later in this series. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-02-19 15:56:19 -08:00
Rhys Perry	238730daef	ac/nir: implement half-float nir_op_ldexp Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:04:46 +00:00
Rhys Perry	6971e8d342	ac/nir: implement half-float nir_op_frsq v2: don't use ac_get_onef() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:04:41 +00:00
Rhys Perry	2038aec22a	ac/nir: implement half-float nir_op_frcp v2: don't use ac_get_onef() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:04:35 +00:00
Rhys Perry	4261edc067	ac/nir: make ac_build_fdiv support 16-bit floats v2: don't use ac_get_onef() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:04:29 +00:00
Rhys Perry	6790b3a8db	ac/nir: make ac_build_isign work on all bit sizes v2: don't use ac_get_zero(), ac_get_one() and ac_int_of_size() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:04:20 +00:00
Rhys Perry	bbbfdef683	ac/nir: make ac_build_clamp work on all bit sizes v2: don't use ac_get_zerof() and ac_get_onef() v3: rename "intr" to "name" Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:03:58 +00:00
Rhys Perry	7e5004e30a	ac/nir: fix 64-bit nir_op_f2f16_rtz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:03:44 +00:00
Rhys Perry	c4ea20c0a0	ac/nir: implement 8-bit nir_load_const_instr Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:03:33 +00:00
Rhys Perry	0ca550e01a	radv: ensure export arguments are always float So that the signature is correct and consistent, the inputs to a export intrinsic should always be 32-bit floats. This and the previous commit fixes a large amount crashes from dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.input_output_int_* tests Fixes: `b722b29f10` ('radv: add support for 16bit input/output') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:03:22 +00:00
Rhys Perry	64065aa504	radv: bitcast 16-bit outputs to integers 16-bit outputs are stored as 16-bit floats in the outputs array, so they have to be bitcast. Fixes: `b722b29f10` ('radv: add support for 16bit input/output') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-19 11:03:18 +00:00
Samuel Pitoiset	47616810ed	radv: fix writing the alpha channel of MRT0 when alpha coverage is enabled This version is better and safer. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-18 18:06:07 +01:00
Samuel Pitoiset	32ab7a59bb	radv: remove unused variable in gather_push_constant_info() Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-18 13:30:16 +01:00
Samuel Pitoiset	0d8f096293	radv: write the alpha channel of MRT0 when alpha coverage is enabled Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109597 Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-18 12:14:22 +01:00
Samuel Pitoiset	2cf5433b99	ac: use new LLVM 8 intrinsic when loading 16-bit values Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-18 12:14:20 +01:00
Samuel Pitoiset	f0223143a8	ac: add ac_build_llvm8_tbuffer_load() helper It uses the new LLVM intrinsics. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-18 12:14:17 +01:00
Samuel Pitoiset	52bdb043af	radv: fix invalid element type when filling vertex input default values The elements added into a vector should have the same type as the first one, otherwise this hits an assertion in LLVM. Fixes: `4b3549c084` ("radv: reduce the number of loaded channels for vertex input fetches") reported-by: Philip Rebohle <philip.rebohle@tu-dortmund.de> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-16 15:33:18 +01:00
Bas Nieuwenhuizen	4b03a19a0b	radv: Use correct num formats to detect whether we should be use 1.0 or 1. normalized and scaled formats also return floats. Fixes: `4b3549c084` ("radv: reduce the number of loaded channels for vertex input fetches") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-15 20:24:16 +00:00
Samuel Pitoiset	227df98fa6	radv: fix radv_fixup_vertex_input_fetches() We should check that num_channels is 4, otherwise that breaks the world. Sorry for the short breakage. Fixes: `4b3549c084` ("radv: reduce the number of loaded channels for vertex input fetches") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-14 09:44:35 +01:00
Samuel Pitoiset	4b3549c084	radv: reduce the number of loaded channels for vertex input fetches It's unnecessary to load more channels than the vertex attribute format. The remaining channels are filled with 0 for y and z, and 1 for w. 29077 shaders in 15096 tests Totals: SGPRS: 1321605 -> 1318869 (-0.21 %) VGPRS: 935236 -> 932252 (-0.32 %) Spilled SGPRs: 24860 -> 24776 (-0.34 %) Code Size: 49832348 -> 49819464 (-0.03 %) bytes Max Waves: 242101 -> 242611 (0.21 %) Totals from affected shaders: SGPRS: 93675 -> 90939 (-2.92 %) VGPRS: 58016 -> 55032 (-5.14 %) Spilled SGPRs: 172 -> 88 (-48.84 %) Code Size: 2862740 -> 2849856 (-0.45 %) bytes Max Waves: 15474 -> 15984 (3.30 %) This mostly helps Croteam games (Talos/Sam2017). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-14 09:10:56 +01:00
Samuel Pitoiset	210aec3612	radv: store vertex attribute formats as pipeline keys The formats will be used for reducing the number of loaded channels. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-14 09:10:09 +01:00
Samuel Pitoiset	45382baef6	radv: use MAX_{VBS,VERTEX_ATTRIBS} when defining max vertex input limits Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-14 09:09:51 +01:00
Samuel Pitoiset	2154fac6f3	ac: make use of ac_build_expand_to_vec4() in visit_image_store() And make ac_build_expand() a static function. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-14 09:09:48 +01:00
Samuel Pitoiset	334da034d8	radv: always export gl_SampleMask when the fragment shader uses it For some reasons, this breaks trees rendering in Project Cars. Fixes: `85010585cd` ("radv: only enable gl_SampleMask if MSAA is enabled too") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109401 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-13 23:01:30 +01:00
Samuel Pitoiset	5e18000d1b	radv/winsys: fix BO list creation when RADV_DEBUG=allbos is set Fixes: `50fd253bd6` ("radv/winsys: Add priority handling during submit.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-13 20:51:40 +01:00
Samuel Pitoiset	1b8983c25b	radv: fix using LOAD_CONTEXT_REG with old GFX ME firmwares on GFX8 This fixes a critical issue. Cc: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109575 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 17:39:30 +01:00
Samuel Pitoiset	bd1186572f	radv: add support for push constants inlining when possible This removes some scalar loads from shaders, but it increases the number of SET_SH_REG packets. This is currently basic but it could be improved if needed. Inlining dynamic offsets might also help. Original idea from Dave Airlie. 29077 shaders in 15096 tests Totals: SGPRS: 1321325 -> 1357101 (2.71 %) VGPRS: 936000 -> 932576 (-0.37 %) Spilled SGPRs: 24804 -> 24791 (-0.05 %) Code Size: 49827960 -> 49642232 (-0.37 %) bytes Max Waves: 242007 -> 242700 (0.29 %) Totals from affected shaders: SGPRS: 290989 -> 326765 (12.29 %) VGPRS: 244680 -> 241256 (-1.40 %) Spilled SGPRs: 1442 -> 1429 (-0.90 %) Code Size: 8126688 -> 7940960 (-2.29 %) bytes Max Waves: 80952 -> 81645 (0.86 %) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 17:25:54 +01:00
Samuel Pitoiset	8364ffe823	radv: keep track of the number of remaining user SGPRs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 17:25:52 +01:00
Samuel Pitoiset	5f9379ca35	radv: gather if shaders load dynamic offsets separately Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 17:25:49 +01:00
Samuel Pitoiset	5806d99984	radv: gather more info about push constants This is needed in order to inline some push constants when possible. This also adds a new helper for initializing the pass. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 17:25:34 +01:00
Samuel Pitoiset	129a9f4937	radv: fix compiler issues with GCC 9 "The C standard says that compound literals which occur inside of the body of a function have automatic storage duration associated with the enclosing block. Older GCC releases were putting such compound literals into the scope of the whole function, so their lifetime actually ended at the end of containing function. This has been fixed in GCC 9. Code that relied on this extended lifetime needs to be fixed, move the compound literals to whatever scope they need to accessible in." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109543 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Gustaw Smolarczyk <wielkiegie@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-12 14:48:08 +01:00
Dave Airlie	90c6880df7	radv: remove alloc parameter from pipeline init clang points out this isn't used. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-11 10:04:40 +10:00
Dave Airlie	a523ae0cac	radv/llvm: initialise passes member. Fixes coverity warning Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-11 08:59:02 +10:00
Lionel Landwerlin	f1bcb9be46	radv: assert that colorAttachment is valid for CmdClearAttachment This partially reverts a change from `b7a93cbded` ("radv: Handle VK_ATTACHMENT_UNUSED in CmdClearAttachment") which fixed actual issues but also started to accept invalid values for the colorAttachment field. This change asserts that the field is valid for the current pass. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b7a93cbded` ("radv: Handle VK_ATTACHMENT_UNUSED in CmdClearAttachment") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-02-08 00:18:16 +00:00
Bas Nieuwenhuizen	13ab63bb62	radv: Implement VK_EXT_buffer_device_address. v2: Also update the release notes. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:37:38 +01:00
Bas Nieuwenhuizen	3259e7b036	radv: Do not use the bo list for local buffers. The kernel already does it for us. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:36:19 +01:00
Bas Nieuwenhuizen	8a15950211	amd/common: Implement global memory accesses. Needed for VK_EXT_buffer_device_address. The pointers are implmemented as i8*, since I could not figure out how to emulate setting struct offsets in LLVM based on the SPIR-V offsets (and more weird stuff like row major matrices). Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:36:11 +01:00
Bas Nieuwenhuizen	5703ecf651	amd/common: Do not use 32-bit loads for shared memory. We use a straight glsl->llvm type conversion so types should already be right. Also even though the writemasks were changed we we not actually doing 32-bit things, so this fails miserably. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:36:06 +01:00
Bas Nieuwenhuizen	8d1718590b	amd/common: handle nir_deref_cast for shared memory from integers. Can happen e.g. after a phi. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:36:02 +01:00
Bas Nieuwenhuizen	830fd0efc1	amd/common: Handle nir_deref_type_ptr_as_array for shared memory. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:58 +01:00
Bas Nieuwenhuizen	dbdb44d575	amd/common: Fix stores to derefs with unknown variable. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:54 +01:00
Bas Nieuwenhuizen	3c24fc64c7	amd/common: Use correct writemask for shared memory stores. The check was for 1 bit being set, which is clearly not what we want. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:49 +01:00
Bas Nieuwenhuizen	00253ab2c4	radv: Fix the shader info pass for not having the variable. For example with VK_EXT_buffer_device_address or VK_KHR_variable_pointers. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:45 +01:00
Bas Nieuwenhuizen	58c8dadd32	amd/common: Implement ptr->int casts in ac_to_integer. For the implicit casts inherent in nir. This should probably have been done for shared memory for VK_KHR_variable_pointers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:40 +01:00
Bas Nieuwenhuizen	e00d9a9a72	amd/common: Add gep helper for pointer increment. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-06 22:35:36 +01:00

1 2 3 4 5 ...

3065 Commits