AlexIndustrial/mesa

Author	SHA1	Message	Date
Daniel Schürmann	d00fb7ce54	ac: add support for trinary_minmax instructions v2: Add missing break (Bas) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-29 01:29:35 +02:00
Dave Airlie	fe5d5d19b0	spirv: add support for SPV_AMD_shader_trinary_minmax Co-authored-by: Daniel Schürmann <daniel.schuermann@campus.tu-berlin.de> Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-29 01:29:29 +02:00
Dave Airlie	3e830a1af2	nir: add support for min/max/median of 3 srcs These are needed for SPV_AMD_shader_trinary_minmax, the AMD HW supports these. Co-authored-by: Daniel Schürmann <daniel.schuermann@campus.tu-berlin.de> Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-29 01:28:58 +02:00
Marek Olšák	025105453a	radeonsi: simplify DCC format categories Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-28 18:45:52 -04:00
Marek Olšák	3fea237c85	radeonsi: don't use the SPI barrier management bug workaround Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-28 18:45:52 -04:00
Marek Olšák	3045c5f274	radeonsi: use maximum OFFCHIP_BUFFERING on Vega12 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-28 18:45:52 -04:00
Bas Nieuwenhuizen	4503ff760c	ac/nir: Add workaround for GFX9 buffer views. On GFX9 whether the buffer size is interpreted as elements or bytes depends on whether IDXEN is enabled in the instruction. If the index is a constant zero, LLVM optimizes IDXEN to 0. Now the size in elements is interpreted in bytes which of course results in out of bounds accesses. The correct fix is most likely to disable the LLVM optimization, but we need something to work with LLVM <= 6.0. radeonsi does the max between stride and element count on the CPU but that results in the size intrinsics returning the wrong size for the buffer. This would cause CTS errors for radv. v2: Also include the store changes. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-29 00:03:03 +02:00
Marek Olšák	4f96747530	ac/surface: set AddrSurfInfoIn.format = ADDR_FMT_8 for stencil, add assertions Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105738 Tested-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-03-28 17:23:41 -04:00
Samuel Pitoiset	1c4fdcf444	radv: enable VK_EXT_sampler_filter_minmax Only enable for CIK+ because it's buggy on SI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-03-28 22:55:48 +02:00
Samuel Pitoiset	413d77e7f9	radv: add support for VK_EXT_sampler_filter_minmax The driver only supports the required formats for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-03-28 22:55:48 +02:00
Samuel Pitoiset	99b52aa1da	radv: rename VEGA10 device name Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-03-28 20:15:17 +02:00
Samuel Pitoiset	4d2c46dda3	radv: add support for Vega12 Based on RadeonSI. Untested. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-03-28 20:15:14 +02:00
Dylan Baker	2cfc68d984	autotools: Include intel/dev/meson.build in tarball Fixes: `272bef0601` ("intel: Split gen_device_info out into libintel_dev") Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2018-03-28 10:19:05 -07:00
Marek Olšák	20eb44ad65	radeonsi: add support for Vega12 Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2018-03-28 11:37:43 -04:00
Marek Olšák	5425d32fcf	amd/addrlib: update to the latest version for Vega12 Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2018-03-28 11:37:43 -04:00
Eric Engestrom	431a1d12cc	gbm: remove never-implemented function I assume this was implemented in a previous version of that commit, but was removed in the version that actually landed. Fixes: `8430af5ebe` "Add support for swrast to the DRM EGL platform" Cc: Giovanni Campagna <gcampagna@src.gnome.org> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-03-28 16:25:52 +01:00
Stefan Schake	77ade10c86	android: Use new nir intrinsics python scripts Fixes: `76dfed8ae2` ("nir: mako all the intrinsics") Signed-off-by: Stefan Schake <stschake@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-03-28 14:48:47 +03:00
Eric Anholt	a691fa4a1b	broadcom/vc5: Fix padding of NPOT miplevels >= 2. The power-of-two padded size that gets minified is based on level 1's dimensions, not level 0's, which starts to differ at a width of 9. Fixes all failures on texelFetch fs sampler2D 1x1x1-64x64x1	2018-03-27 21:16:23 -07:00
Timothy Arceri	92fa89a08d	ac/radeonsi: pass bindless bool to load_sampler_desc() We also fix the base_index for bindless by using the driver location. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 12:56:16 +11:00
Timothy Arceri	5411b98d52	st/glsl_to_nir: set driver location for bindless images and samplers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 12:56:15 +11:00
Timothy Arceri	f94b6b79be	radeonsi/nir: set uses_bindless_samplers for samplers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 12:56:15 +11:00
Timothy Arceri	5c810a2c05	nir: add bindless to nir data Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-03-28 12:56:15 +11:00
Kenneth Graunke	fb18d0dbe4	i965: Drop unnecessary bo->align field. bo->align is always 0; there's no need to waste 8 bytes storing it. Thanks to C99 initializers zeroing fields, we can completely drop the only read of the field altogether. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 18:41:44 -07:00
Kenneth Graunke	037d738a23	i965: Drop unused alignment parameter from brw_bo_alloc(). brw_bo_alloc no longer uses this parameter, so there's no point. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 18:41:44 -07:00
Kenneth Graunke	07ec3a2e0f	i965: Drop alignment parameter from bo_alloc_internal(). Buffers are always page aligned on 965+ hardware; I believe this extra parameter is a vestige from the Gen2-3 era. All callers pass 0, and in fact we assert that the alignment is 0 unless BO_ALLOC_BUSY is set (for some reason). We can just drop the parameter and set the value to 0 explicitly. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 18:41:44 -07:00
Kenneth Graunke	b9a54b18f6	i965: Drop BO_ALLOC_BUSY in intel_miptree_create_for_bo(). intel_miptree_create_for_bo does not actually allocate a BO, so specifying allocation flags accomplishes nothing and is confusing. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 18:41:44 -07:00
Kenneth Graunke	2c01215c1b	i965: Drop PIPE_CONTROL_NO_WRITE from various calls. This is just zero - passing nothing already gives us a post-sync operation of "nothing". Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-03-27 18:41:44 -07:00
Jason Ekstrand	5f21a7afe0	nir/intrinsics: Don't report negative dest_components I have no idea why but having dest_components == -1 was causing a memory leak somewhere. Without this, you can't get through a full shader-db run without running out of memory. Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-03-27 18:18:26 -07:00
Jason Ekstrand	7e38f49a8f	intel/fs: Don't emit a des copy for image ops with has_dest == false This was causing us to walk dest_components times over a thing with no destination. This happened to work because all of the image intrinsics without a destination also happened to have dest_components == 0. We shouldn't be reading dest_components if has_dest == false. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-03-27 18:18:21 -07:00
Ilia Mirkin	776e6af879	nvc0/ir: fix INTERP_* with indirect inputs There were two problems, both of which are fixed now: - The indirect address was not being shifted by 4 - The indirect address was being placed as an argument in the offset case This fixes some of the new interpolateAt* piglits which now test for these situations. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-03-27 20:41:11 -04:00
Timothy Arceri	629ee690ad	nir: fix crash in loop unroll corner case When an if nesting inside anouther if is optimised away we can end up with a loop terminator and following block that looks like this: if ssa_596 { block block_5: /* preds: block_4 / vec1 32 ssa_601 = load_const (0xffffffff / -nan /) break / succs: block_8 / } else { block block_6: / preds: block_4 / / succs: block_7 / } block block_7: / preds: block_6 */ vec1 32 ssa_602 = phi block_6: ssa_552 vec1 32 ssa_603 = phi block_6: ssa_553 vec1 32 ssa_604 = iadd ssa_551, ssa_66 The problem is the phis. Loop unrolling expects the last block in the loop to be empty once we splice the instructions in the last block into the continue branch. The problem is we cant move phis so here we lower the phis to regs when preparing the loop for unrolling. As it could be possible to have multiple additional blocks/ifs following the terminator we just convert all phis at the top level of the loop body for simplicity. We also add some comments to loop_prepare_for_unroll() while we are here. Fixes: `51daccb289` "nir: add a loop unrolling pass" Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105670	2018-03-28 09:59:38 +11:00
Timothy Arceri	48f6014903	st/glsl_to_nir: correctly handle arrays packed across multiple vars Fixes piglit test: tests/spec/arb_enhanced_layouts/execution/component-layout/vs-fs-array-interleave-range.shader_test Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 09:59:38 +11:00
Timothy Arceri	b260efbd5e	radeonsi/nir: fix input processing for packed varyings The location was only being incremented the first time we processed a location. This meant we would incorrectly skip some elements of an array if the first element was packed and proccessed previously but other elements were not. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 09:59:38 +11:00
Timothy Arceri	51f175028d	ac/nir_to_llvm: fix component packing for double outputs We need to wait until after the writemask is widened before we adjust it for component packing. Together with the previous patch this fixes a number of arb_enhanced_layouts component layout piglit tests. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 09:59:37 +11:00
Timothy Arceri	fc51fdbcde	st/glsl_to_nir: fix driver location for dual-slot packed doubles Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 09:59:37 +11:00
Timothy Arceri	47eee04556	radeonsi/nir: fix scanning of multi-slot output varyings This fixes tcs/tes varying arrays where we dont lower indirects and therefore don't split arrays. Here we also fix useagemask for dual slot doubles. Fixes a number of arb_tessellation_shader piglit tests. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-03-28 09:59:37 +11:00
Eric Anholt	9f1b4f6204	broadcom/vc5: Fix RG16I/UI texture sampling. How many times did I look at this table without noticing the missing 'G' in the texture column? Fixes KHR-GLES3.copy_tex_image_conversions.required.* on 7268.	2018-03-27 15:49:58 -07:00
Rob Clark	16581904b0	nir: fix generated nir_intrinsics.c for MSVC Apparently it is not happy about things like: .foo = {} So skip over initializers for empty lists. Fixes: `76dfed8ae2` Reported-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-03-27 15:01:11 -04:00
Rob Clark	76dfed8ae2	nir: mako all the intrinsics I threatened to do this a long time ago.. I probably should have done it a long time ago when there where many fewer intrinsics. But the system of macro/#include magic for dealing with intrinsics is a bit annoying, and python has the nice property of optional fxn params, making it possible to define new intrinsics while ignoring parameters that are not applicable (and naming optional params). And not having to specify various array lengths explicitly is nice too. I think the end result makes it easier to add new intrinsics. v2: couple small fixes found with a test program to compare the old and new tables v3: misc comments, don't rely on capture=true for meson.build, get rid of system_values table to avoid return value of intrinsic() and mostly remove side-effects, add autotools build support v4: scons build Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 08:36:37 -04:00
Rob Clark	cc3a88e81d	nir: fix per_vertex_output intrinsic This is supposed to have both BASE and COMPONENT but num_indices was inadvertantly set to 1. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-03-27 08:20:40 -04:00
Rob Clark	1e0a06000b	glsl_types: fix build break with intel/msvc compiler The VECN() macro was taking advantage of a GCC specific feature that is not available on lesser compilers, mostly for the purposes of avoiding a macro that encoded a return statement. But as suggested by Ian, we could just have the macro produce the entire method body and avoid the need for this. So let's do that instead. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105740 Fixes: `f407edf340` Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Roland Scheidegger <sroland@vmware.com> Cc: Ian Romanick <idr@freedesktop.org> Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-03-27 08:17:11 -04:00
Lin Johnson	41cf30b8bc	mesa: add GL_HALF_FLOAT as supported type to readpixels EXT_color_buffer_float spec states: "An INVALID_OPERATION error is generated ... if the color buffer is a floating-point format and type is not FLOAT, HALF FLOAT, or UNSIGNED_INT_10F_11F_11F_REV." This means that GL_HALF_FLOAT type should be supported when color buffer has floating-point format. Fixes Android CTS test android.view.cts.PixelCopyTest. v2: remove comments of EXT_color_buffer_half_float as EXT_color_buffer_float can use type GL_HALF_FLOAT Signed-off-by: Lin Johnson <johnson.lin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-03-27 09:04:52 +03:00
Eric Anholt	0024b77e87	broadcom/vc5: Fix swizzling of RGB10_A2UI render targets. This is the actual hardware layout, and we were only swizzling R/B back around in texturing. Fixes part of KHR-GLES3.copy_tex_image_conversions.required.cubemap_negx_cubemap_negx in simulation.	2018-03-26 17:46:23 -07:00
Eric Anholt	c2b13627d9	broadcom/vc5: Fix extraneous register index in QIR dumping of TLBU writes. Just like TLB without a config uniform, we don't have a register index.	2018-03-26 17:46:23 -07:00
Eric Anholt	494da6c2dd	broadcom/vc5: Implement workaround for GFXH-1431. This should fix some blending errors, but doesn't impact any testcases in the CTS.	2018-03-26 17:46:19 -07:00
Eric Anholt	1bf466270d	broadcom/vc5: Fix EZ disabling and allow using GT/GE direction as well. Once we've disabled EZ for some draws, we need to not use EZ on future draws. Implementing that made implementing the GT/GE direction trivial. Fixes KHR-GLES3.shaders.fragdepth.compare.no_write on V3D 4.1 simulation.	2018-03-26 17:46:19 -07:00
Eric Anholt	262208eb3c	broadcom/vc5: Disable TF on V3D 4.x when drawing with queries disabled. On 3.x, we just don't flag the primitive as needing TF, but those primitive bits are now allocated to the new primitive types. Now we need to actually update the enable flag at draw time.	2018-03-26 17:46:19 -07:00
Eric Anholt	ef2cf9cc3c	broadcom/vc5: Disable transform feedback on V3D 4.x at the end of the job. The next job from this client will turn it back on unless TF gets disabled, but we don't want the state to leak from this client to another (which causes GPU hangs).	2018-03-26 17:46:19 -07:00
Eric Anholt	1fa820cef8	broadcom/vc5: Move the BCL epilogue code to a per-version compile. I need to do some new packets for transform feedback on 4.1.	2018-03-26 17:46:19 -07:00
Eric Anholt	3387864130	broadcom/vc5: Fix transform feedback in the presence of point size. I had this note to myself, and it turns out that a lot of CTS tests use XFB with points to get data out without using a fragment shader. Keep track of two sets of precomputed TF specs (point size in VPM prologue or not), and switch between them when we enable/disable point size.	2018-03-26 17:46:19 -07:00

1 2 3 4 5 ...

93219 Commits