AlexIndustrial/mesa

Author	SHA1	Message	Date
Jason Ekstrand	f97e251ab2	intel/genxml/bits: Emit per-field _start helpers Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-04-07 22:34:04 -07:00
Jason Ekstrand	430e697868	intel/genxml/bits: Pull the function emit code into a helper block The helper block is extremely general. It takes an string property name and an object that supports three methods: has_prop, iter_prop, and get_prop. This way we can easily generalize it to emit more different types of getter functions. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-04-07 22:34:04 -07:00
Jason Ekstrand	2d52e65d03	intel/genxml/bits: Refactor to add a container class Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-04-07 22:34:04 -07:00
Jason Ekstrand	bc68aa42bd	anv: Use subpass dependencies for flushes Instead of figuring it all out ourselves, just use the information given to us by the client. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	e5bbf8be36	anv/pass: Record required pipe flushes Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	0039d0cf27	anv/pass: Use anv_multialloc for allocating the anv_pass Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	415633a722	anv/descriptor_set: Use anv_multialloc for descriptor set layouts Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	e5c29b8c27	anv: Add a helper for doing mass allocations We tend to try to reduce the number of allocation calls the Vulkan driver uses by doing a single allocation whenever possible for a data structure. While this has certain downsides (usually code complexity), it does mean error handling and cleanup is much easier. This commit adds a nice little helper struct for getting rid of some of that complexity. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	82695c32b6	anv: Add helpers for converting access flags to pipe bits Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-07 19:24:14 -07:00
Jason Ekstrand	4e17b59f6c	anv/query: Use snooping on !LLC platforms Commit `b2c97bc789` which made us start using a busy-wait for individual query results also messed up cache flushing on !LLC platforms. For one thing, I forgot the mfence after the clflush so memory access wasn't properly getting fenced. More importantly, however, was that we were clflushing the whole query range and then waiting for individual queries and then trying to read the results without clflushing again. Getting the clflushing both correct and efficient is very subtle and painful. Instead, let's side-step the problem by just snooping. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2017-04-07 12:17:20 -07:00
Emil Velikov	5318d1ff94	anv: provide anv_gem_busy() stub for the tests Otherwise linking way fail. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100600 Fixes: `f195d40eca` ("anv/device: Add a helper for querying whether a BO is busy") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> Tested-by: Vinson Lee <vlee@freedesktop.org>	2017-04-07 19:45:58 +01:00
Samuel Iglesias Gonsálvez	1c934bc71b	anv/blorp: sample input attachments with resolves on BDW On Broadwell we still need to do a resolve between the subpass that writes and the subpass that reads when there is a self-dependency because HW could not see fast-clears and works on the render cache as if there was regular non-fast-clear surface. Fixes 16 tests on BDW: dEQP-VK.renderpass.formats..input.clear.store.self_dep Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-04-07 07:49:43 +02:00
Jordan Justen	0370350d11	intel/aubinator: Stop searching after a custom handler is found Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-06 13:26:08 -07:00
Jordan Justen	d5bd0e411e	intel/gen_decoder: return -1 for unknown command formats Decoding with aubinator encountered a command of 0xffffffff. With the previous code, it caused aubinator to jump 255 + 2 dwords to start decoding again. Instead we can attempt to detect the known instruction formats. If the format is not recognized, then we can advance just 1 dword. v2: * Update aubinator_error_decode * Actually convert the length variable returned into a signed integer in aubinator.c, intel_batchbuffer.c and aubinator_error_decode.c. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-06 13:26:08 -07:00
Jordan Justen	7c33372f82	intel/gen_decoder: Fix length for Media State/Object commands From BDW PRM, Volume 6: Command Stream Programming, 'Render Command Header Format'. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-06 13:26:08 -07:00
Jordan Justen	3c77a57222	intel/aubinator_error_decode: Fix structure decode data The call to gen_print_group should provide a pointer to the beginning of the the structure data, not the start of the batch data. Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-06 13:25:38 -07:00
Jason Ekstrand	b2c97bc789	anv/query: Busy-wait for available query entries Before, we were just looking at whether or not the user wanted us to wait and waiting on the BO. Some clients, such as the Serious engine, use a single query pool for hundreds of individual query results where the writes for those queries may be split across several command buffers. In this scenario, the individual query we're looking for may become available long before the BO is idle so waiting on the query pool BO to be finished is wasteful. This commit makes us instead busy-loop on each query until it's available. This significantly reduces pipeline bubbles and improves performance of The Talos Principle on medium settings (where the GPU isn't overloaded with drawing) by around 20% on my SkyLake gt4. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Tested-by: Eero Tamminen <eero.t.tamminen@intel.com> Tested-by: Grazvydas Ignotas <notasas@gmail.com>	2017-04-05 21:17:11 -07:00
Jason Ekstrand	f195d40eca	anv/device: Add a helper for querying whether a BO is busy This is a bit more efficient than using GEM_WAIT with a timeout of 0. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2017-04-05 21:17:11 -07:00
Emil Velikov	a6840efc09	anv: provide required gem stubs for the tests Introduce stubs to anv_gem_stub.c that match the anv_gem.c ones. Otherwise we may get link-time errors, when building the tests. v2: Introduce all the missing stubs at once. Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: Vinson Lee <vlee@freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100574 Fixes: `c964f0e485` ("anv: Query the kernel for reset status") Fixes: `651ec926fc` ("anv: Add support for 48-bit addresses") Fixes: `060a6434ec` ("anv: Advertise larger heap sizes") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> --- I've intentionally kept the order the same identical to the anv_gem.c. This way we can easily grep & diff in the future ;-)	2017-04-05 17:54:38 +01:00
Emil Velikov	e664cfc5a7	intel: genxml: automake: include gen_bits_header.py in the tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-04-05 13:16:28 +01:00
Emil Velikov	e180680980	intel: genxml: automake: polish automake rules Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-04-05 13:16:28 +01:00
Jason Ekstrand	060a6434ec	anv: Advertise larger heap sizes Instead of just advertising the aperture size, we do something more intelligent. On systems with a full 48-bit PPGTT, we can address 100% of the available system RAM from the GPU. In order to keep clients from burning 100% of your available RAM for graphics resources, we have a nice little heuristic (which has received exactly zero tuning) to keep things under a reasonable level of control. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	651ec926fc	anv: Add support for 48-bit addresses This commit adds support for using the full 48-bit address space on Broadwell and newer hardware. Thanks to certain limitations, not all objects can be placed above the 32-bit boundary. In particular, general and state base address need to live within 32 bits. (See also Wa32bitGeneralStateOffset and Wa32bitInstructionBaseOffset.) In order to handle this, we add a supports_48bit_address field to anv_bo and only set EXEC_OBJECT_SUPPORTS_48B_ADDRESS if that bit is set. We set the bit for all client-allocated memory objects but leave it false for driver-allocated objects. While this is more conservative than needed, all driver allocations should easily fit in the first 32 bits of address space and keeps things simple because we don't have to think about whether or not any given one of our allocation data structures will be used in a 48-bit-unsafe way. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	439da38d18	anv: Replace anv_bo::is_winsys_bo with a uint32_t flags Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	5d1ba2cb04	anv/blorp: Align vertex buffers to 64B This fixes issues seen when adding support for full 48-bit addresses. The 48-bit addresses themselves have nothing to do with it other than that it caused the kernel to place buffers slightly differently so they interacted differently with the caches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	c964f0e485	anv: Query the kernel for reset status When a client causes a GPU hang (or experiences issues due to a hang in another client) we want to let it know as soon as possible. In particular, if it submits work with a fence and calls vkWaitForFences or vkQueueQaitIdle and it returns VK_SUCCESS, then the client should be able to trust the results of that rendering. In order to provide this guarantee, we have to ask the kernel for context status in a few key locations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	82573d0f75	anv: Check for device loss at the end of WaitForFences It's possible that the device could have been lost while we were waiting. We should let the user know if this has happened. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	c6f69eea6a	anv/pipeline: Properly handle unset gl_Layer and gl_ViewportIndex When the shader does not set one of these values, they are supposed to get a default value of 0. We have hardware bits in 3DSTATE_CLIP for this but haven't been setting them. This fixes the intermittent failure of dEQP-VK.geometry.layered.3d.render_to_default_layer. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	3503b2714b	i965/fs: Always provide a default LOD of 0 for TXS and TXL We already provide a default LOD for textureQueryLevels and texture() on non-fragment stages. However, there are more cases where one is needed such as textureSize(gsampler2DMS*) in SPIR-V. Instead of trying to list out all of the cases one at a time, just provide the default for all TXS and TXL operations. This fixes a shader validation error in the new Sascha deferredmultisampling demo which uses textureSize(gsampler2DMS). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100391 Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:35 -07:00
Jason Ekstrand	1fde054b8f	intel/isl: Refactor and clerify gen8 alignment calculations Adding the actual table from the docs makes it clearer exactly what the restrictions are. In particular, it becomes clear that compressed textures ignore the alignment parameters in RENDER_SURFACE_STATE. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-04 14:51:57 -07:00
Lionel Landwerlin	e8d9b76f63	intel: tools: add aubinator_error_decode tool This is pretty much the same tool as what i-g-t has, only with a more fancy decoding of the instructions/registers. It also doesn't support anything before gen4. v2 (from Matt): Drop authors Remove undefined automake variable v3: Fix incorrect offsets for dword > 1 (Jordan) v4: Fix decompression error with large blobs (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	567d77885e	intel: genxml: add RING_BUFFER_CTL registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	6f260ff049	intel: genxml: add FAULT_REG register Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	ca2771fa18	intel: genxml: add gen7 ERR_INT register v2: add register to gen7.5 (Matt) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	84613bf6d5	intel: genxml: add ACTHD registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	0f195f22aa	intel: genxml: add GFX_ARB_ERROR_RPT register Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	d1a7a54d77	intel: genxml: add INSTDONE registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Mauro Rossi	72175bd2a5	android: intel: genxml: fix genX_xml.h generation rules Recent changes in Makefile.sources merged the aubinator files in a unique list of generated files and genxml/genX_xml.h is now needed to avoid the following building error: ninja: error: '.../genxml/genX_xml.h', needed by '.../genxml/genX_xml.h', missing and no known rule to make it build/core/ninja.mk:148: recipe for target 'ninja_wrapper' failed Fixes: `0f83c05` "intel: genxml: compress all gen files into one" Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-04 09:10:46 +03:00
Jason Ekstrand	405ef7bb33	intel/vec4: Add some fall through comments Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-03 16:58:35 -07:00
Jason Ekstrand	0817110969	anv: Implement VK_KHR_incremental_present Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-04-03 13:51:08 -07:00
Jason Ekstrand	f82b6c6272	vulkan/wsi: Plumb present regions through the common code Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-04-03 13:51:08 -07:00
Lionel Landwerlin	471c1bc7cc	aubinator/gen_decoder/i965: decode instructions from dword 0 Some packets like 3DSTATE_VF_STATISTICS, 3DSTATE_DRAWING_RECTANGLE, 3DPRIMITIVE, PIPELINE_SELECT, etc... have configurable fields in dword0, we probably want to print those. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-03 20:45:34 +01:00
Lionel Landwerlin	04f2e80257	intel: gen_decoder: store pointer to current decoded field in iterator Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-03 20:45:34 +01:00
Lionel Landwerlin	74a80d579d	intel: genxml: fix out of tree builds v2: use Emil's recommendation change rule to closer to genxml/genX_bits.h Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-03-31 15:29:57 +01:00
Tapani Pälli	3535b87a1a	anv: change BLOCK_POOL_MEMFD_SIZE to 1GB This allows us to run 32bit Vulkan apps on Android, ftruncate call would fail on 2GB (max size being 2GB - 1). Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-31 08:43:28 +03:00
Tapani Pälli	2398770c87	android: add libmesa_genxml as dep to libmesa_isl This is to fix following compile error with libmesa_isl: mesa/src/intel/isl/isl.c:28:10: fatal error: 'genxml/genX_bits.h' file not found Fixes: `f0eaf38` ("genxml: New generated header genX_bits.h (v6)") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emli Velikov <emil.velikov@collabora.com>	2017-03-31 08:42:54 +03:00
Lionel Landwerlin	469da094e1	aubinator: enable snb/ilk through --gen Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-03-31 01:25:33 +01:00
Lionel Landwerlin	0f83c05149	intel: genxml: compress all gen files into one Combining all the files into a single string didn't make any difference in the size of the aubinator binary. With this change we now also embed gen4/4.5/5 descriptions, which increases the aubinator size by ~16Kb. v2 (Lionel): rebase makefiles Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-03-31 01:24:56 +01:00
Kenneth Graunke	e113dfabad	intel: Add INTEL_CFLAGS to aubinator CFLAGS. It still needs intel_aub.h. Fixes the build.	2017-03-30 11:58:00 -07:00
Emil Velikov	3df993e1a2	intel: automake: move INTEL_CFLAGS as applicable Only common/decoder.[ch] requires it [for intel_aub.h]. v2: The code was moved to from intel/tools to intel/common, update accordingly. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-03-30 19:07:28 +01:00

1 2 3 4 5 ...

1549 Commits