AlexIndustrial/mesa

Author	SHA1	Message	Date
Marek Olšák	fc0416ef5d	radeonsi: unify CP DMA preparation logic Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:13 +01:00
Marek Olšák	89da3b4458	radeonsi: unify CP DMA code determining various flags v2: don't call get_flush_flags twice per function Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:12 +01:00
Marek Olšák	c3e527f93d	radeonsi: only enable write confirmation on the last CP DMA packet This should improve performance for big copies that need to be split. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:12 +01:00
Ilia Mirkin	8e9ade7eb3	nv50/ir: allow emission of immediates in imul/imad ops Nothing actually uses this yet (due to complications), but the emission logic is right. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-07 00:42:15 -05:00
Jason Ekstrand	a10d59c09a	nir/spirv: Increment num_ubos/ssbos when creating variables	2015-11-06 16:53:27 -08:00
Jason Ekstrand	046563167c	anv/apply_dynamic_offsets: Use the right sized immediate zero	2015-11-06 16:49:24 -08:00
Ilia Mirkin	393d0c336b	nv50/ir: properly set the type of the constant folding result This removes the hack used for merge, which only covers a fraction of the cases. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 19:39:32 -05:00
Ilia Mirkin	2f9aaed749	nv50/ir: add support for const-folding OP_CVT with F64 source/dest Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 19:39:32 -05:00
Jason Ekstrand	104525c33b	anv/pipeline: Set the right SSBO binding table start index for FS	2015-11-06 15:57:51 -08:00
Ilia Mirkin	76957389fc	nv50/ir: add fp64 opcode emission support for G200 (NVA0) Need to emulate rcp/rsq before providing full fp64 support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:36:25 -05:00
Jason Ekstrand	399d5314f6	anv/cmd_buffer: Rework the way we emit UBO surface state The new mechanism should be able to handle SSBOs as well as properly handle emitting surface state on gen7 where we need different strides depending on shader stage.	2015-11-06 15:14:12 -08:00
Hans de Goede	f979d3cfec	nv50/ir: Add support for 64bit immediates to checkSwapSrc01 Now that we support 64 bit immediates in insnCanLoad, we need to swap 64 bit immediate sources too for optimal effect. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Hans de Goede	9f2f8bda6e	nvc0/ir: Teach insnCanLoad about double immediates Teach insnCanLoad about double immediates, together with the "Add support for merge-s to the ConstantFolding pass" This turns the following (nvc0) code: 1: mov u32 $r2 0x00000000 (8) 2: mov u32 $r3 0x3fe00000 (8) 3: add f64 $r0d $r0d $r2d (8) Into: 1: add f64 $r0d $r0d 0.500000 (8) Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Hans de Goede	428506ece2	nv50/ir: Add support for merge-s to the ConstantFolding pass This allows later passes like LoadPropagation to properly deal with 64 bit immediates. If the new 64 bit load this introduces does not get optimized away then split64BitOpPostRA() will split this into 2 instructions again. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Ilia Mirkin	2437f00853	nv50/ir: disallow 64-bit immediates on nv50 targets No instructions are able to load short immediates like nvc0 can. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Ilia Mirkin	11e3dac36e	nv50/ir: allow movs with TYPE_F64 destinations to be split Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Jason Ekstrand	1b5c7e7ecd	anv/pipeline: Expose is_scalar_shader_stage	2015-11-06 15:12:33 -08:00
Jason Ekstrand	5ba281e794	nir/spirv: Add a helper for determining if a block is externally visable	2015-11-06 15:09:57 -08:00
Hans de Goede	b487b55f7d	gm107/ir: Add support for double immediates Add support for encoding double immediates (up to 20 bits of precision) into the generated gm107 machine-code. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 17:22:40 -05:00
Hans de Goede	12c850d01c	nvc0/ir: Add support for double immediates Add support for encoding double immediates (up to 20 bits of precision) into the generated nvc0 machine-code. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 17:22:40 -05:00
Jason Ekstrand	220261a0c9	anv: Use VkDescriptorType instead of anv_descriptor_type	2015-11-06 14:09:52 -08:00
Jason Ekstrand	612e35b2c6	anv: Do range-checking in the shader for dynamic buffers	2015-11-06 13:32:52 -08:00
Jason Ekstrand	f8052351ac	anv/device: Increase the block size for instructions	2015-11-06 13:29:47 -08:00
Francisco Jerez	5169407221	i965/nir/fs: Add comment for no-op memory barrier functions Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-11-06 13:19:56 -08:00
Jason Ekstrand	d7cc9929bb	anv: Remove all support for BufferViews We never actually supported them, we just used them for binding UBOs. Now that we have BufferInfo and we aren't supporting texture buffers yet, we should get rid of them until we can do them properly.	2015-11-06 13:16:18 -08:00
Jordan Justen	faa1193070	i965/nir/fs: Implement new barrier functions for compute shaders For these nir intrinsics, we emit the same code as nir_intrinsic_memory_barrier: * nir_intrinsic_memory_barrier_atomic_counter * nir_intrinsic_memory_barrier_buffer * nir_intrinsic_memory_barrier_image We treat these nir intrinsics as no-ops: * nir_intrinsic_group_memory_barrier * nir_intrinsic_memory_barrier_shared v3: * Add comment for no-op cases (curro) v4: * Moving comment to a separate patch authored by curro Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-11-06 13:16:11 -08:00
Jordan Justen	9d65f3208b	nir: Add new barrier functions for compute shaders When these functions are called in glsl-ir, we create a corresponding nir intrinsic function call. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-11-06 13:15:16 -08:00
Jordan Justen	91f188710a	glsl: Add new barrier functions for compute shaders When these functions are called in GLSL code, we create an intrinsic function call: * groupMemoryBarrier => __intrinsic_group_memory_barrier * memoryBarrierAtomicCounter => __intrinsic_memory_barrier_atomic_counter * memoryBarrierBuffer => __intrinsic_memory_barrier_buffer * memoryBarrierImage => __intrinsic_memory_barrier_image * memoryBarrierShared => __intrinsic_memory_barrier_shared v2: * Consolidate with memoryBarrier function/intrinsic creation (curro) v3: * Instead of add_memory_barrier_function, add an intrinsic_name parameter to _memory_barrier (curro) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-11-06 13:14:44 -08:00
Jason Ekstrand	0360c3608b	anv/device: Only support binding UBOs through BufferInfo	2015-11-06 12:52:12 -08:00
Jason Ekstrand	3aa2fc82dd	anv: Rework UpdateDescriptorSets Previously, UpdateDescriptorSets was wrong because it assumed that the binding was the offset into the descriptor set.	2015-11-06 12:28:03 -08:00
Jason Ekstrand	45b1bbe801	anv: Add a descriptor_index to anv_descriptor_set_binding_layout	2015-11-06 12:16:54 -08:00
Jason Ekstrand	f029e0ce13	anv: Add a layout to anv_descriptor_set	2015-11-06 12:16:54 -08:00
Boyuan Zhang	6bad554d98	radeon/uvd: fix VC-1 simple/main profile decode v2 We just needed to set the extra width/height fields to get this working. v2 (chk): rebased, CC stable added, commit message added, fixed coding style Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-11-06 20:07:23 +01:00
Boyuan Zhang	ed55def44f	st/vaapi: fix vaapi VC-1 simple/main corruption v2 Apply the start code fix only to advanced profile. v2 (chk): add commit message Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-11-06 20:07:23 +01:00
Julien Isorce	cc1e5c972e	st/va: add support for RGBX and BGRX in VPP Before it was only possible to convert a NV12 surface to RGBA or BGRA. This patch uses the same post processing function, "handleVAProcPipelineParameterBufferType", but add definitions for RGBX and BGRX. This patch also makes vlVaQuerySurfaceAttributes more generic to avoid copy and pasting the same lines. Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian K<C3><B6>nig <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-06 17:33:45 +00:00
Julien Isorce	42a5e143a8	vl/buffers: add RGBX and BGRX to the supported formats Useful is one wants to create RGBX or BGRX surfaces. The infrastructure is such that it required just a few definitions to support these formats. Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian K<C3><B6>nig <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-06 17:33:38 +00:00
Julien Isorce	bf6acbb2db	st/va: properly use brackets in vlVaAcquireBufferHandle's switch In "switch (mem_type)" the brackets were surrounding "case+default" instead of "case" only. Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian K<C3><B6>nig <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-06 17:33:16 +00:00
Julien Isorce	bfc245e9ac	st/va: properly indent buffer.c, config.c, image.c and picture.c Some lines were using 4 indentation spaces instead of 3. Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian K<C3><B6>nig <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-06 17:33:01 +00:00
Rob Clark	6459e780ae	freedreno/a4xx: fix blend color Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-06 11:19:04 -05:00
Rob Clark	7465e16124	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-06 11:18:47 -05:00
Guillaume Charifi	6f5e0c08a4	freedreno: add a305 support Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-06 11:17:58 -05:00
Boyan Ding	8f55ebe802	freedreno/ir3: Use nir_foreach_variable Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-06 11:17:53 -05:00
Rob Clark	99597d033a	nir: some small cleanups The various cf nodes all get allocated w/ shader as their ralloc_parent, so lets make this more explicit. Plus couple other corrections/ clarifications. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-11-06 11:15:41 -05:00
Ilia Mirkin	d68226087c	nvc0: reintroduce BGRA4 format support Commit `342e68dc60` (nvc0: remove BGRA4 format support) removed the support to fix a WoW trace. However after further experimentation, I was able to get the blit to work by using a different "fake" format in the 2d engine. The reason why this worked on nv50 is that nv50 falls back to the 3d blit path in case either the src or the dst aren't "faithfully" supported, while nvc0 only does it for the dst format. RG8 is better supported by the nvc0 2d engine than R16. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 00:47:44 -05:00
Brian Paul	581111c4d6	mesa: report enum name in glClientActiveTexture() error string As we do for glActiveTexture(). Trivial.	2015-11-05 20:12:33 -07:00
Chad Versace	16119ad884	anv/meta: Finish load clears for stencil attachments Tested by Crucible "func.depthstencil.stencil_triangles.*" in commit c194292d5eadb84e9d7489fc01ce0b653cdd4ca5 (HEAD -> master) Author: Chad Versace <chad.versace@intel.com> Date: Wed Nov 4 16:19:24 2015 -0800 Subject: func.depthstencil: Remove stencil clear workaround for Mesa	2015-11-05 15:45:43 -08:00
Julien Isorce	497bde6727	st/va: fix memory leak on error in vlVaCreateSurfaces2 Found by coverity: CID #1337953 Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-05 23:39:45 +00:00
Julien Isorce	e0b896c86c	st/va: indent vlVaQuerySurfaceAttributes and vlVaCreateSurfaces2 Some lines were using 4 indentation spaces instead of 3. Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-11-05 23:39:43 +00:00
Kenneth Graunke	8dcf807cb4	i965: Fix scalar VS float[] and vec2[] output arrays. The scalar VS backend has never handled float[] and vec2[] outputs correctly (my original code was broken). Outputs need to be padded out to vec4 slots. In fs_visitor::nir_setup_outputs(), we tried to process each vec4 slot by looping from 0 to ALIGN(type_size_scalar(type), 4) / 4. However, this is wrong: type_size_scalar() for a float[2] would return 2, or for vec2[2] it would return 4. This looked like a single slot, even though in reality each array element would be stored in separate vec4 slots. Because of this bug, outputs[] and output_components[] would not get initialized for the second element's VARYING_SLOT, which meant emit_urb_writes() would skip writing them. Nothing used those values, and dead code elimination threw a party. To fix this, we introduce a new type_size_vec4_times_4() function which pads array elements correctly, but still counts in scalar components, generating correct indices in store_output intrinsics. Normally, varying packing avoids this problem by turning varyings into vec4s. So this doesn't actually fix any Piglit or dEQP tests today. However, if varying packing is disabled, things would be broken. Tessellation shaders can't use varying packing, so this fixes various tcs-input Piglit tests on a branch of mine. v2: Shorten the implementation of type_size_4x to a single line (caught by Connor Abbott), and rename it to type_size_vec4_times_4() (renaming suggested by Jason Ekstrand). Use type_size_vec4 rather than using type_size_vec4_times_4 and then dividing by 4. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-11-05 15:26:07 -08:00
Roland Scheidegger	5ae37ae615	llvmpipe: disable texture cache There are some weird problems with 8-wide vectors.	2015-11-05 18:00:42 +01:00

... 207 208 209 210 211 ...

85652 Commits