AlexIndustrial/mesa

T

Iago Toral Quiroga aa4796ae81 i965/fs/gen7: split instructions that run into exec masking bugs

In fp64 we can produce code like this:

mov(16) vgrf2<2>:UD, vgrf3<2>:UD

That our simd lowering pass would typically split in instructions with a
width of 8, writing to two consecutive registers each. Unfortunately, gen7
hardware has a bug affecting execution masking and as a result, the
second GRF register write won't work properly. Curro verified this:

"The problem is that pre-Gen8 EUs are hardwired to use the QtrCtrl+1
 (where QtrCtrl is the 8-bit quarter of the execution mask signals
 specified in the instruction control fields) for the second
 compressed half of any single-precision instruction (for
 double-precision instructions it's hardwired to use NibCtrl+1,
 at least on HSW), which means that the EU will apply the wrong
 execution controls for the second sequential GRF write if the number
 of channels per GRF is not exactly eight in single-precision mode (or
 four in double-float mode)."

In practice, this means that we cannot write more than one
consecutive GRF in a single instruction if the number of channels
per GRF is not exactly eight in single-precision mode (or four
in double-float mode).

This patch makes our SIMD lowering pass split this kind of instructions
so that the split versions only write to a single register. In the
example above this means that we split the write in 4 instructions, each
one writing 4 UD elements (width = 4) to a single register.

v2 (Curro):
 - Make explicit that the thing about hardwiring NibCtrl+1 for the second
   compressed half is known to happen in Haswell and the issue with IVB
   might not be exactly the same.
 - Assign max_width instead of returning early so that we can handle
   multiple restrictions affecting to the same instruction.
 - Avoid division by 0 if the instruction does not write any registers.
 - Ignore instructions what have WE_all set.
 - Use the instruction execution type size instead of the dst type size.

v3 (Curro):
 - Move the implementation down so it is not placed in the middle of another
   workaround.
 - Declare channels_per_grf as const.
 - Don't break the loop early if we find a BAD_FILE source.
 - Fix the number of channels that the hardware shifts for the second half
   of a compressed instruction to be 8 in single precision and 4 in double
   precision.

Reviewed-by: Francisco Jerez <currojerez@riseup.net>

2016-07-13 07:09:41 +02:00

bin

bugzilla_mesa.sh: Drop "Bug " from sed command

2016-07-07 15:58:46 +01:00

docs

docs: remove duplicated line in 12.0.1 release notes file

2016-07-12 09:42:42 -06:00

doxygen

doxygen: Plumb through gallium/ to automated documentation

2016-05-30 17:53:45 +01:00

include

i965: Removing PCI IDs that are no longer listed as Kabylake.

2016-06-29 11:14:19 -07:00

util: Add ATTRIBUTE_RETURNS_NONNULL.

2016-05-16 11:06:15 -07:00

scons

scons: support 2.5.0

2016-05-25 12:23:12 -06:00

scripts

scripts: bump git_reviewer.pl --git-min-percent default

2016-05-09 19:30:28 -04:00

src

i965/fs/gen7: split instructions that run into exec masking bugs

2016-07-13 07:09:41 +02:00

.dir-locals.el

dir-locals.el: set case-label offset to 0

2016-02-03 15:44:51 -05:00

.gitattributes

Disable autocrlf for Visual Studio project files.

2008-02-28 12:34:01 +09:00

.gitignore

automake: rework the git_sha1.h rule, include in tarball

2016-05-30 17:53:45 +01:00

.mailmap

.mailmap: Fixup my email address

2016-06-23 00:00:46 +02:00

.travis.yml

travis: Add a test build with scons.

2015-12-01 15:09:56 -08:00

Android.common.mk

Android: move libdrm settings to top-level Android.common.mk

2016-06-13 15:31:29 +01:00

Android.mk

isl: add support for Android libmesa_isl static library

2016-06-02 22:31:44 +01:00

appveyor.yml

appveyor: Run unit tests.

2016-04-14 07:19:04 +01:00

autogen.sh

autogen.sh: pass --force to autoreconf, quote ORIGDIR

2015-03-11 23:28:26 +00:00

CleanSpec.mk

android: Depend on gallium_dri from EGL, instead of linking in gallium.

2015-06-09 11:38:45 -07:00

common.py

scons: Allow building with Address Sanitizer.

2016-04-13 06:54:32 +01:00

configure.ac

clover: Bump required LLVM version to 3.6.

2016-07-11 20:19:14 -07:00

install-gallium-links.mk

install-gallium-links.mk: handle multiple libraries

2016-04-14 16:30:57 +01:00

install-lib-links.mk

install-lib-links: remove the .install-lib-links file

2015-02-24 15:33:25 +00:00

Makefile.am

automake: add SWR to `make distcheck' gallium drivers

2016-06-13 15:24:44 +01:00

REVIEWERS

add REVIEWERS and get_reviewer.pl script

2016-05-04 11:25:46 -04:00

SConstruct

scons: whitespace cleanup

2016-05-25 12:23:12 -06:00

VERSION

docs: add 12.1.0-devel release notes template, bump version

2016-05-30 20:03:19 +01:00

docs/README.WIN32

File: docs/README.WIN32

Last updated: 21 June 2013


Quick Start
----- -----

Windows drivers are build with SCons.  Makefiles or Visual Studio projects are
no longer shipped or supported.

Run

  scons libgl-gdi

to build gallium based GDI driver.

This will work both with MSVS or Mingw.


Windows Drivers
------- -------

At this time, only the gallium GDI driver is known to work.

Source code also exists in the tree for other drivers in
src/mesa/drivers/windows, but the status of this code is unknown.

Recipe
------

Building on windows requires several open-source packages. These are
steps that work as of this writing.

- install python 2.7
- install scons (latest)
- install mingw, flex, and bison
- install pywin32 from here: http://www.lfd.uci.edu/~gohlke/pythonlibs
  get pywin32-218.4.win-amd64-py2.7.exe
- install git
- download mesa from git
  see http://www.mesa3d.org/repository.html
- run scons

General
-------

After building, you can copy the above DLL files to a place in your
PATH such as $SystemRoot/SYSTEM32.  If you don't like putting things
in a system directory, place them in the same directory as the
executable(s).  Be careful about accidentially overwriting files of
the same name in the SYSTEM32 directory.

The DLL files are built so that the external entry points use the
stdcall calling convention.

Static LIB files are not built.  The LIB files that are built with are
the linker import files associated with the DLL files.

The si-glu sources are used to build the GLU libs.  This was done
mainly to get the better tessellator code.

If you have a Windows-related build problem or question, please post
to the mesa-dev or mesa-users list.

Languages

C 75.5%

C++ 17.2%

Python 2.7%

Rust 1.8%

Assembly 1.5%

Other 1%