mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno: use OUT_RELOCW when buffer is written	Rob Clark	2014-05-21	1	-4/+4
\| \| \| \| \| \| \|	These aren't buffers we ever read back from CPU, so using incorrect reloc fxn wasn't really harming anything. But might as well be correct. Signed-off-by: Rob Clark <[email protected]>
*	rbug: add missing pipe->blit() entrypoint	Rob Clark	2014-05-21	1	-0/+21
\| \| \| \| \|	Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Jakob Bornecrantz <[email protected]>
*	nv50,nvc0: fix 3d blits with mipmap levels	Ilia Mirkin	2014-05-21	2	-11/+19
\| \| \| \| \| \| \| \| \| \|	Make sure to normalize the z coordinates as well as the x/y ones when there are mipmaps present. Fixes 3d mipmap generation, which now uses the blit path. Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.2" <[email protected]> Reviewed-by: Ben Skeggs <[email protected]>
*	nv50/ir: fix constant folding for OP_MUL subop HIGH	Ilia Mirkin	2014-05-21	1	-4/+43
\| \| \| \| \| \| \| \| \| \| \| \|	These instructions can come in either through IMUL_HI/UMUL_HI TGSI opcodes, or from OP_DIV constant folding. Also make sure that the constant foldings which delete the original instruction still get counted as having done something. Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.1 10.2" <[email protected]> Reviewed-by: Ben Skeggs <[email protected]>
*	nv50/ir: fix s32 x s32 -> high s32 multiply logic	Ilia Mirkin	2014-05-21	2	-11/+82
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Retrieving the high 32 bits of a signed multiply is rather annoying. It appears that the simplest way to do this is to compute the absolute value of the arguments, and perform a u32 x u32 -> u64 operation. If the arguments' signs differ, then negate the result. Since there is no u64 support in the cvt instruction, we have the perform the 2's complement negation "by hand". This logic can come into use by the IMUL_HI instruction (very unlikely to be seen), as well as from constant folding of division by a constant. Fixes dolphin's divisions by 255. Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.1 10.2" <[email protected]> Reviewed-by: Ben Skeggs <[email protected]>
*	freedreno: don't advertise texture arrays for now	Rob Clark	2014-05-20	1	-1/+1
\| \| \| \| \| \| \|	I think a3xx and later should support (it is part of GLES3), but this isn't needed for the time being and still needs to be reversed. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: shadow sampler support	Rob Clark	2014-05-19	2	-3/+46
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: refactor trans_samp()	Rob Clark	2014-05-19	1	-47/+90
\| \| \| \| \| \| \|	Split it up into some smaller fxns so it doesn't grow into a huge monster as we add things. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2014-05-19	4	-4/+10
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	llvmpipe: do IR counting for shader cache management after optimization.	Roland Scheidegger	2014-05-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	2ea923cf571235dfe573c35c3f0d90f632bd86d8 had the side effect of IR counting now being done after IR optimization instead of before. Some quick analysis shows that there's roughly 1.5 times more IR instructions before optimization than after, hence the effective shader cache size got quite a bit smaller. Could counter this with an increase of the instruction limit but it probably makes more sense to count them after optimizations, so move that code. Reviewed-by: Brian Paul <[email protected]>
*	nv50/ir: fix integer mul lowering for u32 x u32 -> high u32	Ilia Mirkin	2014-05-18	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \|	UNION appears to expect that all of its sources are conditionally defined. Otherwise it inserts an unpredicated mov instruction which overwrites the desired result. This fixes tests that use UMUL_HI, and much less directly, unsigned integer division by a constant, which uses this functionality in a peephole pass. Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.1 10.2" <[email protected]> Reviewed-by: Ben Skeggs <[email protected]>
*	nv50/ir: make sure that texprep/texquerylod's args get coalesced	Ilia Mirkin	2014-05-18	1	-0/+2
\| \| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.2" <[email protected]> Reviewed-by: Ben Skeggs <[email protected]>
*	freedreno/a3xx: use util_format_compose_swizzles()	Rob Clark	2014-05-18	1	-9/+9
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: 1D textures	Rob Clark	2014-05-18	1	-4/+25
\| \| \| \| \| \| \| \|	Gallium already gives us height==1 for these, so the texture state is already setup correctly to emulate 1D textures as a Nx1 2D texture. We just need to supply the .y coord. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: fix caps	Rob Clark	2014-05-18	1	-2/+2
\| \| \| \| \| \|	In particular, we want mesa to emulate primitive restart for us. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: fix index buffer offset	Rob Clark	2014-05-18	1	-1/+1
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: add sRBG texture support	Rob Clark	2014-05-16	2	-0/+15
\| \| \| \| \| \| \|	That was easy. Turns out it is just a matter of setting one bit. Enable sampling from sRGB texture, and therefore enable GL 2.1 :-) Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2014-05-16	4	-20/+21
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	gallivm: give more verbose names to modules	Roland Scheidegger	2014-05-16	7	-16/+21
\| \| \| \| \| \| \| \| \|	When we had just one module "gallivm" was an appropriate name. But now we have modules containing all functions for a particular variant, so give it a corresponding name (this is really just for helping debugging). Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/radeon: link in libradeon.la at target level	Emil Velikov	2014-05-15	3	-20/+8
\| \| \| \| \| \| \| \| \| \| \| \|	It makes more sense to link the core and common parts of the driver as the target is build. Additionally this will help us drop duplicating symbols for targets that static link mulitple pipe-drivers. Only egl-static needs that currently with more to come. To simplify things a bit add HAVE_GALLIUM_RADEON_COMMON variable. Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Tom Stellard <[email protected]>
*	gallium/radeon: build only a single common library libradeon	Emil Velikov	2014-05-15	3	-12/+5
\| \| \| \| \| \| \|	Just fold libllvmradeon in libradeon. Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Tom Stellard <[email protected]>
*	freedreno/a3xx: fix write to bogus register	Rob Clark	2014-05-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	The loops for updating the multiple packed fields in SP_VS_OUT[] and SP_VS_VPC_DST[] will zero out one register beyond the last that on required. Which is normally not a problem (and is kinda convenient when looking at cmdstream dumps) unless we have maximum (16) varyings. Fix loop termination condition so that this does not happen. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: account for special inputs/outputs	Rob Clark	2014-05-14	1	-2/+2
\| \| \| \| \| \| \| \| \|	We need to size input/output tables big enough for special inputs/ outputs (gl_Position, gl_FrontFacing, etc) which, while they don't count towards the hw limit of 16 attributes or 16 varyings, we do still need to track them all the same. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: fix MAX_INPUTS shader cap	Rob Clark	2014-05-14	3	-1/+9
\| \| \| \| \| \| \| \| \| \|	Hardware only supports 16. Which fd3_shader_variant properly reflected, but the pipe cap did not, leading to array overflow (and shaders that could not possibly work). Also a bunch of asserts to make problems like this easier to see. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: add debug flag to expose glsl130	Rob Clark	2014-05-14	2	-3/+8
\| \| \| \| \| \| \| \| \| \|	We are starting to add integer support to the compiler, which does not get exercised with glsl feature level 120 and without advertising integer support. But doing so breaks too many things right now. So for now use a debug flag to conditionally expose the functionality while it is in development. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: add KILL_IF	Ryan Houdek	2014-05-14	1	-1/+35
\| \| \| \| \| \| \| \| \|	The KILL_IF opcode could potentially be merged in to the regular KILL opcode function. It was a pain to do so, so I've left is separated for cleanliness. Signed-off-by: Ryan Houdek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: start adding integer support	Ryan Houdek	2014-05-14	1	-0/+169
\| \| \| \| \| \| \| \| \| \| \| \| \|	Adds a large sum of TGSI opcodes to the a3xx compiler. For integer opcodes we have 28 opcodes added. Adds 4 floating point compare opcodes If GLSL 1.30 is enabled, this allows the GLSL 1.30 piglits to have a completion amount of 432/641. Signed-off-by: Ryan Houdek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	llvmpipe: improve setup shader names (for debugging)	Roland Scheidegger	2014-05-15	1	-38/+40
\| \| \| \| \| \| \| \| \| \| \| \| \|	The setup shaders were composed of both a fs shader number and a variant number. But since they aren't tied to a particular fragment shader, the former was a fixed zero while the latter was also always zero because it was never assigned. So, similar to what the fs code does, use a ever increasing number to give it a more catchy name (unlike fragment shaders though where this number is for each explicitly created shader, we just use it for the implicitly created variants). And while here, fix whitespace a bit. Reviewed-by: Jose Fonseca <[email protected]>
*	llvmpipe: kill off llvmpipe_variant_count	Roland Scheidegger	2014-05-15	4	-20/+4
\| \| \| \| \| \| \|	Unused except it was increased for both fs and setup shader variants created. Probably some leftover from ages ago. Reviewed-by: Jose Fonseca <[email protected]>
*	nvc0: enable support for maxwell boards	Ben Skeggs	2014-05-15	5	-19/+48
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: add maxwell (sm50) compiler backend	Ben Skeggs	2014-05-15	16	-5/+3588
\| \| \| \| \| \| \| \| \| \|	The big missing part here is proper sched data calculations, but hopefully the chosen placeholder will be sufficient for now. Passes piglit as well as GK107 does. Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: maxwell isa has no per-instruction join modifier	Ben Skeggs	2014-05-15	4	-19/+23
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: replace immd 0 with $rLASTGPR for emit/restart opcodes	Ben Skeggs	2014-05-15	1	-0/+1
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: move nvc0 lowering pass class definitions into header	Ben Skeggs	2014-05-15	3	-106/+136
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: bump sched data member to 32-bits	Ben Skeggs	2014-05-15	1	-1/+1
\| \| \| \| \| \| \|	SM50 backend requires 21 bits per instruction, not 8. Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: use vertex arrays for eng3d blit	Ben Skeggs	2014-05-15	1	-31/+64
\| \| \| \| \| \| \|	Maxwell doesn't have immediate-mode. Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: restrict "constant vbo" logic to fermi/kepler classes	Ben Skeggs	2014-05-15	1	-1/+1
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: replace some vb->stride checks with constant_vbo instead	Ben Skeggs	2014-05-15	1	-3/+3
\| \| \| \| \| \| \| \|	Maxwell no longer has the methods to set constant attributes, and we'll want to be treating stride 0 vtxbufs the same as for stride > 0. Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: add maxwell class	Ben Skeggs	2014-05-15	2	-0/+4
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: allow for easier modification of compiler library routines	Ben Skeggs	2014-05-15	13	-1057/+1057
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0: properly distribute macros in source form	Ben Skeggs	2014-05-15	5	-244/+365
\| \| \| \| \|	Signed-off-by: Ben Skeggs <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	radeonsi: Fix anisotropic filtering state setup	Michel Dänzer	2014-05-14	3	-13/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Bring it back in line with r600g. I broke this in the original radeonsi bringup. :( Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78537 Cc: "10.1 10.2" <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Alex Deucher <[email protected]>
*	llvmpipe: Delete unneeded LLVM stuff earlier.	José Fonseca	2014-05-14	7	-34/+16
\| \| \| \| \| \|	Same as Frank's change to draw module but for llvmpipe module. Reviewed-by: Roland Scheidegger <[email protected]>
*	gallivm,draw,llvmpipe: Remove support for versions of LLVM prior to 3.1.	José Fonseca	2014-05-14	1	-28/+0
\| \| \| \| \| \| \|	Older versions haven't been tested probably don't work anyway. But more importantly, code supporting it is hindering further work. Reviewed-by: Roland Scheidegger <[email protected]>
*	freedreno/a3xx: occlusion query support	Rob Clark	2014-05-13	5	-3/+185
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add support for hw queries	Rob Clark	2014-05-13	10	-8/+734
\| \| \| \| \| \| \| \| \| \| \|	Real GPU queries need some infrastructure to track samples per tile and accumulate the results. But fortunately this can be shared across GPU generation. See: https://github.com/freedreno/freedreno/wiki/Queries#hardware-queries Signed-off-by: Rob Clark <[email protected]>
*	freedreno/query: allow multiple query implementations	Rob Clark	2014-05-13	6	-107/+269
\| \| \| \| \| \| \| \|	Split out fd_query into an abstract base class, to allow multiple implementations. The current sw based queries are moved into fd_sw_query. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: add point-size	Rob Clark	2014-05-13	1	-4/+14
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2014-05-13	4	-54/+252
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	nv50,nvc0: fix blit 3d path for 1d array textures	Ilia Mirkin	2014-05-11	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Need to adjust coordinates since the shader receives the array index as depth in z, but the TEX instruction expects it to be the second coordinate for a 1D array texture. This fixes fbo-generatemipmap-array. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Ben Skeggs <[email protected]> Cc: "10.2" <[email protected]>