mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	tgsi: provide a way to encode memory qualifiers for SSBO	Ilia Mirkin	2016-01-08	10	-2/+180
\| \| \| \| \| \| \| \| \| \|	Each load/store on most hardware can specify what caching to do. Since SSBO allows individual variables to also have separate caching modes, allow loads/stores to have the qualifiers instead of attempting to encode them in declarations. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	ureg: add buffer support to ureg	Ilia Mirkin	2016-01-08	6	-1/+69
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	tgsi: add ureg support for image decls	Ilia Mirkin	2016-01-08	12	-52/+153
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	glsl: Ensure 64bits shift is used.	Jose Fonseca	2016-01-08	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	I believe that `1u << x`, where x >= 32 yields undefined results according to the C standard. Particularly MSVC says `warning C4334: '<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)`. Reviewed-by: Brian Paul <[email protected]>
*	mesa/main: Avoid `void function returning a value` warning.	Jose Fonseca	2016-01-08	1	-2/+4
\| \| \| \| \| \|	Trivial. Reviewed-by: Brian Paul <[email protected]>
*	nine: allow fragment shader POSITION and FACE to be system values	Marek Olšák	2016-01-08	2	-12/+46
\| \| \| \|	Reported-by: Axel Davy <[email protected]>
*	vl: allow fragment shader POSITION to be a system value	Marek Olšák	2016-01-08	1	-4/+8
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	util/pstipple: allow fragment shader POSITION to be a system value	Marek Olšák	2016-01-08	6	-11/+34
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	st/mesa: add support for POSITION and FACE system values	Marek Olšák	2016-01-08	4	-16/+44
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	tgsi/scan: update for POSITION and FACE sytem values	Marek Olšák	2016-01-08	1	-1/+4
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	gallium: add caps for POSITION and FACE system values	Marek Olšák	2016-01-08	17	-6/+48
\| \| \| \| \| \| \|	v2: document the integer behavior Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	program: add a helper for rewriting FP position input to sysval	Marek Olšák	2016-01-08	2	-0/+29
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	glsl: optionally declare gl_FragCoord & gl_FrontFacing as system values	Marek Olšák	2016-01-08	5	-4/+18
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected] Reviewed-by: Brian Paul <[email protected]>
*	tgsi/ureg: handle redundant declarations in ureg_DECL_system_value	Marek Olšák	2016-01-08	1	-1/+9
\| \| \| \| \|	Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	tgsi/ureg: remove index parameter from ureg_DECL_system_value	Marek Olšák	2016-01-08	4	-13/+16
\| \| \| \| \| \| \| \|	It can be trivially derived from the number of already declared system values. This allows ureg users not to worry about which index to choose. Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	st/mesa: remove dead code from mesa_to_tgsi	Marek Olšák	2016-01-08	1	-51/+0
\| \| \| \| \| \| \|	These aren't part of ARB_fragment_program. Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	radeon, si: Use TGSI chan name defines in lp_build_emit_fetch() calls	Edward O'Callaghan	2016-01-08	2	-8/+8
\| \| \| \| \|	Signed-off-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/aux: Use TGSI chan name defines inplace of literals	Edward O'Callaghan	2016-01-08	1	-6/+7
\| \| \| \| \|	Signed-off-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	mesa: check that internalformat of CopyTexImage*D is not 1, 2, 3, 4	Nicolai Hähnle	2016-01-08	1	-0/+16
\| \| \| \| \| \| \| \| \|	The piglit copyteximage check has recently been augmented to test this, but apparently it hasn't been fixed in Mesa so far. This language also already appears in the OpenGL 2.1 spec (Ian). Reviewed-by: Ian Romanick <[email protected]>
*	i965/compiler: Enable more lowering in NIR	Jason Ekstrand	2016-01-07	1	-0/+7
\| \| \| \| \| \| \|	We don't need these for GLSL or ARB, but we need them for SPIR-V Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	nir/algebraic: Add more lowering	Jason Ekstrand	2016-01-07	2	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \|	This commit adds lowering options for the following opcodes: - nir_op_fmod - nir_op_bitfield_insert - nir_op_uadd_carry - nir_op_usub_borrow Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	nir/opcodes: Fix up uadd_carry and usub_borrow	Jason Ekstrand	2016-01-07	1	-2/+2
\| \| \| \| \| \| \| \|	Both were defined as returning bool but the gpu_shader5 functions are defined to return int. Also, we had the parameters for usub borrwo backwards in the folding expression. Reviewed-by: Matt Turner <[email protected]>
*	nvc0: add ARB_indirect_parameters support	Ilia Mirkin	2016-01-07	5	-6/+313
\| \| \| \| \| \| \|	I chose to make separate macros for this due to the additional complexity and extra scratch usage. Signed-off-by: Ilia Mirkin <[email protected]>
*	st/mesa: expose ARB_indirect_parameters when the backend driver allows	Ilia Mirkin	2016-01-07	2	-0/+2
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	mesa: add support for ARB_indirect_parameters draw functions	Ilia Mirkin	2016-01-07	3	-0/+234
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	mesa: add parameter buffer, used for ARB_indirect_parameters	Ilia Mirkin	2016-01-07	4	-0/+25
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	glapi: add ARB_indirect_parameters definitions	Ilia Mirkin	2016-01-07	7	-1/+63
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	nvc0: add support for real ARB_multi_draw_indirect	Ilia Mirkin	2016-01-07	4	-18/+47
\| \| \| \| \| \| \|	The draw groups are now split up into groups of 32 if there's a non-packed stride, or in groups of 400-500 if the draw data is packed. Signed-off-by: Ilia Mirkin <[email protected]>
*	nvc0: adjust indirect draw macros to handle multiple draws at once	Ilia Mirkin	2016-01-07	3	-52/+101
\| \| \| \| \| \| \|	These are still invoked one at a time, but the underlying macro can handle multiple draws. Signed-off-by: Ilia Mirkin <[email protected]>
*	st/mesa: add support for new mesa indirect draw interface	Ilia Mirkin	2016-01-07	3	-9/+84
\| \| \| \| \| \| \| \| \| \|	This shifts all indirect draws to go through the new function. If the driver doesn't have support for multi draws, we break those up and perform N draws. Otherwise, we pass everything through for just a single draw call. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add caps to expose support for multi indirect draws	Ilia Mirkin	2016-01-07	16	-0/+35
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add sufficient draw interface to allow new indirect features	Ilia Mirkin	2016-01-07	1	-1/+10
\| \| \| \| \| \| \| \|	This makes it possible to support indirect multidraws as well as having the number of such draws to come from a separate GPU resource. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	vbo: create a new draw function interface for indirect draws	Ilia Mirkin	2016-01-07	4	-75/+89
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	All indirect draws are passed to the new draw function. By default there's a fallback implementation which pipes it right back to draw_prims, but eventually both the fallback and draw_prim's support for indirect drawing should be removed. This should allow a backend to properly support ARB_multi_draw_indirect and ARB_indirect_parameters. Signed-off-by: Ilia Mirkin <[email protected]> Acked-by: Marek Olšák <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	llvmpipe: do 64bit plane calculations in the sse path	Roland Scheidegger	2016-01-08	3	-62/+150
\| \| \| \| \| \| \| \| \| \| \| \|	The sse path was pretty much disabled for practical purposes because the largest allowed fb size was 128x128. So, adapt it for 64bit plane calculations. This is actually not that difficult, though a problem is that we can't do a signed 32x32->64bit mul, only unsigned, so need to fix that up. Overall, the code still looks reasonable, though it's not like changes there in setup really make much of a difference in the end... Reviewed-by: Jose Fonseca <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	llvmpipe: don't store eo as 64bit int	Roland Scheidegger	2016-01-08	4	-11/+16
\| \| \| \| \| \| \| \| \| \| \|	eo, just like dcdx and dcdy, cannot overflow 32bit. Store it as unsigned though just in case (it cannot be negative, but in theory twice as big as dcdx or dcdy so this gives it one more bit). This doesn't really change anything, albeit it might help minimally on 32bit archs. Reviewed-by: Jose Fonseca <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	llvmpipe: use aligned data for the assembly program in setup	Roland Scheidegger	2016-01-08	1	-17/+21
\| \| \| \| \| \| \| \| \|	Back in the day (before 24678700edaf5bb9da9be93a1367f1a24cfaa471) the values were not actually in a struct but even then I can't see why we didn't simply align the values. Especially since it's trivial to do so. (Not that it actually matters since the code is pretty much unused for now.) Reviewed-by: Oded Gabbay <[email protected]>
*	draw: initialize prim header flags when clipping lines	Roland Scheidegger	2016-01-08	1	-0/+2
\| \| \| \| \| \| \| \| \|	Otherwise, clipped lines would have undefined stippling reset bit if line stippling is enabled. (Untested, and I just assume copying over the bits from the original line is actually the right thing to do.) Reviewed-by: Jose Fonseca <[email protected]>
*	draw: fix line stippling with unfilled prims	Roland Scheidegger	2016-01-08	1	-18/+38
\| \| \| \| \| \| \| \| \| \| \| \| \|	The unfilled stage was not filling in the prim header, and the line stage then decided to reset the stipple counter or not based on the uninitialized data. This causes some failures in conform linestipple test (albeit quite randomly happening depending on environment). So fill in the prim header in the unfilled stage - I am not entirely sure if anybody really needs determinant after that stage, but there's at least later stages (wide line for instance) which copy over the determinant as well. Reviewed-by: Jose Fonseca <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	glsl: replace null check with assert	Timothy Arceri	2016-01-08	1	-3/+1
\| \| \| \| \| \| \| \|	This was added in 54f583a20 since then error handling has improved. The test this was added to fix now fails earlier since 01822706ec Reviewed-by: Matt Turner <[email protected]>
*	i965: use _mesa_delete_buffer_object	Nicolai Hähnle	2016-01-07	1	-1/+1
\| \| \| \| \| \| \| \| \|	This is more future-proof, plugs the memory leak of Label and properly destroys the buffer mutex. Reviewed-by: Marek Olšák <[email protected]> Cc: "11.0 11.1" <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i915: use _mesa_delete_buffer_object	Nicolai Hähnle	2016-01-07	1	-1/+1
\| \| \| \| \| \| \| \| \|	This is more future-proof, plugs the memory leak of Label and properly destroys the buffer mutex. Reviewed-by: Marek Olšák <[email protected]> Cc: "11.0 11.1" <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	radeon: use _mesa_delete_buffer_object	Nicolai Hähnle	2016-01-07	1	-1/+1
\| \| \| \| \| \| \| \| \|	This is more future-proof, plugs the memory leak of Label and properly destroys the buffer mutex. Reviewed-by: Marek Olšák <[email protected]> Cc: "11.0 11.1" <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	st/mesa: use _mesa_delete_buffer_object	Nicolai Hähnle	2016-01-07	1	-3/+1
\| \| \| \| \| \| \|	This is more future-proof than the current code. Reviewed-by: Marek Olšák <[email protected]> Cc: "11.0 11.1" <[email protected]>
*	mesa/bufferobj: make _mesa_delete_buffer_object externally accessible	Nicolai Hähnle	2016-01-07	2	-1/+5
\| \| \| \| \| \| \| \| \|	gl_buffer_object has grown more complicated and requires cleanup. Using this function from drivers will be more future-proof. Reviewed-by: Marek Olšák <[email protected]> Cc: "11.0 11.1" <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	llvmpipe: use sse2 conv code for altivec	Oded Gabbay	2016-01-07	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In lp_build_conv() and lp_build_conv_auto(), there is a special case of conversion when sse2 is present. That code path is suitable without any changes to altivec, because all the functions that are called in that code path already support altivec. This patch increase the FPS in POWER arch across the board between 10%-25% I checked ipers, glxgears, glxspheres64, openarena, xonotic and glmark2. Signed-off-by: Oded Gabbay <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	radeonsi: adjust the parameters of si_shader_dump	Marek Olšák	2016-01-07	3	-20/+11
\| \| \| \| \| \| \|	The function will be extended to dump all binaries shaders will consist of, so si_shader* makes sense here. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: move si_shader_dump call out of si_compile_llvm	Marek Olšák	2016-01-07	2	-2/+11
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: inline si_shader_binary_read	Marek Olšák	2016-01-07	3	-11/+3
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: move si_shader_dump call out of si_shader_binary_read	Marek Olšák	2016-01-07	3	-20/+21
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: separate shader dumping code to si_shader_dump and *_dump_stats	Marek Olšák	2016-01-07	1	-12/+30
\| \| \| \| \| \| \|	Eventually, I'd like to dump stats for several combined binaries, which is why you don't see a binary parameter in si_shader_dump_stats Reviewed-by: Nicolai Hähnle <[email protected]>