mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	glsl: implement double builtin functions	Dave Airlie	2015-02-19	1	-259/+492
\| \| \| \| \| \| \|	This implements the bulk of the builtin functions for fp64 support. Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/lower_instructions: add double lowering passes	Dave Airlie	2015-02-19	1	-0/+65
\| \| \| \| \| \| \|	This lowers double dot product and lrp to fma. Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: enable/disable certain lowering passes for doubles	Dave Airlie	2015-02-19	1	-3/+3
\| \| \| \| \| \| \| \|	We want to restrict some lowering passes to floats only, and enable other for doubles. Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: validate output types for shader stages	Tapani Pälli	2015-02-19	1	-0/+45
\| \| \| \| \| \| \| \| \| \| \|	Patch fixes Piglit test: arb_gpu_shader_fp64/preprocessor/fs-output-double.frag and adds additional validation for shader outputs. Signed-off-by: Tapani Pälli <[email protected]> Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: add double support to lower_mat_op_to_vec	Dave Airlie	2015-02-19	1	-0/+2
\| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Linking support for doubles	Dave Airlie	2015-02-19	1	-1/+7
\| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Support double loop control	Dave Airlie	2015-02-19	1	-2/+6
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Support double inouts	Dave Airlie	2015-02-19	1	-4/+24
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/lexer: Support double floats	Dave Airlie	2015-02-19	1	-4/+27
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/parser: Support double floats	Dave Airlie	2015-02-19	1	-4/+29
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ast: Support double floats	Dave Airlie	2015-02-19	4	-15/+90
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Add ubo lowering support for doubles	Dave Airlie	2015-02-19	1	-24/+33
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Add support doubles in optimization passes	Dave Airlie	2015-02-19	3	-4/+38
\| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ir: Add builder support for functions with double floats	Dave Airlie	2015-02-19	2	-0/+28
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ir: Add builtin constant function support for doubles	Dave Airlie	2015-02-19	1	-32/+215
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ir: Add cloning support for doubles	Dave Airlie	2015-02-19	1	-0/+1
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ir: Add printing support for doubles	Dave Airlie	2015-02-19	1	-0/+11
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl/ir: Add builtin function support for doubles	Dave Airlie	2015-02-19	3	-10/+195
\| \| \| \| \| \| \|	v2: add d2b, more ir_constant stuff (Ilia) Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: fix uniform linking logic in the presence of structs	Ilia Mirkin	2015-02-19	3	-31/+57
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add a enter/leave record callback so that the offset may be aligned to the proper value. Otherwise only leaf fields are called, and the first field needs to be aligned to the outer struct's base alignment while the last field needs to be aligned to the inner struct's base alignment. This removes most usage of the last field/record type values passed into visit_field. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	glsl: teach std140_base_alignment about samplers	Ilia Mirkin	2015-02-19	1	-0/+9
\| \| \| \| \| \| \| \| \|	These functions are about to be used more aggressively for determining uniform layout. Samplers may be inside of structs, and it's easier to reuse the existing base alignment logic. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	glsl: Uniform linking support for doubles	Dave Airlie	2015-02-19	1	-1/+6
\| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Add double builtin type generation	Dave Airlie	2015-02-19	5	-23/+151
\| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: add ARB_gpu_shader_fp64 to the glsl extensions. (v2)	Dave Airlie	2015-02-19	4	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	v2: add define bit (Tapani Pälli) Patch makes following Piglit tests pass: arb_gpu_shader_fp64/preprocessor/define.vert arb_gpu_shader_fp64/preprocessor/define.frag Reviewed-by: Ian Romanick <[email protected]> Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	glsl: Add double builtin type	Dave Airlie	2015-02-19	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	This causes a lot of warnings about unchecked type in switch statements - fix them later. Signed-off-by: Dave Airlie <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nir: Recognize and reduce duplicated fsats.	Eric Anholt	2015-02-18	1	-0/+2
\| \| \| \| \| \| \| \|	No effect on vc4 shader-db. v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering fsat.	Eric Anholt	2015-02-18	2	-1/+3
\| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44356 -> 44354 (-0.00%) instructions in affected programs: 55 -> 53 (-3.64%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering ffma.	Eric Anholt	2015-02-18	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13966 -> 13791 (-1.25%) uniforms in affected programs: 435 -> 260 (-40.23%) total instructions in shared programs: 44732 -> 44356 (-0.84%) instructions in affected programs: 9599 -> 9223 (-3.92%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering fneg/ineg.	Eric Anholt	2015-02-18	2	-0/+12
\| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44911 -> 44732 (-0.40%) instructions in affected programs: 11371 -> 11192 (-1.57%) v2: Fix broken iabs(isub(0, a)) transformation. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering fsqrt(x) to frcp(frsqrt(x)).	Eric Anholt	2015-02-18	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13972 -> 13966 (-0.04%) uniforms in affected programs: 408 -> 402 (-1.47%) total instructions in shared programs: 44973 -> 44911 (-0.14%) instructions in affected programs: 1551 -> 1489 (-4.00%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add lowering of POW instructions if the lower flag is set.	Eric Anholt	2015-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	This could be done in a separate pass like we do in GLSL IR, but it seems to me like having the definitions of the transformations in the two directions next to each other makes a lot of sense. v2: Reorder the comment about the transformation. Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: Conditionalize the POW reconstruction on shader compiler options.	Eric Anholt	2015-02-18	3	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	Mesa has a shader compiler struct flagging whether GLSL IR's opt_algebraic and other passes should try and generate certain types of opcodes or patterns. Extend that to NIR by defining our own struct, which is automatically generated from the Mesa struct in glsl_to_nir and provided directly by the driver in TGSI-to-NIR. v2: Split out the previous two prep patches. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v2)
*	nir: Add an optional expression controlling nir_algebraic xforms.	Eric Anholt	2015-02-18	1	-7/+32
\| \| \| \| \| \| \| \| \| \| \| \|	This will be used so that we can customize the transforms for the target GPU, so we don't un-lower expressions that had already been lowered (or introduce new lowering transformations that not all GPUs want) v2: Drop the complication of having the condition->index dictionary, since we don't actually expect there to be many different conditions (change by Kenneth). Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: Add a nir_shader_compiler_options struct pointed to by the shaders.	Eric Anholt	2015-02-18	3	-4/+38
\| \| \| \| \| \| \| \| \|	This will be used to give the optimization passes a chance to customize behavior for the particular target device. v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	Avoid fighting with Solaris headers over isnormal()	Alan Coopersmith	2015-02-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	When compiling in C99 or C++11 modes, Solaris defines isnormal() as a macro via <math.h>, which causes the function definition to become too mangled to compile. Signed-off-by: Alan Coopersmith <[email protected]> Cc: "10.5" <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	Remove extraneous ; after DECL_TYPE usage	Alan Coopersmith	2015-02-17	1	-33/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The macro is defined to provide a trailing ; so this caused the expansion to end in ";;" which made the Solaris Studio compilers issue warnings for every line of: "builtin_type_macros.h", line 113: Warning: extra ";" ignored. for every file that included the header, filling build logs with thousands of useless warnings. Signed-off-by: Alan Coopersmith <[email protected]> Cc: "10.5" <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	glsl: Reduce memory consumption of copy propagation passes.	Kenneth Graunke	2015-02-17	2	-6/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	opt_copy_propagation and opt_copy_propagation_elements create new ACP and Kill sets each time they enter a new control flow block. For if blocks, they also copy the entire existing ACP set contents into the new set. When we exit the control flow block, we discard the new sets. However, we weren't freeing them - so they lived on until the pass finished. This can waste a lot of memory (57MB on one pessimal shader). This patch makes the pass allocate ACP entries using this->acp as the memory context, and Kill entries out of this->kill. It also steals kill entries when moving them from the inner kill list to the parent. It then frees the lists, including their contents. v2: Move ralloc_free(this->acp) just before this->acp = orig_acp (suggested by Eric Anholt). Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Cc: "10.5 10.4" <[email protected]>
*	glcpp: Silence GCC warning	Ian Romanick	2015-02-17	1	-1/+1
\| \| \| \| \| \| \| \| \|	glcpp/glcpp.c:124:1: warning: ‘static’ is not at beginning of declaration [-Wold-style-declaration] const static struct option ^ Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	glsl/tests: add IMAGE type.	Ilia Mirkin	2015-02-17	1	-0/+3
\| \| \| \| \| \| \| \|	This fixes a warning when running make check. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	nir: Make gl_FrontFacing a system_value	Jason Ekstrand	2015-02-14	1	-2/+8
\| \| \| \| \| \| \| \| \|	GLSL IR labels gl_FrontFacing as an input variable and not a system value. This commit makes NIR silently translate gl_FrontFacing to a system value so that it properly gets translated into a load_system_value intrinsic. Reviewed-by: Connor Abbott <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	nir/lower_phis_to_scalar: Fix some logic in is_phi_scalarizable	Jason Ekstrand	2015-02-14	1	-3/+3
\| \| \| \| \|	Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: add missing header to the sources list	Emil Velikov	2015-02-12	1	-0/+1
\| \| \| \| \| \| \|	Cc: "10.5" <[email protected]> Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Connor Abbott <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	nir: resolve nir.h dependency list (fix make distcheck)	Emil Velikov	2015-02-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Use nir/nir_opcodes.h as is (w/o the absolute path), as it is the target name used to generate the actual file. Otherwise the target is missing, the file won't get generated and the build will fail. Cc: "10.5" <[email protected]> Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	glsl: Optimize (f2i(trunc x)) into (f2i x).	Matt Turner	2015-02-11	1	-0/+9
\| \| \| \| \| \|	total instructions in shared programs: 5950326 -> 5949286 (-0.02%) instructions in affected programs: 88264 -> 87224 (-1.18%) helped: 692
*	glsl: Optimize round-half-up pattern.	Matt Turner	2015-02-11	1	-0/+33
\| \| \| \| \|	Hurts some Psychonauts shaders, but after the next patch (which this enables) they're fewer instructions than before this patch.
*	glsl: Add trunc() to ir_builder.	Matt Turner	2015-02-11	2	-0/+6
\|
*	nir: Recognize open-coded fmin/fmax.	Matt Turner	2015-02-11	1	-0/+2
\| \| \| \| \| \| \| \| \|	And unfortunately other shaders do the same thing but with >=/<= which we can't apply this optimization to because of NaNs. instructions in affected programs: 23309 -> 22938 (-1.59%) Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: Add algebraic opt for int comparisons with identical operands.	Eric Anholt	2015-02-11	1	-0/+9
\| \| \| \| \| \| \| \| \|	No change on shader-db on i965. v2: Reword the comment due to feedback from Erik Faye-Lund Reviewed-by: Connor Abbott <[email protected]> (v1) Reviewed-by: Jason Ekstrand <[email protected]> (v1)
*	nir: Fix load_const comparisons for CSE.	Eric Anholt	2015-02-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We want the size of a float per component, not the size of a whole vec4. NIR instructions on i965: total instructions in shared programs: 1261937 -> 1261929 (-0.00%) instructions in affected programs: 114 -> 106 (-7.02%) Looking at one of these examples (tesseract), it's from vec4 load_consts for a MRT solid fill, which do get CSEed now that we don't memcmp off the end of the const value and into the SSA def. For the 1-component loads that are common in i965, we were only memcmping off into the rest of the usually zero-filled const_value. Reviewed-by: Connor Abbott <[email protected]>
*	glsl: Optimize 1/exp(x) into exp(-x).	Matt Turner	2015-02-10	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	Lots of shaders divide by exp2(...) which we turn into a multiplication by the reciprocal. We can avoid the reciprocal by simply negating exp2's argument. total instructions in shared programs: 5947154 -> 5946695 (-0.01%) instructions in affected programs: 118661 -> 118202 (-0.39%) helped: 380 Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	nir: Remove casts from void*.	Matt Turner	2015-02-10	4	-14/+13
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>