mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	nir: Add native_integers to nir_shader_compiler_options.	Kenneth Graunke	2015-03-08	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \|	glsl_to_nir, tgsi_to_nir, and prog_to_nir all want to know whether the driver supports native integers. Presumably other passes may as well. Adding this to nir_shader_compiler_options is an easy way to provide that information, as it's accessible via nir_shader::options. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Delete nir_shader::user_structures and num_user_structures.	Kenneth Graunke	2015-03-08	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \|	Nothing actually uses these, and the only caller of glsl_to_nir() (brw_fs_nir.cpp) always passes NULL for the _mesa_glsl_parse_state pointer, meaning they'll always be NULL and 0, respectively. Just delete them. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	nir/register: Add a parent_instr field	Jason Ekstrand	2015-02-24	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a parent_instr field similar to the one for ssa_def. The difference here is that the parent_instr field on a nir_register can be NULL if the register does not have a unique definition or if that definition does not dominate all its uses. We set this field in the out-of-SSA pass so that backends can get SSA-like information even after they have gone out of SSA. Reviewed-by: Connor Abbott <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	nir: Drop dependency on mtypes.h for core NIR.	Eric Anholt	2015-02-20	1	-1/+3
\| \| \| \| \| \| \| \|	One less new directory necessary for gallium code that wants to interact with NIR. Reviewed-by: Connor Abbott <[email protected]> Reviewed-by: Jose Fonseca <[email protected]>
*	util: Move Mesa's bitset.h to util/.	Eric Anholt	2015-02-20	1	-1/+1
\| \| \| \|	Reviewed-by: Jose Fonseca <[email protected]>
*	nir: Add a global code motion (GCM) pass	Jason Ekstrand	2015-02-19	1	-0/+2
\| \| \| \| \| \| \| \| \|	v2 Jason Ekstrand <[email protected]>: - Use nir_dominance_lca for computing least common anscestors - Use the block index for comparing dominance tree depths - Pin things that do partial derivatives Reviewed-by: Reviewed-by: Connor Abbott <[email protected]>
*	nir/instr: Change "live" to a more generic "pass_flags" field	Jason Ekstrand	2015-02-19	1	-2/+4
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>
*	nir: Make nir_[cf_node/instr]_[prev/next] return null if at the end	Jason Ekstrand	2015-02-19	1	-6/+22
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>
*	nir/dominance: Add a constant-time mechanism for comparing blocks	Jason Ekstrand	2015-02-19	1	-0/+9
\| \| \| \| \| \| \| \| \|	This is mostly thanks to Connor. The idea is to do a depth-first search that computes pre and post indices for all the blocks. We can then figure out if one block dominates another in constant time by two simple comparison operations. Reviewed-by: Connor Abbott <[email protected]>
*	nir/dominance: Expose the dominance intersection function	Jason Ekstrand	2015-02-19	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Being able to find the least common anscestor in the dominance tree is a useful thing that we may want to do in other passes. In particular, we need it for GCM. v2: Handle NULL inputs by returning the other block Reviewed-by: Connor Abbott <[email protected]>
*	nir: Add a flag for lowering fsat.	Eric Anholt	2015-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44356 -> 44354 (-0.00%) instructions in affected programs: 55 -> 53 (-3.64%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering ffma.	Eric Anholt	2015-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13966 -> 13791 (-1.25%) uniforms in affected programs: 435 -> 260 (-40.23%) total instructions in shared programs: 44732 -> 44356 (-0.84%) instructions in affected programs: 9599 -> 9223 (-3.92%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering fneg/ineg.	Eric Anholt	2015-02-18	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44911 -> 44732 (-0.40%) instructions in affected programs: 11371 -> 11192 (-1.57%) v2: Fix broken iabs(isub(0, a)) transformation. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Add a flag for lowering fsqrt(x) to frcp(frsqrt(x)).	Eric Anholt	2015-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13972 -> 13966 (-0.04%) uniforms in affected programs: 408 -> 402 (-1.47%) total instructions in shared programs: 44973 -> 44911 (-0.14%) instructions in affected programs: 1551 -> 1489 (-4.00%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Conditionalize the POW reconstruction on shader compiler options.	Eric Anholt	2015-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Mesa has a shader compiler struct flagging whether GLSL IR's opt_algebraic and other passes should try and generate certain types of opcodes or patterns. Extend that to NIR by defining our own struct, which is automatically generated from the Mesa struct in glsl_to_nir and provided directly by the driver in TGSI-to-NIR. v2: Split out the previous two prep patches. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v2)
*	nir: Add a nir_shader_compiler_options struct pointed to by the shaders.	Eric Anholt	2015-02-18	1	-2/+13
\| \| \| \| \| \| \| \| \|	This will be used to give the optimization passes a chance to customize behavior for the particular target device. v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	nir: Mark nir_print_instr's instr pointer as const.	Kenneth Graunke	2015-02-10	1	-1/+1
\| \| \| \| \| \| \| \|	Printing instructions doesn't modify them, so we can mark the parameter const. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: add an optimization to remove useless phi nodes	Connor Abbott	2015-02-03	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This removes phi nodes whose sources all point to the same thing. Shader-db results: total NIR instructions in shared programs: 2045293 -> 2041209 (-0.20%) NIR instructions in affected programs: 126564 -> 122480 (-3.23%) helped: 615 HURT: 0 total FS instructions in shared programs: 4321840 -> 4320392 (-0.03%) FS instructions in affected programs: 24622 -> 23174 (-5.88%) helped: 138 HURT: 0 Reviewed-by: Jason Ekstrand <[email protected]> Tested-by: Jason Ekstrand <[email protected]> Signed-off-by: Connor Abbott <[email protected]>
*	nir: Add a pass to lower vector phi nodes to scalar phi nodes	Jason Ekstrand	2015-02-03	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	v2 Jason Ekstrand <[email protected]>: - Add better comments - Use nir_ssa_dest_init and nir_src_for_ssa more places - Fix some void * casts v3 Jason Ekstrand <[email protected]>: - Rework the way we determine whether or not to sccalarize a phi node to make the recursion non-bogus - Treat load_const instructions as scalarizable v4 Jason Ekstrand <[email protected]>: - Allow uniform and input loads to be scalarizable v5 Jason Ekstrand <[email protected]>: - Also consider loads of inputs (varying, uniform, or ubo) to be scalarizable. We were already doing this for load_var on uniforms and inputs. Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: Add an invalid type	Jason Ekstrand	2015-01-29	1	-0/+1
\| \| \| \| \| \|	This allows us to indicate a concept of an invalid type. Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: add a helper function for getting the number of source components	Connor Abbott	2015-01-26	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Unlike with non-SSA ALU instructions, where if they're per-component you have to look at the writemask to know which source channels are being used, SSA ALU instructions always have all the possible channels enabled so we can just look at the number of components in the SSA definition for per-component instructions to say how many source components are being used. v2: use new name nir_ssa_alu_instr_src_components() Reviewed-by: Jason Ekstrand <[email protected]> Signed-off-by: Connor Abbott <[email protected]>
*	nir: Use pointers for nir_src_copy and nir_dest_copy	Jason Ekstrand	2015-01-26	1	-2/+2
\| \| \| \| \| \| \| \|	This avoids the overhead of copying structures and better matches the newly added nir_alu_src_copy and nir_alu_dest_copy. Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Connor Abbott <[email protected]>
*	nir: use Python to autogenerate opcode information	Connor Abbott	2015-01-24	1	-14/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before, we used a system where a file, nir_opcodes.h, defined some macros that were included to generate the enum values and the nir_op_infos structure. This worked pretty well, but for development the error messages were never very useful, Python tools couldn't understand the opcode list, and it was difficult to use nir_opcodes.h to do other things like autogenerate a builder API. Now, we store opcode information in nir_opcodes.py, and we have nir_opcodes_c.py to generate the old nir_opcodes.c and nir_opcodes_h.py to generate nir_opcodes.h, which contains all the enum names and gets included into nir.h like before. In addition to solving the above problems, using Python and Mako to generate everything means that it's much easier to add keep information centralized as we add new things like constant propagation that require per-opcode information. v2: - make Opcode derive from object (Dylan) - don't use assert like it's a function (Dylan) - style fixes for fnoise, use xrange (Dylan) - use iterkeys() in nir_opcodes_h.py (Dylan) - use pydoc-style comments (Jason) - don't make fmin/fmax commutative and associative yet (Jason) Signed-off-by: Connor Abbott <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> v3 Jason Ekstrand <[email protected]> - Alphabetize source file lists - Generate nir_opcodes.h in the builddir instead of the source dir - Include $(builddir)/src/glsl/nir in the i965 build - Rework nir_opcodes.h generation so it generates a complete header file instead of one that has to be embedded inside an enum declaration
*	nir: Expose nir_print_instr() for debug prints	Eric Anholt	2015-01-23	1	-0/+1
\| \| \| \| \| \| \| \| \|	It's nice to have this present in your default cases so you can see what instruction is triggering an abort. v2: Just pass a NULL state, now that it won't crash when you do. Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Add nir_lower_alu_to_scalar.	Eric Anholt	2015-01-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the equivalent of brw_fs_channel_expressions.cpp, which I wanted for vc4. v2: Use the nir_src_for_ssa() helper, and another instance of nir_alu_src_copy(). v3: Drop the non-SSA support. All intended callers will have SSA-only ALU ops. v4: Use insert_before, drop stale bcsel/fcsel comment, drop now-unused unsupported() function, drop lower_context struct. v5: Completely rename the pass to nir_lower_alu_to_scalar(), add an assert about weird input_sizes[]. Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Make some helpers for copying ALU src/dests.	Eric Anholt	2015-01-23	1	-0/+4
\| \| \| \| \| \| \| \| \|	There aren't many users yet, but I wanted to do this from my scalarizing pass. v2: Constify the src arguments. Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Make an easier helper for setting up SSA defs.	Eric Anholt	2015-01-22	1	-0/+2
\| \| \| \| \| \| \| \|	Almost all instructions we nir_ssa_def_init() for are nir_dests, and you have to keep from forgetting to set is_ssa when you do. Just provide the simpler helper, instead. Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Replace assert(0) with unreachable().	Matt Turner	2015-01-21	1	-2/+1
\| \| \| \| \| \|	Fixes a couple of warnings in the process. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Add src and dest constructors	Jason Ekstrand	2015-01-21	1	-0/+37
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>
*	nir: Add a nir_foreach_phi_src helper macro	Jason Ekstrand	2015-01-20	1	-0/+3
\| \| \| \|	Reviewed-by: Connor Abbott <cwabbott02gmail.com>
*	util: Move main/set to util/hash_set	Jason Ekstrand	2015-01-15	1	-1/+1
\| \| \| \| \|	Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	nir: Add a pass for lowering copy instructions	Jason Ekstrand	2015-01-15	1	-0/+3
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>
*	nir: Rename lower_variables to lower_vars_to_ssa	Jason Ekstrand	2015-01-15	1	-1/+1
\| \| \| \| \| \| \| \|	The original name wasn't particularly descriptive. This one indicates that it actually gives you SSA values as opposed to the old pass which lowered variables to registers. Reviewed-by: Connor Abbott <[email protected]>
*	nir/tex_instr: Add a nir_tex_src struct and dynamically allocate the src array	Jason Ekstrand	2015-01-15	1	-10/+14
\| \| \| \| \| \| \| \|	This solves a number of problems. First is the ability to change the number of sources that a texture instruction has. Second, it solves the delema that may occur if a texture instruction has more than 4 sources. Reviewed-by: Connor Abbott <[email protected]>
*	nir/validate: Only build in debug mode	Jason Ekstrand	2015-01-15	1	-0/+4
\| \| \| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Connor Abbott <[email protected]>
*	nir: Make intrinsic flags into an enum	Jason Ekstrand	2015-01-15	1	-14/+14
\| \| \| \| \| \| \| \|	This should be much better for debugging as GDB will pick up on the fact that it's an enum and actually tell you what you're looking at instead of giving you some arbitrary hex value you have to go look up. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Use static inlines instead of macros for list getters	Jason Ekstrand	2015-01-15	1	-28/+81
\| \| \| \| \| \| \| \|	This should make debugging a lot easier as GDB handles static inlines much better than macros. Also, static inlines are typesafe. Reviewed-By: Glenn Kennard <[email protected]> Reviewed-by: Connor Abbott <[email protected]>
*	nir/variable: Remove the constant_value field	Jason Ekstrand	2015-01-15	1	-12/+2
\| \| \| \| \| \| \| \| \|	This was a left-over relic of GLSL IR that we aren't using for anything. If we ever want that value again, we can add it back, but NIR constant folding should be just as good as GLSL IR's if not better pretty soon, so I'm not worried about it. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Add some documentation	Jason Ekstrand	2015-01-15	1	-22/+69
\| \| \| \| \|	Signed-off-by: Jason Ekstrand <[email protected]> Reviewed-by: Connor Abbott <[email protected]>
*	nir: Rename parallel_copy_copy to parallel_copy_entry and add a foreach macro	Jason Ekstrand	2015-01-15	1	-2/+11
\| \| \| \| \| \| \| \| \| \|	parallel_copy_copy was a silly name. Also, things were getting long and annoying, so I added a foreach macro. For historical reasons, several of the original iterations over parallel copy entries in from_ssa used the _safe variants of the loop. However, all of these no longer ever remove an entry so it's ok to make them all use the normal iterator. Reviewed-by: Connor Abbott <[email protected]>
*	nir/from_ssa: Clean up parallel copy handling and document it better	Jason Ekstrand	2015-01-15	1	-7/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, we were doing a lazy creation of the parallel copy instructions. This is confusing, hard to get right, and involves some extra state tracking of the copies. This commit adds an extra walk over the basic blocks to add the block-end parallel copies up front. This should be much less confusing and, consequently, easier to get right. This commit also adds more comments about parallel copies to help explain what all is going on. As a consequence of these changes, we can now remove the at_end parameter from nir_parallel_copy_instr. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Rename nir_block_following_if to nir_block_get_following_if	Jason Ekstrand	2015-01-15	1	-1/+1
\| \| \| \| \| \|	The new name is a little longer but less confusing. Reviewed-by: Connor Abbott <[email protected]>
*	nir/opcodes: Remove the per_component info field	Jason Ekstrand	2015-01-15	1	-18/+15
\| \| \| \| \| \| \| \| \| \| \|	Originally, this field was intended for determining if the given instruction acted per-component or if it had mismatching source and destination sizes that would have to be interpreted specially. However, we can easily derive this from output_size == 0, so it's not really that useful. Also, the values we were setting in nir_opcodes.h for this field were completely bogus and it was never used. Reviewed-by: Connor Abbott <[email protected]>
*	nir/opcodes: Add algebraic properties metadata	Jason Ekstrand	2015-01-15	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit adds some algebraic properties to the metadata of each opcode in NIR. In particular, you now know, just from the metadata, if a given opcode is commutative or associative. This will be useful for algebraic transformation passes that want to be able to match a + b as well as b + a in one go. v2: Make algebraic properties all caps. This was more consistent with the intrinsics flags and seems better for flags in general. Also, the enums are now declared with (1 << n) rather then hex values. v3: fmin and fmax technically aren't commutative or associative. Things get funny when one of the arguments is a NaN. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Make load_const SSA-only	Jason Ekstrand	2015-01-15	1	-16/+4
\| \| \| \| \| \| \| \|	As it was, we weren't ever using load_const in a non-SSA way. This allows us to substantially simplify the load_const instruction. If we ever need a non-SSA constant load, we can do a load_const and an imov. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Make nir_ssa_undef_instr_create initialize the destination	Jason Ekstrand	2015-01-15	1	-1/+2
\| \| \| \|	Reviewed-by: Connor Abbott <[email protected]>
*	nir: Add a foreach_ssa_def function	Jason Ekstrand	2015-01-15	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	There are some functions whose destinations are SSA-only and so aren't a nir_dest. This provides a function that is capable of iterating over the SSA definitions defined by those functions. If you want registers, you should use the old iterator. v2: Kenneth Graunke <[email protected]>: - Fix nir_foreach_ssa_def's return value. Reviewed-by: Connor Abbott <[email protected]>
*	nir: Remove predication	Jason Ekstrand	2015-01-15	1	-14/+0
\| \| \| \| \| \| \| \|	We stopped generating predicates in glsl_to_nir some time ago. Right now, it's all dead untested code that I'm not convinced always worked in the first place. If we decide we want them back, we can revert this patch. Reviewed-by: Connor Abbott <[email protected]>
*	nir/metadata: Rename metadata_dirty to metadata_preserve	Jason Ekstrand	2015-01-15	1	-1/+1
\| \| \| \| \| \| \| \| \|	nir_metadata_dirty was a terrible name because the parameter it takes is the metadata to be preserved. This is really confusing because it looks like it's doing the opposite of what it is actually doing. Now it's named sensibly. Reviewed-by: Connor Abbott <[email protected]>
*	nir/tex_instr: Rename the indirect source type and add an array size	Jason Ekstrand	2015-01-15	1	-1/+10
\| \| \| \| \| \| \| \| \|	In particular, we rename nir_tex_src_sampler_index to _sampler_offset and add a sampler_array_size field to nir_tex_instr. This way we can pass the size of sampler arrays through to backends even after removing the variable information and, with it, the type. Reviewed-by: Connor Abbott <[email protected]>