mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	gallium: add CONSTBUF type to tgsi_file_type	Timothy Arceri	2017-09-15	1	-0/+1
\| \| \| \| \| \| \|	This will be use to distinguish between load types when using the TGSI_OPCODE_LOAD opcode. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/{r600, radeonsi}: Fix segfault with color format (v2)	Denis Pauk	2017-09-14	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102552 v2: Patch cleanup proposed by Nicolai Hähnle. * deleted changes in si_translate_texformat. Cc: Nicolai Hähnle <[email protected]> Cc: Ilia Mirkin <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	radeonsi: optimize TCS epilog when invocation 0 writes tess factors	Marek Olšák	2017-09-11	1	-2/+0
\| \| \| \| \| \| \| \| \| \|	This removes the barrier and LDS stores and loads for tess factors when it's possible. The removal of the barrier seems more important to me though. In one shader, it removes 17 * 4 bytes from the shader binary. Reviewed-by: Nicolai Hähnle <[email protected]>
*	tgsi/scan: add a new pass that analyzes tess factor writes (v2)	Marek Olšák	2017-09-11	2	-0/+235
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pass tries to deduce whether tess factors are always written by all shader invocations. The implication for radeonsi is that it doesn't have to use a barrier near the end of TCS, and doesn't have to use LDS for passing the tess factors to the epilog. v2: Handle barriers and do the analysis pass for each code segment surrounded by barriers separately, and AND results from all such segments writing tess factors. The change is trivial in the main switch statement. Also, the result is renamed to "tessfactors_are_def_in_all_invocs" to make the name accurate. Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/u_blitter: use UTIL_BLITTER_ATTRIB_NONE (0) instead of 0 directly	Marek Olšák	2017-09-11	1	-2/+2
\| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium/u_blitter: don't pass GENERIC in VS if it's not needed	Marek Olšák	2017-09-11	1	-17/+45
\| \| \| \| \| \| \|	Now, depth-only clears and custom passes don't read memory in VS. Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium/u_blitter: use draw_rectangle for all blits except cubemaps	Marek Olšák	2017-09-11	2	-88/+98
\| \| \| \| \| \| \|	Add ZW coordinates to the draw_rectangle callback and use it. Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium/u_blitter: use draw_rectangle callback for layered clears	Marek Olšák	2017-09-11	2	-28/+29
\| \| \| \| \| \| \|	They are done with instancing. Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium/u_blitter: add new union blitter_attrib to replace pipe_color_union	Marek Olšák	2017-09-11	2	-52/+53
\| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	llvmpipe, draw: improve shader cache debugging	Roland Scheidegger	2017-09-09	2	-22/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With GALLIVM_DEBUG=perf set, output the relevant stats for shader cache usage whenever we have to evict shader variants. Also add some output when shaders are deleted (but not with the perf setting to keep this one less noisy). While here, also don't delete that many shaders when we have to evict. For fs, there's potentially some cost if we have to evict due to the required flush, however certainly shader recompiles have a high cost too so I don't think evicting one quarter of the cache size makes sense (and, if we're evicting based on IR count, we probably typically evict only very few or just one shader too). For vs, I'm not sure it even makes sense to evict more than one shader at a time, but keep the logic the same for now. Reviewed-by: Jose Fonseca <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	gallivm: fix gather implementation a bit	Roland Scheidegger	2017-09-09	1	-10/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	gather is defined in terms of bilinear filtering, just without the filtering part. However, there's actually some subtle differences required in our implementation, because we use some tricks to simplify coord wrapping for the two coords per direction. For bilinear filtering, we don't care if we end up with an incorrect texel, as long as the filter weight is 0.0 for it. Likewise, the order of the texels doesn't actually matter (as long as they still have the correct filter weight). But for gather, these tricks lead to incorrect results. Fix this for CLAMP_TO_EDGE, and add some comments to the other wrap functions which look broken (the 3 mirror_clamp plus mirror_repeat) (too complex to fix right now, and noone really seems to care...). Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Jose Fonseca <[email protected]>
*	vl/compositor: make vl_compositor_set_yuv_layer() static	Leo Liu	2017-09-07	2	-44/+28
\| \| \| \| \| \| \|	Since it's no longer being called outside of compositor Signed-off-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	vl/compositor: make a helper function for YUV deinterlacing	Leo Liu	2017-09-07	2	-0/+40
\| \| \| \| \| \| \| \|	The similar function is in OMX, and only used by OMX. Now have it moved to vl/compositor for other state tracker to use later. Signed-off-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	llvmpipe, tgsi: hook up dx10 gather4 opcode	Roland Scheidegger	2017-09-07	2	-8/+25
\| \| \| \| \| \| \| \| \|	Trivial. We already support tg4 for legacy tex opcodes, so the actual texture sampling code already handles it. (Just like TG4, we don't handle additional capabilities and always sample red channel.) Reviewed-by: Jose Fonseca <[email protected]>
*	llvmpipe, draw: increase shader cache limits	Roland Scheidegger	2017-09-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We're not particularly concerned with memory usage, if the tradeoff is shader recompiles. And it's common for apps to have a lot of shaders nowadays (and, since our shaders include a LOT of context state of course we may create quite a bit more shaders even). So quadruple the amount of shaders draw will cache (from 128 to 512). For llvmpipe (fs shaders) quadruple the number of instructions, keep the number of variants the same for now (only with very simple, non-texturing shaders the variant limit could really be reached), and simplify the definition, it's probably easier to just have one different definition per branch... Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/tests: always use two-dimensional constant references	Nicolai Hähnle	2017-09-04	1	-2/+2
\| \| \| \| \| \|	Acked-by: Roland Scheidegger <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	pp: always use two-dimensional constant references	Nicolai Hähnle	2017-09-04	1	-10/+10
\| \| \| \| \| \|	Acked-by: Roland Scheidegger <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	gallium/hud: always use two-dimensional constant references	Nicolai Hähnle	2017-09-04	1	-4/+4
\| \| \| \| \| \|	Acked-by: Roland Scheidegger <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	tgsi/build: always generate two-dimensional constant file accesses	Nicolai Hähnle	2017-09-04	2	-31/+45
\| \| \| \|	Reviewed-by: Timothy Arceri <[email protected]>
*	tgsi/ureg: always emit constants (and their decls) as 2D	Nicolai HÃ¤hnle	2017-09-04	1	-15/+7
\| \| \| \| \| \|	Acked-by: Roland Scheidegger <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	gallium: all drivers should accept two-dimensional constant buffer indexing	Nicolai Hähnle	2017-09-04	1	-1/+1
\| \| \| \| \| \| \| \| \|	Most older drivers seem to just ignore the Dimension setting, so virtually no changes should be needed. Acked-by: Roland Scheidegger <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	llvmpipe: lp_build_gather_elem_vec BE fix for 3x16 load	Ben Crocker	2017-09-01	1	-2/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix loading of a 3x16 vector as a single 48-bit load on big-endian systems (PPC64, S390). Roland Scheidegger's commit e827d9175675aaa6cfc0b981e2a80685fb7b3a74 plus Ray Strode's patch reduce pre-Roland Piglit failures from ~4000 to ~2000. This patch fixes three of the four regressions observed by Ray: - draw-vertices - draw-vertices-half-float - draw-vertices-half-float_gles2 One regression remains: - draw-vertices-2101010 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100613 Cc: "17.2" "17.1" <[email protected]> Signed-off-by: Ben Crocker <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	gallivm: correct channel shift logic on big endian	Ray Strode	2017-09-01	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	lp_build_fetch_rgba_soa fetches a texel from a texture. Part of that process involves first gathering the element together from memory into a packed format, and then breaking out the individual color channels into separate, parallel arrays. The code fails to account for endianess when reading the packed values. This commit attempts to correct the problem by reversing the order the packed values are read on big endian systems. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100613 Cc: "17.2" "17.1" <[email protected]> Signed-off-by: Ray Strode <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium/u_threaded: rename IGNORE_VALID_RANGE -> NO_INFER_UNSYNCHRONIZED	Marek Olšák	2017-08-28	2	-4/+4
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/u_threaded: disallow discard_range if map_buffer is unsynchronized	Marek Olšák	2017-08-28	1	-1/+3
\| \| \| \| \| \| \|	The discard range codepath takes precedence, so if we get both unsynchronized and discard_range, choose unsynchronized. Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/vbuf: fix buffer reference bugs	Brian Paul	2017-08-25	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In two places we called pipe_resource_reference() to remove a reference to a vertex buffer resource. But we neglected to check if the buffer was a user buffer and not a pipe_resource. This caused us to pass an invalid pipe_resource pointer to pipe_resource_reference(). Instead of calling pipe_resource_reference(&vbuf->resource, NULL), use pipe_vertex_buffer_unreference(&vbuf) which checks the is_user_buffer field and does the right thing. Also, explicity set the is_user_buffer field to false after setting the vbuf->resource pointer to out_buffer. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102377 Reviewed-by: Marek Olšák <[email protected]> Tested-by: Bruce Cherniak <[email protected]>
*	gallium/u_threaded: fix a typo	Marek Olšák	2017-08-25	1	-1/+1
\|
*	gallivm: remove unused variable	Brian Paul	2017-08-24	1	-2/+0
\| \| \| \|	Trivial.
*	pipe-loader: use MAYBE_UNUSED to silence warning	Brian Paul	2017-08-24	1	-1/+2
\| \| \| \|	Trivial.
*	tgsi: store opcode mnemonics in a separate table	Nicolai Hähnle	2017-08-23	2	-6/+16
\| \| \| \| \| \| \| \| \| \|	They are only used for debug info. Together with making tgsi_opcode_info::opcode a bitfield, this reduces the size of tgsi_opcode_info on 64-bit systems from 24 bytes to 4 bytes, and makes the whole data structure a bit more linker friendly. Reviewed-by: Marek Olšák <[email protected]>
*	gallium: use tgsi_get_opcode_name instead of tgsi_opcode_info::mnemonic	Nicolai Hähnle	2017-08-23	5	-7/+10
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	tgsi: macro-ify the opcodes table	Nicolai Hähnle	2017-08-23	3	-251/+263
\| \| \| \| \| \| \| \| \|	So we can easily re-arrange members of tgsi_opcode_info, and readers of the code don't have to guess what all the 0s mean. Mostly done with regex search&replace. Reviewed-by: Marek Olšák <[email protected]>
*	tgsi: remove post_indent from some 64-bit opcodes	Nicolai Hähnle	2017-08-23	1	-6/+6
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	tgsi: reduce tgsi_opcode_info::pre_dedent and post_indent to 1 bit	Nicolai Hähnle	2017-08-23	1	-2/+2
\| \| \| \| \| \| \| \|	It's not clear why they were ever 2 bits to begin with. Perhaps the original intent was to use signed values, but that doesn't seem to have ever been the case in master. Reviewed-by: Marek Olšák <[email protected]>
*	util: fix valgrind errors when dumping pipe_draw_info	Nicolai Hähnle	2017-08-23	1	-3/+9
\| \| \| \| \| \| \|	Various index-related fields are only initialized when required, so they should only be dumped in those cases. Reviewed-by: Marek Olšák <[email protected]>
*	tgsi/scan: fix uses_double	Marek Olšák	2017-08-22	1	-2/+8
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium: remove TGSI opcode SCS	Marek Olšák	2017-08-22	9	-164/+1
\| \| \| \| \| \| \|	use COS+SIN instead. Reviewed-by: Roland Scheidegger <[email protected]> Acked-by: Jose Fonseca <[email protected]>
*	gallium/u_blitter: don't use boolean, TRUE, FALSE	Marek Olšák	2017-08-22	2	-73/+73
\| \| \| \| \| \| \|	v2: cherry-picked from the bigger patch series Reviewed-by: Nicolai Hähnle <[email protected]> (v1) Tested-by: Brian Paul <[email protected]>
*	gallium/u_simple_shaders: do util_make_layered_clear_vertex_shader differently	Marek Olšák	2017-08-22	3	-25/+19
\| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium/u_blitter: remove get_next_surface_layer callback	Marek Olšák	2017-08-22	2	-15/+1
\| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium: remove TGSI opcode BREAKC	Marek Olšák	2017-08-22	7	-71/+2
\| \| \| \| \|	Reviewed-by: Roland Scheidegger <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium: remove TGSI opcode XPD	Marek Olšák	2017-08-22	8	-200/+1
\| \| \| \| \| \|	use MUL+MAD+MOV instead. Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: remove TGSI opcode DPH	Marek Olšák	2017-08-22	9	-105/+6
\| \| \| \| \| \|	use DP4 or DP3 + ADD. Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: remove TGSI opcode DP2A	Marek Olšák	2017-08-22	9	-111/+2
\| \| \| \| \| \|	use DP3 instead. Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: remove TGSI_OPCODE_CALLNZ	Marek Olšák	2017-08-22	5	-5/+1
\| \| \| \|	Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: remove TGSI FENCE opcodes	Marek Olšák	2017-08-22	1	-3/+3
\| \| \| \| \| \|	use MEMBAR instead Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: remove TGSI opcodes PUSHA, POPA, SAD, TXQ_LZ	Marek Olšák	2017-08-22	8	-51/+4
\| \| \| \|	Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium/util: add new module that allocate "numbers"	Samuel Pitoiset	2017-08-22	3	-0/+160
\| \| \| \| \| \| \| \|	Will be used for allocating bindless descriptor slots for RadeonSI. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	util/log: add auto logger facility	Nicolai HÃ¤hnle	2017-08-22	2	-2/+75
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	util: add chunk logging module	Nicolai Hähnle	2017-08-22	3	-0/+263
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>