mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	st/clover: Add event to deps even if it has been triggered	Niels Ole Salscheider	2013-08-26	1	-1/+1
\| \| \| \| \| \| \| \|	The command is submitted once the event has been triggered, but it might not have completed yet. Therefore, we have to add it to deps in order to wait on it. Signed-off-by: Niels Ole Salscheider <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	st/clover: Profiling support	Niels Ole Salscheider	2013-08-26	3	-18/+142
\| \| \| \| \|	Signed-off-by: Niels Ole Salscheider <[email protected]> Acked-by: Francisco Jerez <[email protected]>
*	tgsi_build: fix order of arguments for ind register build	Dave Airlie	2013-08-27	1	-1/+1
\| \| \| \| \| \| \|	This was broken when arrayid was added. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	tgsi: finish declaration parsing for arrays.	Dave Airlie	2013-08-27	1	-1/+31
\| \| \| \| \| \| \| \| \|	I previously fixed this partly in 9e8400f4c95bde1f955c7977066583b507159a10, however I didn't go far enough in testing it, now when I parse a TGSI shader with arrays in it my iterator can see the ArrayID set to the proper value. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	svga: replace 0 with PIPE_OK in a few places	Brian Paul	2013-08-26	3	-5/+5
\|
*	swrast: init i0, i1 values to silence warnings	Brian Paul	2013-08-26	1	-0/+1
\| \| \| \|	Reviewed-by: Chad Versace <[email protected]>
*	mesa: init dst values in COPY_CLEAN_4V_TYPE_AS_FLOAT()	Brian Paul	2013-08-26	1	-1/+2
\| \| \| \| \| \|	to silence gcc 4.8.1 warnings. And improve the ASSERT(0) call. Reviewed-by: Chad Versace <[email protected]>
*	glsl: init limit=0 to silence uninitialized var warning	Brian Paul	2013-08-26	1	-1/+1
\| \| \| \|	Reviewed-by: Paul Berry <[email protected]>
*	i965/vs: Allocate register set once at context creation.	Kenneth Graunke	2013-08-26	3	-4/+6
\| \| \| \| \| \| \| \| \| \|	Now that we use a fixed set of register classes, we can set up the register set and conflict graphs once, at context creation, rather than on every VS compile. This is obviously less expensive, and also what we already do in the FS backend. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Paul Berry <[email protected]>
*	i965/vs: Move base_reg_count computation to brw_alloc_reg_set().	Kenneth Graunke	2013-08-26	1	-3/+4
\| \| \| \| \| \| \| \|	We're soon going to be calling brw_alloc_reg_set() from outside of the visitor, where we don't have the precomputed "max_grf" variable handy. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Paul Berry <[email protected]>
*	i965/vs: Expose the payload registers to the register allocator.	Kenneth Graunke	2013-08-26	2	-6/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For now, nothing else can get allocated over them. That may change at some point in the future. This also means that base_reg_count can be computed without knowing the number of registers used for the payload, which is required if we want to allocate the register set once at context creation time. See commit 551e1cd44f6857f7e29ea4c8f892da5a97844377, which implemented virtually identical code in the FS backend. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Paul Berry <[email protected]>
*	i965/vs: Use a fixed set of register classes.	Kenneth Graunke	2013-08-26	1	-37/+13
\| \| \| \| \| \| \| \| \| \| \|	Arrays, structures, and matrices use large VGRFs of arbitrary sizes. However, split_virtual_grfs() breaks those down into VGRFs of size 1. For reference, commit 5d90b988791e51cfb6413109271ad102fd7a304c is the analogous change to the FS backend. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Paul Berry <[email protected]>
*	i965: Allow C++ type safety in the use of enum brw_urb_write_flags.	Paul Berry	2013-08-26	7	-11/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(From a suggestion by Francisco Jerez) If an enum represents a bitfield of flags, e.g.: enum E { A = 1, B = 2, C = 4, D = 8, }; then C++ normally prohibits statements like this: enum E x = A \| B; because A and B are implicitly converted to ints before OR-ing them, and an int can't be stored in an enum without a type cast. C, on the other hand, allows an int to be implicitly converted to an enum without casting. In the past we've dealt with this situation by storing flag bitfields as ints. This avoids ugly casting at the expense of some type safety that C++ would normally have offered (e.g. we get no warning if we accidentally use the wrong enum type). However, we can get the best of both worlds if we override the \| operator. The ugly casting is confined to the operator overload, and we still get the benefit of C++ making sure we don't use the wrong enum type. v2: Remove unnecessary comment and unnecessary use of "enum" keyword. Use static_cast. Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965: Remove redundant (and uninitialized) field vec4_generator::ctx.	Paul Berry	2013-08-26	2	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	We never noticed that this field was uninitialized because it is only used in an error path that reports internal Mesa errors. But it's silly to have it around anyway because &brw->ctx is equivalent. Should fix Coverity defect CID 1063351: Uninitialized pointer field (UNINIT_CTOR) /src/mesa/drivers/dri/i965/brw_vec4_emit.cpp: 148 Reviewed-by: Ian Romanick <[email protected]>
*	i965: Don't try to fall back when creating unrecognized program targets.	Paul Berry	2013-08-26	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If brwNewProgram is asked to create a program for an unrecognized target, don't bother falling back on _mesa_new_program(). That just hides bugs. Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Anuj Phogat <[email protected]> v2: Use assert() rather than _mesa_problem(). Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	radeonsi: Also set the depth component mask bit for stencil-only exports	Michel Dänzer	2013-08-26	1	-1/+4
\| \| \| \| \| \| \| \|	The stencil values come out wrong without this for some reason. 50 more little piglits. Cc: [email protected]
*	glsl: Add built-in function prototypes for GLSL 3.30	Kenneth Graunke	2013-08-25	2	-0/+1169
\| \| \| \| \| \| \| \|	330.frag is a direct copy of 150.frag. 330.glsl is 150.glsl combined with ARB_shader_bit_encoding.glsl. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	glsl: Bump standalone compiler versions to 3.30.	Kenneth Graunke	2013-08-25	2	-3/+3
\| \| \| \| \| \| \|	These are necessary in order to compile the built-in functions. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	mesa: Set query->EverBound in glQueryCounter().	Kenneth Graunke	2013-08-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	glIsQuery is supposed to return false for names returned by glGenQueries until their first use. BeginQuery is a use, but QueryCounter is also a use. From the ARB_timer_query spec: "A timer query object is created with the command void QueryCounter(uint id, enum target); [...] If <id> is an unused query object name, the name is marked as used [...]" Fixes Piglit's spec/ARB_timer_query/query-lifetime. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Chad Versace <[email protected]> Cc: [email protected]
*	r600g: Implement the new float comparison instructions for Cayman as well.	Henri Verbeet	2013-08-25	1	-4/+4
\| \| \| \| \| \| \| \|	I assume this should have been part of commit 7727fbb7c5d64348994bce6682e681d6181a91e9. This (obviously) fixes a lot tests. Signed-off-by: Henri Verbeet <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	nv30: add forgotten PIPE_CAP_CUBE_MAP_ARRAY cap to list	Ilia Mirkin	2013-08-25	1	-0/+1
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Cc: "9.2" <[email protected]>
*	nouveau/video: avoid overwriting base codec init with template	Ilia Mirkin	2013-08-25	2	-2/+2
\| \| \| \| \| \| \| \| \| \|	Commit 53e20b8b introduced the use of a template to initialize some common fields. Move this copying of fields to before the common vp3 fields are initialized. Reported-by: Martin Peres <[email protected]> Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Christian König <[email protected]>
*	freedreno/a3xx: don't leak so much	Rob Clark	2013-08-24	1	-0/+11
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: fix SGT/SLT/etc	Rob Clark	2013-08-24	1	-29/+125
\| \| \| \| \| \| \| \| \| \|	The cmps.f.* instruction doesn't actually seem to give a float 1.0 or 0.0 output. It either needs a cov.u16f16 or add.s + sel.f16. This makes SGT/SLT/etc more similar to CMP, so handle them in trans_cmp(). This fixes a bunch of piglit tests. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: bit of re-arrange/cleanup	Rob Clark	2013-08-24	1	-61/+71
\| \| \| \| \| \| \| \|	It seems there are a number of cases where instructions have limitations about taking reading src's from const register file, so make get_unconst() a bit easier to use. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: make compiler errors more useful	Rob Clark	2013-08-24	2	-17/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We probably should get rid of assert() entirely, but at this stage it is more useful for things to crash where we can catch it in a debugger. With compile_error() we have a single place to set an error flag (to bail out and return an error on the next instruction) so that will be a small change later when enough of the compiler bugs are sorted. But re-arrange/cleanup the error/assert stuff so we at least get a dump of the TGSI that triggered it. So we see some useful output in piglit logs. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: fix segfault when no color buffer bound	Rob Clark	2013-08-24	7	-18/+40
\| \| \| \| \| \| \|	Don't crash when no color buffer bound. Something caught when starting to run piglit, fixes a hanful of piglit tests. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: cat4 cannot use const reg as src	Rob Clark	2013-08-24	1	-10/+27
\| \| \| \| \| \| \| \| \| \| \| \| \|	Category 4 instructions (rsq, rcp, sqrt, etc) seem to be unable to take a const register as src. In these cases we need to move the src to a temporary gpr first. This is the second case of such a restriction, where the instruction encoding appears to support a const src, but in fact the hw appears to ignore that bit. So split things out into a helper that can be re-used for any instructions which have this limitation. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: use max_reg rather than file_count	Rob Clark	2013-08-24	1	-7/+7
\| \| \| \| \| \| \| \| \| \|	Our current (rather naive) register assignment is based on mapping different register files (INPUT, OUTPUT, TEMP, CONST, etc) based on the max register index of the preceding file. But in some cases, the lowest used register in a file might not be zero. In which case file_count[file] != file_max[file] + 1. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: handle saturate on dst	Rob Clark	2013-08-24	1	-0/+49
\| \| \| \| \| \| \| \|	Sometimes things other than color dst need saturating, like if there is a 'clamp(foo, 0.0, 1.0)'. So for saturated dst add the extra instructions to fix up dst. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: fix CMP	Rob Clark	2013-08-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The 1st src to add.s needs (r) flag (repeat), otherwise it will end up: add.s dst.xyzw, tmp.xxxx -1 instead of: add.s dst.xyzw, tmp.xyzw, -1 Also, if we are using a temporary dst to avoid clobbering one of the src registers, we actually need to use that as the dst for the sel instruction. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: some texture fixes	Rob Clark	2013-08-24	1	-1/+24
\| \| \| \| \| \|	Stop hard coding bits that indicate texture type (2d/3d/cube/etc). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update register headers	Rob Clark	2013-08-24	8	-111/+758
\| \| \| \| \| \|	resync w/ rnndb database Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add debug option to disable scissor optimization	Rob Clark	2013-08-24	3	-14/+22
\| \| \| \| \| \|	Useful for testing and debugging. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: fix viewport on gmem->mem resolve	Rob Clark	2013-08-24	1	-0/+8
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: fix color inversion on mem->gmem restore	Rob Clark	2013-08-24	1	-3/+3
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	radeonsi: Handle additional PIPE_COMPUTE_CAP_*	Niels Ole Salscheider	2013-08-23	1	-1/+14
\| \| \| \| \| \| \| \| \| \| \|	This patch adds support for: PIPE_COMPUTE_CAP_MAX_INPUT_SIZE PIPE_COMPUTE_CAP_MAX_LOCAL_SIZE Return the values reported by the closed source driver for now. Signed-off-by: Niels Ole Salscheider <[email protected]> Reviewed-by: Tom Stellard <[email protected]>
*	radeonsi: copy r600_get_timestamp	Niels Ole Salscheider	2013-08-23	1	-0/+9
\| \| \| \| \|	Signed-off-by: Niels Ole Salscheider <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: Implement PIPE_QUERY_TIMESTAMP	Niels Ole Salscheider	2013-08-23	4	-2/+46
\| \| \| \| \|	Signed-off-by: Niels Ole Salscheider <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallivm: fix min/mag switchover point for nearest/none mip filter	Roland Scheidegger	2013-08-23	5	-66/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, the min/mag switchover point when using nearest/none mip filter was effectively -0.5 which can't be right. Looks like new OpenGL thinks it's ok if it's always 0.0 (older versions required 0.5 in some cases), let's hope everybody else thinks that's fine too. Refactor this slightly and get the per-quad/per-pixel min/mag decision values further down to sampling, though still only the first component is used yet. While here also fix code trying to skip lod bias application etc. when mipfilter is none, as this is still needed for determining min/mag filter. Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/osmesa: Link, not copy, the shared library to the LIB_DIR.	Jon Severinsson	2013-08-23	1	-1/+1
\| \| \| \| \| \| \|	Just like all other mesa libraries... CC: "9.2" <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	gallium/osmesa: Always link with the c++ linker.	Jon Severinsson	2013-08-23	1	-9/+2
\| \| \| \| \| \| \|	Just like all other gallium targets... CC: "9.2" <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	gallium/osmesa: Make and install an osmesa.pc.	Jon Severinsson	2013-08-23	3	-3/+15
\| \| \| \| \| \| \| \| \|	As of "2f142d59 build: Add --enable-gallium-osmesa flag." the pkgconfig file from classic osmesa is no longer installed when building gallium osmesa, so copy it to gallium osmesa and install the copy instead. CC: "9.2" <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/gs: Add a data structure for tracking VS output VUE map.	Paul Berry	2013-08-23	2	-1/+16
\| \| \| \| \| \|	Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/vec4: Make a function for setting up vec4 program key clip info.	Paul Berry	2013-08-23	2	-6/+24
\| \| \| \| \| \| \| \|	This functionality will need to be reused by geometry shaders. Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965: Make prim_to_hw_prim accessible outside brw_draw.c.	Paul Berry	2013-08-23	2	-1/+3
\| \| \| \| \| \| \| \| \|	We will need access to this array in order to configure the geometry shader. Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/gs: add GS visitors.	Paul Berry	2013-08-23	3	-0/+351
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces the vec4_gs_visitor class, which translates geometry shaders from GLSL IR to back-end opcodes. This class is derived from vec4_visitor (which is also the base class for vec4_vs_visitor), so as a result most of the back end code is shared. The only parts that differ are: - Geometry shaders use a different input payload organization, since the inputs need to match up with the outputs of the previous pipeline stage (vec4_gs_visitor::setup_payload() and vec4_gs_visitor::setup_varying_inputs()). - Geometry shader input array dereferences need a special stride computation, since all geometry shader inputs are interleaved into one giant array (vec4_gs_visitor::compute_array_stride()). - There are no geometry shader system values (vec4_gs_visitor::make_reg_for_system_value()). - At the beginning of a geometry shader, extra data in R0 needs to be zeroed out, and a vertex counter needs to be initialized (vec4_gs_visitor::emit_prolog()). - When EmitVertex() appears in the shader, the current contents of output variables need to be emitted to the URB, and the vertex counter needs to be incremented (vec4_gs_visitor::visit(ir_emit_vertex )). - When generating a URB_WRITE message to output vertex data, the current state of the vertex counter needs to be used to store a write offset in the message header (vec4_gs_visitor::emit_urb_write_header()). - The URB_WRITE message that outputs vertex data needs to be sent using GS_OPCODE_URB_WRITE, since VS_OPCODE_URB_WRITE would overwrite the offsets in the message header (vec4_gs_visitor::emit_urb_write_opcode()). - At the end of a geometry shader, the final vertex count needs to be delivered using a URB WRITE message (vec4_gs_visitor::emit_thread_end()). - EndPrimitive() functionality is not implemented yet (vec4_gs_visitor::visit(ir_end_primitive )). - There is no support for assembly shaders (vec4_gs_visitor::emit_program_code()). v2: Make num_input_vertices const. Refer to registers as rN rather than gN, for consistency with the PRM. Fix misspelling. Improve comment in the ir_emit_vertex visitor explaining why we emit vertices inside a conditional. Enclose the conditional code in the ir_emit_vertex visitor between curly braces. Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/gs: Add GS_OPCODE_SET_DWORD_2_IMMED.	Paul Berry	2013-08-23	4	-0/+28
\| \| \| \| \| \|	Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/gs: Add GS_OPCODE_SET_VERTEX_COUNT.	Paul Berry	2013-08-23	4	-0/+45
\| \| \| \| \| \|	Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965/gs: Add GS_OPCODE_SET_WRITE_OFFSET.	Paul Berry	2013-08-23	4	-0/+57
\| \| \| \| \| \| \| \|	v2: Added a comment to vec4_generator::generate_gs_set_write_offset(). Reviewed-by: Ian Romanick <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>