mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	i965: Remove i915 chip names.	Kenneth Graunke	2013-07-09	1	-1/+0
\| \| \| \| \| \| \| \| \|	i915 chipsets shouldn't ever hit this driver. Signed-off-by: Kenneth Graunke <[email protected]> Acked-by: Chris Forbes <[email protected]> Acked-by: Paul Berry <[email protected]> Acked-by: Anuj Phogat <[email protected]>
*	i965: Replace intel_context:needs_ff_sync with intel->gen == 5.	Kenneth Graunke	2013-07-09	6	-14/+8
\| \| \| \| \| \| \| \| \| \| \| \| \|	Technically, needs_ff_sync was set on Gen5+, but it was only consulted in the clipper threads and quad/lineloop decomposition code, which are both Gen4-5 only. So in reality it only identified Ironlake. The named flag doesn't really clarify things, and seems like overkill. Signed-off-by: Kenneth Graunke <[email protected]> Acked-by: Chris Forbes <[email protected]> Acked-by: Paul Berry <[email protected]> Acked-by: Anuj Phogat <[email protected]>
*	i965: Add missing newline to blorp color clear perf_debug message.	Kenneth Graunke	2013-07-09	1	-1/+1
\| \| \| \| \| \| \|	perf_debug() doesn't add a newline for you; without this, all the INTEL_DEBUG=perf output was jumbled together. Signed-off-by: Kenneth Graunke <[email protected]>
*	glsl: Silence unused variable warning in the release build	Emil Velikov	2013-07-08	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Resolves the following gcc warning opt_flip_matrices.cpp:84:32: warning: unused variable 'deref' v2: keep the variable, but wrap it in a ifndef NDEBUG block (suggested by Ian) Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl/ast: Silence uninitialized variable warnings in the release build	Emil Velikov	2013-07-08	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Resolves the following gcc warnings warning: 'iface_type_name' may be used uninitialized in this function warning: 'var_mode' may be used uninitialized in this function Note: The variables are initialised to UNKNOWN and ir_var_auto Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Add an assertion to brwProgramStringNotify.	Paul Berry	2013-07-08	1	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \|	driver->ProgramStringNotify is only called for ARB programs, fixed function vertex programs, and ir_to_mesa (which isn't used by the i965 back-end). Therefore, even after geometry shaders are added, brwProgramStringNotify should only ever be called with a target of GL_VERTEX_PROGRAM_ARB or GL_FRAGMENT_PROGRAM_ARB. This patch adds an assertion to clarify that. Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl: Allow non-constant expression initializers of const-qualified vars.	Matt Turner	2013-07-08	1	-11/+19
\| \| \| \| \| \|	Required by ARB_shading_language_420pack. Reviewed-by: Kenneth Graunke <[email protected]>
*	r600g: improve the mechanism for recognizing an empty CS	Marek Olšák	2013-07-08	3	-3/+8
\| \| \| \|	Reviewed-by: Alex Deucher <[email protected]>
*	r600g: explicitly flush caches for streamout-based buffer copying & clearing	Marek Olšák	2013-07-08	1	-0/+13
\| \| \| \| \| \| \|	It's done automatically for vertex buffers, but not for constant buffers, textures, and colorbuffers. Reviewed-by: Alex Deucher <[email protected]>
*	r600g: only flush the caches that need to be flushed during CP DMA operations	Marek Olšák	2013-07-08	3	-32/+117
\| \| \| \| \| \| \|	This should increase performance if constant uploads are done with the CP DMA, because only the cache that needs to be flushed is flushed. Reviewed-by: Alex Deucher <[email protected]>
*	r600g: split INVAL_READ_CACHES into vertex, tex, and const cache flags	Marek Olšák	2013-07-08	5	-27/+52
\| \| \| \| \| \| \|	also flushing any cache in evergreen_emit_cs_shader seems to be superfluous (we don't flush caches when changing the other shaders either) Reviewed-by: Alex Deucher <[email protected]>
*	r600g: adjust flush flags (v3)	Alex Deucher	2013-07-08	6	-7/+42
\| \| \| \| \| \| \| \| \| \| \| \| \|	1. flush SH with read caches 2. add flag for DB flushes 3. add flag for CB flushes v2: flush all CBs, remove redundant emit_state variable. v3: Marek: also set the new flags in r600_context_flush, the CP dma functions, and texture_barrier, and rename them Signed-off-by: Marek Olšák <[email protected]> Reviewed-by: Alex Deucher <[email protected]>
*	r600g: don't call buffer_wait in buffer_mmap_sync_with_rings	Marek Olšák	2013-07-08	1	-2/+1
\| \| \| \| \| \| \| \|	The winsys should do this, because it measures how much time we spend in buffer_map doing synchronization, which can be viewed with the gallium HUD. Reviewed-by: Alex Deucher <[email protected]>
*	r600g: don't read back the MSAA depth buffer if the read flag is not set	Marek Olšák	2013-07-08	1	-8/+8
\| \| \| \|	Reviewed-by: Alex Deucher <[email protected]>
*	r600g: don't flush the context in texture_transfer_map	Marek Olšák	2013-07-08	1	-5/+0
\| \| \| \| \| \|	the winsys does this automatically Reviewed-by: Alex Deucher <[email protected]>
*	r600g: fix texture offset computation for mapped MSAA depth buffers	Marek Olšák	2013-07-08	2	-16/+14
\| \| \| \| \| \| \| \| \|	It was wrong, because the offset shouldn't be applied to MSAA depth buffers. This small cleanup should prevent such issues in the future. This fixes a lockup in "piglit/fbo-depthstencil default_fb -samples=n". Reviewed-by: Alex Deucher <[email protected]>
*	r600g: fix color resolve for RGBX8 and RGBX16 integer formats	Marek Olšák	2013-07-08	1	-2/+2
\| \| \| \|	Reviewed-by: Alex Deucher <[email protected]>
*	r600g: enable fast MSAA color clear for array/3D/cube textures	Marek Olšák	2013-07-08	1	-4/+3
\| \| \| \|	Reviewed-by: Alex Deucher <[email protected]>
*	r600g: implement fast MSAA color clear for integer textures	Marek Olšák	2013-07-08	1	-9/+12
\| \| \| \| \| \| \|	this also fixes the fast clear with multiple colorbuffers and each having a different format Reviewed-by: Alex Deucher <[email protected]>
*	r600/uvd: fix check for UVD 2.x	Christian König	2013-07-08	1	-1/+1
\| \| \| \|	Signed-off-by: Christian König <[email protected]>
*	i965: fix alpha test for MRT	Chris Forbes	2013-07-06	4	-10/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Include src0 alpha in the RT write message when using MRT, so it is used for the alpha test instead of the normal per-RT alpha value. Fixes broken rendering in Dota2 under Wine [FDO #62647]. No Piglit regressions on Ivybridge. V2: reuse (and simplify) existing sample_alpha_to_coverage flag in the FS key, rather than adding another redundant one. Signed-off-by: Chris Forbes <[email protected]> Reviewd-by: Paul Berry <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62647 NOTE: This is a candidate for the stable branches.
*	gallivm: (trivial) fix using one lod instead of per-quad lod for texel fetch	Roland Scheidegger	2013-07-05	1	-1/+2
\| \| \| \| \| \|	The logic for choosing number of lods was bogus. (The code should ultimately handle the case of only one lod even with multiple quads but currently can't.)
*	gallivm: Remove bogus assert.	José Fonseca	2013-07-05	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \|	It is perfectly valid for the swizzle to be bigger than 2. For example the texel offsets could be SAMPLE ..., IMM[0].zzz What is not correct is for chan_index to be bigger than 2. Trivial.
*	nvc0: enable very initial support for nvf0 (GK110)	Ben Skeggs	2013-07-05	5	-5/+76
\| \| \| \| \| \| \|	Shaders need a lot of work still. Basic stuff generally works, so this is basically just fine for gnome-shell, OA etc at this point. Signed-off-by: Ben Skeggs <[email protected]>
*	gallivm: (trivial) fix bogus assertion for per-element lod with 1d resources	Roland Scheidegger	2013-07-05	2	-2/+1
\| \| \| \| \| \|	The assertion was always broken but the code unused until enabling the per-element lod code. Fixes piglit texelFetch vs isampler1D and similar tests (only run with GL 3.0 version override).
*	gallivm: do per-pixel lod calculations for explicit lod	Roland Scheidegger	2013-07-04	10	-126/+195
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	d3d10 requires per-pixel lod calculations for explicit lod, lod bias and explicit derivatives, and we should probably do it for OpenGL too - at least if they are used from vertex or geometry shaders (so doesn't apply to lod bias) this doesn't just affect neighboring pixels. Some code was already there to handle this so fix it up and enable it. There will no doubt be a performance hit unfortunately, we could do better if we'd knew we had a real vector shift instruction (with variable shift count) but this requires AVX2 on x86 (or a AMD Bulldozer family cpu). Don't do anything for lod bias and explicit derivatives yet, though no special magic should be needed for them neither. Likewise, the size query is still broken just the same. v2: Use information if lod is a (broadcast) scalar or not. The idea would be to base this on the actual value, for now just pretend it's a scalar in fs and not a scalar otherwise (so, per-pixel lod is only used in gs/vs but same code is generated for fs as before). Reviewed-by: Jose Fonseca <[email protected]>
*	draw: fix overflows in the indexed rendering paths	Zack Rusin	2013-07-03	4	-43/+159
\| \| \| \| \| \| \| \| \| \| \| \| \|	The semantics for overflow detection are a bit tricky with indexed rendering. If the base index in the elements array overflows, then the index of the first element should be used, if the index with bias overflows then it should be treated like a normal overflow. Also overflows need to be checked for in all paths that either the bias, or the starting index location. Signed-off-by: Zack Rusin <[email protected]> Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	draw/llvm: index overflows if it's greater than elt max	Zack Rusin	2013-07-03	1	-1/+1
\| \| \| \| \| \| \| \| \|	The comparison, incorrectly, was greater-than-or-equal to elt max. Signed-off-by: Zack Rusin <[email protected]> Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	i965: Move the rest of intel_tex_layout.c into brw_tex_layout.c.	Kenneth Graunke	2013-07-03	6	-191/+102
\| \| \| \| \| \| \| \| \| \| \|	The texture alignment unit functions are called from brw_tex_layout.c, so it makes sense to put them there. Since the only caller of intel_get_texture_alignment_unit() is in brw_tex_layout.c, it could be made into a static function. However, this patch instead simply folds it into the caller, as it's only two lines anyway. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Push intel_get_texture_alignment_unit call into brw_miptree_layout	Kenneth Graunke	2013-07-03	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	intel_miptree_create_layout() calls intel_get_texture_alignment_unit() and then immediately calls brw_miptree_layout(). There are no other callers. intel_get_texture_alignment_unit() populates the miptree's alignment unit fields, which are used by brw_miptree_layout() to determine where to place each miplevel. Since brw_miptree_layout() needs those to be present, it makes sense to have it initialize them as the first step. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Declare for-loop counters in the loop in brw_tex_layout.c.	Kenneth Graunke	2013-07-03	1	-11/+7
\| \| \| \| \| \| \| \|	The driver is compiled in C99 mode, so this is not a problem. It's slighlty tidier. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Remove use of GLuint/GLint in brw_tex_layout.c.	Kenneth Graunke	2013-07-03	1	-19/+19
\| \| \| \| \| \| \|	Using GL types is silly; this isn't even remotely API-facing. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Tidy the brw_tex_layout.c copyright and file header comments.	Kenneth Graunke	2013-07-03	1	-34/+31
\| \| \| \| \| \| \| \|	This uses Doxygen style for the file comments, and generally makes it more consistent with the rest of the driver. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Move i945_texture_layout_2d to brw_tex_layout.c	Kenneth Graunke	2013-07-03	3	-71/+72
\| \| \| \| \| \| \|	This consolidates the miptree layout logic in a single file. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Remove fallthrough for Gen4 cube map layout.	Kenneth Graunke	2013-07-03	1	-9/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that both 2DArray and Cube layouts are taken care of by helper functions, it's easy to just call the right function for each generation. This is a little cleaner than falling through. This also reworks the comments. Referencing "Volume 1" of the BSpec isn't very helpful, since that's only available inside Intel, and it doesn't even use volume numbers. Also, "Ironlake...finally" sounds a bit strange considering that almost all hardware uses the 2D array approach. At this point, Gen4 is the only special case. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Combine GL_TEXTURE_CUBE_MAP_ARRAY case with the other array cases.	Kenneth Graunke	2013-07-03	1	-5/+2
\| \| \| \| \| \| \|	These do the exact same thing; combining them is tidier. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Pull 3D texture layout code out into a helper function.	Kenneth Graunke	2013-07-03	1	-77/+82
\| \| \| \| \| \| \|	A bit cleaner than having it in one giant function. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965: Replace maxBatchSize variable with BATCH_SZ define.	Kenneth Graunke	2013-07-03	4	-5/+3
\| \| \| \| \| \| \| \|	maxBatchSize was only ever initialized to BATCH_SZ, and a few places used BATCH_SZ directly anyway. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Move annotate_aub out of the vtable.	Kenneth Graunke	2013-07-03	3	-5/+2
\| \| \| \| \| \| \| \|	brw_annotate_aub() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Move debug_batch hook out of the vtable.	Kenneth Graunke	2013-07-03	3	-4/+2
\| \| \| \| \| \| \| \|	brw_debug_batch() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Remove render_target_supported from the vtable.	Kenneth Graunke	2013-07-03	5	-6/+3
\| \| \| \| \| \| \| \| \| \| \| \|	brw_render_target_supported() is the only implementation of this function, so it makes sense to just call it directly. Rather than adding an #include of brw_wm.h, this patch moves the prototype to brw_context.h. Prototypes seem to be in rather arbitrary places at the moment, and either place seems as good as the other. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Move is_hiz_depth_format out of the vtable.	Kenneth Graunke	2013-07-03	6	-31/+26
\| \| \| \| \| \| \| \|	brw_is_hiz_depth_format() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Remove the invalidate_state() vtable hook.	Kenneth Graunke	2013-07-03	3	-12/+0
\| \| \| \| \| \| \|	The hook was a noop. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Replace fprintfs with assertions in GLenum comparison translators.	Kenneth Graunke	2013-07-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	These functions translate GLenum comparison operations into the hardware enumerations. They should never be passed something other than a GL comparison operator, or something is very broken. Assertions seem more appropriate than fprintf. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Replace intel_state.c enums with those from brw_defines.h.	Kenneth Graunke	2013-07-03	3	-102/+46
\| \| \| \| \| \| \| \| \| \| \| \|	Both intel_context.h and brw_defines.h have #defines for comparison functions, stencil ops, blending logic ops, and blending factors. They're exactly the same values, so it makes sense to pick one. brw_defines.h is the logical place for this kind of stuff, so this patch converts intel_state.c to use the set defined there. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Delete pre-DRI2.3 viewport hacks.	Kenneth Graunke	2013-07-03	3	-25/+1
\| \| \| \| \| \| \| \| \| \|	The __DRI_USE_INVALIDATE extension was added in May 11th, 2010 by commit 4258e3a2e1c327. At this point, it's unlikely that anyone's using the right mix of new and old components to hit this path. Deleting it removes an untested code path and cleans up the driver a bit. Cc: Kristian Høgsberg <[email protected]> Cc: Keith Packard <[email protected]>
*	i965: Remove "There are probably better ways" comment.	Kenneth Graunke	2013-07-03	1	-5/+0
\| \| \| \| \| \| \|	There are always better ways to do things. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Delete brw_print_reg() function.	Kenneth Graunke	2013-07-03	3	-99/+0
\| \| \| \| \| \| \| \| \| \|	This wasn't called from anywhere; presumably it was used to examine brw_regs when debugging shader assembly. However, it prints registers in a different notation than brw_disasm.c which everyone is used to...which means I doubt anyone will want to use it. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Move contents of intel_clear.h to intel_context.h.	Kenneth Graunke	2013-07-03	4	-40/+2
\| \| \| \| \| \| \| \| \|	Having a header file for a single prototype seems rather excessive. Plus, the actual function is in brw_clear.c, not intel_clear.c, so there isn't even the .c/.h filename symmetry one might expect. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	i965: Move contents of intel_extensions.h to intel_context.h.	Kenneth Graunke	2013-07-03	4	-37/+3
\| \| \| \| \| \| \| \|	Having an entire header file for a single prototype seems a bit excessive. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>