mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	st/wgl: Clamp wglChoosePixelFormatARB's output nNumFormats to nMaxFormats.	José Fonseca	2014-07-29	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	While running https://github.com/nvMcJohn/apitest with apitrace I noticed that Mesa was producing bogus results: wglChoosePixelFormatARB(hdc, piAttribIList = {...}, pfAttribFList = &0, nMaxFormats = 1, piFormats = {19, 65576, 37, 198656, 131075, 0, 402653184, 0, 0, 0, 0, -573575710}, nNumFormats = &12) = TRUE However https://www.opengl.org/registry/specs/ARB/wgl_pixel_format.txt states <nNumFormats> returns the number of matching formats. The returned value is guaranteed to be no larger than <nMaxFormats>. Cc: "10.2" <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	gallium/radeon: Add some Emacs .dir-locals.el files	Michel Dänzer	2014-07-29	4	-0/+45
\| \| \| \| \| \|	Based on the toplevel one but adapted to the driver/winsys coding styles. Reviewed-by: Marek Olšák <[email protected]>
*	ilo: fix fb height of HiZ ops	Chia-I Wu	2014-07-29	1	-1/+1
\| \| \| \| \|	It was set to aligned width. It appears to be fine on GEN7+, but causes random hangs on GEN6.
*	glapi: add indexed blend functions (GL 4.0)	Tapani Pälli	2014-07-28	2	-5/+31
\| \| \| \| \| \| \| \| \| \| \|	This makes some of the UE4 engine demos (Stylized, Mobile Temple) render correctly, tested on Intel Haswell machine. Signed-off-by: Tapani Pälli <[email protected]> Acked-by: Anuj Phogat <[email protected]> Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78716
*	r600g,radeonsi: switch all occurences of array_size to util_max_layer	Marek Olšák	2014-07-28	3	-6/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes 3D texture support in all these cases, because array_size is 1 with 3D textures and depth0 actually contains the "array size". util_max_layer is universal and returns the last layer index for any texture target. A lot of the cases below can't actually be hit with 3D textures, but let's be consistent. This fixes a failure in: piglit layered-rendering/clear-color-all-types 3d single_level for r600g and radeonsi, which was caused by an incorrect CMASK size calculation. Cc: [email protected] Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: fix occlusion queries on Hawaii	Marek Olšák	2014-07-28	1	-1/+7
\| \| \| \| \| \| \|	This was just a guess - and it worked! Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	winsys/radeon: fix vram_size overflow with Hawaii	Marek Olšák	2014-07-28	3	-5/+7
\| \| \| \| \| \| \|	This fixes piglit spec/!OpenGL 3.1/minmax. Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	radeonsi: fix a hang with streamout on Hawaii	Marek Olšák	2014-07-28	2	-1/+14
\| \| \| \| \| \| \| \| \| \| \|	I actually couldn't reproduce this one, but internal docs recommend this workaround. Better safe than sorry. Also, the number of dwords for the sync packets is increased by 4 instead of 2, because it wasn't bumped last time when a new packet was added there. Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	radeonsi: fix a hang with instancing on Hawaii	Marek Olšák	2014-07-28	1	-1/+15
\| \| \| \| \| \| \|	This fixes "piglit/bin/arb_transform_feedback2-draw-auto instanced". Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	gallium/util: add a helper for calculating primitive count from vertex count	Marek Olšák	2014-07-28	1	-0/+15
\| \| \| \| \| \|	This is needed by the following commit which is a candidate for stable too. Cc: [email protected]
*	radeonsi: fix CMASK and HTILE calculations for Hawaii	Marek Olšák	2014-07-28	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	This fixes the checkerboard pattern in glxgears and anything that triggers fast color clear. num_channels is always <= 8, but Hawaii has 16 pipes. Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	r600g,radeonsi: add debug flags which disable tiling	Marek Olšák	2014-07-28	3	-13/+24
\| \| \| \|	Reviewed-by: Alex Deucher <[email protected]>
*	gallium: rename shader cap MAX_CONSTS to MAX_CONST_BUFFER_SIZE	Marek Olšák	2014-07-28	18	-39/+40
\| \| \| \| \| \| \| \| \| \|	This new name isn't so confusing. I also changed the gallivm limit, because it looked wrong. Reviewed-by: Brian Paul <[email protected]> v2: use sizeof(float[4])
*	r600g: switch SNORM conversion to DX and GLES behavior	Marek Olšák	2014-07-28	4	-7/+0
\| \| \| \| \| \| \| \| \|	it also matches GL 4.2 further discussion: http://lists.freedesktop.org/archives/mesa-dev/2013-August/042680.html Cc: [email protected]
*	util: Fix typo	Tom Stellard	2014-07-28	1	-1/+1
\| \| \| \|	Spotted by okias on IRC.
*	ilo: correctly propagate resource renames to hardware	Chia-I Wu	2014-07-28	3	-14/+30
\| \| \| \| \|	Not only should we mark states dirty when the underlying resource is renamed, we should also update the CSO bo when available.
*	ilo: add ilo_resource_get_bo() helper	Chia-I Wu	2014-07-28	2	-17/+18
\| \| \| \|	We will need it in the following commit.
*	radeonsi: Use util_memcpy_cpu_to_le32()	Tom Stellard	2014-07-28	2	-19/+8
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	util: Add util_memcpy_cpu_to_le32() v3	Tom Stellard	2014-07-28	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \|	v2: - Preserve word boundaries. v3: - Use const and restrict. - Fix indentation. Reviewed-by: Michel Dänzer <[email protected]>
*	clover: Add checks for image support to the image functions v2	Tom Stellard	2014-07-28	3	-0/+12
\| \| \| \| \| \| \| \| \| \|	Most image functions are required to return a CL_INVALID_OPERATION error when used on devices without image support. v2: - Simplified the code Reviewed-by: Francisco Jerez <[email protected]>
*	r600g/compute: Add debug information to promote and demote functions	Bruno Jiménez	2014-07-28	1	-4/+11
\| \| \| \| \| \| \|	v2: Add information about the item's starting point and size v3: Rebased on top of master Reviewed-by: Tom Stellard <[email protected]>
*	r600g/compute: Add documentation to compute_memory_pool	Bruno Jiménez	2014-07-28	2	-31/+86
\| \| \| \| \| \|	v2: Rebased on top of master Reviewed-by: Tom Stellard <[email protected]>
*	ilo: unblock an inline write with a staging bo	Chia-I Wu	2014-07-28	1	-13/+31
\| \| \| \|	This should allow a deeper pipeline.
*	ilo: try unblocking a transfer with a staging bo	Chia-I Wu	2014-07-28	4	-19/+209
\| \| \| \| \| \| \|	When mapping a busy resource with PIPE_TRANSFER_DISCARD_RANGE or PIPE_TRANSFER_FLUSH_EXPLICIT, we can avoid blocking by allocating and mapping a staging bo, and emit pipelined copies at proper places. Since the staging bo is never bound to GPU, we give it packed layout to save space.
*	ilo: enable persistent and coherent transfers	Chia-I Wu	2014-07-28	3	-8/+35
\| \| \| \|	Enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT and reorder caps a bit.
*	ilo: drop ptr from ilo_transfer	Chia-I Wu	2014-07-28	2	-35/+36
\| \| \| \| \|	With the recent clean-ups, we can pass the mapped pointer around between functions cleanly. Drop it to make ilo_transfer smaller.
*	ilo: s/TRANSFER_MAP_UNSYNC/TRANSFER_MAP_GTT_UNSYNC/	Chia-I Wu	2014-07-28	2	-6/+6
\| \| \| \| \|	It maps to drm_intel_gem_bo_map_unsynchronized(), which results in unsynchronized GTT mapping.
*	ilo: drop unused context param from transfer functions	Chia-I Wu	2014-07-28	1	-115/+100
\| \| \| \|	Many of the transfer functions do not need an ilo_context. Drop it.
*	ilo: tidy up transfer mapping/unmapping	Chia-I Wu	2014-07-28	1	-88/+89
\| \| \| \| \| \|	Add xfer_map() to replace map_bo_for_transfer(). Add xfer_unmap() and xfer_alloc_staging_sys() to simplify texture and buffer mapping/unmapping, and enable more code sharing between them.
*	ilo: tidy up choose_transfer_method()	Chia-I Wu	2014-07-28	1	-84/+164
\| \| \| \| \| \|	Add a bunch of helper functions and a big comment for choose_transfer_method(). This also fixes handling of PIPE_TRANSFER_MAP_DIRECTLY to not ignore tiling.
*	ilo: free transfers with util_slab_free()	Chia-I Wu	2014-07-28	1	-1/+1
\| \| \| \|	We used FREE() in one of the error path.
*	clover: Add clUnloadPlatformCompiler.	EdB	2014-07-28	2	-1/+6
\| \| \| \|	Reviewed-by: Francisco Jerez <[email protected]>
*	clover: Add clCreateProgramWithBuiltInKernels.	EdB	2014-07-28	2	-1/+23
\| \| \| \| \| \| \|	[ Francisco Jerez: Check for devices not associated with the specified context. Style fix. ] Reviewed-by: Francisco Jerez <[email protected]>
*	glsl/cs: Add several GLSL compute shader variables	Jordan Justen	2014-07-27	1	-0/+6
\| \| \| \| \| \| \| \|	With MESA_EXTENSION_OVERRIDE=GL_ARB_compute_shader, this fixes piglit: built-in-constants tests/spec/arb_compute_shader/minimum-maximums.txt Signed-off-by: Jordan Justen <[email protected]> Reviewed-by: Chris Forbes <[email protected]>
*	main/cs: Add additional compute shader constant values	Jordan Justen	2014-07-27	2	-0/+18
\| \| \| \| \| \| \| \|	With MESA_EXTENSION_OVERRIDE=GL_ARB_compute_shader, this fixes piglit: * arb_compute_shader-minmax Signed-off-by: Jordan Justen <[email protected]> Reviewed-by: Chris Forbes <[email protected]>
*	glsl: No longer require ubo block index to be constant in ir_validate	Chris Forbes	2014-07-26	1	-1/+0
\| \| \| \| \| \|	Signed-off-by: Chris Forbes <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl: Accept nonconstant array references in lower_ubo_reference	Chris Forbes	2014-07-26	1	-11/+32
\| \| \| \| \| \| \| \| \| \| \| \|	Instead of falling back to just the block name (which we won't find), look for the first element of the block array. We'll deal with the rest in the backend by arranging for the blocks to be laid out contiguously. V2: Squashed together patches 3, 5 of V1, plus a naming tweak. Signed-off-by: Chris Forbes <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl: Convert uniform_block in lower_ubo_reference to ir_rvalue.	Chris Forbes	2014-07-26	1	-7/+8
\| \| \| \| \| \| \| \| \| \|	Previously this was a block index with special semantics for -1. With ARB_gpu_shader5, this need not be a compile-time constant, so allow any rvalue here and convert the -1 to a NULL pointer. Signed-off-by: Chris Forbes <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl: Mark entire UBO array active if indexed with non-constant.	Chris Forbes	2014-07-26	1	-19/+31
\| \| \| \| \| \| \| \| \|	Without doing a lot more work, we have no idea which indices may be used at runtime, so just mark them all. Signed-off-by: Chris Forbes <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	glsl: Allow non-constant UBO array indexing with GLSL4/ARB_gpu_shader5.	Chris Forbes	2014-07-26	1	-1/+2
\| \| \| \| \| \|	Signed-off-by: Chris Forbes <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	ilo: simplify ilo_flush()	Chia-I Wu	2014-07-26	3	-20/+30
\| \| \| \|	Move fence creation to the new ilo_fence_create().
*	r600g/compute: Defrag the pool at the same time as we grow it	Bruno Jiménez	2014-07-25	2	-23/+19
\| \| \| \| \| \| \| \| \| \| \| \|	This allows us two things: we now need less item copies when we have to defrag+grow the pool (to just one copy per item) and, even in the case where we don't need to defrag the pool, we reduce the data copied to just the useful data that the items use. Note: The fallback path is a bit ugly now, but hopefully we won't need it much. Reviewed-by: Tom Stellard <[email protected]>
*	r600g/compute: Try to use a temporary resource when growing the pool	Bruno Jiménez	2014-07-25	1	-18/+43
\| \| \| \| \| \| \| \| \| \| \| \| \|	Now, before moving everything to host memory, we try to create a new resource to use as a pool. I we succeed we just use this resource and delete the previous one. If we fail we fallback to using the shadow. This should make growing the pool faster, and we can also save 64KB of memory that were allocated for the 'shadow', even if they weren't used. Reviewed-by: Tom Stellard <[email protected]>
*	freedreno: fix typo in gpu version check	Rob Clark	2014-07-25	1	-1/+1
\| \| \| \| \| \| \|	Opps, I should use larger fonts, I guess. Reported-by: Ilia Mirkin <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: split out shader compiler from a3xx	Rob Clark	2014-07-25	25	-477/+580
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move the bits we want to share between generations from fd3_program to ir3_shader. So overall structure is: fdN_shader_stateobj -> ir3_shader -> ir3_shader_variant -> ir3 \|- ... \- ir3_shader_variant -> ir3 So the ir3_shader becomes the topmost generation neutral object, which manages the set of variants each of which generates, compiles, and assembles it's own ir. There is a bit of additional renaming to s/fd3_compiler/ir3_compiler/, etc. Keep the split between the gallium level stateobj and the shader helper object because it might be a good idea to pre-compute some generation specific register values (ie. anything that is independent of linking). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: rename ir3_shader to ir3	Rob Clark	2014-07-25	12	-55/+55
\| \| \| \| \| \| \| \|	First step of reoganization split out compiler (so it can be shared between a3xx and a4xx). Rename ir3_shader -> ir3 (since we'll want the name ir3_shader for a higher level object). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: scheduler vs pred reg	Rob Clark	2014-07-25	2	-3/+51
\| \| \| \| \| \| \|	The scheduler also needs to be aware of predicate register (p0) in addition to address register (a0). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: little cleanups	Rob Clark	2014-07-25	4	-39/+19
\| \| \| \| \| \|	Remove some obsolete comments, rename deref->addr. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: enable/disable wa's based on patch-level	Rob Clark	2014-07-25	5	-9/+35
\| \| \| \| \| \| \| \|	It seems like for the most part, different behaviors, workarounds, etc, should be conditional on GPU patch revision (ie. a320.0 vs a320.2) rather than GPU id (a320 vs a330). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx/compiler: make IR heap dyanmic	Rob Clark	2014-07-25	2	-8/+43
\| \| \| \| \| \| \| \| \| \| \| \|	The fixed size heap is a remnant of the fdre-a3xx assembler. Yet it is convenient for being able to free the entire data structure in one shot without worrying about leaking nodes. Change it to dynamically grow the heap size (adding chunks) as needed so we don't have an artificial upper limit on shader size (other than hw limits) and don't always have to allocate worst-case size. Signed-off-by: Rob Clark <[email protected]>