mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	i965/miptree: Use cpu tiling/detiling when mapping	Scott D Phillips	2018-05-25	1	-4/+98
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rename the (un)map_gtt functions to (un)map_map (map by returning a map) and add new functions (un)map_tiled_memcpy that return a shadow buffer populated with the intel_tiled_memcpy functions. Tiling/detiling with the cpu will be the only way to handle Yf/Ys tiling, when support is added for those formats. v2: Compute extents properly in the x\|y-rounded-down case (Chris Wilson) v3: Add units to parameter names of tile_extents (Nanley Chery) Use _mesa_align_malloc for the shadow copy (Nanley) Continue using gtt maps on gen4 (Nanley) v4: Use streaming_load_memcpy when detiling v5: (edited by Ken) Move map_tiled_memcpy above map_movntdqa, so it takes precedence. Add intel_miptree_access_raw, needed after rebasing on commit b499b85b0f2cc0c82b7c9af91502c2814fdc8e67. Reviewed-by: Chris Wilson <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i915: Fix streaming loads for intel_tiled_memcpy	Chris Wilson	2018-05-25	1	-5/+5
\| \| \| \| \| \| \| \|	We stream from a tiled and aligned source into an unaligned user buffer, so we need to use _mm_storeu_si128. Fixes: d21c086d819d78fb3f6abcbb14aa492970f442aa (i965/tiled_memcpy: inline movntdqa loads in tiled_to_linear) Reviewed-by: Kenneth Graunke <[email protected]>
*	intel/blorp: Support blits and clears on surfaces with offsets	Jason Ekstrand	2018-05-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	For certain EGLImage cases, we represent a single slice or LOD of an image with a byte offset to a tile and X/Y intratile offsets to the given slice. Most of i965 is fine with this but it breaks blorp. This is a terrible way to represent slices of a surface in EGL and we should stop some day but that's a very scary and thorny path. This gets blorp to start working with those surfaces and fixes some dEQP EGL test bugs. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106629 Cc: [email protected] Reviewed-by: Kenneth Graunke <[email protected]>
*	st/mesa: simplify lastLevel determination in st_finalize_texture	Marek Olšák	2018-05-25	1	-13/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes shader images where we always bind stObj->pt and not individual gl_texture_images. Roughly based on i965 commit 845ad2667ab2466752f06ea30bdb9c837116c308 which does a similar thing but for a different reason. This fixes GL CTS assertion failures introduced by Ilia. Cc: 18.0 18.1 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	i965/tiled_memcpy: inline movntdqa loads in tiled_to_linear	Scott D Phillips	2018-05-25	4	-5/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The reference for MOVNTDQA says: For WC memory type, the nontemporal hint may be implemented by loading a temporary internal buffer with the equivalent of an aligned cache line without filling this data to the cache. [...] Subsequent MOVNTDQA reads to unread portions of the WC cache line will receive data from the temporary internal buffer if data is available. This hidden cache line sized temporary buffer can improve the read performance from wc maps. v2: Add mfence at start of tiled_to_linear for streaming loads (Chris) Reviewed-by: Chris Wilson <[email protected]> Reviewed-by: Matt Turner <[email protected]> Acked-by: Kenneth Graunke <[email protected]>
*	mesa: do not leak ctx->Shader.ReferencedProgram references	Jose Dapena Paz	2018-05-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	When glUseProgram is used, references to the included shaders are added in ctx->Shader.ReferencedProgram. But those references are not decreased when the shader data is deallocated. Thus, those shaders are leaked. Explicitely remove the pending references to these shaders. Fixes: e6506b3cd23 ("mesa: retain gl_shader_programs after glDeleteProgram if they are in use") Reviewed-by: Timothy Arceri <[email protected]>
*	i965: enable OES_texture_view for gen8+	Tapani Pälli	2018-05-24	1	-1/+2
\| \| \| \| \|	Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	mesa: changes to expose OES_texture_view extension	Tapani Pälli	2018-05-24	5	-6/+17
\| \| \| \| \| \| \| \| \| \| \|	Functionality already covered by ARB_texture_view, patch also adds missing 'gles guard' for enums (added in f1563e6392). Tested via arb_texture_view.*_gles3 tests and individual app utilizing texture view with ETC2. Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Use intel_bufferobj_buffer() wrapper in image surface state setup.	Francisco Jerez	2018-05-23	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of directly using intel_obj->buffer. Among other things intel_bufferobj_buffer() will update intel_buffer_object:: gpu_active_start/end, which are used by glBufferSubData() to decide which path to take. Fixes a failure in the Piglit ARB_shader_image_load_store-host-mem-barrier Buffer Update/WaW tests, which could be reproduced with a non-standard glGetTexSubImage implementation (see bug report). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105351 Reported-by: Nanley Chery <[email protected]> Cc: [email protected] Reviewed-by: Nanley Chery <[email protected]>
*	i965: Handle non-zero texture buffer offsets in buffer object range calculation.	Francisco Jerez	2018-05-23	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	Otherwise the specified surface state will allow the GPU to access memory up to BufferOffset bytes past the end of the buffer. Found by inspection. v2: Protect against out-of-range BufferOffset (Nanley). Cc: [email protected] Reviewed-by: Nanley Chery <[email protected]>
*	i965: Move buffer texture size calculation into a common helper function.	Francisco Jerez	2018-05-23	1	-23/+32
\| \| \| \| \| \| \| \| \| \| \| \| \|	The buffer texture size calculations (should be easy enough, right?) are repeated in three different places, each of them subtly broken in a different way. E.g. the image load/store path was never fixed to clamp to MaxTextureBufferSize, and none of them are taking into account the buffer offset correctly. It's easier to fix it all in one place. Cc: [email protected] Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106481 Reviewed-by: Nanley Chery <[email protected]>
*	Revert "mesa: simplify _mesa_is_image_unit_valid for buffers"	Francisco Jerez	2018-05-23	1	-13/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit c0ed52f6146c7e24e1275451773bd47c1eda3145. It was preventing the image format validation from being done on buffer textures, which is required to ensure that the application doesn't attempt to bind a buffer texture with an internal format incompatible with the image unit format (e.g. of different texel size), which is not allowed by the spec (it's not allowed for any texture target, whether or not there is spec wording restricting this behavior specifically for buffer textures) and will cause the driver to calculate texel bounds incorrectly and potentially crash instead of the expected behavior. Cc: [email protected] Reviewed-by: Marek Olšák <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106465 Reviewed-by: Nanley Chery <[email protected]>
*	i965: add {X,A}BGR2101010 to 'intel_image_formats'	Miguel Casas	2018-05-23	1	-0/+6
\| \| \| \| \| \| \| \| \|	This patch adds {X,A}BGR2101010 entries to the list of supported 'intel_image_formats'. Bug: https://crbug.com/776093 Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	dri_util: Add R10G10B10{A,X}2 translation between DRI and mesa_format.	Miguel Casas	2018-05-23	1	-0/+8
\| \| \| \| \| \| \| \| \|	Add R10G10B10{A,X}2 translation between mesa_format and DRI format to driGLFormatToImageFormat() and driImageFormatToGLFormat(). Bug: https://crbug.com/776093 Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	i965: Remove ring switching entirely	Jason Ekstrand	2018-05-22	11	-105/+61
\| \| \| \| \|	Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/miptree: Move the access_raw call to the individual map functions	Jason Ekstrand	2018-05-22	1	-3/+13
\| \| \| \| \| \| \| \| \| \| \|	The only function that doesn't need to call access_raw is map_blit. If it takes the blitter path, it will happen as part of intel_miptree_copy. If map_blit takes the blorp path, brw_blorp_copy_miptrees will handle doing whatever resolves are needed. This should save us resolves in quite a few cases and will probably help performance a bit. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Remove support for the BLT ring	Jason Ekstrand	2018-05-22	1	-9/+3
\| \| \| \| \| \| \|	We still support the blitter on gen4-5 but it's on the same ring as 3D. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/miptree: Use blorp for blit maps on gen6+	Jason Ekstrand	2018-05-22	1	-11/+25
\| \| \| \| \|	Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/miptree: Use blorp for validation tex copies on gen6+	Jason Ekstrand	2018-05-22	1	-11/+29
\| \| \| \| \| \| \| \|	It's faster than the blitter and can handle things like stencil properly so it doesn't require software fallbacks. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Delete the blitter path for CopyTexSubImage	Jason Ekstrand	2018-05-22	1	-58/+0
\| \| \| \| \| \| \| \|	The blorp path (called first) can do anything the blitter path can do so it's just dead code. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Don't fall back to the blitter in BlitFramebuffer	Jason Ekstrand	2018-05-22	1	-8/+0
\| \| \| \| \| \| \| \| \|	On gen4-5, we try the blitter before we even try blorp. On newer platforms, blorp can do everything the blitter can so there's no point in even having the blitter fall-back path. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Remove some unused includes of intel_blit.h	Jason Ekstrand	2018-05-22	4	-4/+0
\| \| \| \| \|	Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/blit: Delete intel_emit_linear_blit	Jason Ekstrand	2018-05-22	2	-62/+0
\| \| \| \| \| \| \|	This function is no longer used. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Use meta for pixel ops on gen6+	Jason Ekstrand	2018-05-22	3	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	Using meta for anything is fairly aweful and definitely has more CPU overhead. However, it also uses the 3D pipe and is therefore likely faster in terms of GPU time than the blitter. Also, the blitter code has so many early returns that it's probably not buying us that much. We may as well just use meta all the time instead of working over-time to find the tiny case where we can use the blitter. We keep gen4-5 using the old blit paths to avoid perturbing old hardware too much. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Emit VF cache invalidates for 48-bit addressing bugs with softpin.	Kenneth Graunke	2018-05-22	2	-0/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We'd like to start using soft-pin to assign BO addresses up front, and never move them again. Our previous plan for dealing with 48-bit VF cache bugs was to relocate vertex buffers to the low 4GB, so we'd never have addresses that alias in the low 32 bits. But that requires moving buffers dynamically. This patch tracks the last seen BO address for each vertex/index buffer, and emits a VF cache invalidate if the high bits change. (Ideally, we won't hit this case very often.) This should work for the soft-pin case, but unfortunately won't work in the relocation case, as we don't actually know the addresses. So, we have to use both methods. v2: Mention that the cache uses a <VertexBufferIndex, Address> tuple more explicitly (suggested by Scott). Mention "single batch" too (suggested by Chris). Reviewed-by: Scott D Phillips <[email protected]>
*	i965: Introduce a "memory zone" concept on BO allocation.	Kenneth Graunke	2018-05-22	16	-38/+107
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We're planning to start managing the PPGTT in userspace in the near future, rather than relying on the kernel to assign addresses. While most buffers can go anywhere, some need to be restricted to within 4GB of a base address. This commit adds a "memory zone" parameter to the BO allocation functions, which lets the caller specify which base address the BO will be associated with, or BRW_MEMZONE_OTHER for the full 48-bit VMA. Eventually, I hope to create a 4GB memory zone corresponding to each state base address. Reviewed-by: Scott D Phillips <[email protected]> Reviewed-by: Jordan Justen <[email protected]>
*	mesa: stop hiding query parameters from OpenGL compat	Timothy Arceri	2018-05-21	1	-14/+7
\| \| \| \| \| \| \| \|	Just let the extension detection do its job as we will be adding compat profile support in future, also we want these to work with compat profile version overrides. Reviewed-by: Marek Olšák <[email protected]>
*	i965: isl: Move the MCS gen7+ assertion into ISL	Nanley Chery	2018-05-18	1	-2/+0
\| \| \| \| \| \| \|	This is useful for every user of ISL. Drop the comment along the way to match similar functions in ISL. Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/miptree: Remove format assertion in alloc_aux	Nanley Chery	2018-05-18	1	-5/+0
\| \| \| \| \| \| \| \| \|	intel_miptree_supports_{ccs,mcs,hiz} ensures the format is valid for the color or depth miptree before the miptree is assigned an aux_usage. alloc_aux switches on the aux_usage so don't assert that the format is valid. Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/miptree: Simplify the switch in supports_ccs	Nanley Chery	2018-05-18	1	-5/+1
\| \| \| \| \|	Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	i965: Make get_ccs_surf succeed in alloc_aux	Nanley Chery	2018-05-18	2	-10/+11
\| \| \| \| \| \| \| \| \| \|	Synchronize the requirements listed in isl_surf_get_ccs_surf with intel_miptree_supports_ccs by importing a restriction from ISL. Some implications: * We successfully create every aux_surf in alloc_aux * We only return false from alloc_aux if we run out of memory Reviewed-by: Topi Pohjolainen <[email protected]>
*	st/mesa: only define GLSL 1.4 for compat if driver supports it	Christian Gmeiner	2018-05-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently GLSL 1.4 is defined for all gallium drivers even only GLSL 1.2 is supported as seen on etnaviv. v1 -> v2: - use _min(..) as suggested by Lucas Stach and Michel Dänzer Fixes: 4560aad780b ("mesa: add GLSLVersionCompat constant") Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Lucas Stach <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	vbo: remove MaxVertexAttribStride assert check.	Dave Airlie	2018-05-18	1	-1/+0
\| \| \| \| \| \| \| \|	Some drivers (virgl) don't support GL4.4 or GLES3.1 yet, so never fill in this const. Reviewed-by: Mathias Fröhlich <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	mesa: drop GL_EXT_polygon_offset support	Timothy Arceri	2018-05-18	6	-25/+0
\| \| \| \| \| \| \| \|	glPolygonOffset() has been part of the GL standard since 1.1. Also niether AMD or Nvidia support this in their binary drivers. Reviewed-by: Marek Olšák <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=61761
*	mesa: The glArrayElement api is independent of the current program.	Mathias Fröhlich	2018-05-17	2	-2/+2
\| \| \| \| \| \| \| \| \|	All the shader program dependent handling is done on the level of the gl_Context::Array._DrawVAO/_DrawVAOEnabledAttribs. So, skip array element invalidation on _NEW_PROGRAM. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Mathias Fröhlich <[email protected]>
*	mesa: Flag _NEW_ARRAY only if we are changing ctx->Array.VAO.	Mathias Fröhlich	2018-05-17	1	-6/+14
\| \| \| \| \| \| \| \| \|	For the VAO internal helper functions that may be called with a non current VAO, flag the _NEW_ARRAY state only if it is the current ctx->Array.VAO. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Mathias Fröhlich <[email protected]>
*	mesa: Remove flush_vertices argument from VAO methods.	Mathias Fröhlich	2018-05-17	9	-57/+51
\| \| \| \| \| \| \|	The flush_vertices argument is now unused, remove it. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Mathias Fröhlich <[email protected]>
*	mesa: Remove FLUSH_VERTICES from VAO state changes.	Mathias Fröhlich	2018-05-17	1	-59/+6
\| \| \| \| \| \| \| \| \|	Pending draw calls on immediate mode or display list calls do not depend on changes of the VAO state. So, remove calls to FLUSH_VERTICES and flag _NEW_ARRAY as appropriate. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Mathias Fröhlich <[email protected]>
*	i965/blorp: Disable BLORP clear color updates	Nanley Chery	2018-05-17	1	-2/+4
\| \| \| \| \| \| \|	With the previous patches, we now update the indirect clear color buffer every time the clear color changes. Avoid redundant updates. Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/blorp: Also skip the fast clear if the clear color differs	Nanley Chery	2018-05-17	1	-4/+3
\| \| \| \| \| \| \| \|	If the aux state is CLEAR and clear color value has changed, only the surface state must be updated. The bit-pattern in the aux buffer is exactly the same. Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/clear: Drop a stale comment in fast_clear_depth	Nanley Chery	2018-05-17	1	-4/+0
\| \| \| \| \| \| \| \| \|	This comment made more sense when it was above the calls to intel_miptree_slice_set_needs_depth_resolve(). We stopped using these functions at commit 554f7d6d02931ea45653c8872565d21c1678a6da ("i965: Move depth to the new resolve functions"). Reviewed-by: Jason Ekstrand <[email protected]>
*	i965: Update the indirect buffer in set_clear_color	Nanley Chery	2018-05-17	2	-37/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For depth buffers, we avoid fast-clearing if the aux_state is already CLEAR. We do the same for color buffers only if the clear color doesn't change. We require that the clear colors match because, in that case, we don't update the indirect clear color outside of BLORP. Update the indirect clear color for color buffers as well. We'll enable the same depth buffer optimization for color buffers in a later patch. Note that we're now actually updating the indirect clear color twice in the case where we use BLORP to perform the fast-clear. This is only temporary. In later patches, we'll prevent BLORP from performing the update. v2: Add more context to the commit message (Topi). Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/clear: Remove an early return in fast_clear_depth	Nanley Chery	2018-05-17	1	-5/+0
\| \| \| \| \| \| \| \| \| \| \|	Reduce complexity and allow the next patch to delete some code. With this change, clear operations will still be skipped and setting the aux_state will cause no side-effects. Remove the associated comment which implies an early return. Reviewed-by: Rafael Antognolli <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965: Use set_clear_color for depth miptrees	Nanley Chery	2018-05-17	3	-19/+2
\| \| \| \| \| \| \|	Reduce code duplication now and prevent it in the following commits. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	Revert "i965: Make the miptree clear color setter take a gl_color_union"	Nanley Chery	2018-05-17	3	-7/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 1d94aa19877fb702ffacacde28ad7253cce72c97. The next patch will make depth miptrees use the clear color setter that was originally being used for color miptrees. Go back to using the isl_color_value parameter because it's the same type as the fast_clear_color field used by color and depth miptrees. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/miptree: Unify aux buffer allocation	Nanley Chery	2018-05-17	2	-142/+82
\| \| \| \| \| \| \| \| \| \|	There isn't much that changes between the aux allocation functions. Remove the duplicated code. v2: Inline the switch statement (Jason). Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965: Prepare to delete intel_miptree_alloc_ccs()	Nanley Chery	2018-05-17	3	-15/+16
\| \| \| \| \| \| \| \| \| \| \| \|	We're going to delete intel_miptree_alloc_ccs() in the next commit. With that in mind, replace the use of this function in do_single_blorp_clear() with intel_miptree_alloc_aux() and move the delayed allocation logic to it's callers. v2: Duplicate the delayed allocation comment (Topi Pohjolainen). Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/miptree: Drop the mt param from alloc_aux_buffer	Nanley Chery	2018-05-17	1	-5/+4
\| \| \| \| \| \|	Drop an unused parameter. Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/miptree: Drop the alloc_flags param from alloc_aux_buffer	Nanley Chery	2018-05-17	1	-15/+14
\| \| \| \| \| \| \|	We have enough information to determine the optimal flags internally. Reviewed-by: Topi Pohjolainen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/miptree: Drop the name param from alloc_aux_buffer	Nanley Chery	2018-05-17	1	-5/+4
\| \| \| \| \| \|	A name of "aux-miptree" should be sufficient. Reviewed-by: Jason Ekstrand <[email protected]>