mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno/ir3: handle arrays of images	Rob Clark	2018-06-19	1	-6/+30
\| \| \| \| \| \| \| \|	Unlike textures, this doesn't get lowered for us. (Would be nice if they were.. at least until we are ready to deal w/ indirect indexing..) Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: images can be arrays too	Rob Clark	2018-06-19	2	-22/+83
\| \| \| \| \| \|	Seems I previously toally forgot about 2d-arrays, etc.. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: use move_load_const pass	Rob Clark	2018-06-19	1	-0/+3
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	gallium: add support for programmable sample locations	Rhys Perry	2018-06-14	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Rhys Perry <[email protected]> Reviewed-by: Brian Paul <[email protected]> (v2) Reviewed-by: Marek Olšák <[email protected]> (v2)
*	freedreno/ir3: use pipe_image_view's cpp	Rob Clark	2018-06-11	1	-1/+6
\| \| \| \| \| \| \|	At least for PIPE_BUFFER, we could get the resource used as (for example) R32F imageBuffer. So using cpp=1 from the rsc is wrong. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix image dimensions offset	Rob Clark	2018-06-11	1	-1/+1
\| \| \| \| \| \|	copy-pasta fail from how SSBO sizes are handled. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: correct image/ssbo offset	Rob Clark	2018-06-11	1	-1/+1
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: use saml always if we have lod	Rob Clark	2018-06-11	1	-1/+1
\| \| \| \| \| \| \|	In some cases we get plain tex opcodes (but w/ a lod argument).. in this case always use the saml instruction. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: don't cp absneg into meta:fi	Rob Clark	2018-06-11	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \|	If using a fanin (collect) to collect of consecutive registers together, we can CP mov's into the fanin, but not (abs) or (neg). No places that allow those modifiers are consuming a fanin anyways. But this caused an absneg to be lost between a ldgb and stgb for shaders like: outputs[n] = abs(input[n]) Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: rework size/type conversion instructions	Rob Clark	2018-06-11	1	-10/+156
\| \| \| \| \| \|	With 8b and 16b, there are a lot more to handle. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: propagate HALF flag across fanout	Rob Clark	2018-06-11	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	If we have a fanout (split) meta instruction to split the result of a vector instruction, propagate the HALF flag back to the original instruction. Otherwise result ends up in a full precision register while instruction(s) that use the result look in a half-precision register. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: add sample-id/sample-mask-in	Rob Clark	2018-06-11	1	-3/+12
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add sample-id/sample-mask-in	Rob Clark	2018-06-11	1	-0/+21
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2018-06-11	8	-87/+213
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: image atomics use image-store path	Rob Clark	2018-06-11	1	-0/+8
\| \| \| \| \| \| \| \|	image reads are handled via tex state, whereas image writes and atomics are handled via SSBO state block. Previously we were only considering image write, and not image atomics which also uses the SSBO state block. Signed-off-by: Rob Clark <[email protected]>
*	gallium: add PIPE_CAP_GLSL_FEATURE_LEVEL_COMPATIBILITY	Marek Olšák	2018-05-29	1	-0/+1
\| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]> Reviewed-by: Timothy Arceri <[email protected]>
*	gallium/winsys: rename DRM_API_HANDLE_* to WINSYS_HANDLE_*	Dave Airlie	2018-05-30	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This just renames this as we want to add an shm handle which isn't really drm related. Originally by: Marc-André Lureau <[email protected]> (airlied: I used this sed script instead) This was generated with: git grep -l 'DRM_API_' \| xargs sed -i 's/DRM_API_/WINSYS_/g' Reviewed-by: Marek Olšák <[email protected]>
*	freedreno/a5xx: remove fd5_shader_stateobj	Rob Clark	2018-05-15	3	-23/+10
\| \| \| \| \| \|	Extra level of indirection that serves no purpose. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a4xx: remove fd4_shader_stateobj	Rob Clark	2018-05-15	3	-23/+10
\| \| \| \| \| \|	Extra level of indirection that serves no purpose. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: remove fd3_shader_stateobj	Rob Clark	2018-05-15	3	-26/+13
\| \| \| \| \| \|	Extra level of indirection that serves no purpose. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: fence should hold a ref to pipe	Rob Clark	2018-05-15	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since the fence can outlive the context, and all it really needs to wait on a fence is the pipe, use the new fd_pipe reference counting to hold a ref to the pipe and drop the ctx pointer. This fixes a crash seen with (for example) glmark2: #0 fd_pipe_wait_timeout (pipe=0xbf48678b3cd7b32b, timestamp=0, timeout=18446744073709551615) at freedreno_pipe.c:101 #1 0x0000ffffbdf75914 in fd_fence_finish (pscreen=0x561110, ctx=0x0, fence=0xc55c10, timeout=18446744073709551615) at ../src/gallium/drivers/freedreno/freedreno_fence.c:96 #2 0x0000ffffbde154e4 in dri_flush (cPriv=0xb1ff80, dPriv=0x556660, flags=3, reason=__DRI2_THROTTLE_SWAPBUFFER) at ../src/gallium/state_trackers/dri/dri_drawable.c:569 #3 0x0000ffffbecd8b44 in loader_dri3_flush (draw=0x558a28, flags=3, throttle_reason=__DRI2_THROTTLE_SWAPBUFFER) at ../src/loader/loader_dri3_helper.c:656 #4 0x0000ffffbecbc36c in glx_dri3_flush_drawable (draw=0x558a28, flags=3) at ../src/glx/dri3_glx.c:132 #5 0x0000ffffbecd91e8 in loader_dri3_swap_buffers_msc (draw=0x558a28, target_msc=0, divisor=0, remainder=0, flush_flags=3, force_copy=false) at ../src/loader/loader_dri3_helper.c:827 #6 0x0000ffffbecbcfc4 in dri3_swap_buffers (pdraw=0x5589f0, target_msc=0, divisor=0, remainder=0, flush=1) at ../src/glx/dri3_glx.c:587 #7 0x0000ffffbec98218 in glXSwapBuffers (dpy=0x502bb0, drawable=2097154) at ../src/glx/glxcmds.c:840 #8 0x000000000040994c in CanvasGeneric::update (this=0xfffffffff400) at ../src/canvas-generic.cpp:114 #9 0x0000000000411594 in MainLoop::step (this=this@entry=0x5728f0) at ../src/main-loop.cpp:108 #10 0x0000000000409498 in do_benchmark (canvas=...) at ../src/main.cpp:117 #11 0x00000000004071b0 in main (argc=<optimized out>, argv=<optimized out>) at ../src/main.cpp:210 Signed-off-by: Rob Clark <[email protected]>
*	freedreno: batch cache doesn't hold a ref to batch	Rob Clark	2018-05-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The cache doesn't hold a (strong) reference to the batch. So we shouldn't be trying to drop a reference, as that leads to: #0 0x0000ffffbecb37a0 in raise () from /lib64/libc.so.6 #1 0x0000ffffbeca159c in abort () from /lib64/libc.so.6 #2 0x0000ffffbecacf48 in __assert_fail_base () from /lib64/libc.so.6 #3 0x0000ffffbecacfa8 in __assert_fail () from /lib64/libc.so.6 #4 0x0000ffffbd28def0 in pipe_reference_described (ptr=0x4f47130, reference=0x0, get_desc=0xffffbd2e0f08 <__fd_batch_describe>) at ../src/gallium/auxiliary/util/u_inlines.h:88 #5 0x0000ffffbd28e188 in fd_batch_reference_locked (ptr=0x4f40de0, batch=0x0) at ../src/gallium/drivers/freedreno/freedreno_batch.h:258 #6 0x0000ffffbd28e9a8 in fd_bc_invalidate_resource (rsc=0x4f40ca0, destroy=true) at ../src/gallium/drivers/freedreno/freedreno_batch_cache.c:244 #7 0x0000ffffbd293778 in fd_resource_destroy (pscreen=0xedc170, prsc=0x4f40ca0) at ../src/gallium/drivers/freedreno/freedreno_resource.c:644 #8 0x0000ffffbd922674 in u_transfer_helper_resource_destroy (pscreen=0xedc170, prsc=0x4f40ca0) at ../src/gallium/auxiliary/util/u_transfer_helper.c:144 #9 0x0000ffffbd29527c in pipe_resource_reference (ptr=0x4f455d8, tex=0x0) at ../src/gallium/auxiliary/util/u_inlines.h:144 #10 0x0000ffffbd29548c in fd_surface_destroy (pctx=0x1012720, psurf=0x4f455d0) at ../src/gallium/drivers/freedreno/freedreno_surface.c:78 #11 0x0000ffffbd1f9c48 in pipe_surface_reference (ptr=0x4f471d0, surf=0x0) at ../src/gallium/auxiliary/util/u_inlines.h:113 #12 0x0000ffffbd1f9ef4 in util_copy_framebuffer_state (dst=0x4f471c8, src=0x0) at ../src/gallium/auxiliary/util/u_framebuffer.c:114 #13 0x0000ffffbd2e0e30 in __fd_batch_destroy (batch=0x4f47130) at ../src/gallium/drivers/freedreno/freedreno_batch.c:225 #14 0x0000ffffbd28e1b0 in fd_batch_reference_locked (ptr=0xfffffffff010, batch=0x0) at ../src/gallium/drivers/freedreno/freedreno_batch.h:262 #15 0x0000ffffbd28e6b0 in fd_bc_invalidate_context (ctx=0x1012720) at ../src/gallium/drivers/freedreno/freedreno_batch_cache.c:190 #16 0x0000ffffbd2e2b6c in fd_context_destroy (pctx=0x1012720) at ../src/gallium/drivers/freedreno/freedreno_context.c:139 #17 0x0000ffffbd2c3280 in fd5_context_destroy (pctx=0x1012720) at ../src/gallium/drivers/freedreno/a5xx/fd5_context.c:56 #18 0x0000ffffbd5b7a8c in st_destroy_context_priv (st=0xfd72f0, destroy_pipe=true) at ../src/mesa/state_tracker/st_context.c:281 Signed-off-by: Rob Clark <[email protected]>
*	freedreno: Fix ir3_cmdline.c build.	Eric Anholt	2018-05-01	1	-0/+1
\| \| \| \| \| \|	Fixes: 6487e7a30c9e ("nir: move GL specific passes to src/compiler/glsl") Reviewed-by: Timothy Arceri <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	gallium: add initial support for conservative rasterization	Rhys Perry	2018-04-30	1	-0/+10
\| \| \| \| \| \|	Signed-off-by: Rhys Perry <[email protected]> Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	nir: move GL specific passes to src/compiler/glsl	Timothy Arceri	2018-05-01	1	-1/+1
\| \| \| \| \| \| \|	With this we should have no passes in src/compiler/nir with any dependencies on headers from core GL Mesa. Reviewed-by: Alejandro Piñeiro <[email protected]>
*	nir: Offset vertex_id by first_vertex instead of base_vertex	Neil Roberts	2018-04-19	3	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	base_vertex will be zero for non-indexed calls and in that case we need vertex_id to be offset by the ‘first’ parameter instead. That is what we get with first_vertex. This is true for both GL and Vulkan. The freedreno driver is also setting vertex_id_zero_based on nir_options. In order to avoid breakage this patch switches the relevant code to handle SYSTEM_VALUE_FIRST_VERTEX so that it can retain the same behavior. v2: change a3xx/fd3_emit.c and a4xx/fd4_emit.c from SYSTEM_VALUE_BASE_VERTEX to SYSTEM_VALUE_FIRST_VERTEX (Kenneth). Reviewed-by: Ian Romanick <[email protected]> Cc: Rob Clark <[email protected]> Acked-by: Marek Olšák <[email protected]>
*	freedreno/ir3: use lower_global_vars_to_local in cmdline compiler	Rob Clark	2018-04-07	1	-0/+1
\| \| \| \| \| \| \| \|	tgsi_to_nir emits things with arrays as global vars.. and nir->ir3 does lower_locals_to_regs. But nothing was lowering global to local, which breaks compiling tgsi shaders Signed-off-by: Rob Clark <[email protected]>
*	nir+drivers: add helpers to get # of src/dest components	Rob Clark	2018-04-03	1	-5/+1
\| \| \| \| \| \| \| \| \|	Add helpers to get the number of src/dest components for an intrinsic, and update spots that were open-coding this logic to use the helpers instead. Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	freedreno/ir3: fix fallout of unused false-depth elimination	Rob Clark	2018-04-03	2	-17/+19
\| \| \| \| \| \| \| \| \| \|	Since we were MARK flag for both preventing loops, and tracking whether instructions were used, we could end up in an infinite loop due to bd2ca2bcdd. Instead invert the logic.. mark all instructions UNUSED up front and clear the flag as we visit them. Fixes: bd2ca2bcdd freedreno/ir3: eliminate unused false-deps Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: don't align height for PIPE_BUFFER	Rob Clark	2018-04-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	Buffers can be large, so we probably don't want to make them all 32x bigger. But they can't be rendered to (at least in GL) so we don't need this workaround to prevent page faults on mem<->gmem. Cc: "18.0" <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: fix page faults on last level	Rob Clark	2018-04-01	1	-0/+10
\| \| \| \| \| \| \| \| \| \|	We could alternatively fall back to using "old style" draw's for mem<->gmem (ie. what <= a4xx do) when height is not aligned to 32, but that is somewhat more work (and not really something that could be applied to stable) Cc: "18.0" <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix issue w/ glamor composite shaders	Rob Clark	2018-03-31	2	-2/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes an issue that became possible when we started lowering phi webs to regs (a7ea2b4e) (although was not really seen until we also switched to using peephole select pass (ec8bc54a) instead of lowering all if/else to select). If texture coord (or anything else that uses create_collect() to collect scalar values in a sequence of scalar registers) was consuming a value produced on either side of an if/else (ie. a phi lowered to nir reg, which in ir3 is an "array" of length 1) then register allocation would happen incorrectly and we'd end up sampling from garbage coordinates. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: more half-precision fixes	Rob Clark	2018-03-31	2	-8/+37
\| \| \| \| \| \| \| \|	Some instructions require src/dst to be in full or half precision register depending on src/dst type. So do a better job of propagating register type. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add helper to create immed of specified size	Rob Clark	2018-03-31	1	-4/+11
\| \| \| \| \| \| \|	We'll also need to be able to create a half-precision immediate. So re-work create_immed(). Prep work for following patch. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: pass ctx instead of block to create_collect()	Rob Clark	2018-03-31	1	-18/+19
\| \| \| \| \| \|	Prep work for following patch. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: eliminate unused false-deps	Rob Clark	2018-03-31	2	-11/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously false-dependencies would get flagged as used, even if the only "use" was a false dep to (for example) prevent a load from being scheduled after a store. In addition to being pointless instructions, in some cases they can cause problems. For example, ldg (and similar instructions) depend on an immed arg getting CP'd into the instruction, but this doesn't happen if an instruction is otherwise unused. Which can result in undefined results (overwriting unintended registers). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add local_group_size	Rob Clark	2018-03-31	3	-2/+12
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: clear SSA flag when assigning "ARRAY" regs too	Rob Clark	2018-03-31	1	-0/+1
\| \| \| \| \| \|	Avoids a misleading "INVALID FLAGS" warning in debug builds. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: print array live ranges	Rob Clark	2018-03-31	1	-4/+10
\| \| \| \| \| \|	This is also useful to see if optmsgs are enabled. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Implement DP2 instruction	Wladimir J. van der Laan	2018-03-31	1	-0/+21
\| \| \| \| \| \| \| \|	Use DOT2ADDv instruction with 0.0f constant add. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: implement SEQ/SNE instructions	Wladimir J. van der Laan	2018-03-31	1	-3/+20
\| \| \| \| \| \| \| \| \|	Extend translate_sge_slt to emit these, in analogous fashion but using CNDEv. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Compressed textures support	Wladimir J. van der Laan	2018-03-31	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for: - PIPE_FORMAT_ETC1_RGB8 - PIPE_FORMAT_DXT1_RGB - PIPE_FORMAT_DXT1_RGBA - PIPE_FORMAT_DXT3_RGBA - PIPE_FORMAT_DXT5_RGBA Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Support TEXTURE_RECT	Wladimir J. van der Laan	2018-03-31	3	-1/+4
\| \| \| \| \| \| \| \| \|	Denormalized texture coordinates are required for text rendering in GALLIUM_HUD. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Prevent crash in emit_texture if view is not set	Wladimir J. van der Laan	2018-03-31	1	-3/+10
\| \| \| \| \| \| \| \| \| \|	Textures will sometimes be updated if texture view state was un-set, without this change that causes an assertion crash or segfault. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Fix fd2_tex_swiz	Wladimir J. van der Laan	2018-03-31	1	-9/+9
\| \| \| \| \| \| \| \| \| \| \|	Compose swizzles using util_format_compose_swizzles instead of the custom code (which somehow had a bug). This makes the GL_ALPHA internal format work. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Change use of BLEND_ to BLEND2_	Wladimir J. van der Laan	2018-03-31	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Change use of BLEND_ to BLEND2_, BLEND_* a3xx_rb_blend_opcode BLEND2_* is a2xx_rb_blend_opcode This makes no effective difference as the used enumerant has the same value (0), but the other enumerants do not match 1-to-1 so this will avoid future problems. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Update rnndb header for formats enumeration	Wladimir J. van der Laan	2018-03-31	1	-20/+13
\| \| \| \| \| \| \| \| \| \|	The format enumeration comes comes from the yamoto register headers that are part of the amd-gpu kernel driver. (see freedreno envytools commit b8fb7978e7ae106d0d11d0b238ab2ba2d4dd9d43) Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	util: Move util_is_power_of_two to bitscan.h and rename to ↵	Ian Romanick	2018-03-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	util_is_power_of_two_or_zero The new name make the zero-input behavior more obvious. The next patch adds a new function with different zero-input behavior. Signed-off-by: Ian Romanick <[email protected]> Suggested-by: Matt Turner <[email protected]> Reviewed-by: Alejandro Piñeiro <[email protected]>
*	nir: Rename image intrinsics to image_var	Jason Ekstrand	2018-03-23	2	-20/+20
\| \| \| \| \| \| \| \| \| \| \|	Generated with git grep -l nir_intrinsic_image \| xargs \ sed -i 's/nir_intrinsic_image/nir_intrinsic_image_var/g' and some manual fixing in nir_intrinsics.h Reviewed-by: Timothy Arceri <[email protected]>
*	gallium: add packed uniform CAP	Timothy Arceri	2018-03-20	1	-0/+1
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>