mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	nir: nir_shader_compiler_options: drop native_integers	Christian Gmeiner	2019-05-07	12	-53/+11
\| \| \| \| \| \| \| \|	Driver which do not support native integers should use a lowering pass to go from integers to floats. Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	panfrost: Refactor blend descriptors	Alyssa Rosenzweig	2019-05-07	3	-120/+122
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit does a fairly large cleanup of blend descriptors, although there should not be any functional changes. In particular, we split apart the Midgard and Bifrost blend descriptors, since they are radically different. From there, we can identify that the Midgard descriptor as previously written was really two render targets' descriptors stuck together. From this observation, we split the Midgard descriptor into what a single RT actually needs. This enables us to correctly dump blending configuration for MRT samples on Midgard. It also allows the Midgard and Bifrost blend code to peacefully coexist, with runtime selection rather than a #ifdef. So, as a bonus, this will help the future Bifrost effort, eliminating one major source of compile-time architectural divergence. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	lima/gpir: enable lowering for ftrunc	Vasily Khoruzhick	2019-05-07	1	-0/+1
\| \| \| \| \|	Reviewed-by: Qiang Yu <[email protected]> Signed-off-by: Vasily Khoruzhick <[email protected]>
*	lima/gpir: implement nir_op_fmov	Vasily Khoruzhick	2019-05-07	1	-0/+1
\| \| \| \| \|	Reviewed-by: Qiang Yu <[email protected]> Signed-off-by: Vasily Khoruzhick <[email protected]>
*	lima: use int_to_float lowering pass	Vasily Khoruzhick	2019-05-07	1	-2/+6
\| \| \| \| \| \| \| \|	Neither GP nor PP in Mali4x0 support integers, so utilize new pass and set native_integers to true for now until this flag is dropped. Reviewed-by: Qiang Yu <[email protected]> Signed-off-by: Vasily Khoruzhick <[email protected]>
*	nir: add int_to_float lowering pass	Vasily Khoruzhick	2019-05-07	4	-0/+215
\| \| \| \| \| \| \| \| \| \| \| \| \|	This new pass lowers ints and bools to floats. It allows hardware that doesn't have native integers (e.g. Mali4x0) use the same code paths as modern hardware. It uses newly introduced pass to gather SSA types and should be used as late as possible. Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]> Signed-off-by: Vasily Khoruzhick <[email protected]>
*	radeonsi: add config entry for Counter-Strike Global Offensive	Timothy Arceri	2019-05-07	1	-0/+3
\| \| \| \| \| \| \| \| \|	This fixes rendering issues with gun scopes which is rather important. Cc: "19.0" "19.1" <[email protected]> Acked-by: Bas Nieuwenhuizen <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100239
*	lima/gpir: fix float uniform alignment issue	Vasily Khoruzhick	2019-05-06	1	-2/+1
\| \| \| \| \| \| \| \| \|	If PIPE_CAP_PACKED_UNIFORMS is not set uniforms are vec4 aligned, so lima_nir_lower_uniform_to_scalar should use first channel of vec4 for float uniforms. Reviewed-by: Qiang Yu <[email protected]> Signed-off-by: Vasily Khoruzhick <[email protected]>
*	draw: flush when setting stream-out targets	Erik Faye-Lund	2019-05-06	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	We need to re-prepare the middle-end state to pick up changes to this state to react correctly to pausing/resuming stream-out. So let's add a flush here. Signed-off-by: Erik Faye-Lund <[email protected]> Fixes: ec8cbd79ac4 "draw/softpipe: EXT_transform_feedback support (v2)" Reviewed-by: Roland Scheidegger <[email protected]>
*	llvmpipe: pass stream-out targets to draw-module early	Erik Faye-Lund	2019-05-06	2	-11/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We currently set this state in the draw-module twice on each draw, but which trashes this state. So far that's not a problem, because we don't really do much from that function. But it turns out, we're going to have to do more; namely flush when the state changes. This will incur a large performance penalty due to the excessive setting. Instead, let's rely on the CSO caching making sure that llvmpipe_set_so_targets doesn't get called needlessly, and setup the state directly there instead. Signed-off-by: Erik Faye-Lund <[email protected]> Reviewed-by: Roland Scheidegger <[email protected]>
*	virgl: do not use inline writes for subdata	Chia-I Wu	2019-05-06	1	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Inline writes skip transfer map/unamp at the cost of an extra copy on the data during execbuffer. That is generally a win for small transfers. But the heuristic to use inline writes based on buffer sizes rather than transfer sizes makes little sense. More importantly, inline writes miss optimizations that are done for buffer transfers. Let's just use transfers. Signed-off-by: Chia-I Wu <[email protected]> Reviewed-By: Gert Wollny <[email protected]>
*	virgl: rework queries	Chia-I Wu	2019-05-06	1	-45/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	virglrender has been changed such that - VIRGL_CCMD_GET_QUERY_RESULT is fenced - query buffers (PIPE_BIND_CUSTOM) are coherent We can check if a query is ready using DRM_IOCTL_VIRTGPU_WAIT, and also avoid a synchronized transfer to retrieve the query result. When running against an older virglrenderer, it falls back to the old behavior automatically. TF2 @ 640x480 for pts4.dem went from 17fps to 40fps on my testing machine. Signed-off-by: Chia-I Wu <[email protected]> Reviewed-by: Gurchetan Singh <[email protected]>
*	virgl: export resource_is_busy from winsys	Chia-I Wu	2019-05-06	3	-11/+16
\| \| \| \| \|	Signed-off-by: Chia-I Wu <[email protected]> Reviewed-by: Gurchetan Singh <[email protected]>
*	radv: fix rowPitch for R32G32B32 formats on GFX9	Samuel Pitoiset	2019-05-06	1	-1/+13
\| \| \| \| \| \| \| \| \| \|	The pitch is actually the number of components per row. We found the problem when we implemented some meta operations for these formats and the wrong pitch has been confirmed with a small test case. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108325 Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	iris: Enable PIPE_CAP_SURFACE_REINTERPRET_BLOCKS	Kenneth Graunke	2019-05-06	2	-6/+95
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes CompressedTexSubImage from a PBO source do proper GPU rendering to upload instead of stalling to map the PBO source on the CPU (then copying it on the CPU). Thanks Bas Nieuwenhuizen for pointing out that Vulkan includes this functionality, and to Jason Ekstrand for writing the code I adapted. Vulkan only supports a single layer, however, and this code tries to support multiple layers as long as it's miplevel 0. Improves performance in Sid Meier's Civilization VI: Average frame time (ms): -3.67423% +/- 1.46201% (n=5) 99th percentile frame time (ms): -5.09910% +/- 3.87874% (n=5)
*	radv: Use given stride for images imported from Android.	Bas Nieuwenhuizen	2019-05-06	3	-0/+35
\| \| \| \| \| \|	Handled similarly as radeonsi. I checked the offsets are actually used. Acked-by: Samuel Pitoiset <[email protected]>
*	lima/ppir: abort compilation in case of unsupported intrinsic	Erico Nunes	2019-05-06	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently ppir continues compilation when there is an unsupported intrinsic, resulting in a shader that will surely not work as intended. This is a problem during piglit runs as some tests don't compile properly due to this but actually still get submitted to the gpu and leave the system in an unstable state after executing, causing further tests to fail. Signed-off-by: Erico Nunes <[email protected]> Reviewed-by: Qiang Yu <[email protected]>
*	lima/ir: print names of unsupported intrinsics	Erico Nunes	2019-05-06	2	-2/+4
\| \| \| \| \| \| \| \| \|	While lima still doesn't support some kinds of intrinsics, it is more helpful to display the name of the unsupported instr->intrinsic to make debugging easier. Signed-off-by: Erico Nunes <[email protected]> Reviewed-by: Qiang Yu <[email protected]>
*	mesa: Makefile.sources: Add nir_lower_fb_read.c to Makefile.sources list	John Stultz	2019-05-06	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In commit a99c360a4630 (nir: add pass to lower fb reads), a new file was added that needs to also be added to the Makefile.sources list used by the Android and SCons build system. Cc: Rob Clark <[email protected]> Cc: Emil Velikov <[email protected]> Cc: Amit Pundir <[email protected]> Cc: Sumit Semwal <[email protected]> Cc: Alistair Strachan <[email protected]> Cc: Greg Hartman <[email protected]> Cc: Tapani Pälli <[email protected]> Cc: Jason Ekstrand <[email protected]> Fixes: a99c360a463 ("nir: add pass to lower fb reads") Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Signed-off-by: John Stultz <[email protected]>
*	mesa: Makefile.sources: Add ir3_nir_lower_load_barycentric_at_sample/offset ↵	John Stultz	2019-05-06	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to Makefile.sources In commit 2f0b9d22495 ("freedreno/ir3: lower load_barycentric_at_offset") a new file was added that needs to also be added to the Makefile.sources list used by Android and SCons build system. Cc: Rob Clark <[email protected]> Cc: Emil Velikov <[email protected]> Cc: Amit Pundir <[email protected]> Cc: Sumit Semwal <[email protected]> Cc: Alistair Strachan <[email protected]> Cc: Greg Hartman <[email protected]> Cc: Tapani Pälli <[email protected]> Cc: Jason Ekstrand <[email protected]> Fixes: 2f0b9d22495 ("freedreno/ir3: lower load_barycentric_at_offset") Reviewed-by: Emil Velikov <[email protected]> Signed-off-by: John Stultz <[email protected]>
*	mesa: android: freedreno: Fix build failure due to path change	John Stultz	2019-05-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The ir3_nir_trig.py file was moved in a previous commit, aa0fed10d3574 (freedreno: move ir3 to common location), so update the Android.gen.mk file to match. Cc: Rob Clark <[email protected]> Cc: Emil Velikov <[email protected]> Cc: Amit Pundir <[email protected]> Cc: Sumit Semwal <[email protected]> Cc: Alistair Strachan <[email protected]> Cc: Greg Hartman <[email protected]> Cc: Tapani Pälli <[email protected]> Cc: Jason Ekstrand <[email protected]> Fixes: aa0fed10d35 ("freedreno: move ir3 to common location") Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Signed-off-by: John Stultz <[email protected]>
*	mesa: android: freedreno: build libfreedreno_{drm,ir3} static libs	Amit Pundir	2019-05-06	6	-2/+131
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add libfreedreno_drm/ir3 to the build Cc: Rob Clark <[email protected]> Cc: Emil Velikov <[email protected]> Cc: Amit Pundir <[email protected]> Cc: Sumit Semwal <[email protected]> Cc: Alistair Strachan <[email protected]> Cc: Greg Hartman <[email protected]> Cc: Tapani Pälli <[email protected]> Cc: Jason Ekstrand <[email protected]> Fixes: b4476138d5a ("freedreno: move drm to common location") Fixes: aa0fed10d35 ("freedreno: move ir3 to common location") Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Signed-off-by: Amit Pundir <[email protected]> [jstultz: Tweaked to add extra ir3 files from master] Signed-off-by: John Stultz <[email protected]>
*	mesa: android: Remove unnecessary dependency tracking rules	Alistair Strachan	2019-05-06	2	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current AOSP master build system breaks building mesa due to the following error: external/mesa3d/src/compiler/Android.glsl.gen.mk:94: error: writing to readonly directory: "external/mesa3d/src/compiler/glsl/ir.h" This error is bogus -- nothing "writes" to ir.h -- but the rule is unnecessary because the generated header that is a dependency of the non-generated header should be added to LOCAL_GENERATED_SOURCES and this will track if the dependency needs to be regenerated. (This change fixes a similar problem affecting nir.h too.) Cc: Rob Clark <[email protected]> Cc: Emil Velikov <[email protected]> Cc: Amit Pundir <[email protected]> Cc: Sumit Semwal <[email protected]> Cc: Alistair Strachan <[email protected]> Cc: Greg Hartman <[email protected]> Cc: Tapani Pälli <[email protected]> Cc: Jason Ekstrand <[email protected]> Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Emil Velikov <[email protected]> Signed-off-by: Alistair Strachan <[email protected]> [jstultz: Forward ported and tweaked commit subject] Signed-off-by: John Stultz <[email protected]>
*	radv: Implement cosited_even sampling.	Bas Nieuwenhuizen	2019-05-06	2	-2/+83
\| \| \| \| \| \| \| \| \| \|	Apparently cosited_even was the required one instead of midpoint. This adds slight offset of 0.5 pixels to the coordinates (+ we need the image size to convert to normalized coords) Fixes: 91702374d5d "radv: Add ycbcr lowering pass." Acked-by: Samuel Pitoiset <[email protected]>
*	radv: Disable subsampled formats.	Bas Nieuwenhuizen	2019-05-06	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Broken on Polaris and since I discovered NV12 is not subsampled, but a 2-plane format I decided I don't really care. Work to do to re-enable: 1) Figure out which devices support it natively. 2) Write some software emulation for the others. Fixes: 52c1adda21b "radv: Add ycbcr format features." Reviewed-by: Samuel Pitoiset <[email protected]>
*	util/drirc: add workarounds for bugs in Doom 3: BFG	Timothy Arceri	2019-05-06	1	-0/+5
\| \| \| \| \| \| \| \|	This makes the game playable on radeonsi. Cc: "19.0" "19.1" <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110143
*	freedreno: remove unused forward struct declaration19.1-branchpoint	Rob Clark	2019-05-04	1	-2/+0
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	panfrost/midgard: iabs cannot run on mul	Alyssa Rosenzweig	2019-05-04	1	-1/+1
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: Lower mixed csel (NIR)	Alyssa Rosenzweig	2019-05-04	2	-12/+83
\| \| \| \| \| \| \| \| \|	Basically, when the conditions of a csel diverge, we scalarize to avoid going into weird code paths during emit. We could be doing better, but this case can't occur organically from GLSL as far as I can, though it does fix lowered atan2. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: Fix RA when temp_count = 0	Alyssa Rosenzweig	2019-05-04	2	-50/+70
\| \| \| \| \| \| \| \|	A previous commit by Tomeu aborted RA early, which solves the memory corruption issue, but then generates an incorrect compile. This fixes that. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: Fix integer selection	Alyssa Rosenzweig	2019-05-04	2	-33/+10
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost: Support RGB565 FBOs	Alyssa Rosenzweig	2019-05-04	4	-29/+80
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Handle dest_override generalized	Alyssa Rosenzweig	2019-05-04	1	-22/+68
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Stub out 64-bit	Alyssa Rosenzweig	2019-05-04	1	-5/+15
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Print 8-bit sources	Alyssa Rosenzweig	2019-05-04	1	-23/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This handles the usual case. 8-bit register access parallels 16-bit access, but with one major caveat: in 8-bit mode, only half of the register file is actually (directly) accessible as sources. In particular, for each 16-bit integer register (hrN), we can only index a single 8-bit integer (qrN), corresponding to the lower 8-bits. To get the upper 8-bits, it is required to do an explicit shift. For example, to add the bytes of a 16-bit integer hr0.x and get the result as an 8-bit qr0, you'd need to do something like: ilsr hr1.x, hr0.x, #8 iadd qr0.x, qr0.x, qr1.x This scheme diverges from 32-bit registers, in that both the upper and lower halves of a 32-bit register are individually accessible as a pair of half registers. For contrast, to add the lower and upper 16-bits of a 32-bit integer r0.x, you can just: iadd hr0.x, hr0.x, hr1.x Since hr1.x = upper 16-bit of r0.x. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Support 8-bit destination	Alyssa Rosenzweig	2019-05-04	1	-18/+21
\| \| \| \| \| \| \| \|	Meanwhile, we're forced to disable dest_override, since it's not yet clear how this interacts with other bitnesses (it'll likely need to be overhauled in any case). Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: Rename ilzcnt8 -> iclz	Alyssa Rosenzweig	2019-05-04	2	-2/+2
\| \| \| \| \| \|	Per OpenCL. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: Fix crash on unknown op	Alyssa Rosenzweig	2019-05-04	1	-2/+6
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Fill in .int mod	Alyssa Rosenzweig	2019-05-04	1	-1/+1
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Extend print_reg to 8-bit	Alyssa Rosenzweig	2019-05-04	1	-15/+34
\| \| \| \|	Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard/disasm: Catch mask errors	Alyssa Rosenzweig	2019-05-04	1	-0/+11
\| \| \| \| \| \| \|	We silently ignored certain bits of the mask, which causes issues when disassembly 8/64-bit ops. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	panfrost/midgard: reg_mode_full -> reg_mode_32, etc	Alyssa Rosenzweig	2019-05-04	3	-16/+16
\| \| \| \| \| \| \|	In preparation for 8-bit and 64-bit operands, let's not reinforce the 32-bit-centric biases in the ISA. Signed-off-by: Alyssa Rosenzweig <[email protected]>
*	freedreno/a6xx: deduplicate a few lines	Rob Clark	2019-05-04	1	-6/+0
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add ubwc_enabled helper	Rob Clark	2019-05-04	6	-26/+28
\| \| \| \| \| \| \| \| \|	Since it is dependent on the tile mode (ie. disabled for smaller mipmap levels), we should handle it a similar way to fd_resource_level_linear(). The code previously mostly did the right thing because the old helper took the tile mode. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: move UBWC color offset to fd_resource_offset()	Rob Clark	2019-05-04	7	-18/+42
\| \| \| \| \| \| \| \| \| \|	Best to keep it encapsulated in the helper which returns layer/level offset (and actually use that helper everywhere) rather than spreading the logic around the code. Also add a helper to find UBWC offset, to complete the encapsulation. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: buffer resources cannot be compressed	Rob Clark	2019-05-04	1	-26/+5
\| \| \| \| \| \| \|	Small cleanup. They are just an array of data and only ever linear/ uncompressed. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: mark imported resources as valid	Rob Clark	2019-05-04	1	-0/+2
\| \| \| \| \| \| \|	If someone is importing a buffer, we can't really know the state of it's contents, so assume it is valid. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: UBWC support for images	Rob Clark	2019-05-04	2	-19/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are still some fallbacks we'll need to handle before we can enable UBWC by default. I think we may need to fallback to uncompressed if image atomic operations are used. And we still need to sort out how to handle image and sampler views of compressed resources if the image/ sampler view is using a format that does not support compression. (I think the latter should hopefully be uncommon outside of deqp/piglit.) But at least this gets us to the point where supertuxkart works properly with UBWC enabled ;-) Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: UBWC fixes	Rob Clark	2019-05-04	2	-11/+78
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A few fixes that get UBWC working for the games/benchmarks where I noticed problems before (in particular and manhattan, and stk (modulo image support for UBWC when compute shaders are used for post-process effects): + fix the size of the UBWC meta buffer (ie, the offset to color pixel data) that is returned by ->fill_ubwc_buffer_sizes() + correct size/layout for 8 and 16 byte per pixel formats + limit the supported formats.. Note all formats that can be tiled can be compressed. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2019-05-04	8	-32/+58
\| \| \| \| \| \|	Corrects tex state ubwc pitch/size Signed-off-by: Rob Clark <[email protected]>