mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	mesa/util: add allow_glsl_builtin_const_expression driconf override	Timothy Arceri	2018-06-19	7	-1/+20
\| \| \| \| \| \| \|	Google Earth VR shaders uses builtins in constant expressions with GLSL 1.10. That feature wasn't allowed until GLSL 1.20. Reviewed-by: Dave Airlie <[email protected]>
*	util: manually extract the program name from program_invocation_name	Timothy Arceri	2018-06-19	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Glibc has the same code to get program_invocation_short_name. However for some reason the short name gets mangled for some wine apps. For example with Google Earth VR I get: program_invocation_name: "/home/tarceri/.local/share/Steam/steamapps/common/EarthVR/Earth.exe" program_invocation_short_name: "e" Acked-by: Eric Engestrom <[email protected]>
*	ac/surface: Set compressZ for stencil-only surfaces.	Bas Nieuwenhuizen	2018-06-19	1	-1/+1
\| \| \| \| \| \| \|	We HTILE compress stencil-only surfaces too. CC: 18.1 <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	anv: Use a single global API patch version	Jason Ekstrand	2018-06-18	2	-8/+11
\| \| \| \| \| \| \| \| \|	The Vulkan API has only one patch version shared among all of the major.minor versions. We should also advertise the same patch version regardless of major.minor. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106941 Reviewed-by: Lionel Landwerlin <[email protected]>
*	radeonsi: enable OpenGL 3.3 compat profile	Timothy Arceri	2018-06-19	1	-1/+1
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	mesa: add ff fragment shader support for geom and tess shaders	Timothy Arceri	2018-06-19	1	-1/+5
\| \| \| \| \| \| \|	This is required for compatibility profile support. Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	v3d: Set the SO offsets correctly if we have to re-emit.	Eric Anholt	2018-06-18	5	-4/+24
\| \| \| \| \| \|	This should fix TF across a glFlush() or TF pause/restart. Fixes dEQP-GLES3.functional.transform_feedback.array.interleaved.lines.highp_float and many, many others.
*	gallium/hud: = should rename the last added data source	Marek Olšák	2018-06-18	1	-1/+4
\| \| \| \|	Tested-by: Dieter Nützel <[email protected]>
*	anv: Disable constant buffer 0 being relative.	Rafael Antognolli	2018-06-18	2	-1/+29
\| \| \| \| \| \| \| \| \|	If we are on gen8+ and have context isolation support, just make that constant buffer address be absolute, so we can use it for push UBOs too. v2: Do not duplicate constant_buffer_0_is_relative flag (Jason) Reviewed-by: Jason Ekstrand <[email protected]>
*	anv/device: Check for kernel support of context isolation.	Rafael Antognolli	2018-06-18	2	-0/+4
\| \| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	intel/genxml: Add bitmasks for CS_DEBUG_MODE2/INSTPM.	Rafael Antognolli	2018-06-18	7	-0/+32
\| \| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	swr/rast: Clang-Format most rasterizer source code	Alok Hota	2018-06-18	114	-22174/+27802
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	radv: fix reported number of available VGPRs	Eric Engestrom	2018-06-18	1	-1/+1
\| \| \| \| \| \| \| \|	It's a bit late to round up after an integer division. Fixes: de889794134e6245e08a2 "radv: Implement VK_AMD_shader_info" Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Alex Smith <[email protected]>
*	mesa: add missing return in error path	Eric Engestrom	2018-06-18	1	-1/+3
\| \| \| \| \| \| \| \|	Fixes: 67f40dadaa6666dacd90 "mesa: add support for ARB_sample_locations" Cc: Rhys Perry <[email protected]> Cc: Brian Paul <[email protected]> Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	radv: Use less conservative approximation for context rolls.	Bas Nieuwenhuizen	2018-06-18	1	-3/+6
\| \| \| \| \| \| \|	Drops the number of time we set the scissor by 4x for F1 2017, which results in a consistent performance improvement of about 4%. Reviewed-by: Samuel Pitoiset <[email protected]>
*	radv: fix bitwise check	Eric Engestrom	2018-06-18	1	-1/+1
\| \| \| \| \| \|	Fixes: 922cd38172b8a2bc286bd "radv: implement out-of-order rasterization when it's safe on VI+" Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]>
*	meson: fix i965/anv/isl genX static lib names	Eric Engestrom	2018-06-18	3	-3/+3
\| \| \| \| \| \| \| \| \| \| \|	Shouldn't make any functional difference, just that `liblibanv_gen90.a` will now be called `libanv_gen90.a`. Fixes: 3218056e0eb375eeda470 "meson: Build i965 and dri stack" Fixes: d1992255bb29054fa5176 "meson: Add build Intel "anv" vulkan driver" Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Dylan Baker <[email protected]>
*	mesa: Unconditionally enable floating-point textures	Timothy Arceri	2018-06-18	2	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARB_texture_float references US Patent #6,650,327 [1] which has a filing date of June 16 1998. According to [2], patents filed after 1995 expire 20 years from the filing date, giving an expiration of June 17 2018. [1] https://www.google.com/patents/US6650327 [2] https://en.wikipedia.org/wiki/Term_of_patent_in_the_United_States Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	intel/fs: shuffle_64bit_data_for_32bit_write is not used anymore	Jose Maria Casanova Crespo	2018-06-16	2	-36/+0
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use new shuffle_32bit_write for all 64-bit storage writes	Jose Maria Casanova Crespo	2018-06-16	1	-7/+6
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: shuffle_32bit_load_result_to_64bit_data is not used anymore	Jose Maria Casanova Crespo	2018-06-16	2	-58/+0
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_from_32bit_read for 64-bit FS load_input	Jose Maria Casanova Crespo	2018-06-16	1	-4/+4
\| \| \| \| \| \| \| \| \|	As the previous use of shuffle_32bit_load_result_to_64bit_data had a source/destination overlap for 64-bit. Now a temporary destination is used for 64-bit cases to use shuffle_from_32bit_read that doesn't handle src/dst overlaps. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: shuffle_from_32bit_read at load_per_vertex_input at TCS/TES	Jose Maria Casanova Crespo	2018-06-16	1	-14/+8
\| \| \| \| \| \| \| \| \| \| \|	Previously, the shuffle function had a source/destination overlap that needs to be avoided to use shuffle_from_32bit_read. As we can use for the shuffle destination the destination of removed MOVs. This change also avoids the internal MOVs done by the previous shuffle to deal with possible overlaps. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_from_32bit_read at VS load_input	Jose Maria Casanova Crespo	2018-06-16	1	-10/+5
\| \| \| \| \| \| \| \| \| \|	shuffle_from_32bit_read manages 32-bit reads to 32-bit destination in the same way that the previous loop so now we just call the new function for all bitsizes, simplifying also the 64-bit load_input. v2: Add comment about future 16-bit support (Jason Ekstrand) Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_from_32bit_read for 64-bit gs_input_load	Jose Maria Casanova Crespo	2018-06-16	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This implementation avoids two unneeded MOVs for each 64-bit component. One was done in the old shuffle, to avoid cases of src/dst overlap but this is not the case. And the removed MOV was already being being done in the shuffle. Copy propagation wasn't able to remove them because shuffle destination values are defined with partial writes because they have stride == 2. v2: Reword commit log summary (Jason Ekstrand) Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: shuffle_from_32bit_read for 64-bit do_untyped_vector_read	Jose Maria Casanova Crespo	2018-06-16	1	-10/+2
\| \| \| \| \| \| \| \| \| \|	do_untyped_vector_read is used at load_ssbo and load_shared. The previous MOVs are removed because shuffle_from_32bit_read can handle storing the shuffle results in the expected destination just using the proper offset. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Remove old 16-bit shuffle/unshuffle functions	Jose Maria Casanova Crespo	2018-06-16	2	-73/+0
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_for_32bit_write for 16-bits store_ssbo	Jose Maria Casanova Crespo	2018-06-16	1	-5/+2
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_from_32bit_read to read 16-bit SSBO	Jose Maria Casanova Crespo	2018-06-16	1	-4/+2
\| \| \| \| \| \| \| \|	Using shuffle_from_32bit_read instead of 16-bit shuffle functions avoids the need of retype. At the same time new function are ready for 8-bit type SSBO reads. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: Use shuffle_from_32bit_read at VARYING_PULL_CONSTANT_LOAD	Jose Maria Casanova Crespo	2018-06-16	1	-15/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	shuffle_from_32bit_read can manage the shuffle/unshuffle needed for different 8/16/32/64 bit-sizes at VARYING PULL CONSTANT LOAD. To get the specific component the first_component parameter is used. In the case of the previous 16-bit shuffle, the shuffle operation was generating not needed MOVs where its results where never used. This behaviour passed unnoticed on SIMD16 because dead_code_eliminate pass removed the generated instructions but for SIMD8 they cound't be removed because of being partial writes. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: New shuffle_for_32bit_write and shuffle_from_32bit_read	Jose Maria Casanova Crespo	2018-06-16	2	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These new shuffle functions deal with the shuffle/unshuffle operations needed for read/write operations using 32-bit components when the read/written components have a different bit-size (8, 16, 64-bits). Shuffle from 32-bit to 32-bit becomes a simple MOV. shuffle_src_to_dst takes care of doing a shuffle when source type is smaller than destination type and an unshuffle when source type is bigger than destination. So this new read/write functions just need to call shuffle_src_to_dst assuming that writes use a 32-bit destination and reads use a 32-bit source. As shuffle_for_32bit_write/from_32bit_read components take components in unit of source/destination types and shuffle_src_to_dst takes units of the smallest type component, we adjust components and first_component parameters. To enable this new functions it is needed than there is no source/destination overlap in the case of shuffle_from_32bit_read. That never happens on shuffle_for_32bit_write as it allocates a new destination register as it was at shuffle_64bit_data_for_32bit_write. v2: Reword commit log and add comments to explain why first_component and components parameters are adjusted. (Jason Ekstrand) Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/fs: general 8/16/32/64-bit shuffle_src_to_dst function	Jose Maria Casanova Crespo	2018-06-16	1	-0/+101
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This new function takes care of shuffle/unshuffle components of a particular bit-size in components with a different bit-size. If source type size is smaller than destination type size the operation needed is a component shuffle. The opposite case would be an unshuffle. Component units are measured in terms of the smaller type between source and destination. As we are un/shuffling the smaller components from/into a bigger one. The operation allows to skip first_component number of components from the source. Shuffle MOVs are retyped using integer types avoiding problems with denorms and float types if source and destination bitsize is different. This allows to simplify uses of shuffle functions that are dealing with these retypes individually. Now there is a new restriction so source and destination can not overlap anymore when calling this shuffle function. Following patches that migrate to use this new function will take care individually of avoiding source and destination overlaps. v2: (Jason Ekstrand) - Rewrite overlap asserts. - Manage type_sz(src.type) == type_sz(dst.type) case using MOVs from source to dest. This works for 64-bit to 64-bits operation that on Gen7 as it doesn't support Q registers. - Explain that components units are based in the smallest type. v3: - Fix unshuffle overlap assert (Jason Ekstrand) Reviewed-by: Jason Ekstrand <[email protected]>
*	ac: Clear meminfo to avoid valgrind warning.	Bas Nieuwenhuizen	2018-06-16	1	-1/+1
\| \| \| \| \| \| \|	Somehow valgrind misses that the value is initialized by the ioctl. Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]>
*	radv: fix emitting the TCS regs on GFX9	Samuel Pitoiset	2018-06-16	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	The primitive ID is NULL and this generates an invalid select instruction which crashes because one operand is NULL. This fixes crashes in The Long Journey Home, Quantum Break and Just Cause 3 with DXVK. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106756 CC: <[email protected]> Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	nir: Document a couple instances of parent_instr	Ian Romanick	2018-06-15	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	nir_ssa_def::parent_instr and nir_src::parent_instr have the same name, but they mean really different things. I choose to save the next person the hour+ that I just spent figuring that out. Even now that I know, I doubt I'd notice in code review that someone typed foo->parent_instr when they actually meant foo->ssa->parent_instr. v2: Minor wording tweak in nir_ssa_def::parent_instr. Suggested by Jason. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/fs: Propagate conditional modifiers from not instructions	Ian Romanick	2018-06-15	1	-1/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Skylake total instructions in shared programs: 14399081 -> 14399010 (<.01%) instructions in affected programs: 26961 -> 26890 (-0.26%) helped: 57 HURT: 0 helped stats (abs) min: 1 max: 6 x̄: 1.25 x̃: 1 helped stats (rel) min: 0.16% max: 0.80% x̄: 0.30% x̃: 0.18% 95% mean confidence interval for instructions value: -1.50 -0.99 95% mean confidence interval for instructions %-change: -0.35% -0.25% Instructions are helped. total cycles in shared programs: 532978307 -> 532976050 (<.01%) cycles in affected programs: 468629 -> 466372 (-0.48%) helped: 33 HURT: 20 helped stats (abs) min: 3 max: 360 x̄: 116.52 x̃: 98 helped stats (rel) min: 0.06% max: 3.63% x̄: 1.66% x̃: 1.27% HURT stats (abs) min: 2 max: 172 x̄: 79.40 x̃: 43 HURT stats (rel) min: 0.04% max: 3.02% x̄: 1.48% x̃: 0.44% 95% mean confidence interval for cycles value: -81.29 -3.88 95% mean confidence interval for cycles %-change: -1.07% 0.12% Inconclusive result (%-change mean confidence interval includes 0). All Gen6+ platforms, except Ivy Bridge, had similar results. (Haswell shown) total instructions in shared programs: 12973897 -> 12973838 (<.01%) instructions in affected programs: 25970 -> 25911 (-0.23%) helped: 55 HURT: 0 helped stats (abs) min: 1 max: 2 x̄: 1.07 x̃: 1 helped stats (rel) min: 0.16% max: 0.62% x̄: 0.28% x̃: 0.18% 95% mean confidence interval for instructions value: -1.14 -1.00 95% mean confidence interval for instructions %-change: -0.32% -0.24% Instructions are helped. total cycles in shared programs: 410355841 -> 410352067 (<.01%) cycles in affected programs: 578454 -> 574680 (-0.65%) helped: 47 HURT: 5 helped stats (abs) min: 3 max: 360 x̄: 85.74 x̃: 18 helped stats (rel) min: 0.05% max: 3.68% x̄: 1.18% x̃: 0.38% HURT stats (abs) min: 2 max: 242 x̄: 51.20 x̃: 4 HURT stats (rel) min: <.01% max: 0.45% x̄: 0.15% x̃: 0.11% 95% mean confidence interval for cycles value: -104.89 -40.27 95% mean confidence interval for cycles %-change: -1.45% -0.66% Cycles are helped. Ivy Bridge total instructions in shared programs: 11679351 -> 11679301 (<.01%) instructions in affected programs: 28208 -> 28158 (-0.18%) helped: 50 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.12% max: 0.54% x̄: 0.23% x̃: 0.16% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -0.27% -0.19% Instructions are helped. total cycles in shared programs: 257445362 -> 257444662 (<.01%) cycles in affected programs: 419338 -> 418638 (-0.17%) helped: 40 HURT: 3 helped stats (abs) min: 1 max: 170 x̄: 65.05 x̃: 24 helped stats (rel) min: 0.02% max: 3.51% x̄: 1.26% x̃: 0.41% HURT stats (abs) min: 2 max: 1588 x̄: 634.00 x̃: 312 HURT stats (rel) min: 0.05% max: 2.97% x̄: 1.21% x̃: 0.62% 95% mean confidence interval for cycles value: -97.96 65.41 95% mean confidence interval for cycles %-change: -1.56% -0.62% Inconclusive result (value mean confidence interval includes 0). No changes on Iron Lake or GM45. v2: Move 'if (cond != BRW_CONDITIONAL_Z && cond != BRW_CONDITIONAL_NZ)' check outside the loop. Suggested by Iago. Signed-off-by: Ian Romanick <[email protected]>
*	i965/fs: Rearrange code to remove most of the gotos	Ian Romanick	2018-06-15	1	-11/+3
\| \| \| \|	Signed-off-by: Ian Romanick <[email protected]>
*	i965/fs: Refactor propagation of conditional modifiers from compares to adds	Ian Romanick	2018-06-15	1	-57/+80
\| \| \| \|	Signed-off-by: Ian Romanick <[email protected]>
*	i965/vec4: Optimize OR with 0 into a MOV	Ian Romanick	2018-06-15	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	All of the affected shaders are geometry shaders... the same ones from the similar fs changes. The "No changes on any other platforms" comment below is not quite right. Without the previous change to register coalescing, this optimization caused quite a few regressions in tests that either used gl_ClipVertex or used different interpolation modes. I observed that with both patches applied, glsl-1.10/execution/interpolation/interpolation-none-gl_BackSecondaryColor-smooth-vertex.shader_test was one instruction shorter. I suspect other shaders would be similarly affected. Since this is all based on NOS, shader-db does not reflect it. Haswell total instructions in shared programs: 12954955 -> 12954918 (<.01%) instructions in affected programs: 3603 -> 3566 (-1.03%) helped: 37 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.21% max: 2.50% x̄: 1.99% x̃: 2.50% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -2.30% -1.69% Instructions are helped. total cycles in shared programs: 410012108 -> 410012098 (<.01%) cycles in affected programs: 3540 -> 3530 (-0.28%) helped: 5 HURT: 0 helped stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 helped stats (rel) min: 0.28% max: 0.28% x̄: 0.28% x̃: 0.28% 95% mean confidence interval for cycles value: -2.00 -2.00 95% mean confidence interval for cycles %-change: -0.28% -0.28% Cycles are helped. Ivy Bridge total instructions in shared programs: 11679387 -> 11679351 (<.01%) instructions in affected programs: 3292 -> 3256 (-1.09%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.21% max: 2.50% x̄: 2.04% x̃: 2.50% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -2.34% -1.74% Instructions are helped. No changes on any other platforms. Signed-off-by: Ian Romanick <[email protected]>
*	i965/vec4: Don't register coalesce into source of VS_OPCODE_UNPACK_FLAGS_SIMD4X2	Ian Romanick	2018-06-15	1	-0/+9
\| \| \| \| \| \| \|	This prevents regressions in a bunch of clipping and interpolation tests caused by the next patch (i965/vec4: Optimize OR with 0 into a MOV). Signed-off-by: Ian Romanick <[email protected]>
*	i965/fs: Optimize OR with 0 into a MOV	Ian Romanick	2018-06-15	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fs_visitor::set_gs_stream_control_data_bits generates some code like "control_data_bits \| stream_id << ((2 * (vertex_count - 1)) % 32)" as part of EmitVertex. The first time this (dynamically) occurs in the shader, control_data_bits is zero. Many times we can determine this statically and various optimizations will collaborate to make one of the OR operands literal zero. Converting the OR to a MOV usually allows it to be copy-propagated away. However, this does not happen in at least some shaders (in the assembly output of shaders/closed/UnrealEngine4/EffectsCaveDemo/301.shader_test, search for shl). All of the affected shaders are geometry shaders. Broadwell and Skylake had similar results. (Skylake shown) total instructions in shared programs: 14375452 -> 14375413 (<.01%) instructions in affected programs: 6422 -> 6383 (-0.61%) helped: 39 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.14% max: 2.56% x̄: 1.91% x̃: 2.56% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -2.26% -1.57% Instructions are helped. total cycles in shared programs: 531981179 -> 531980555 (<.01%) cycles in affected programs: 27493 -> 26869 (-2.27%) helped: 39 HURT: 0 helped stats (abs) min: 16 max: 16 x̄: 16.00 x̃: 16 helped stats (rel) min: 0.60% max: 7.92% x̄: 5.94% x̃: 7.92% 95% mean confidence interval for cycles value: -16.00 -16.00 95% mean confidence interval for cycles %-change: -6.98% -4.90% Cycles are helped. No changes on earlier platforms. Signed-off-by: Ian Romanick <[email protected]>
*	v3d: Handle a no-intersection scissor even if it's outside of the VP.	Eric Anholt	2018-06-15	1	-10/+8
\| \| \| \| \| \|	The min/maxes ended up producing a negative clip width/height for dEQP-GLES3.functional.fragment_ops.scissor.outside_render_line. Just make sure they stay at 0 (or v3d 3.x's workaround) if that happens.
*	v3d: Use the proper depth texture type for sampling.	Eric Anholt	2018-06-15	1	-3/+3
\| \| \| \|	Fixes failing tests in dEQP-GLES3.functional.texture.shadow
*	v3d: Limit shader threading according to our maximum TMU fifo usage.	Eric Anholt	2018-06-15	1	-10/+24
\| \| \| \| \| \|	Fixes simulator assertion failures in dEQP-GLES3.functional.shaders.texture_functions.texture.samplercubeshadow_bias_fragment and similar complicated cases.
*	v3d: Fix shaders using pixel center W but no varyings.	Eric Anholt	2018-06-15	4	-16/+9
\| \| \| \| \| \| \| \|	The docs called this field "uses both center W and centroid W", but actually it's "do you need center W even if varyings don't obviously call for it?" Fixes dEQP-GLES3.functional.shaders.builtin_variable.fragcoord_w
*	intel/aubinator: Use int to store getopt_long flags.	Rafael Antognolli	2018-06-15	1	-2/+2
\| \| \| \| \| \| \| \|	getopt_long flag parameter is an int pointer, so if we use bool to store those values, when getopt_long writes to one of them, it might end up overwriting the next one. Reviewed-by: Ian Romanick <[email protected]>
*	Revert "radv: always set/load both depth and stencil clear values"	Samuel Pitoiset	2018-06-15	1	-5/+28
\| \| \| \| \| \| \| \| \|	This fixes a rendering regression with RoTR. This reverts commit 4bdad9faddc82a4560603936ce5ade5707ecb254. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: don't check for linear images in emit_fast_color_clear()	Samuel Pitoiset	2018-06-15	1	-2/+0
\| \| \| \| \| \| \| \|	We don't enable CMASK for linear surfaces and addrlib only enables DCC for tiling surfaces. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: allow RADV_PERFTEST=dccmsaa on GFX9	Samuel Pitoiset	2018-06-15	1	-2/+2
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: add RADV_DEBUG=checkir	Samuel Pitoiset	2018-06-15	5	-3/+11
\| \| \| \| \| \| \|	This allows to run the LLVM verifier pass. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>