mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno: instanced drawing/compute not yet supported	Ilia Mirkin	2014-10-02	1	-3/+3
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>
*	mesa: fix GetTexImage for 1D array depth textures	Dave Airlie	2014-10-03	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While running piglit in virgl, I hit an assert in intel driver. "qemu-system-x86_64: intel_tex.c:219: intel_map_texture_image: Assertion `tex_image->TexObject->Target != 0x8C18 \|\| h == 1' failed." Thanks to Eric and Ken for pointing me in the right direction, Fix the get_tex_depth to do the same fixup as get_tex_rgba does for 1D array textures. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Dave Airlie <airlied@redhat.com>
*	st/mesa: Fix paths used in Android builds	Tomasz Figa	2014-10-03	3	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With current makefiles the build fails because source and build paths are generated incorrectly. With Android build system the top_srcdir and top_builddir variables are undefined and all paths are relative to where Android.mk is located. This ends up with path likes external/mesa/src/mesa/src/mesa/ for both source and build paths, which are obviously wrong. This patch fixes this by overriding resulting SRCDIR and BUILDDIR variables with empty string, so that paths end up being relative to Android.mk file again. Appending correct build path to generated files is already done in Android.gen.mk. Signed-off-by: Tomasz Figa <tomasz.figa@gmail.com> CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
*	st/mesa: Generate format_info.c in Android builds	Tomasz Figa	2014-10-03	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	Current Android makefiles lack generation of format_info.c, which is a dependency of main/format.c. This patch adds necessary code to Android.gen.mk. Signed-off-by: Tomasz Figa <tomasz.figa@gmail.com> CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
*	util: Include in Android builds	Tomasz Figa	2014-10-03	8	-4/+114
\| \| \| \| \| \| \| \| \| \|	This patch fixes Android build failures by including src/util directory in compilation. Files inside of this directory are compiled into libmesa_util static library and linked with resulting libGLES_mesa. Signed-off-by: Tomasz Figa <tomasz.figa@gmail.com> CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
*	i965/fs: Use the correct base_mrf for spilling pairs in SIMD8	Jason Ekstrand	2014-10-02	1	-3/+4
\| \| \| \| \| \| \| \| \| \|	Before, we were hard-coding the base_mrf based on dispatch width not number of registers spilled at a time. This caused us to emit instructions with a base_mrf or 14 and a mlen of 3 so we used the magical non-existant m16 register. This fixes the problem. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>
*	i965/fs: Add a MAX_GRF_SIZE define and use it various places	Jason Ekstrand	2014-10-02	4	-6/+9
\| \| \| \| \| \| \| \| \| \| \| \|	Previously, we had a MAX_SAMPLER_MESSAGE_SIZE which we used instead. However, some FB write messages can validly be longer than this so we need something different. Since MAX_SAMPLER_MESSAGE_SIZE is validly useful on its own, we leave it alone and add a new MAX_GRF_SIZE that's big enough for FB writes. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84539 Reviewed-by: Matt Turner <mattst88@gmail.com>
*	i965/fs: Use the actual regsister width in brw_reg_from_fs_reg	Jason Ekstrand	2014-10-02	1	-0/+13
\| \| \| \| \| \| \| \|	This fixes a bug where 1-wide operations don't properly translate down to 1-wide instructions. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>
*	i965/fs_fp: Use null_reg from fs_visitor instead of rolling our own	Jason Ekstrand	2014-10-02	1	-6/+4
\| \| \| \| \| \|	Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84529 Reviewed-by: Matt Turner <mattst88@gmail.com>
*	freedreno/a3xx: handle large shader program sizes	Rob Clark	2014-10-02	1	-11/+63
\| \| \| \| \| \| \|	Above a certain limit use CACHE mode instead of BUFFER mode. This should solve gpu hangs with large shader programs. Signed-off-by: Rob Clark <robclark@freedesktop.org>
*	freedreno: update generated headers	Rob Clark	2014-10-02	4	-8/+9
\| \| \| \|	Signed-off-by: Rob Clark <robclark@freedesktop.org>
*	freedreno: dual-source render targets are not supported	Ilia Mirkin	2014-10-02	1	-1/+1
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>
*	gallium/hud: use u_sampler_view_default_template helper	Ilia Mirkin	2014-10-02	1	-6/+3
\| \| \| \| \| \| \| \|	The existing code was not setting several fields, most importantly the target, which is required on nv50/nvc0. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>
*	glsl: Fix memory leak in builtin_builder::_image_prototype.	Iago Toral Quiroga	2014-10-02	1	-3/+5
\| \| \| \| \| \|	in_var calls the ir_variable constructor, which dups the variable name. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
*	mesa: relax draw api validation on ES2	Tapani Pälli	2014-10-02	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \|	Patch fixes failing test in WebGL conformance test 'point-no-attributes' when running Chrome on OpenGL ES. (Shader program may draw points using constant data in shader.) No Piglit regressions. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
*	glsl: make consistent use of DECLARE_RALLOC_CXX_OPERATORS	Ilia Mirkin	2014-10-02	2	-47/+3
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
*	vc4: Fix the mapping of the minification filter to HW values.	Eric Anholt	2014-10-01	1	-8/+8
\| \| \| \| \| \|	They're actually as documented in the HW specs and the GL mipmapping enums order. Fixes fbo-generatemipmap-filtering , and some other tests where we were off by a few bits due to unexpected linear filtering.
*	vc4: Make the last static array in vc4_program.c dynamically sized.	Eric Anholt	2014-10-01	2	-3/+13
\|
*	vc4: Fix some broken indentation.	Eric Anholt	2014-10-01	1	-1/+1
\|
*	vc4: Add support for the FACE semantic.	Eric Anholt	2014-10-01	5	-1/+24
\| \| \| \|	Fixes glsl-fs-frontfacing.
*	vc4: Add support for TGSI_OPCODE_CLAMP.	Eric Anholt	2014-10-01	1	-0/+12
\| \| \| \|	This will be used by the shared LIT lowering code.
*	vc4: Fix compiler warning	Eric Anholt	2014-10-01	1	-1/+1
\|
*	meta: Fix make check failures in setup_glsl_msaa_blit_scaled_shader()	Anuj Phogat	2014-10-01	1	-8/+9
\| \| \| \| \| \| \|	introduced by commit 68ee950. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reported-by: Mark Janes <mark.a.janes@intel.com>
*	mesa: fix _mesa_alloc_dispatch_table() declaration	Brian Paul	2014-10-01	1	-1/+1
\| \| \| \|	Insert 'void' parameter to match declaration in api_exec.h. Trivial.
*	meta: (trivial) remove accidental double semicolon	Roland Scheidegger	2014-10-01	1	-1/+1
\|
*	i965: Enable EXT_framebuffer_multisample_blit_scaled for gen8	Anuj Phogat	2014-10-01	1	-2/+1
\| \| \| \| \|	Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>
*	meta: Implement ext_framebuffer_multisample_blit_scaled extension	Anuj Phogat	2014-10-01	2	-13/+199
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Extension enables doing a multisample buffer resolve and buffer scaling using a single glBlitFrameBuffer() call. Currently, we have this extension implemented in BLORP which is only used by SNB and IVB. This patch implements the extension in meta path which makes it available to Broadwell. Implementation features: - Supports scaled resolves of 2X, 4X and 8X multisample buffers. - Avoids unnecessary shader compilations by storing the pre compiled shaders for each supported sample count. - Uses bilinear filtering for both GL_SCALED_RESOLVE_FASTEST_EXT and GL_SCALED_RESOLVE_NICEST_EXT filter options. This is an allowed behavior in the extension's spec. - I tried doing bicubic filtering for GL_SCALED_RESOLVE_NICEST_EXT filter. It made the edges in the image look little smoother but the image gets blurred causing no overall quality improvement. For now I have dropped the idea of doing different filtering for nicest filter. V2: - Minor changes to simplify the fragment shader. - Refactor the code to move i965 specific sample_map computation out of Meta. We now use ctx->Const.SampleMap{2,4,8}x variables initialized by the driver. - Use a simple msaa resolve shader for scaled resolves with scaling factor = 1.0. V3: - Make changes to create a string out of ctx->Const.SampleMap{2,4,8}x variables and use it in fragment shader. V4: - Make changes to use uint8_t type ctx->Const.SampleMap{2,4,8}x variables. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>
*	i965: Initialize the SampleMap{2,4,8}x variables	Anuj Phogat	2014-10-01	3	-0/+55
\| \| \| \| \| \| \| \| \| \| \| \| \|	with values specific to Intel hardware. V2: Define and use gen6_get_sample_map() function to initialize the variables. V3: Change the function name to gen6_set_sample_maps() and use memcpy() to fill in the data. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>
*	mesa: Add new variables in gl_context to store sample layout	Anuj Phogat	2014-10-01	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \|	SampleMap{2,4,8}x variables are used in later patches to implement EXT_framebuffer_multisample_blit_scaled extension. V2: Use integer array instead of a string. Bump up the comment. V3: Use uint8_t type array. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>
*	st/va: implement vlVa(Query\|Create\|Get\|Put\|Destroy)Image	Leo Liu	2014-10-01	3	-9/+281
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch implements functions for images support, which basically supports copy data between video surface and user buffers, in this case supports SW decode, and other video output v2: fix buffer size for odd-sized image case expose I420 format as well v3: fix YUV 4:2:2 format data buffer size cleanup I420 format exposure Signed-off-by: Leo Liu <leo.liu@amd.com>
*	st/va: implement Picture functions for mpeg2 h264 and vc1	Christian König	2014-10-01	4	-6/+371
\| \| \| \| \| \| \| \|	This patch implements codec for mpeg2 h264 and vc1, populates codec parameters and pass them to HW driver. Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com>
*	st/va: implement Context Surface and Buffer	Christian König	2014-10-01	4	-21/+320
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch implements context managements, relate it HW driver, functions for video surface managements, and functions for application data memory buffer managements. implemented functions: vlVa(Create\|Destroy)Context vlVa(Create\|Destroy\|Put)Surfaces vlVa(Create\|Destroy)Buffer Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com>
*	st/va: implement vlVa(Create\|Destroy\|Query\|Get)Config	Christian König	2014-10-01	3	-5/+143
\| \| \| \| \| \| \| \| \| \| \|	This patch is for application to query configuration, such as profiles, entrypoints, and attributes v2: fix missing profile with query Signed-off-by: Michael Varga <michael.varga@amd.com> Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com>
*	st/va: skeleton VAAPI state tracker	Christian König	2014-10-01	15	-0/+1015
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a skeleton VA-API state tracker, which is filled with live in the subsequent patches. v2: fixes in configure.ac and va state_tracker Makefile.am v3: do not link against libva. detect libva version, and correctly set driver entrypoint name. rebase(cleanup) targets/va/Makefile.am v4: cleanup va version auto detection add back targets/va/va.sym Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
*	st/vdpau: move common functions to util	Leo Liu	2014-10-01	2	-77/+78
\| \| \| \| \| \| \| \|	Break out these functions so that they can be shared with a other state trackers. They will be used in subsequent patches for the new VA-API state tracker. Signed-off-by: Leo Liu <leo.liu@amd.com>
*	freedreno: max-texture-lod-bias should be 15.0f	Rob Clark	2014-10-01	1	-1/+1
\| \| \| \| \| \|	Fixes piglit lodbias test. Signed-off-by: Rob Clark <robclark@freedesktop.org>
*	mesa: Avoid flagging _NEW_VIEWPORT on redundant viewport updates.	Kenneth Graunke	2014-10-01	1	-0/+6
\| \| \| \| \| \| \| \| \|	Cuts the number of i965 color calculator viewport uploads by 100x (11017983 -> 113385) in 'x11perf -gc' with Glamor in Xephyr. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
*	i965: Drop CACHE_NEW_VS_PROG from the gen7_sf_state atom.	Kenneth Graunke	2014-10-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	I believe when I wrote this code, gen6_sf_state used CACHE_NEW_VS_PROG, which has since been replaced by BRW_NEW_VUE_MAP_GEOM_OUT. It's not needed here anyway - only SBE needs it. Just a copy and paste mistake. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
*	i965: Drop brwBindProgram driver hook.	Kenneth Graunke	2014-10-01	1	-20/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This function flagged BRW_NEW__PROGRAM When ctx->{Vertex,Geometry,Fragment}Program._Current changes, core Mesa calls the BindProgram driver hook, which flagged BRW_NEW__PROGRAM. However, brw_upload_state also checks for that changing, sets the same flags, and also updates brw->fragment_program and so on. So, this looks to be entirely redundant. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Add missing /* BRW_NEW_FRAGMENT_PROGRAM */ comments.	Kenneth Graunke	2014-10-01	3	-6/+7
\| \| \| \| \| \| \| \| \|	I had to dig a bit to figure out why this was necessary. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Use "1ull" instead of "1" in BRW_NEW_* defines.	Kenneth Graunke	2014-10-01	1	-32/+32
\| \| \| \| \| \| \| \| \| \|	Now that the bitfield is a uint64_t, we should use 1ull. Currently, we only have 32 entries, so 1 works fine, but it's not future-proof. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Use ~0ull when flagging all BRW_NEW_* dirty flags.	Kenneth Graunke	2014-10-01	3	-4/+4
\| \| \| \| \| \| \| \| \|	~0 is 0xFFFFFFFF, which only covers the first 32 bits. We need all 64. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Fix INTEL_DEBUG=state to work with 64-bit dirty bits.	Kenneth Graunke	2014-10-01	1	-16/+7
\| \| \| \| \| \| \| \| \| \| \|	This will keep INTEL_DEBUG=state working when we add BRW_NEW_* bits beyond 1 << 31. We missed doing this when widening the driver flags from uint32_t to uint64_t. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Delete CACHE_NEW_BLORP_CONST_COLOR_PROG.	Kenneth Graunke	2014-10-01	2	-3/+0
\| \| \| \| \| \| \| \| \|	Unused since krh rewrote fast clears to use meta. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Fix typo in comment	Chris Forbes	2014-10-01	1	-1/+1
\| \| \| \|	Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>
*	i965: Fix spelling of GEN7_SAMPLER_EWA_ANISOTROPIC_ALGORITHM	Chris Forbes	2014-10-01	2	-2/+2
\| \| \| \|	Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>
*	llvmpipe: Add missing LLVMGetGlobalContext() arg in lp_test_format.c.	Vinson Lee	2014-09-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix build error introduced with commit eedbce9c63a3f385908bdc8a69e8be98dd3522ff. lp_test_format.c: In function ‘test_format_unorm8’: lp_test_format.c:226:4: error: too few arguments to function ‘gallivm_create’ gallivm = gallivm_create("test_module_unorm8"); ^ In file included from ../../../../src/gallium/auxiliary/gallivm/lp_bld_format.h:38:0, from lp_test_format.c:42: ../../../../src/gallium/auxiliary/gallivm/lp_bld_init.h:58:1: note: declared here gallivm_create(const char *name, LLVMContextRef context); ^ Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84538 Signed-off-by: Vinson Lee <vlee@freedesktop.org>
*	glx/dri3: Provide error diagnostics when DRI3 allocation fails	Keith Packard	2014-09-30	1	-8/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of just segfaulting in the driver when a buffer allocation fails, report error messages indicating what went wrong so that we can debug things. As a simple example, chromium wraps Mesa in a sandbox which doesn't allow access to most syscalls, including the ability to create shared memory segments for fences. Before, you'd get a simple segfault in mesa and your 3D acceleration would fail. Now you get: $ chromium --disable-gpu-blacklist [10618:10643:0930/200525:ERROR:nss_util.cc(856)] After loading Root Certs, loaded==false: NSS error code: -8018 libGL: pci id for fd 12: 8086:0a16, driver i965 libGL: OpenDriver: trying /local-miki/src/mesa/mesa/lib/i965_dri.so libGL: Can't open configuration file /home/keithp/.drirc: Operation not permitted. libGL: Can't open configuration file /home/keithp/.drirc: Operation not permitted. libGL error: DRI3 Fence object allocation failure Operation not permitted [10618:10618:0930/200525:ERROR:command_buffer_proxy_impl.cc(153)] Could not send GpuCommandBufferMsg_Initialize. [10618:10618:0930/200525:ERROR:webgraphicscontext3d_command_buffer_impl.cc(236)] CommandBufferProxy::Initialize failed. [10618:10618:0930/200525:ERROR:webgraphicscontext3d_command_buffer_impl.cc(256)] Failed to initialize command buffer. This made it pretty easy to diagnose the problem in the referenced bug report. Bugzilla: https://code.google.com/p/chromium/issues/detail?id=415681 Signed-off-by: Keith Packard <keithp@keithp.com> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Matt Turner <mattst88@gmail.com>
*	glx/dri3: Use four buffers until X driver supports async flips	Keith Packard	2014-09-30	2	-2/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A driver which doesn't have async flip support will queue up flips without any way to replace them afterwards. This means we've got a scanout buffer pinned as soon as we schedule a flip and so we need another buffer to keep from stalling. When vblank_mode=0, if there are only three buffers we do: current scanout buffer = 0 at MSC 0 Render frame 1 to buffer 1 PresentPixmap for buffer 1 at MSC 1 This is sitting down in the kernel waiting for vblank to become the next scanout buffer Render frame 2 to buffer 2 PresentPixmap for buffer 2 at MSC 1 This cannot be displayed at MSC 1 because the kernel doesn't have any way to replace buffer 1 as the pending scanout buffer. So, best case this will get displayed at MSC 2. Now we block after this, waiting for one of the three buffers to become idle. We can't use buffer 0 because it is the scanout buffer. We can't use buffer 1 because it's sitting in the kernel waiting to become the next scanout buffer and we can't use buffer 2 because that's the most recent frame which will become the next scanout buffer if the application doesn't manage to generate another complete frame by MSC 2. With four buffers, we get: current scanout buffer = 0 at MSC 0 Render frame 1 to buffer 1 PresentPixmap for buffer 1 at MSC 1 This is sitting down in the kernel waiting for vblank to become the next scanout buffer Render frame 2 to buffer 2 PresentPixmap for buffer 2 at MSC 1 This cannot be displayed at MSC 1 because the kernel doesn't have any way to replace buffer 1 as the pending scanout buffer. So, best case this will get displayed at MSC 2. The X server will queue this swap until buffer 1 becomes the scanout buffer. Render frame 3 to buffer 3 PresentPixmap for buffer 3 at MSC 1 As soon as the X server sees this, it will replace the pending buffer 2 swap with this swap and release buffer 2 back to the application Render frame 4 to buffer 2 PresentPixmap for buffer 2 at MSC 1 Now we're in a steady state, flipping between buffer 2 and 3 waiting for one of them to be queued to the kernel. ... current scanout buffer = 1 at MSC 1 Now buffer 0 is free and (e.g.) buffer 2 is queued in the kernel to be the scanout buffer at MSC 2 Render frames, flipping between buffer 0 and 3 When the system can replace a queued buffer, and we update Present to take advantage of that, we can use three buffers and get: current scanout buffer = 0 at MSC 0 Render frame 1 to buffer 1 PresentPixmap for buffer 1 at MSC 1 This is sitting waiting for vblank to become the next scanout buffer Render frame 2 to buffer 2 PresentPixmap for buffer 2 at MSC 1 Queue this for display at MSC 1 1. There are three possible results: 1) We're still before MSC 1. Buffer 1 is released, buffer 2 is queued waiting for MSC 1. 2) We're now after MSC 1. Buffer 0 was released at MSC 1. Buffer 1 is the current scanout buffer. a) If the user asked for a tearing update, we swap scanout from buffer 1 to buffer 2 and release buffer 1. b) If the user asked for non-tearing update, we queue buffer 2 for the MSC 2. In all three cases, we have a buffer released (call it 'n'), ready to receive the next frame. Render frame 3 to buffer n PresentPixmap for buffer n If we're still before MSC 1, then we'll ask to present at MSC 1. Otherwise, we'll ask to present at MSC 2. Present already does this if the driver offers async flips, however it does this by waiting for the right vblank event and sending an async flip right at that point. I've hacked the intel driver to offer this, but I get tearing at the top of the screen. I think this is because flips are always done from within the ring, and so the latency between the vblank event and the async flip happening can cause tearing at the top of the screen. That's why I'm keying the need for the extra buffer on the lack of 2D driver support for async flips. Signed-off-by: Keith Packard <keithp@keithp.com> Acked-by: Jason Ekstrand <jason.ekstrand@intel.com> Tested-by: Dylan Baker <baker.dylan.c@gmail.com>
*	i965/fs: Fix the build	Jason Ekstrand	2014-09-30	1	-1/+1
\|