mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	mesa: Completely remove QuerySamplesForFormat from driver func table	Eduardo Lima Mitev	2016-03-03	1	-1/+0
\| \| \| \| \| \| \|	At this point, all uses have been replaced by the more general hook QueryInternalFormat, introduced by ARB_internalformat_query2. Reviewed-by: Dave Airlie <[email protected]>
*	i965/formatquery: Respond queries SAMPLES and NUM_SAMPLE_COUNTS	Eduardo Lima Mitev	2016-03-03	3	-4/+15
\| \| \| \| \| \| \| \| \| \|	This effectively disables old QuerySamplesForFormat driver hook, since it is never called by Mesa anymore. v2: Call brw_query_samples_for_format() with a dummy buffer to calculate num samples, to avoid modifying the original buffer. Reviewed-by: Dave Airlie <[email protected]>
*	i965: Move brw_query_samples_for_format() to brw_queryformat.c	Eduardo Lima Mitev	2016-03-03	3	-38/+41
\| \| \| \| \| \| \|	Now that there is a dedicated source file for internal format queries, this function belongs there. Reviewed-by: Dave Airlie <[email protected]>
*	i965: Add boilerplate function for QueryInternalFormat driver hook	Eduardo Lima Mitev	2016-03-03	4	-0/+52
\| \| \| \| \| \| \|	By default, we call back the driver's hook fallback function that has generic implementations for the all the queries. Reviewed-by: Dave Airlie <[email protected]>
*	mesa: Add a default QueryInternalFormat() function for drivers	Eduardo Lima Mitev	2016-03-03	1	-0/+1
\| \| \| \| \| \| \|	This is a fallback function for drivers not implementing ARB_internalformat_query2. Reviewed-by: Dave Airlie <[email protected]>
*	meta: Remove the 'allocate_storage' parameter in _mesa_meta_pbo_GetTexSubImage()	Anuj Phogat	2016-03-02	4	-7/+3
\| \| \| \| \| \| \| \|	Texture is already allocated before calling this meta function. So, the value of 'allocate_storage' passed to the function is always false. Signed-off-by: Anuj Phogat <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Fix the pbo usage in meta for GLES{1,2} contexts	Anuj Phogat	2016-03-02	1	-8/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	OpenGL ES 1.0 doesn't support using GL_STREAM_DRAW and both ES 1.0 and 2.0 don't support GL_STREAM_READ in glBufferData(). So, handle it correctly by calling the _mesa_meta_begin() before create_texture_for_pbo(). V2: Remove the changes related to allocate_storage. (Ian) Cc: <[email protected]> Signed-off-by: Anuj Phogat <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	i965/gen6/gs: Replace V-immediate with VF-immediate.	Matt Turner	2016-03-02	1	-5/+5
\| \| \| \|	Reviewed-by: Francisco Jerez <[email protected]>
*	i965: set ctx->Const.MaxViewport{Width,Height} to 32k	Samuel Iglesias Gonsálvez	2016-03-02	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From ARB_viewport_array spec: " * On GL3-capable hardware the VIEWPORT_BOUNDS_RANGE should be at least [-16384, 16383]. * On GL4-capable hardware the VIEWPORT_BOUNDS_RANGE should be at least [-32768, 32767]." This range is set using ctx->Const.MaxViewportWidth value, so just bump those constants to 32k for gen7+ which can support OpenGL 4.0. Signed-off-by: Samuel Iglesias Gonsálvez <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	program: Remove OPCODE_KIL_NV.	Matt Turner	2016-03-01	1	-20/+0
\| \| \| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Acked-by: Brian Paul <[email protected]>
*	mesa: Remove EmitCondCodes.	Matt Turner	2016-03-01	2	-2/+0
\| \| \| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Acked-by: Brian Paul <[email protected]>
*	i965/meta: Don't pollute the framebuffer namespace	Ian Romanick	2016-03-01	3	-27/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	tl;dr: For many types of GL object, we can NEVER use the Gen function. In OpenGL ES (all versions!) and OpenGL compatibility profile, applications don't have to call Gen functions. The GL spec is very clear about how you can mix-and-match generated names and non-generated names: you can use any name you want for a particular object type until you call the Gen function for that object type. Here's the problem scenario: - Application calls a meta function that generates a name. The first Gen will probably return 1. - Application decides to use the same name for an object of the same type without calling Gen. Many demo programs use names 1, 2, 3, etc. without calling Gen. - Application calls the meta function again, and the meta function replaces the data. The application's data is lost, and the app fails. Have fun debugging that. Signed-off-by: Ian Romanick <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92363 Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/meta: Use _mesa_bind_framebuffers instead of _mesa_BindFramebuffer	Ian Romanick	2016-03-01	3	-8/+25
\| \| \| \| \|	Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Don't pollute the framebuffer namespace	Ian Romanick	2016-03-01	4	-54/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	tl;dr: For many types of GL object, we can NEVER use the Gen function. In OpenGL ES (all versions!) and OpenGL compatibility profile, applications don't have to call Gen functions. The GL spec is very clear about how you can mix-and-match generated names and non-generated names: you can use any name you want for a particular object type until you call the Gen function for that object type. Here's the problem scenario: - Application calls a meta function that generates a name. The first Gen will probably return 1. - Application decides to use the same name for an object of the same type without calling Gen. Many demo programs use names 1, 2, 3, etc. without calling Gen. - Application calls the meta function again, and the meta function replaces the data. The application's data is lost, and the app fails. Have fun debugging that. Fixes piglit tests: - object-namespace-pollution glGetTexImage-compressed framebuffer - object-namespace-pollution glGenerateMipmap framebuffer Signed-off-by: Ian Romanick <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92363 Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta/decompress: Track framebuffer using gl_framebuffer instead of GL API ↵	Ian Romanick	2016-03-01	2	-7/+12
\| \| \| \| \| \| \|	object handle Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta/generate_mipmap: Track framebuffer using gl_framebuffer instead of GL ↵	Ian Romanick	2016-03-01	2	-24/+16
\| \| \| \| \| \| \|	API object handle Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Use _mesa_bind_framebuffers instead of _mesa_BindFramebuffer	Ian Romanick	2016-03-01	3	-8/+40
\| \| \| \| \|	Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Use _mesa_CreateFramebuffers instead of _mesa_GenFramebuffers	Ian Romanick	2016-03-01	4	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	This enables later patches that will stop calling _mesa_GenFramebuffers or _mesa_CreateFramebuffers which pollute the framebuffer namespace. For framebuffers, the Bind call is still necessary. sed -i -e 's/_mesa_GenFramebuffers/_mesa_CreateFramebuffers/' \ src/mesa/drivers/common/*.c Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/meta: Use _mesa_CreateFramebuffers instead of _mesa_GenFramebuffers	Ian Romanick	2016-03-01	3	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	This enables later patches that will stop calling _mesa_GenFramebuffers or _mesa_CreateFramebuffers which pollute the framebuffer namespace. For framebuffers, the Bind call is still necessary. sed -i -e 's/_mesa_GenFramebuffers/_mesa_CreateFramebuffers/' \ src/mesa/drivers/dri/i965/*.c Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Save and restore the framebuffer using gl_framebuffer instead of GL ↵	Ian Romanick	2016-03-01	2	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	API object handle Some meta operations can be called recursively. Future changes (the "Don't pollute the ... namespace" changes) will cause objects with invalid names to be used. If a nested meta operation tries to restore an object named 0xDEADBEEF, it will fail. This also fixes another latent bug in meta. In a multithreaded, multicontext application, one thread can delete an object that is bound in another thread. That object continues to exist until it is unbound (i.e., its refcount drops to zero). Meta unbinds objects all over the place. As a result, the rebind in _mesa_meta_end could fail because the object vanished! See https://bugs.freedesktop.org/show_bug.cgi?id=92363#c8. Using _mesa_reference_<object type> to save and restore the objects prevents the refcount from going to zero. Signed-off-by: Ian Romanick <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92363 Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Use _mesa_check_framebuffer_status instead of _mesa_CheckFramebufferStatus	Ian Romanick	2016-03-01	4	-12/+12
\| \| \| \| \| \| \| \| \| \| \| \|	sed -i -e 's/_mesa_CheckFramebufferStatus(GL_DRAW_FRAMEBUFFER/_mesa_check_framebuffer_status(ctx, ctx->DrawBuffer/' \ -e 's/_mesa_CheckFramebufferStatus(GL_FRAMEBUFFER[^)]*/_mesa_check_framebuffer_status(ctx, ctx->DrawBuffer/' \ -e 's/_mesa_CheckFramebufferStatus(GL_READ_FRAMEBUFFER/_mesa_check_framebuffer_status(ctx, ctx->ReadBuffer/' \ $(grep -rl _mesa_CheckFramebufferStatus src/mesa/drivers) The second expression catches both GL_FRAMEBUFFER and GL_FRAMEBUFFER_EXT. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Obvious refactor of _mesa_meta_framebuffer_texture_image	Ian Romanick	2016-03-01	1	-21/+5
\| \| \| \| \|	Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	meta: Convert _mesa_meta_bind_fbo_image to take a gl_framebuffer instead of ↵	Ian Romanick	2016-03-01	5	-49/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a GL API handle Also change the name of the function to _mesa_meta_framebuffer_texture_image. The function is basically a wrapper around _mesa_framebuffer_texture (which is used to implement glFramebufferTexture1D and friends), so it makes sense for it's name to be similar to that. The next patch will clean _mesa_meta_framebuffer_texture_image up considerably. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965: Push most TES inputs in vec4 mode.	Kenneth Graunke	2016-02-29	2	-34/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(This is commit 4a1c8a3037cd29938b2a6e2c680c341e9903cfbe for vec4 mode.) Using the push model for inputs is much more efficient than pulling inputs - the hardware can simply copy a large chunk into URB registers at thread creation time, rather than having the thread send messages to request data from the L3 cache. Unfortunately, it's possible to have more TES inputs than fit in registers, so we have to fall back to the pull model in some cases. However, it turns out that most tessellation evaluation shaders are fairly simple, and don't use many inputs. An arbitrary cut-off of 24 vec4 slots (12 registers) should suffice. (I chose this instead of the 32 vec4 slots used in the scalar backend to avoid regressing a few Piglit tests due to the vec4 register allocator being too stupid to figure out what to do. We probably ought to fix that, but it's a separate issue.) Improves performance in GPUTest's tessmark_x64 microbenchmark by 41.5394% +/- 0.288519% (n = 115) at 1024x768 on my Clevo W740SU (with Iris Pro 5200). Improves performance in Synmark's Gl40TerrainFlyTess microbenchmark by 38.3576% +/- 0.759748% (n = 42). v2: Simplify abs/negate handling, as requested by Matt. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chris Forbes <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	Android: clean-up and fix DRI module path handling	Rob Herring	2016-02-29	3	-5/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MESA_DRI_MODULE_PATH is only getting set for classic DRI drivers and may or may not be set correctly for gallium_dri.so depending on the makefile include ordering. For Android 6 and earlier it is fine, but with build system changes in AOSP master, it is not. Move the path variables to a single place at the top level and introduce MESA_DRI_MODULE_REL_PATH for Android 5 and later which require relative paths. With this, there is a single variable to change. Cc: "11.1 11.2" <[email protected]> Signed-off-by: Rob Herring <[email protected]> Reviewed-by: Emil Velikov <[email protected]>
*	Android: remove dependence on .SECONDEXPANSION	Rob Herring	2016-02-29	1	-9/+13
\| \| \| \| \| \| \| \| \| \|	With the Android build system changes to ninja/kati, the use of .SECONDEXPANSION is no longer supported. Fix this by avoiding rule specific variables and using $(transform-generated-source). Cc: "11.1 11.2" <[email protected]> Signed-off-by: Rob Herring <[email protected]> Reviewed-by: Emil Velikov <[email protected]>
*	Android: fix build break from nir/glsl move to compiler/	Rob Herring	2016-02-29	3	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commits a39a8fbbaa12 ("nir: move to compiler/") and eb63640c1d38 ("glsl: move to compiler/") broke Android builds. Fix them. There is also a missing dependency between generated NIR headers and several libraries. This isn't a new issue, but seems to have been exposed by the NIR move. Built with i915, i965, freedreno, r300g, r600g, vc4, and virgl enabled. Cc: "11.2" <[email protected]> Cc: Mauro Rossi <[email protected]> Signed-off-by: Rob Herring <[email protected]> Reviewed-by: Emil Velikov <[email protected]>
*	i965/cfg: Fix comment list punctuation	Ian Romanick	2016-02-26	1	-1/+1
\| \| \| \| \| \|	Trivial Signed-off-by: Ian Romanick <[email protected]>
*	i965/cfg: Split out dead control flow paths to simplify both paths	Ian Romanick	2016-02-26	1	-55/+38
\| \| \| \| \| \| \|	v2: Fix some bad indentation. Suggested by Curro. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965/cfg: Don't handle fully empty if/else/endif	Ian Romanick	2016-02-26	1	-10/+4
\| \| \| \| \| \| \| \| \| \|	This will now never occur. The empty if-else part would have already been removed leaving an empty if-endif part. No shader-db changes. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965/cfg: Eliminate an empty then-branch of an if/else/endif	Ian Romanick	2016-02-26	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On BDW, total instructions in shared programs: 8448571 -> 8448367 (-0.00%) instructions in affected programs: 21000 -> 20796 (-0.97%) helped: 116 HURT: 0 v2: Remove spurious attempt to combine the if_block with the (removed!) else_block. Suggested by Matt and Curro. Correct the comment describing what the new pass does. Suggested by Matt. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965/cfg: Track prev_block and prev_inst explicitly in the whole function	Ian Romanick	2016-02-26	1	-5/+7
\| \| \| \| \| \| \| \|	This provides a trivial simplification now, and it makes some future changes more straight forward. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965/cfg: Slightly rearrange dead_control_flow_eliminate	Ian Romanick	2016-02-26	1	-56/+57
\| \| \| \| \| \| \| \| \|	'git diff -w' is a bit more illustrative. A couple declarations were moved, the continue was removed, and the code was reindented. This will simplify future changes. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965: Simplify brw_nir_lower_vue_inputs() slightly.	Kenneth Graunke	2016-02-26	1	-11/+6
\| \| \| \| \| \| \|	The same code appeared in both branches; pull it above the if statement. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Avoid recalculating the normal VUE map for IO lowering.	Kenneth Graunke	2016-02-26	4	-51/+30
\| \| \| \| \| \| \| \|	The caller already computes it. Now that we have stage specific functions, it's really easy to pass this in. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Avoid recalculating the tessellation VUE map for IO lowering.	Kenneth Graunke	2016-02-26	4	-26/+25
\| \| \| \| \| \| \| \|	The caller already computes it. Now that we have stage specific functions, it's really easy to pass this in. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Eliminate brw_nir_lower_{inputs,outputs,io} functions.	Kenneth Graunke	2016-02-26	7	-93/+33
\| \| \| \| \| \| \| \| \| \| \| \|	Now that each stage is directly calling brw_nir_lower_io(), and we have per-stage helper functions, it makes sense to just call the relevant one directly, rather than going through multiple switch statements. This also eliminates stupid function parameters, such as the two that only apply to vertex attributes. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Split brw_nir_lower_inputs/outputs into per-stage functions.	Kenneth Graunke	2016-02-26	1	-130/+174
\| \| \| \| \| \| \| \|	These functions are both giant switch statements where most cases don't overlap at all. Let's put the bulk of the work in per-stage helpers. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Remove catch-all nir_lower_io call with specific cases.	Kenneth Graunke	2016-02-26	1	-1/+4
\| \| \| \| \| \| \| \| \|	Most cases already call nir_lower_io explicitly for input and output lowering. This catch all isn't very useful anymore - we can just add it to the remaining cases. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Move optimizations from brw_nir_lower_io to brw_postprocess_nir.	Kenneth Graunke	2016-02-26	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	This simplifies things. Every caller of brw_nir_lower_io() immediately calls brw_postprocess_nir(). The only real change this will have is that we get an extra brw_nir_optimize() call when compiling compute shaders, but that seems fine. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Always do NIR IO lowering at specialization time.	Kenneth Graunke	2016-02-26	2	-8/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We've now hit literally every case other than geometry shaders (and compute shaders, but those are a no-op). So, let's just move geometry shaders over too and be done with it. The only advantage to doing this at link time was to save the expense of running the pass on recompiles. But we're already running a lot of passes, and the extra code complexity isn't worth it. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Make an is_scalar boolean in brw_compile_gs().	Kenneth Graunke	2016-02-26	1	-4/+4
\| \| \| \| \| \| \| \| \|	Shorter than compiler->scalar_stage[MESA_SHADER_GEOMETRY], which can help with line-wrapping. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965/nir: Do lower_io late for fragment shaders	Jason Ekstrand	2016-02-26	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	The Vulkan driver wants to be able to delete fragment outputs that are beyond key.nr_color_regions; this is a lot easier if we lower outputs at specialization time rather than link time. (Rationale added to commit message by Ken) Signed-off-by: Jason Ekstrand <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Set dest type to UW for several send messages	Jordan Justen	2016-02-26	2	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Without this, on SIMD 16 the send instruction destination will appear to write more than one destination register, causing the simulator to report an error. Of course, the send instruction can actually write more than one destination register regardless of the type set for the destination, so this is a bit strange. Suggested-by: Kenneth Graunke <[email protected]> Signed-off-by: Jordan Justen <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	i965/fs: Allow saturate propagation to propagate negations into MADs.	Matt Turner	2016-02-25	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Allows us to transform mad res src0 src1 src2 mov.sat dst -res into mad.sat dst -src0 -src1 src2 instructions in affected programs: 3712 -> 3688 (-0.65%) helped: 24 Reviewed-by: Ian Romanick <[email protected]>
*	i965/fs: Allow saturate propagation to propagate negations into ADDs.	Matt Turner	2016-02-25	2	-4/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Allows us to transform add res src0 src1 mov.sat dst -res into add.sat dst -src0 -src1 No shader-db changes. Reviewed-by: Ian Romanick <[email protected]>
*	i965/fs: Allow saturate propagation to propagate negations into MULs.	Matt Turner	2016-02-25	2	-3/+137
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Allows us to transform mul res src0 src1 mov.sat dst -res into mul.sat dst src0 -src1 instructions in affected programs: 45246 -> 45054 (-0.42%) helped: 162 Reviewed-by: Ian Romanick <[email protected]>
*	i965/fs: Don't CSE negated multiplies with saturation.	Matt Turner	2016-02-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's not correct to CSE these multiplies mul.sat dst1, -a, b mul.sat dst2, a, b by emitting a negated MOV from dst1 to dst2: mul.sat dst1, -a, b mov dst2, -dst1 Take 2.0*2.0 for example. The first multiply would produce 0.0 and the second would produce 1.0. Fixes bad generated code in 18 to 22 shaders: instructions in affected programs: 432 -> 464 (7.41%) helped: 4 HURT: 18 Cc: [email protected] Reviewed-by: Ian Romanick <[email protected]>
*	i965: Enable tiled mem_copy with sRGB-formatted resources	Nanley Chery	2016-02-24	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \|	RGBA8 and BGRA8 unorm formats are compatible with the various mem_copy functions. Their sRGB counterparts are also compatible because they're also color-renderable (of importance when the specified resource is a readbuffer) and they share the same physical layout. Signed-off-by: Nanley Chery <[email protected]> Reviewed-by: Anuj Phogat <[email protected]>
*	i965/fs: Return result of image atomic in a register of the expected type.	Francisco Jerez	2016-02-22	1	-1/+1
\| \| \| \| \| \| \| \|	So the result is of float type if we're implementing the float overload of imageAtomicExchange. This is the only back-end change required to support OES_shader_image_atomic AFAICT. Reviewed-by: Ilia Mirkin <[email protected]>