mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	automake: explicitly set TARGET_RADEON_{WINSYS,COMMON}	Emil Velikov	2014-10-14	3	-5/+5
\| \| \| \| \| \| \| \| \| \|	Originally the variables were set only once via the ?= operator but that causes issues when doing incremental builds. They appear to be undefined and missing from the dependency list despite their addition to LIBADD. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84807 Signed-off-by: Emil Velikov <[email protected]>
*	vc4: Fix render target NPOT alignment at small miplevels.	Eric Anholt	2014-10-14	1	-3/+12
\| \| \| \| \| \| \| \|	The texturing hardware takes the POT level 0 width/height and minifies those. This is different from what we were doing, for example, for 273-wide's level 5: POT(273>>5) == 8, while POT(273)>>5 == 16. Fixes piglit-depthstencil-render-miplevels 273.
*	vc4: Add support for having 0 vertex elements used.	Eric Anholt	2014-10-14	2	-6/+47
\| \| \| \| \|	You have to load at least 1, according to the simulator. Fixes 4 piglit tests and even more ES2 conformance tests.
*	ilo: clear writer pointer after unmapping	Chia-I Wu	2014-10-14	1	-0/+1
\| \| \| \| \| \| \|	It does not look like an issue now but it is good to be future proof. Spotted by Courtney Goeltzenleuchter. Signed-off-by: Chia-I Wu <[email protected]>
*	vc4: Write the VPM read setup multiple times to queue all the inputs.	Eric Anholt	2014-10-13	1	-3/+18
\| \| \| \| \| \| \|	There's a 4-element fifo, and the size (number of dwords per vertex) field is just 4 bits. Fixes glsl-routing on sim.
*	vc4: Add support for the TXL opcode.	Eric Anholt	2014-10-13	1	-5/+15
\| \| \| \| \| \|	There's a bit at the bottom of cube map stride (which has some formatting bugs in the docs) which flips the bias coordinate to being an absolute LOD.
*	vc4: Improve the accuracy of SIN and COS.	Eric Anholt	2014-10-13	1	-11/+17
\| \| \| \| \| \| \| \| \|	This gets them to pass glsl-sin/cos. There was an obvious problem that I was using the FRC code on the scaled input value, which means that we had a range in [0, 1], while our taylor is most accurate across [-0.5, 0.5]. We can just slide things over, but that means flipping the sign of the coefficients. After that, it was just a matter of stuffing more coefficients in.
*	vc4: Match VS outputs to FS inputs.	Eric Anholt	2014-10-13	3	-18/+135
\| \| \| \| \| \| \| \| \|	If the VS doesn't output a value that the FS needs, we still need to read the right contents for the remaining FS inputs, by emitting padding. And if the VS outputs something the FS doesn't need, we shouldn't put it in the VPM at all (so the code producing it can get DCEed). Fixes 77 piglit tests.
*	vc4: Add support for the CEIL opcode.	Eric Anholt	2014-10-13	1	-0/+22
\| \| \| \|	Not as big of a deal as SSG, but still +9 piglit tests.
*	vc4: Add support for the SSG opcode.	Eric Anholt	2014-10-13	1	-0/+12
\|
*	r600g: Implement GL_ARB_sample_shading	Glenn Kennard	2014-10-12	10	-119/+383
\| \| \| \| \| \| \| \|	Also fixes two sided lighting which was broken at least on pre-evergreen by commit b1eb00. Signed-off-by: Glenn Kennard <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	radeonsi: use tgsi_shader_info in si_llvm_emit_fs_epilogue	Marek Olšák	2014-10-12	1	-71/+61
\| \| \| \| \| \| \| \| \|	This is the last use tgsi_parse_token in radeonsi. It looks ugly because the code was re-indented, but there is really no change in behavior. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: remove si_shader_output_values::index	Marek Olšák	2014-10-12	1	-17/+6
\| \| \| \| \| \| \| \|	It's redundant now. It led to a simplification in si_llvm_emit_streamout, because outidx == reg. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in si_llvm_emit_vs_epilogue	Marek Olšák	2014-10-12	1	-26/+13
\| \| \| \| \| \|	That code was really ugly. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: remove shader->input[] and output[] arrays and dependencies	Marek Olšák	2014-10-12	3	-89/+2
\| \| \| \| \| \| \| \| \|	They were reinventing tgsi_shader_info. They are unused now. radeon_llvm_context::load_input can be NULL if input fetching is implemented in some other way. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: move param_offset out of shader->input[] and output[]	Marek Olšák	2014-10-12	3	-7/+10
\| \| \| \| \| \|	Those are going away. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info to get a list of GS outputs	Marek Olšák	2014-10-12	2	-14/+12
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in si_update_spi_map	Marek Olšák	2014-10-12	1	-9/+13
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: simplify dereferences in si_update_spi_map	Marek Olšák	2014-10-12	1	-2/+2
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in si_shader_vs	Marek Olšák	2014-10-12	1	-2/+3
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in si_shader_ps	Marek Olšák	2014-10-12	3	-5/+5
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in fetch_input_gs	Marek Olšák	2014-10-12	1	-4/+5
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: don't rely on shader->output in si_llvm_emit_fs_epilogue	Marek Olšák	2014-10-12	1	-1/+1
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: use tgsi_shader_info in si_llvm_emit_es_epilogue	Marek Olšák	2014-10-12	1	-17/+5
\| \| \| \| \| \|	tgsi_shader_info contains everything we need. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: don't recompile shaders when changing nr_cbufs from 0 to 1	Marek Olšák	2014-10-12	3	-4/+4
\| \| \| \| \| \|	Both cases are equivalent. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: remove vs.ucps_enabled from the shader key	Marek Olšák	2014-10-12	3	-15/+0
\| \| \| \| \| \|	Written CLIPDIST outputs are simply disabled in PA_CL_VS_OUT_CNTL. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: assume ClipDistance usage mask is always 0xf	Marek Olšák	2014-10-12	2	-8/+2
\| \| \| \| \| \| \| \| \| \| \| \|	No code in Mesa sets the usage mask to any other value. The final mask is AND'ed with enable bits from the rasterizer state anyway. If somebody implements setting usage masks in st/mesa, we can use tgsi_shader_info to get it more easily. This is a prerequisite for the following commit. Reviewed-by: Michel Dänzer <[email protected]>
*	ilo: disassemble compacted instructions	Chia-I Wu	2014-10-11	4	-2/+453
\| \| \| \|	Signed-off-by: Chia-I Wu <[email protected]>
*	vc4: Use the fnv1 hash function instead of gallium util's crc32.	Eric Anholt	2014-10-10	1	-2/+3
\| \| \| \| \|	Improves simulated norast performance on a little benchmark by 13.4012% +/- 2.08459% (n=13).
*	vc4: Don't look up the compiled shaders unless state has changed.	Eric Anholt	2014-10-10	3	-0/+28
\| \| \| \| \|	Improves simulated norast performance on a little benchmark by 38.0965% +/- 3.27534% (n=11).
*	vc4: Actually clear the context's dirty flags.	Eric Anholt	2014-10-10	1	-0/+1
\| \| \| \| \|	I was trying to skip state updates when !dirty, and suspiciously everything was always dirty.
*	vc4: Optimize the other case of SEL_X_Y wih a 0 -> SEL_X_0(a).	Eric Anholt	2014-10-10	1	-1/+23
\| \| \| \|	Cleans up some output to be more obvious in a piglit test I'm looking at.
*	vc4: Optimize out adds of 0.	Eric Anholt	2014-10-09	1	-0/+26
\|
*	vc4: Optimize fmul(x, 0) and fmul(x, 1).	Eric Anholt	2014-10-09	1	-0/+45
\| \| \| \| \|	This was being generated frequently by matrix multiplies of 2 and 3-channel vertex attributes (which have the 0 or 1 loaded in the shader).
*	vc4: Factor out the turn-it-into-a-mov in opt_algebraic.	Eric Anholt	2014-10-09	1	-10/+12
\| \| \| \|	This will be used more in the next commits.
*	vc4: Eliminate unused texture instructions.	Eric Anholt	2014-10-09	1	-1/+21
\|
*	vc4: Dead code eliminate unused SF instructions.	Eric Anholt	2014-10-09	1	-7/+26
\|
*	vc4: Prevent copy propagating out the MOVs from r4.	Eric Anholt	2014-10-09	1	-1/+11
\| \| \| \| \| \| \| \| \|	Copy propagating these might result in reading the r4 after some other instruction has written r4. Just prevent all copy propagation of this for now. Fixes bad rendering with upcoming indirect register access support, where the copy propagation was consistently happening across another read.
*	vc4: Split the coordinate shader to its own vc4_compiled_shader.	Eric Anholt	2014-10-09	3	-89/+54
\| \| \| \| \| \| \| \| \| \| \|	Merging VS and CS into the same struct wasn't winning us anything except for not allocating a separate BO (but if we want to pack programs into BOs, we should pack not just those 2 programs together). What it was getting us was a bunch of code duplication about hash table lookups and propagating vc4_compile contents into a vc4_compiled_shader. I was about to make the situation worse with indirect uniform buffer access.
*	vc4: Add #defines for the texture uniform fields.	Eric Anholt	2014-10-09	2	-19/+113
\| \| \| \| \| \|	I wanted to make another set of texture uploads for handling reladdr constants, and duplicating all the bitshifting looked like a terrible idea. In the process, this fixes a swap of the s/t texture wrap modes.
*	vc4: Initialize undefined temporaries to 0.	Eric Anholt	2014-10-09	1	-1/+6
\| \| \| \| \| \| \| \| \| \|	Under the simulator, reading registers before writing them triggers an assertion failure. c->undef gets treated as r0, which will usually be written, but not if it's used in the first instruction. We should definitely not be aborting in this case, and return some sort of undefined value instead. Fixes glsl-user-varying-ff.
*	r600g,radeonsi: Always use GTT again for PIPE_USAGE_STREAM buffers	Michel Dänzer	2014-10-09	1	-1/+3
\| \| \| \| \| \| \| \| \|	Putting those in VRAM can cause long pauses due to buffers being moved into / out of VRAM. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84662 Cc: [email protected] Reviewed-by: Alex Deucher <[email protected]>
*	vc4: Optimize SF(ITOF(x)) -> SF(x).	Eric Anholt	2014-10-09	1	-0/+16
\| \| \| \| \|	This is a common production of st_glsl_to_tgsi, because CMP takes a float argument.
*	vc4: Add some optimization of FADD(FSUB(0, x)).	Eric Anholt	2014-10-09	1	-0/+31
\| \| \| \| \|	This is a common production of st_glsl_to_tgsi, which uses negate flags on source arguments to handle subtraction.
*	vc4: Mostly fix offset calculation for NPOT mipmap levels.	Eric Anholt	2014-10-09	2	-3/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The non-base NPOT levels are stored as POT-aligned images. We get that POT alignment by minifying the POT-aligned base level. This means that level strides are also POT aligned, so we have to tell the rendering mode config that our resource is larger than the actual requested area. Fixes the fbo-generatemipmap-formats NPOT cases. Regresses depthstencil-render-miplevels 273 * -- the texture presentation now works (where it was completely broken before), it looks like there's some overflow of image bounds happening at the lower miplevels.
*	vc4: Move the mirrored kernel code to a kernel/ directory.	Eric Anholt	2014-10-09	11	-258/+382
\| \| \| \|	Now this whole setup matches the kernel's file layout much more closely.
*	vc4: Enable LIT lowering in TGSI instead of our own code.	Eric Anholt	2014-10-08	1	-35/+1
\| \| \| \|	This brings us the -128/128 clamping on the w component.
*	vc4: Fix scalar math opcodes to replicate their result from the X channel.	Eric Anholt	2014-10-08	1	-4/+16
\| \| \| \| \|	Thanks to robclark for pointing out that I was probably failing to do this when I reported a "bug" in his lowering code.
*	ilo: fix rectlist on GEN7+	Chia-I Wu	2014-10-09	1	-0/+3
\| \| \| \| \| \|	It was broken by 343b014b57ecc5431477e090100e6a26edbda540. Signed-off-by: Chia-I Wu <[email protected]>
*	vc4: Add support for two-sided color.	Eric Anholt	2014-10-08	2	-18/+51
\| \| \| \| \| \| \| \| \| \|	It's fairly easy, thanks to Rob Clark's lowering code. Fixes two-sided-lighting and 4 vertex-program-two-side testcases, while regressing 8 testcases that involve enabling two-sided color while only initializing one of the two colors in the VS. If you're enabling two sided color, it's of course expected that you really do set up both colors, so this is still an improvement (and when we set up a linker for TGSI, we'll hopefully fix those 8 fails).