mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	gallium: initialize simple screen in drivers	Zack Rusin	2009-01-31	1	-0/+2
\|
*	gallium: make p_winsys internal	Zack Rusin	2009-01-30	6	-11/+11
\| \| \| \| \|	move it to pipe/internal/p_winsys_screen.h and start converting the state trackers to the screen usage
*	gallium: give the screen priority when it comes to buffer allocations	Zack Rusin	2009-01-29	3	-15/+15
\| \| \| \| \| \|	allows the driver to overwrite buffer allocation, first step on the way to making winsys interface internal to the drivers. state trackers and the code above it will go through the screen
*	gallium: standardize api on the prefix "nr"	Zack Rusin	2009-01-27	1	-1/+1
\|
*	gallium: standardize naming of masks	Zack Rusin	2009-01-27	2	-29/+29
\|
*	Merge commit 'origin/gallium-0.2' into gallium-xlib-rework	Keith Whitwell	2009-01-19	11	-522/+570
\|\
\| *	cell: Specify constant as float for CEILF().	Jonathan Adamczewski	2009-01-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Without the f, the constant is treated as a double, resulting in slower arithmetic and libgcc conversion calls each time CEILF() is used.
\| *	cell: Add missing suffix to SHUFFLE macro	Jonathan Adamczewski	2009-01-13	1	-1/+1
\| \|
\| *	cell: allocate batch buffers w/ 16-byte alignment	Jonathan Adamczewski	2009-01-12	8	-143/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Replace cell_batch{align,alloc)*() with cell_batch_alloc16(), allocating multiples of 16 bytes that are 16 byte aligned. Opcodes are stored in preferred slot of SPU machine word. Various structures are explicitly padded to 16 byte multiples. Added STATIC_ASSERT().
\| *	cell: optimize unpack_colors() function, saving 12 cycles	Brian Paul	2009-01-11	1	-31/+34
\| \|
\| *	cell: move color unpacking code into separate function	Brian Paul	2009-01-11	1	-76/+89
\| \|
\| *	cell: re-order the z/stencil fetch/extract/convert instructions for better perf	Brian Paul	2009-01-11	1	-55/+51
\| \| \| \| \| \| \| \|	The new instruction order is 10 cycles faster.
\| *	cell: simplify the 'optional register' code	Brian Paul	2009-01-11	1	-62/+50
\| \|
\| *	cell: asst datatype clean-ups	Brian Paul	2009-01-11	1	-73/+75
\| \|
\| *	cell: move depth/stencil code into separate function	Brian Paul	2009-01-11	1	-179/+213
\| \|
\| *	cell: clean-up, re-indent, comments	Brian Paul	2009-01-11	1	-95/+139
\| \|
* \|	Merge commit 'origin/gallium-0.2' into gallium-xlib-rework	Keith Whitwell	2009-01-11	2	-101/+28
\|\\|
\| *	cell: use tgsi_dump_instruction() instead of spe_comment()	Brian Paul	2009-01-10	1	-101/+25
\| \|
\| *	cell: added rule to produce .s assembly files	Brian Paul	2009-01-10	1	-0/+3
\| \|
* \|	Merge commit 'origin/gallium-0.2' into gallium-xlib-rework	Keith Whitwell	2009-01-09	5	-745/+987
\|\\| \| \| \| \| \| \| \| \| \| \|	Conflicts: progs/glsl/Makefile
\| *	cell: replace 0 with -1 in SPE_COUNT_USABLE_SPES query	Jonathan Adamczewski	2009-01-06	1	-1/+1
\| \|
\| *	cell: fix code emit for RSQ/RCP when src arg == dst arg	Brian Paul	2009-01-05	1	-6/+8
\| \| \| \| \| \| \| \|	Fixes moire-like artifacts seen in fslight demo.
\| *	cell: SIMDize sorting in setup_sort_vertices()	Jonathan Adamczewski	2009-01-05	1	-55/+42
\| \| \| \| \| \| \| \| \| \| \| \|	Put setup.v{min,mid,max,provoke} into a union with qword vertex_headers. Rewrite vertex sorting to more efficiently handle the packed data items. Reduces spu_tri.o by ~128 bytes.
\| *	cell: SIMDize some subtractions	Jonathan Adamczewski	2009-01-05	1	-8/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Put edge.{dx,dy} into a union with a vector and perform subtractions in setup_sort_vertices() on vectors. Reduces spu_tri.o by ~300 bytes.
\| *	cell: improvements to spu_tri.c	Jonathan Adamczewski	2009-01-04	1	-42/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Replace int setup.span{left,right}[2] with vec_uint4 setup.span.quad SIMDize calculate_mask() and inline into into flush_spans() Set setup.span.quad members using spu_shuffle() or spu_sel(). Reduces spu_tri.o by ~116 bytes.
\| *	cell: new spu_shuffle.h header	Jonathan Adamczewski	2009-01-04	1	-0/+186
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Facilitates creation of shuffle patterns for use with spu_shuffle() and si_shufb() intrinsics. To be used by subsequent patches.
\| *	cell: initial codegen support for fragment shader loops	Brian Paul	2009-01-04	1	-26/+193
\| \| \| \| \| \| \| \| \| \| \| \|	Basic for/while loops work now. Only one level of loop nesting is supported at this time (same for if/else). The progs/glsl/mandelbrot demo works, but the colors are too dim.
\| *	cell: add support for PIPE_CAP_TEXTURE_MIRROR_REPEAT query	Brian Paul	2009-01-04	1	-0/+4
\| \| \| \| \| \| \| \|	We don't really support this texwrap mode yet, but this enables GL 2.1
\| *	cell: clean-up, improve SPU code generation	Brian Paul	2009-01-04	1	-622/+506
\| \| \| \| \| \| \| \|	Start on ARL and address-relative indexing too.
\| *	cell: fix typo (s/10/0)	Brian	2009-01-03	1	-2/+2
\| \|
* \|	cell: fix breakage from xlib re-org	Brian Paul	2009-01-08	1	-2/+1
\| \| \| \| \| \| \| \|	Some of these fixes are quick band-aids for now.
* \|	gallium: split driver-independent code out of xlib winsys	Keith Whitwell	2009-01-08	1	-40/+0
\|/ \| \| \|	Place in new xlib state-tracker. This is a statetracker for the GLX API.
*	CELL: use variant-length fragment ops programs	Robert Ellison	2008-11-21	8	-64/+216
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a set of changes that optimizes the memory use of fragment operation programs (by using and transmitting only as much memory as is needed for the fragment ops programs, instead of maximal sizes), as well as eliminate the dependency on hard-coded maximal program sizes. State that is not dependent on fragment facing (i.e. that isn't using two-sided stenciling) will only save and transmit a single fragment operation program, instead of two identical programs. - Added the ability to emit a LNOP (No Operation (Load)) instruction. This is used to pad the generated fragment operations programs to a multiple of 8 bytes, which is necessary for proper operation of the dual instruction pipeline, and also required for proper SPU-side decoding. - Added the ability to allocate and manage a variant-length struct cell_command_fragment_ops. This structure now puts the generated function field at the end, where it can be as large as necessary. - On the PPU side, we now combine the generated front-facing and back-facing code into a single variant-length buffer (and only use one if the two sets of code are identical) for transmission to the SPU. - On the SPU side, we pull the correct sizes out of the buffer, allocate a new code buffer if the one we have isn't large enough, and save the code to that buffer. The buffer is deallocated when the SPU exits. - Commented out the emit_fetch() static function, which was not being used.
*	CELL: improve twiddling/untwiddling error text	Robert Ellison	2008-11-20	1	-2/+2
\| \| \| \| \| \|	As suggested by Brian Paul: in the case of a twiddling error, instead of reporting the bad format number (which is all but unusable), report the more useful enum name.
*	CELL: fix stencil twiddling, stencil invert	Robert Ellison	2008-11-13	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Many stencil tests were failing because of a failure to read the stencil buffer, due to "twiddling" (or "untwiddling") "an unsupported texture format". This is fixed for the case of a stencil/Z S824Z format (which twiddles just like the 32-bit color formats). tests/stencilwrap.c was failing on the GL_INVERT test, because the emitted code for "spe_xori" turned out not to be an actual "xori" instruction, but rather a "stqd" instruction, because of a typo in the rtasm code. This is now fixed, and tests/stencil_wrap now works.
*	CELL: fix stencil test bugs	Robert Ellison	2008-11-12	2	-12/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixed a boneheaded error in the generation of SPU code that calculates the results of the stencil test. Basically, all the greater than/less than calculations were exactly inverted: they were coded as though the given comparison took the stencil value as a left-hand operand and the reference value as a right-hand operand, but the actual semantics always put the reference as the left-hand operand and the stencil as the right-hand operand. With this fix, tests/dinoshade runs, as do all the other Mesa tests and samples that use stencil (and that don't use texture formats unsupported by Cell).
*	cell: include cell_surface.h	Brian Paul	2008-11-12	1	-0/+1
\|
*	cell: include cell_pipe_state.h	Brian Paul	2008-11-12	1	-0/+1
\|
*	cell: implement NRM3 opcode	Brian Paul	2008-11-11	1	-0/+48
\|
*	CELL: two-sided stencil fixes	Robert Ellison	2008-11-11	10	-204/+110
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With these changes, the tests/stencil_twoside test now works. - Eliminate blending from the stencil_twoside test, as it produces an unneeded dependency on having blending working - The spe_splat() function will now work if the register being splatted and the destination register are the same - Separate fragment code generated for front-facing and back-facing fragments. Often these are the same; if two-sided stenciling is on, they can be different. This is easier and faster than generating code that does both tests and merges the results. - Fixed a cut/paste bug where if the back Z-pass stencil operation were different from all the other operations, the back Z-fail results were incorrect.
*	CELL: fix several stencil problems	Robert Ellison	2008-11-07	3	-88/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This small set of changes repairs several different stenciling problems; now redbook/stencil also runs correctly (and maybe others - I haven't checked everything yet). - The number of instructions that had been allocated for fragment ops used to be 64 (in cell/common.h). With complicated stencil use, we managed to get up to 93, which caused a segfault before we noticed we'd overran our memory buffer. It's now been bumped to 128, which should be enough for even complicated stencil and fragment op usage. - The status of cell surfaces never changed beyond the initial PIPE_SURFACE_STATUS_UNDEFINED. When a user called glClear() to clear just the Z buffer (but not the stencil buffer), this caused the check_clear_depth_with_quad() function to return false (because the surface status was believed to be undefined), and so the device was instructed to clear the whole buffer (including the stencil buffer), instead of correctly using a quad to clear just the depth, leaving the stencil alone. This has been fixed similarly to the way the i915 driver handles the surface status: during cell_clear_surface(), the status is set to PIPE_SURFACE_STATUS_DEFINED. Then a partial buffer clear is handled with a quad, as expected. Note that we are not using PIPE_SURFACE_STATUS_CLEAR (also similar to the i915); technically, we should be setting the surface status to CLEAR on a clear, and to DEFINED when we actually draw something (say on cell_vbuf_draw()), but it's difficult to figure out exactly which surfaces are affected by a cell_vbuf_draw(), so for now we're doing the easy thing. - The fragment ops handling was very clever about only pulling out the parts of the Z/stencil buffer that it needed for calculations; but this failed when only part of the buffer was written, because the part that was never pulled out was inadvertently cleared. Now all the data from the combined Z/stencil buffer is pulled out, just so the proper values can be recombined later and written back to the buffer correctly. As a bonus, the fragment op code generation is simplified.
*	cell: minor reformatting, var renaming	Brian Paul	2008-11-05	1	-7/+11
\|
*	CELL: fix use of stencil value mask	Robert Ellison	2008-10-30	1	-42/+112
\| \| \| \| \| \| \|	The Cell stencil tests were completely ignoring the stencil value mask. Now the original code paths are still used if the stencil value mask is all 1s; but code to use the mask for the stencil value and reference value comparisons is now emitted if the mask is not all 1s.
*	cell: Protected use of non-initialized untile buffers	Jonathan White	2008-10-30	1	-4/+7
\|
*	CELL: stencil bug fixes	Robert Ellison	2008-10-30	5	-13/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Two definitive bugs in stenciling were fixed. The first, reversed registers in the generated Select Bytes (selb) instruction, caused the stenciling INCR and DECR operations to fail dramatically, putting new values in where old values were supposed to be and vice versa. The second caused stencil tiles to not be read and written from main memory by the SPUs. A per-spu flag, spu.read_depth, was used to indicate whether the SPU should be reading depth tiles, and was set only when depth was enabled. A second flag, spu.read_stencil, was set when stenciling was enabled, but never referenced. As stenciling and depth are in the same tiles on the Cell, and there is no corresponding TAG_WRITE_TILE_STENCIL to complement TAG_WRITE_TILE_COLOR and TAG_WRITE_TILE_Z, I fixed this by eliminating the unused "spu.read_stencil", renaming "spu.read_depth" to "spu.read_depth_stencil", and setting it if either stenciling or depth is enabled. I also added an optimization to the fragment ops generation code, that avoids calculating stencil values and/or stencil writemask when the stencil operations are all KEEP.
*	cell: Added check for PIPE_FLUSH_RENDER_CACHE to cell_flush to fix black ↵	Jonathan White	2008-10-30	1	-1/+1
\| \| \| \|	blocks during st_readpixels due to a flush wait not happening in order to allow any previous rendering to complete.
*	cell: add scalar param to emit_function_call() to indicate scalar function calls	Brian Paul	2008-10-29	1	-34/+69
\| \| \| \| \|	Scalar calls only use the X component of the src regs and smear the result across the dest register's X/Y/Z/W.
*	cell: use simd utilities for pow, exp2, log2	Brian Paul	2008-10-29	1	-22/+6
\|
*	cell: fix a number of fence issues	Brian Paul	2008-10-28	4	-7/+30
\| \| \| \|	Plus add assertions to check status, alignment, etc.
*	cell: don't include libmisc.h	Brian Paul	2008-10-28	1	-1/+0
\| \| \| \|	Doesn't seem to be needed and fixes compilation with SDK 3.1 beta.