mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	r600g: texture instructions also work fine with TGSI_FILE_INPUT	Christian König	2011-01-12	1	-1/+3
\|
*	r600g: DP4 also supports writemasking	Christian König	2011-01-12	1	-8/+6
\|
*	r600g: Why all this fiddling with tgsi_helper_copy?	Christian König	2011-01-12	1	-21/+41
\| \| \| \| \| \| \| \|	tgsi_helper_copy is used on several occasions to copy a temporary result into the real destination register to emulate writemasks for OP3 and reduction operations. According to R600 ISA that's unnecessary. This patch fixes this use for MAD, CMP and DP4.
*	r600g: fix tex and vtx joining	Christian König	2011-01-12	1	-2/+2
\|
*	r600g: Fixed SIN/COS/SCS for the case where the operand is a literal.	Tilman Sauerbeck	2011-01-11	1	-2/+15
\| \| \| \| \|	Signed-off-by: Tilman Sauerbeck <[email protected]> Reviewed-by: Alex Deucher <[email protected]>
*	r600g: move user fence into base radeon structure	Jerome Glisse	2011-01-11	1	-3/+0
\| \| \| \| \| \| \|	This avoid any issue when context is free and we still try to access fence through radeon structure. Signed-off-by: Jerome Glisse <[email protected]>
*	r300g: add debug option for buffer upload logging	Marek Olšák	2011-01-10	3	-0/+9
\|
*	noop: make noop useable like trace or rbug	Jerome Glisse	2011-01-09	2	-47/+35
\| \| \| \| \| \| \| \|	If you want to enable noop set GALLIUM_NOOP=1 as an env variable. You need first to enable noop wrapping for your driver see change to src/gallium/targets/dri-r600/ in this commit as an example. Signed-off-by: Jerome Glisse <[email protected]>
*	r300g: do not upload the same user buffer several times	Marek Olšák	2011-01-09	1	-1/+3
\| \| \| \|	Performance++.
*	nvc0: implement queries	Christoph Bumiller	2011-01-09	10	-23/+432
\|
*	i965g: fix warnings	Dave Airlie	2011-01-09	2	-2/+1
\|
*	i965g: update intel_decode from upstream.	Dave Airlie	2011-01-09	4	-137/+432
\|
*	i965g: update disassembler code from classic.	Dave Airlie	2011-01-09	8	-34/+95
\| \| \| \|	still a bit of work to do, the winsys gen setting is a bit of a hack.
*	i965g: update brw_defines.h from classic driver	Dave Airlie	2011-01-09	3	-37/+397
\|
*	i965g: update brw_structs.h from classic driver.	Dave Airlie	2011-01-09	3	-88/+288
\|
*	i965g: update to similiar gen stuff as i965	Dave Airlie	2011-01-09	33	-151/+173
\|
*	r300g: fix crash when flushing ZMASK	Marek Olšák	2011-01-09	5	-92/+81
\| \| \| \| \| \| \| \| \| \|	https://bugs.freedesktop.org/show_bug.cgi?id=32912 The fix is to call update_derived_state before user buffer uploads. I've also moved some code around. Unfortunately, there are still some ZMASK-related bugs which cause misrendering, i.e. flushing doesn't always work and glean/fbo fails.
*	nvfx,nv50: pipe_reference the constant buffers	Christoph Bumiller	2011-01-08	2	-6/+5
\|
*	nvc0: fix primitive restart in immediate mode	Christoph Bumiller	2011-01-08	2	-9/+18
\|
*	r300g: fix a surface leak when flushing ZMASK	Marek Olšák	2011-01-08	1	-0/+1
\|
*	r300g: rework command submission and resource space checking	Marek Olšák	2011-01-08	5	-97/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivation behind this rework is to get some speed by reducing CPU overhead. The performance increase depends on many factors, but it's measurable (I think it's about 10% increase in Torcs). This commit replaces libdrm's radeon_cs_gem with our own implemention. It's optimized specifically for r300g, but r600g could use it as well. Reloc writes and space checking are faster and simpler than their counterparts in libdrm (the time complexity of all the functions is O(1) in nearly all scenarios, thanks to hashing). (libdrm's radeon_bo_gem is still being used in the driver.) It works like this: cs_add_reloc(cs, buf, read_domain, write_domain) adds a new relocation and also adds the size of 'buf' to the used_gart and used_vram winsys variables based on the domains, which are simply or'd for the accounting purposes. The adding is skipped if the reloc is already present in the list, but it accounts any newly-referenced domains. cs_validate is then called, which just checks: used_vram/gart < vram/gart_size * 0.8 The 0.8 number allows for some memory fragmentation. If the validation fails, the pipe driver flushes CS and tries do the validation again, i.e. it validates only that one operation. If it fails again, it drops the operation on the floor and prints some nasty message to stderr. cs_write_reloc(cs, buf) just writes a reloc that has been added using cs_add_reloc. The read_domain and write_domain parameters have been removed, because we already specify them in cs_add_reloc. The space checking has been tested by putting small values in vram/gart_size variables.
*	nvc0: fix reloc domain conflict on buffer migration	Christoph Bumiller	2011-01-08	1	-12/+12
\| \| \| \| \|	Occurred because the code assumed that buf->domain would remain equal to old_domain.
*	nvc0: upload user buffers only from draw info min to max index	Christoph Bumiller	2011-01-08	2	-3/+9
\| \| \| \|	There are actually applications that profit immensely from this.
*	nvc0: fix emission of first 3 u8 indices to RING_NI	Christoph Bumiller	2011-01-08	1	-1/+1
\|
*	nvc0: reset mt transfer address after read loop over layers	Christoph Bumiller	2011-01-08	1	-0/+1
\|
*	nvc0: tie buffer memory release to the buffer fence	Christoph Bumiller	2011-01-08	1	-4/+7
\| \| \| \| \|	... instead of the next fence to be emitted. This way we have a chance to reclaim the storage earlier.
*	r300g: Remove invalid assertion.	Łukasz Krotowski	2011-01-08	1	-1/+0
\| \| \| \| \| \| \|	Invalid after be1af4394e060677b7db6bbb8e3301e38a3363da (user buffer creation with width0 == ~0). Signed-off-by: Marek Olšák <[email protected]>
*	r600g: Also set const_offset if the buffer is not a user buffer in ↵	Henri Verbeet	2011-01-07	1	-0/+2
\| \| \| \|	r600_upload_const_buffer().
*	r600g: Update some comments for Evergreen.	Henri Verbeet	2011-01-07	1	-1/+3
\|
*	r600g: Split ALU clauses based on used constant cache lines.	Henri Verbeet	2011-01-07	2	-21/+129
\|
*	r600g: Consistently use the copy of the alu instruction in ↵	Henri Verbeet	2011-01-07	1	-9/+9
\| \| \| \|	r600_bc_add_alu_type().
*	r600g: Store kcache settings as an array.	Henri Verbeet	2011-01-07	3	-24/+25
\|
*	r300g: derive user buffer sizes at draw time	Marek Olšák	2011-01-07	9	-104/+144
\| \| \| \| \| \| \|	This only uploads the [min_index, max_index] range instead of [0, userbuf size], which greatly speeds up user buffer uploads. This is also a prerequisite for atomizing vertex arrays in st/mesa.
*	r600g: allow constant buffers to be user buffers.	Dave Airlie	2011-01-07	6	-4/+44
\| \| \| \| \| \| \| \| \|	This provides an upload facility for the constant buffers since Marek's constants in user buffers changes. gears at least work on my evergreen now. Signed-off-by: Dave Airlie <[email protected]>
*	r600g: add support for NI (Northern Islands) GPUs	Alex Deucher	2011-01-06	4	-0/+76
\| \| \| \|	This adds support for Barts, Turks, and Caicos asics.
*	svga: Ensure that the wrong vdecls don't get used in swtnl path	Jakob Bornecrantz	2011-01-06	3	-0/+19
\| \| \| \| \| \| \|	The draw module set new state that didn't require swtnl which caused need_swtnl to be unset. This caused the call from to svga_update_state(svga, SVGA_STATE_SWTNL_DRAW) from the vbuf backend to overwrite the vdecls we setup there to be overwritten with the real buffers vdecls.
*	r300g: fix corruption when nr_cbufs==0 and multiwrites enabled	Marek Olšák	2011-01-06	1	-1/+2
\| \| \| \|	https://bugs.freedesktop.org/show_bug.cgi?id=32634
*	r300g: remove the buffer range checking	Marek Olšák	2011-01-06	2	-60/+1
\| \| \| \| \| \|	It's no longer needed because the upload buffer remains mapped while the CS is being filled (openarena, ut2004 and others that this code was for do not use VBOs by default).
*	r300g: skip buffer validation of upload buffers when appropriate	Marek Olšák	2011-01-06	5	-8/+36
\| \| \| \|	because the upload buffers are reused for subsequent draw operations.
*	gallium: drivers should reference vertex buffers	Marek Olšák	2011-01-06	14	-37/+58
\| \| \| \|	So that a state tracker can unreference them after set_vertex_buffers.
*	u_upload_mgr: new features	Marek Olšák	2011-01-06	4	-10/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Added a parameter to specify a minimum offset that should be returned. r300g needs this to better implement user buffer uploads. This weird requirement comes from the fact that the Radeon DRM doesn't support negative offsets. - Added a parameter to notify a driver that the upload flush occured. A driver may skip buffer validation if there was no flush, resulting in a better performance. - Added a new upload function that returns a pointer to the upload buffer directly, so that the buffer can be filled e.g. by the translate module.
*	nvc0: Fix typo of nvc0_mm.c in SConscript.	Vinson Lee	2011-01-06	1	-1/+1
\|
*	r600g: support up to 64 shader constants	Alex Deucher	2011-01-04	2	-1/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From the r600 ISA: Each ALU clause can lock up to four sets of constants into the constant cache. Each set (one cache line) is 16 128-bit constants. These are split into two groups. Each group can be from a different constant buffer (out of 16 buffers). Each group of two constants consists of either [Line] and [Line+1] or [line + loop_ctr] and [line + loop_ctr +1]. For supporting more than 64 constants, we need to break the code into multiple ALU clauses based on what sets of constants are needed in that clause. Note: This is a candidate for the 7.10 branch. Signed-off-by: Alex Deucher <[email protected]>
*	Merge remote branch 'origin/nvc0'	Christoph Bumiller	2011-01-04	46	-11/+17354
\|\
\| *	nvc0: fix index size method value for u8 indices	Christoph Bumiller	2011-01-04	1	-8/+2
\| \|
\| *	nvc0: set the correct FP header bit for multiple colour outputs	Christoph Bumiller	2011-01-04	1	-1/+1
\| \|
\| *	nvc0: delete memory caches and fence on screen destruction	Christoph Bumiller	2011-01-04	4	-0/+50
\| \|
\| *	nvc0: use mov instead of ld for scalar const loads	Christoph Bumiller	2011-01-04	1	-1/+6
\| \|
\| *	nvc0: fix resource unmap after vertex push	Christoph Bumiller	2011-01-04	3	-10/+8
\| \|
\| *	nvc0: use the proper typed opcodes in constant folding	Christoph Bumiller	2011-01-04	1	-86/+92
\| \|