mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	st/mesa: optionally apply texture swizzle to border color v2	Christoph Bumiller	2013-04-18	2	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \|	This is the only sane solution for nv50 and nvc0 (really, trust me), but since on other hardware the border colour is tightly coupled with texture state they'd have to undo the swizzle, so I've added a cap. The dependency of update_sampler on the texture updates was introduced to avoid doing the apply_depthmode to the swizzle twice. v2: Moved swizzling helper to u_format.c, extended the CAP to provide more accurate information.
*	gallium: Disambiguate TGSI_OPCODE_IF.	José Fonseca	2013-04-17	1	-4/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	TGSI_OPCODE_IF condition had two possible interpretations: - src.x != 0.0f - Mesa statetracker when PIPE_SHADER_CAP_INTEGERS was false either for vertex and fragment shaders - gallivm/llvmpipe - postprocess - vl state tracker - vega state tracker - most old drivers - old internal state trackers - many graw examples - src.x != 0U - Mesa statetracker when PIPE_SHADER_CAP_INTEGERS was true for both vertex and fragment shaders - tgsi_exec/softpipe - r600 - radeonsi - nv50 And drivers that use draw module also were a mess (because Mesa would emit float IFs, but draw module supports native integers so it would interpret IF arg as integers...) This sort of works if the source argument is limited to float +0.0f or +1.0f, integer 0, but would fail if source is float -0.0f, or integer in the float NaN range. It could also fail if source is integer 1, and hardware flushes denormalized numbers to zero. But with this change there are now two opcodes, IF and UIF, with clear meaning. Drivers that do not support native integers do not need to worry about UIF. However, for backwards compatibility with old state trackers and examples, it is advisable that native integer capable drivers also support the float IF opcode. I tried to implement this for r600 and radeonsi based on the surrounding code. I couldn't do this for nouveau, so I just shunted IF/UIF together, which matches the current behavior. Reviewed-by: Roland Scheidegger <[email protected]> Reviewed-by: Marek Olšák <[email protected]> v2: - Incorporate Roland's feedback. - Fix r600_shader.c merge conflict. - Fix typo in radeon, spotted by Michel Dänzer. - Incorporte Christoph Bumiller's patch to handle TGSI_OPCODE_IF(float) properly in nv50/ir.
*	gallium: Eliminate TGSI_OPCODE_IFC.	José Fonseca	2013-04-17	1	-5/+0
\| \| \| \| \| \|	Never used or implemented. Reviewed-by: Roland Scheidegger <[email protected]>
*	gallium: PIPE_COMPUTE_CAP_IR_TARGET - allow drivers to specify a processor v2	Tom Stellard	2013-04-05	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	This target string now contains four values instead of three. The old processor field (which was really being interpreted as arch) has been split into two fields: processor and arch. This allows drivers to pass a more a more detailed description of the hardware to compiler frontends. v2: - Adapt to libclc changes Reviewed-by: Francisco Jerez <[email protected]>
*	freedreno: document debug flag	Erik Faye-Lund	2013-04-04	1	-0/+4
\| \| \| \| \|	Signed-off-by: Erik Faye-Lund <[email protected]> Signed-off-by: Brian Paul <[email protected]>
*	gallium/docs: fix definition of PIPE_QUERY_SO_STATISTICS	Christoph Bumiller	2013-04-03	1	-3/+5
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add PIPE_CAP_QUERY_PIPELINE_STATISTICS	Christoph Bumiller	2013-04-03	1	-0/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	gallium/docs: document get_driver_query_info	Marek Olšák	2013-03-26	1	-0/+12
\|
*	gallium,st/mesa: don't use blit-based transfers with software rasterizers	Marek Olšák	2013-03-23	1	-0/+4
\| \| \| \| \| \| \| \| \|	The blit-based paths for TexImage, GetTexImage, and ReadPixels aren't very fast with software rasterizer. Now Gallium drivers have the ability to turn them off. Reviewed-by: Brian Paul <[email protected]> Tested-by: Brian Paul <[email protected]>
*	gallium: add TGSI_SEMANTIC_TEXCOORD,PCOORD v3	Christoph Bumiller	2013-03-20	3	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes it possible to identify gl_TexCoord and gl_PointCoord for drivers where sprite coordinate replacement is restricted. The new PIPE_CAP_TGSI_TEXCOORD decides whether these varyings should be hidden behind the GENERIC semantic or not. With this patch only nvc0 and nv30 will request that they be used. v2: introduce a CAP so other drivers don't have to bother with the new semantic v3: adapt to introduction gl_varying_slot enum
*	tgsi: add ArrayID documentation v2	Christian König	2013-03-19	1	-0/+18
\| \| \| \| \| \|	v2: further improve the text with comments from Christoph Bumiller. Signed-off-by: Christian König <[email protected]>
*	d3d1x: Remove.	José Fonseca	2013-03-12	1	-1/+1
\| \| \| \| \| \|	Unused/unmaintained. Reviewed-by: Christoph Bumiller <[email protected]>
*	llvmpipe: bump glsl version to 140	Roland Scheidegger	2013-03-02	1	-11/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	texel offsets should have been the last missing feature for 130, and in fact 140 as well (last there were texture buffers). In any case we still don't do OpenGL 3.0 (missing MSAA which will be difficult, plus EXT_packed_float, ARB_depth_buffer_float and EXT_framebuffer_sRGB). v2: bump to 140 instead - we have everything except we crash when not writing to gl_Position (but softpipe crashes as well) so let's just say this is a bug instead. Also (by Dave Airlie's suggestion) update llvm-todo.txt. Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/docs: improve text about resources a bit.	Roland Scheidegger	2013-02-22	1	-29/+33
\| \| \| \| \| \| \| \| \| \| \|	This clarifies some things and gets rid of some old stuff. The most significant one is probably that buffers cannot have formats (nearly all drivers completely ignored format and used width0 as byte size already in any case). There seems to be no use case for "structured" buffers. (Note while d3d11 has new Structured Buffers, these still aren't associated with a format, rather a byte stride, which we can't do yet either way.) Reviewed-by: Jose Fonseca <[email protected]>
*	gallivm/tgsi: fix src modifier fetching with non-float types.	Roland Scheidegger	2013-02-16	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Need to take the type into account. Also, if we want to allow mov's with modifiers we need to pick a type (assume float). v2: don't allow all modifiers on all type, in particular don't allow absolute on non-float types and don't allow negate on unsigned. Also treat UADD as signed (despite the name) since it is used for handling both signed and unsigned integer arguments and otherwise modifiers don't work. Also add tgsi docs clarifying this. Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/docs: fix typos in sample opcode descriptions	Roland Scheidegger	2013-02-12	1	-2/+3
\|
*	gallium: fix tgsi SAMPLE_L opcode to use separate source for explicit lod	Roland Scheidegger	2013-02-12	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It looks like using coord.w as explicit lod value is a mistake, most likely because some dx10 docs had it specified that way. Seems this was changed though: http://msdn.microsoft.com/en-us/library/windows/desktop/hh447229%28v=vs.85%29.aspx - let's just hope it doesn't depend on runtime build version or something. Not only would this need translation (so go against the stated goal these opcodes should be close to dx10 semantics) but it would prevent usage of this opcode with cube arrays, which is apparently possible: http://msdn.microsoft.com/en-us/library/windows/desktop/bb509699%28v=vs.85%29.aspx (Note not only does this show cube arrays using explicit lod, but also the confusion with this opcode: it lists an explicit lod parameter value, but then states last component of location is used as lod). (For "true" hw drivers, only nv50 had code to handle it, and it appears the code was already right for the new semantics, though fix up the seemingly wrong c/d arguments while there.) v2: fix comment, separate out other changes. Reviewed-by: Jose Fonseca <[email protected]>
*	gallivm: fix up size queries for dx10 sviewinfo opcode	Roland Scheidegger	2013-02-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Need to calculate the number of mip levels (if it would be worthwile could store it in dynamic state). While here, the query code also used chan 2 for the lod value. This worked with mesa state tracker but it seems safer to use chan 0. Still passes piglit textureSize (with some handwaving), though the non-GL parts are (largely) untested. v2: clarify and expect the sviewinfo opcode to return ints, not floats, just like the OpenGL textureSize (dx10 supports dst modifiers with resinfo). Also simplify some code. Reviewed-by: Jose Fonseca <[email protected]>
*	gallium: add SQRT shader opcode	Brian Paul	2013-02-04	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The glsl-to-tgsi translater will emit SQRT to implement GLSL's sqrt() and distance() functions if the PIPE_SHADER_CAP_TGSI_SQRT_SUPPORTED query says it's supported by the driver. Otherwise, sqrt(x) is implemented with xrsq(x). The problem with this is sqrt(0) must be handled specially because rsq(0) might be Inf/NaN/undefined (and then 0rsq(0) is Inf/Nan/undefined). In the glsl-to-tgsi code we use an extra CMP to check if x is zero and then replace the result of x*rsq(x) with zero. In the end, this makes sqrt() generate much more reasonable code for drivers that can do square roots. Note that many of piglit's generated shader tests use the GLSL distance() function.
*	gallium: add PIPE_CAP_TEXTURE_BUFFER_OFFSET_ALIGNMENT	Christoph Bumiller	2013-01-30	1	-0/+3
\| \| \| \|	Reviewed-by: Brian Paul <[email protected]>
*	gallium: remove PIPE_CAP_DEPTHSTENCIL_CLEAR_SEPARATE	Marek Olšák	2013-01-15	2	-4/+1
\| \| \| \|	Reviewed-by: Brian Paul <[email protected]>
*	gallium: s/PIPE_CAP_TIMER_QUERY/PIPE_CAP_QUERY_TIME_ELAPSED/	José Fonseca	2012-12-20	1	-1/+1
\| \| \| \| \| \| \| \|	To better reflect what it is being advertised. Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	gallium: expose ARB_map_buffer_alignment on Radeon	Marek Olšák	2012-10-31	1	-0/+6
\| \| \| \| \| \| \| \|	Reviewed-by: Brian Paul <[email protected]> v2: update relnotes-9.1 v3: use align_malloc and align_free for malloced buffers in r300g v4: document the new CAP in the docs
*	gallium/docs: fix sphinx warning	Andreas Boll	2012-10-24	1	-1/+1
\| \| \| \| \| \| \|	src/gallium/docs/source/context.rst:495: WARNING: malformed hyperlink target. Reviewed-by: Brian Paul <[email protected]>
*	gallium/docs: update some distro information	Andreas Boll	2012-10-15	2	-8/+37
\| \| \| \|	Reviewed-by: Brian Paul <[email protected]>
*	gallium: unify transfer functions	Marek Olšák	2012-10-11	1	-13/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	"get_transfer + transfer_map" becomes "transfer_map". "transfer_unmap + transfer_destroy" becomes "transfer_unmap". transfer_map must create and return the transfer object and transfer_unmap must destroy it. transfer_map is successful if the returned buffer pointer is not NULL. If transfer_map fails, the pointer to the transfer object remains unchanged (i.e. doesn't have to be NULL). Acked-by: Brian Paul <[email protected]>
*	gallium: remove resource_resolve	Marek Olšák	2012-09-30	2	-15/+1
\| \| \| \| \| \| \|	The functionality is provided by the new blit function. Tested-by: Michel Dänzer <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	gallium: add blit into the interface	Marek Olšák	2012-09-30	1	-0/+9
\| \| \| \| \|	Tested-by: Michel Dänzer <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	gallium: Add PIPE_COMPUTE_CAP_MAX_MEM_ALLOC_SIZE v2	Tom Stellard	2012-09-25	1	-0/+2
\| \| \| \| \| \|	v2: - Add comment in screen.rst - Report OpenCL required minimum for r600g
*	gallium: add new pipe_screen::can_create_resource() function	Brian Paul	2012-09-17	1	-0/+10
\| \| \| \| \| \| \| \| \| \|	Used to implement proxy textures. If a gallium driver doesn't implement this function we'll just continue to use the core Mesa fallback code. Without this hook we really have no good way to implement OpenGL proxy textures with gallium drivers. Reviewed-by: Jose Fonseca <[email protected]>
*	gallium/docs: document interface changes for timestamp query	Marek Olšák	2012-07-10	1	-0/+10
\| \| \| \|	the query type is already documented
*	gallium: Add PIPE_CAP_START_INSTANCE	Fredrik Höglund	2012-06-19	1	-1/+2
\| \| \| \|	Reviewed-by: Brian Paul <[email protected]>
*	gallium/compute: Add PIPE_COMPUTE_CAP_IR_TARGET v4	Francisco Jerez	2012-06-01	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	v2: Tom Stellard - Update CAP description v3: Tom Stellard - TGSI targets should pass an empty string for this CAP. v4: Tom Stellard - TGSI targets can ignore this CAP. Reviewed-by: Francisco Jerez <[email protected]>
*	gallium/docs: beef up the docs related to color clamping	Brian Paul	2012-05-25	2	-3/+18
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	clover, gallium: add PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK	Christoph Bumiller	2012-05-12	1	-0/+5
\| \| \| \| \| \| \|	This is not necessarily the product of MAX_BLOCK_SIZE[i]. Reviewed-by: Tom Stellard <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	gallium/docs: remove documentation of redefine_user_buffer	Marek Olšák	2012-05-12	1	-14/+0
\|
*	Merge branch 'gallium-userbuf'	Marek Olšák	2012-05-11	1	-0/+9
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: src/gallium/docs/source/screen.rst src/gallium/drivers/nv50/nv50_state.c src/gallium/include/pipe/p_defines.h src/mesa/state_tracker/st_draw.c
\| *	gallium: add PIPE_CAP_CONSTANT_BUFFER_OFFSET_ALIGNMENT	Marek Olšák	2012-04-30	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is required for any serious constant buffer support. Constant buffer offsets on ATI and NVIDIA DX10 and DX11 GPUs must be a multiple of 256. In OpenGL, this can be queried via GL_UNIFORM_BUFFER_OFFSET_ALIGNMENT.
\| *	gallium: add PIPE_CAP_USER_INDEX_BUFFERS and PIPE_CAP_USER_CONSTANT_BUFFERS	Marek Olšák	2012-04-30	1	-0/+6
\| \|
* \|	gallium/compute: Drop TGSI dependency.	Francisco Jerez	2012-05-11	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add a shader cap for specifying the preferred shader representation. Right now the only supported value is TGSI, other enum values will be added as they are needed. This is mainly to accommodate AMD's LLVM compiler back-end by letting it bypass the TGSI representation for compute programs. Other drivers will keep using the common TGSI instruction set. Reviewed-by: Tom Stellard <[email protected]>
* \|	gallium/tgsi: Introduce the "LOCAL" register declaration modifier.	Francisco Jerez	2012-05-11	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change will be useful to implement function parameter passing on top of TGSI. As we don't have a proper stack, a register-based calling convention will be used instead, which isn't necessarily a bad thing given that GPUs often have plenty of registers to spare. Using the same register space for local temporaries and inter-procedural communication caused some inefficiencies, because in some cases the register allocator would lose the freedom to merge temporary values together into the same physical register, leading to suboptimal register (and sometimes, as a side effect, instruction) usage. The LOCAL declaration modifier specifies that the value isn't intended for parameter passing and as a result the compiler doesn't have to give any guarantees of it being preserved across function boundaries. Ignoring the LOCAL flag doesn't change the semantics of a valid program in any way, because local variables are just supposed to get a more relaxed treatment. IOW, this should be a backwards-compatible change.
* \|	gallium/tgsi: Add support for atomic opcodes.	Francisco Jerez	2012-05-11	1	-0/+164
\| \|
* \|	gallium/tgsi: Add support for barriers.	Francisco Jerez	2012-05-11	1	-0/+49
\| \|
* \|	gallium/tgsi: Add resource write-back support.	Francisco Jerez	2012-05-11	2	-2/+29
\| \| \| \| \| \| \| \| \| \|	Define a new STORE opcode with a role dual to the LOAD opcode, and add flags to specify that a shader resource is intended for writing.
* \|	gallium/tgsi: Add support for raw resources.	Francisco Jerez	2012-05-11	1	-1/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Normal resource access (e.g. the LOAD TGSI opcode) is supposed to perform a series of conversions to turn the texture data as it's found in memory into the target data type. In compute programs it's often the case that we only want to access the raw bits as they're stored in some buffer object, and any kind of channel conversion and scaling is harmful or inefficient, especially in implementations that lack proper hardware support to take care of it -- in those cases the conversion has to be implemented in software and it's likely to result in a performance hit even if the pipe_buffer and declaration data types are set up in a way that would just pass the data through. Add a declaration flag that marks a resource as typeless. No channel conversion will be performed in that case, and the X coordinate of the address vector will be interpreted in byte units instead of elements for obvious reasons. This is similar to D3D11's ByteAddressBuffer, and will be used to implement OpenCL's constant arguments. The remaining four compute memory spaces can also be understood as raw resources.
* \|	gallium/tgsi: Move interpolation info from tgsi_declaration to a separate token.	Francisco Jerez	2012-05-11	1	-9/+15
\| \| \| \| \| \| \| \| \| \| \| \|	Move Interpolate, Centroid and CylindricalWrap from tgsi_declaration to a separate token -- they only make sense for FS inputs and we need room for other flags in the top-level declaration token.
* \|	gallium: Add context hooks for binding shader resources.	Francisco Jerez	2012-05-11	2	-1/+23
\| \|
* \|	gallium/tgsi: Split sampler views from shader resources.	Francisco Jerez	2012-05-11	1	-40/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit splits the current concept of resource into "sampler views" and "shader resources": "Sampler views" are textures or buffers that are bound to a given shader stage and can be read from in conjunction with a sampler object. They are analogous to OpenGL texture objects or Direct3D SRVs. "Shader resources" are textures or buffers that can be read and written from a shader. There's no support for floating point coordinates, address wrap modes or filtering, and, unlike sampler views, shader resources are global for the whole graphics pipeline. They are analogous to OpenGL image objects (as in ARB_shader_image_load_store) or Direct3D UAVs. Most hardware is likely to implement shader resources and sampler views as separate objects, so, having the distinction at the API level simplifies things slightly for the driver. This patch introduces the SVIEW register file with a declaration token and syntax analogous to the already existing RES register file. After this change, the SAMPLE_* opcodes no longer accept a resource as input, but rather a SVIEW object. To preserve the functionality of reading from a sampler view with integer coordinates, the SAMPLE_I(_MS) opcodes are introduced which are similar to LOAD(_MS) but take a SVIEW register instead of a RES register as argument.
* \|	gallium: Basic compute interface.	Francisco Jerez	2012-05-11	2	-1/+66
\|/ \| \| \| \| \| \| \| \| \|	Define an interface that exposes the minimal functionality required to implement some of the popular compute APIs. This commit adds entry points to set the grid layout and other state required to keep track of the usual address spaces employed in compute APIs, to bind a compute program, and execute it on the device. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/docs: document the new vertex fetch CAPs	Marek Olšák	2012-04-24	1	-0/+12
\|