| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
|
|
| |
Since atm our OPs aren't typed but instead values are, we need to
take care if they're used as different types (e.g. a load makes a
value u32 by default).
Maybe this should be changed (also to match TGSI), but it should
work as well if done properly.
|
| |
|
| |
|
|
|
|
|
|
|
|
|
| |
At some point we'll want to support real subroutines instead of
just inlining them into the main shader.
Since recursive calls are forbidden, we can just save all used
registers to a fixed local memory region and restore them on a
return, no need for a stack pointer.
|
| |
|
| |
|
|\
| |
| |
| |
| | |
Conflicts:
src/gallium/drivers/nv50/nv50_program.c
|
| |
| |
| |
| | |
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| | |
spotted by taiu on irc
|
| | |
|
| | |
|
| |
| |
| |
| | |
fixes at least 2 more piglits.
|
| |
| |
| |
| |
| |
| |
| |
| |
| | |
One can bind same texture or sampler to different slot,
each slot needs it own state. The solution implemented
here is not exactly beautifull or optimal need to think
to somethings better.
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| | |
fixes problems in valgrind with uninitialised values.
|
| |
| |
| |
| | |
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| |
| | |
Signed-off-by: Dave Airlie <[email protected]>
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| | |
Signed-off-by: Alex Deucher <[email protected]>
|
| |
| |
| |
| |
| | |
Free the buffers before the visuals. Fixes valgrind warning
reported in fd.o bug 29919.
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Make state statically allocated, this kills a bunch of code
and avoid intensive use of malloc/free. There is still a lot
of useless duplicate function wrapping that can be kill. This
doesn't improve yet performance, needs to avoid memcpy states
in radeon_ctx_set_draw and to avoid rebuilding vs_resources,
dsa, scissor, cb_cntl, ... states at each draw command.
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| |
| |
| |
| |
| | |
This reverts commit 1fa7245c348cb7aced81f1672140f64cb6450e2f.
Conflicts:
src/gallium/drivers/r600/r600_state.c
|
| |
| |
| |
| | |
Signed-off-by: Patrice Mandin <[email protected]>
|
| |
| |
| |
| | |
This fixes fbo-3d and fbo-cubemap
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
This reverts commit de0b76cab22caa9fc7260f80acb8f151ccced6c5, its pre-computes the texture state wrong,
you can't just use an array of levels, since you can have FBOs to depth texture slices inside a level as well
it would get really messy quickly. Probably need to split commits like this up into pieces for each piece
of state, so we can revert bits easier in case of regressions.
This also break 5 piglit tests, and valgrind starts to warn about invalid read/writes after this.
|
| |
| |
| |
| |
| |
| |
| |
| | |
fixes warning that
r600_blit.c: In function ‘r600_resource_copy_region’:
r600_blit.c:136: warning: passing argument 1 of ‘util_resource_copy_region’ from incompatible pointer type
and also 7 more piglit tests.
|
| |
| |
| |
| | |
add cube and shadow support to the texture code.
|
| |
| |
| |
| | |
This, plus the previous commit fix fd.o bug 29806.
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| | |
For some reason r600c, emits extra instructions in the FP to do the depth write output swizzle,
I'm not sure this is required, so here I'm doing it in the exports.
this fixes the mesa trivial demos tri-depthwrite and tri-depthwrite2, it doesn't fix
the glsl1 gl_FragDepth writing test however.
|
| | |
|
| | |
|
| |
| |
| |
| | |
this fixes a few if the fs shader tests, 10 more piglits
|
| | |
|
| |
| |
| |
| |
| |
| |
| | |
this is a bit of a workaround, something is wrong with the literal emits here
so we just use the trig copy function to copy the immd to a temp at start of op.
fix VP/FP LIT tests
|
| |
| |
| |
| |
| |
| |
| |
| |
| | |
So as the trig functions used up the literal spots for the PI work, if the arg0 was an immediate
we'd hit failure, so copy the literal before starting.
add some tracking of max temp used to avoid trashing temp regs.
5 more piglits, fp1 COS,SCS,SIN tests
|
| | |
|
| |
| |
| |
| | |
constant splitting was broken for multi-constant cases, fixes fp1 CMP+MAD, vp1 CMP.
|
| | |
|
| |
| |
| |
| |
| |
| |
| | |
Also add an error if we hit this problem again, we need to do this better
possibly tying the literal addition to the last flag.
Signed-off-by: Dave Airlie <[email protected]>
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Idea is to build hw state at pipe state creation and
reuse them while keeping a non PM4 packet interface
btw winsys & pipe driver. This commit also force rebuild
of pm4 packet on each call to radeon_state_pm4 which
in turn slow down everythings, this will be addressed.
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| | |
Signed-off-by: Jerome Glisse <[email protected]>
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
The current states code had an unhealthy relationship between
that had to somehow magically align themselves, editing either
place meant renumbering all states after the one you were on,
and it was pretty unapproachable code.
This replaces the huge types structures with a simple type + sub
type struct, which is keyed on an stype enum in radeon.h. Each
stype can have a per-shader type subclassing (4 types supported,
PS/VS/GS/FS), and also has a number of states per-subtype. So you
have 256 constants per 4 shaders per one CONSTANT stype.
The interface from the driver is changed to pass in the tuple,
(stype, id, shader_type), and we look for this. If
radeon_state_shader ever shows up on profile, it could use a
hashtable based on stype/shader_type to speed things up.
Signed-off-by: Dave Airlie <[email protected]>
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| | |
By default LLVM adds a signal handler to output a pretty stack trace.
This signal handler is never removed, causing problems when unloading
the shared object where the gallium driver resides.
Thanks to Chris Li for finding this.
|
| | |
|
| |
| |
| |
| |
| |
| |
| | |
The result is scalar, so when argument is zero/undef we can pass vector
zero/undef.
Also, support the scalar case.
|
| |
| |
| |
| |
| | |
SVGA3DOP_CMP is not supported for vertex shaders;
use SLT + LRP instead.
|
| |
| |
| |
| |
| |
| |
| |
| |
| | |
every command buffer.
Only non null resources.
To ensure that relocations are emitted for every resource currently
referred.
|