mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	radeon/ac: fix intrinsic version check	Dave Airlie	2017-03-06	1	-1/+1
\| \| \| \| \| \| \|	Reported-by: [email protected] Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100068 Signed-off-by: Dave Airlie <[email protected]>
*	ac: normalize build helper names	Marek Olšák	2017-03-03	3	-283/+282
\| \| \| \| \| \|	s/emit/build/ Reviewed-by: Dave Airlie <[email protected]>
*	ac: replace SI.vs.load.input with amdgcn.buffer.load.format	Marek Olšák	2017-03-03	1	-0/+20
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: move SI.vs.load.input building into amd/common	Marek Olšák	2017-03-03	2	-0/+23
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	ac: replace llvm.SI.tbuffer.store with llvm.amdgcn.buffer.store if ADD_TID=0	Marek Olšák	2017-03-03	3	-4/+62
\| \| \| \| \| \| \| \|	ADD_TID doesn't work. Needs more investigation. v2: remove leftover dead code Reviewed-by: Dave Airlie <[email protected]> (v1)
*	ac: remove offen parameter from ac_build_buffer_store_dword	Marek Olšák	2017-03-03	3	-10/+8
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: merge and simplify tbuffer_store functions	Marek Olšák	2017-03-03	3	-74/+38
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: replace AMDGPU.bfe.* with amdgcn.*bfe	Marek Olšák	2017-03-03	2	-0/+29
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: move kill intrinsic building into amd/common	Marek Olšák	2017-03-03	2	-0/+17
\| \| \| \| \| \|	just a cleanup Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: set readnone on reads from read-only memory	Marek Olšák	2017-03-03	2	-3/+11
\|
*	radeonsi: replace SI.packf16 with amdgcn.cvt.pkrtz	Marek Olšák	2017-03-03	2	-0/+20
\|
*	ac: replace old image intrinsics with new ones	Marek Olšák	2017-03-03	1	-0/+80
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: move image intrinsic building to amd/common	Marek Olšák	2017-03-03	2	-0/+97
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	ac: replace SI.export with amdgcn.exp.*	Marek Olšák	2017-03-03	1	-0/+31
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	radeonsi: move llvm.SI.export building to amd/common	Marek Olšák	2017-03-03	2	-0/+26
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	ac: unify build_type_name_for_intr functions	Marek Olšák	2017-03-03	3	-38/+42
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	gallivm, ac: add writeonly and inaccessiblememonly attributes	Marek Olšák	2017-03-03	2	-0/+4
\| \| \| \|	Reviewed-by: Dave Airlie <[email protected]>
*	amd/common: Fix build with new ac_add_function_attr()	Tobias Klausmann	2017-03-01	3	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fix usage of ac_add_function_attr() and make it known! common/ac_nir_to_llvm.c: In function 'create_llvm_function': common/ac_nir_to_llvm.c:265:4: error: implicit declaration of function 'ac_add_function_attr' [-Werror=implicit-function-declaration] ac_add_function_attr(main_function, i + 1, AC_FUNC_ATTR_BYVAL); ^~~~~~~~~~~~~~~~~~~~ Signed-off-by: Tobias Klausmann <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	gallivm,ac: add function attributes at call sites instead of declarations	Marek Olšák	2017-03-01	4	-48/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	They can vary at call sites if the intrinsic is NOT a legacy SI intrinsic. We need this to force readnone or inaccessiblememonly on some amdgcn intrinsics. This is only used with LLVM 4.0 and later. Intrinsics only used with LLVM <= 3.9 don't need the LEGACY flag. gallivm and ac code is in the same patch, because splitting would be more complicated with all the LEGACY uses all over the place. v2: don't change the prototype of lp_add_function_attr. Reviewed-by: Jose Fonseca <[email protected]> (v1)
*	gallivm,ac: remove unused FUNC_ATTR_LAST enums	Marek Olšák	2017-03-01	1	-1/+0
\| \| \| \|	Reviewed-by: Jose Fonseca <[email protected]>
*	radv: fix txs for sampler buffers	Dave Airlie	2017-03-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	I messed this up when I wrote it, this fixes: dEQP-VK.memory.pipeline_barrier.uniform_texel_buffer. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Cc: "17.0" <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	amd/common: fix ASICREV_IS_POLARIS11_M for Polaris12	Marek Olšák	2017-02-28	1	-1/+1
\| \| \| \| \| \|	Cc: 17.0 <[email protected]> Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	radv/ac: Use constants for immutable samplers.	Bas Nieuwenhuizen	2017-02-28	1	-0/+16
\| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	radeon/ac: make ac_shader_binary_config_start() available externally	Timothy Arceri	2017-02-28	2	-1/+8
\| \| \| \| \| \| \| \|	The read config functions are different for r600 and radeonsi so we can't just share the one in amd common. So just share this instead. Reviewed-by: Marek Olšák <[email protected]>
*	radeon/ac: add llvm_ir_string to ac_shader_binary struct	Timothy Arceri	2017-02-28	1	-0/+1
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radv/ac: Add integer->integer casts.	Bas Nieuwenhuizen	2017-02-26	1	-0/+18
\| \| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Acked-by: Edward O'Callaghan <[email protected]>
*	ac: silence a warning	Marek Olšák	2017-02-25	1	-2/+1
\| \| \| \|	trivial
*	radv: add sample mask output support	Dave Airlie	2017-02-24	2	-2/+7
\| \| \| \| \| \| \| \| \|	This adds support to write to sample mask from the fragment shader. We can optimise this later like radeonsi. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: refactor our fmask sample index fixup.	Dave Airlie	2017-02-24	1	-122/+107
\| \| \| \| \| \| \| \|	This refactors out the sample index fixup between txf and image load. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv: fetch sample index via fmask for image coord as well.	Dave Airlie	2017-02-24	1	-54/+126
\| \| \| \| \| \| \| \| \| \| \|	This follows the txf_ms code, I can't figure out why amdgpu-pro doesn't do this in their shaders, they must know someone we don't. This fixes: dEQP-VK.pipeline.multisample_shader_builtin.sample_id.* Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv: add sample mask input support	Dave Airlie	2017-02-24	1	-1/+6
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv: fix interpolation at wrong place for offset interp	Dave Airlie	2017-02-24	1	-2/+4
\| \| \| \| \| \| \| \| \|	The code was interpolating at the offset from the sample, not the offset from the center. Also fix for persample interpolation modes we should force the pixel center to be at the sample. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: handle gs->copy shader clip distances.	Dave Airlie	2017-02-23	1	-13/+68
\| \| \| \| \| \| \| \| \| \| \|	This fixes up the clip distance passing between the geometry shader and the copy shader. It packs the clip and cull distances into one or two consecutive slots, and avoids wasting space and make sure the gs output and copy shader input agree on where things are stored. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: pass clips properly from vertex->geometry shader stages.	Dave Airlie	2017-02-23	1	-6/+40
\| \| \| \| \| \| \| \| \| \| \|	This works out the geometry shader clip/cull inputs separately to the outputs, and uses that information to read from the ES->GS ring buffer. It stores the clip/cull distances packed into one or two slots. It fixes the es output emission and gs input reading to match. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: rename num clips/cull to output clips/culls	Dave Airlie	2017-02-23	1	-10/+10
\| \| \| \| \| \| \| \| \|	As geom shaders can have different ones on entry and exit. also move to uint8_t as these are never that big. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	ac/llvm: use min+max instead of AMDGPU.clamp on LLVM 5.0	Marek Olšák	2017-02-18	1	-0/+17
\| \| \| \| \| \| \|	It selects v_med3_f32, which has the same rate & size. Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: stop using TGSI_OPCODE_CLAMP by moving it amd/common	Marek Olšák	2017-02-18	2	-0/+16
\| \| \| \| \|	Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	ac/llvm: fix various findMSB bugs	Marek Olšák	2017-02-18	1	-2/+3
\| \| \| \| \| \|	sffbh needs to be suffixed with ".i32" Reviewed-by: Dave Airlie <[email protected]>
*	radv/ac: use shared umsb helper.	Dave Airlie	2017-02-16	1	-17/+1
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeon/ac: add emit umsb shared code.	Dave Airlie	2017-02-16	2	-0/+29
\| \| \| \| \| \| \| \|	Since we shared imsb, makes sense to share umsb. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeon/ac: use llvm.amdgcn.sffbh intrinsic instead of AMDGPU.flbit.i32	Dave Airlie	2017-02-16	1	-1/+2
\| \| \| \| \| \| \| \|	Use the newer intrinsic. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: use shader imsb emission code.	Dave Airlie	2017-02-16	1	-17/+1
\| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeon/ac: add ac_emit_imsb helper.	Dave Airlie	2017-02-16	2	-0/+28
\| \| \| \| \| \| \| \| \|	We want to use a different intrinsic on newer llvm, so move this code to a shared area. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv: fix warning since using common gs emit code	Dave Airlie	2017-02-14	1	-1/+0
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: use sendmsg emission interface.	Dave Airlie	2017-02-14	1	-26/+4
\| \| \| \| \| \| \|	This uses the common code to emit the correct intrinsic. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeon/ac/llvm: add support for sendmsg emission	Dave Airlie	2017-02-14	2	-0/+25
\| \| \| \| \| \| \| \| \|	This lets us use the new intrinsic on the correct version of llvm. Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: use common interp code for new intrinsics	Dave Airlie	2017-02-14	1	-20/+41
\| \| \| \| \| \| \| \|	This uses the common fs interp code to use the new llvm intrinsics so llvm can drop the old ones. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv/ac: avoid the fmask path when doing txs.	Dave Airlie	2017-02-06	1	-1/+2
\| \| \| \| \| \| \| \|	This fixes the vulkan samples deferredmultisampling test. Cc: "17.0" <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeon/ac: move common llvm build functions to a separate file.	Dave Airlie	2017-02-07	5	-851/+931
\| \| \| \| \| \| \| \| \|	Suggested by Marek. Reviewed-by: Marek Olšák <[email protected]> Acked-by: Nicolai Hähnle <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radv: fix shared memory load/stores.	Dave Airlie	2017-02-03	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \|	If we have an indirect index here we need to scale it by attribute slots e.g. is this is vec2[256] then we get an indir_index in the 0.255 range but the vec2 are aligned inside vec4 slots. So scale the indir index, then extract the channels. Reviewed-by: Bas Nieuwenhuizen <[email protected]> Cc: "17.0" <[email protected]> Signed-off-by: Dave Airlie <[email protected]>