mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert "anv/device: expose shaderInt16 support in gen8+"	Mark Janes	2018-05-03	1	-1/+1
\| \| \| \| \| \| \|	This reverts commit 0ba0ac815e078185c1f408ec7078fd1efac1a634. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106393 Reviewed-by: Scott D Phillips <[email protected]>
*	Revert "i965/compiler: handle conversion to smaller type in the lowering ↵	Mark Janes	2018-05-03	2	-7/+12
\| \| \| \| \| \| \| \| \|	pass for that" This reverts commit 96b51537908cd2aace85f54b437eeb72e6346b7e. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106393 Reviewed-by: Scott D Phillips <[email protected]>
*	anv/device: expose shaderInt16 support in gen8+	Iago Toral Quiroga	2018-05-03	1	-1/+1
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	anv/pipeline: support SpvCapabilityInt16 in gen8+	Iago Toral Quiroga	2018-05-03	1	-0/+1
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: implement 16-bit pack/unpack opcodes	Iago Toral Quiroga	2018-05-03	1	-0/+10
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	compiler/lower_64bit_packing: rename the pass to be more generic	Iago Toral Quiroga	2018-05-03	1	-1/+1
\| \| \| \| \| \|	It can do 32-bit packing too now. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: fix 16-bit comparisons	Iago Toral Quiroga	2018-05-03	1	-8/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	NIR assumes that booleans are always 32-bit, but Intel hardware produces 16-bit booleans for 16-bit comparisons. This means that we need to convert the 16-bit result to 32-bit. In the future we want to add an optimization pass to clean this up and hopefully remove the conversions. v2 (Jason): use the type of the source for the temporary and use brw_reg_type_from_bit_size for the conversion to 32-bit. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: lower some 16-bit integer operations to 32-bit	Iago Toral Quiroga	2018-05-03	1	-0/+21
\| \| \| \| \| \| \| \| \|	These are not supported in hardware for 16-bit integers. We do the lowering pass after the optimization loop to ensure that we lower ALU operations injected by algebraic optimizations too. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: support negate and abs of half float immediates	Jose Maria Casanova Crespo	2018-05-03	1	-2/+4
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: fix brw_imm_w for negative 16-bit integers	Jose Maria Casanova Crespo	2018-05-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	16-bit immediates need to replicate the 16-bit immediate value in both words of the 32-bit value. This needs to be careful to avoid sign-extension, which the previous implementation was not handling properly. For example, with the previous implementation, storing the value -3 would generate imm.d = 0xfffffffd due to signed integer sign extension, which is not correct. Instead, we should cast to uint16_t, which gives us the correct result: imm.ud = 0xfffdfffd. We only had a couple of cases hitting this path in the driver until now, one with value -1, which would work since all bits are one in this case, and another with value -2 in brw_clip_tri(), which would hit the aforementioned issue (this case only affects gen4 although we are not aware of whether this was causing an actual bug somewhere). v2: Make explicit uint32_t casting for left shift (Jason Ekstrand) Reviewed-by: Jason Ekstrand <[email protected]> Cc: "18.0 18.1" <[email protected]>
*	intel/compiler: fix 16-bit int brw_negate_immediate and brw_abs_immediate	Jose Maria Casanova Crespo	2018-05-03	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From Intel Skylake PRM, vol 07, "Immediate" section (page 768): "For a word, unsigned word, or half-float immediate data, software must replicate the same 16-bit immediate value to both the lower word and the high word of the 32-bit immediate field in a GEN instruction." This fixes the int16/uint16 negate and abs immediates that weren't taking into account the replication in lower and upper words. v2: Integer cases are different to Float cases. (Jason Ekstrand) Included reference to PRM (Jose Maria Casanova) v3: Make explicit uint32_t casting for left shift (Jason Ekstrand) Split half float implementation. (Jason Ekstrand) Fix brw_abs_immediate (Jose Maria Casanova) Cc: "18.0 18.1" <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: implement nir_instr_type_load_const for 16-bit constants	Jose Maria Casanova Crespo	2018-05-03	1	-0/+5
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: implement conversions from 16-bit int/float to bool	Iago Toral Quiroga	2018-05-03	1	-5/+11
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: implement conversion between float/int 16-bit types	Iago Toral Quiroga	2018-05-03	1	-0/+4
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/compiler: handle conversion to smaller type in the lowering pass for that	Iago Toral Quiroga	2018-05-03	2	-12/+7
\| \| \| \| \| \| \|	The lowering pass was specialized to act on 64-bit to 32-bit conversions only, but the implementation is valid for other cases. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: fix isign for 16-bit integers	Iago Toral Quiroga	2018-05-03	1	-5/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	We need to use 16-bit constants with 16-bit instructions, otherwise we get the following validation error: "Destination stride must be equal to the ratio of the sizes of the execution data type to the destination type" Because the execution data type is 4B due to the 32-bit integer constant. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel: aubinator: add an option to limit the number of decoded VBO lines	Lionel Landwerlin	2018-05-02	1	-16/+23
\| \| \| \| \|	Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: decoder: limit to the number decoded lines from VBO	Lionel Landwerlin	2018-05-02	2	-8/+16
\| \| \| \| \| \| \| \|	By default we set no limit, but the debug batch decoder in i965 sets it to 100. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	anv: Advertise variableMultisampleRate	Jason Ekstrand	2018-05-02	1	-1/+1
\| \| \| \| \| \| \| \|	Initially, I didn't understand this feature. Turns out that all it means is that you can switch multisample rates in the middle of a zero-attachment subpass. We've been able to do this since forever. Reviewed-by: Anuj Phogat <[email protected]>
*	intel: Fix 3DSTATE_CONSTANT buffer decoding.	Kenneth Graunke	2018-05-02	1	-15/+25
\| \| \| \| \| \| \| \| \| \|	First, this was iterating over the 3DSTATE_CONSTANT_* instruction but trying to process fields of the 3DSTATE_CONSTANT_BODY substructure. Secondly, the fields have been called Buffer[0] and Read Length[0], for a while now, and we were not handling the subscripts correctly. Reviewed-by: Lionel Landwerlin <[email protected]>
*	intel: fix aubinator include	Lionel Landwerlin	2018-05-02	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Lionel Landwerlin <[email protected]> Fixes: 7c22c150c40b3 ("intel: Move batch decoder/disassembler from tools/ to common/") Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Give the batch decoder a callback to ask about state size.	Kenneth Graunke	2018-05-02	4	-6/+28
\| \| \| \| \| \| \| \| \|	Given an arbitrary batch, we don't always know what the size of certain things are, such as how many entries are in a binding table. But it's easy for the driver to track that information, so with a simple callback we can calculate this correctly for INTEL_DEBUG=bat. Reviewed-by: Lionel Landwerlin <[email protected]>
*	intel: Move batch decoder/disassembler from tools/ to common/	Kenneth Graunke	2018-05-02	7	-11/+9
\| \| \| \| \| \| \| \|	Making these part of libintel_common allows us to use them in the DRI driver. The standalone tool binaries already link against the common library, too, so it's no harder for them. Reviewed-by: Lionel Landwerlin <[email protected]>
*	intel: batch-decoder: iterate VERTEX_BUFFER_STATE fields	Lionel Landwerlin	2018-05-02	1	-31/+39
\| \| \| \| \| \| \| \| \|	The gen_field_iterator only iterates the fields of a given gen_group. If we want to iterate the fields of another gen_group contained as field, we need to do it manually. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: decoder: fix starting dword of struct fields	Lionel Landwerlin	2018-05-02	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Struct fields might span several dwords, but iter_dword is incremented up to the last dword of the current field before we print out the struct's fields. We can't use iter_dword for computing the offset into the pointer of data to decode. v2: Fix displayed offset number (Ken) Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: decoder: document when fields should be used	Lionel Landwerlin	2018-05-02	1	-3/+2
\| \| \| \| \|	Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: decoder: identify groups with fixed length	Lionel Landwerlin	2018-05-02	2	-6/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	<register> & <struct> elements always have fixed length. The get_length() method implies that we're dealing with an instruction in which the length is encoded into the variable data but the field iterator uses it without checking what kind of gen_group it is dealing with. Let's make get_length() report the correct length regardless of the gen_group (register, struct or instruction). Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: decoder: make the field iterator use more natural	Lionel Landwerlin	2018-05-02	2	-30/+36
\| \| \| \| \| \| \| \| \| \| \|	while (iter_next()) { ... } instead of do { ... } while (iter_next()); Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: activate the gl_BaseVertex lowering	Antia Puentes	2018-05-02	5	-16/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Surplus code related to the basevertex is removed. The Vertex Elements contain now: * VE 1: <firstvertex, BaseInstance, VertexID, InstanceID> * VE 2: <DrawID, is_indexed_draw, 0, 0> Also fixes unreachable message. Fixes OpenGL CTS tests: * KHR-GL46.shader_draw_parameters_tests.ShaderDrawArraysInstancedParameters * KHR-GL46.shader_draw_parameters_tests.ShaderMultiDrawArraysParameters * KHR-GL46.shader_draw_parameters_tests.MultiDrawArraysIndirectCountParameters * KHR-GL46.shader_draw_parameters_tests.ShaderDrawArraysParameters * KHR-GL46.shader_draw_parameters_tests.ShaderMultiDrawArraysIndirectParameters Fixes Piglit tests: * arb_shader_draw_parameters-drawid-indirect baseinstance * arb_shader_draw_parameters-basevertex Reviewed-by: Jason Ekstrand <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102678
*	intel: emit is_indexed_draw in the same VE than gl_DrawID	Antia Puentes	2018-05-02	3	-8/+19
\| \| \| \| \| \| \| \| \| \| \|	The Vertex Elements are now: * VE 1: <BaseVertex/firstvertex, BaseInstance, VertexID, InstanceID> * VE 2: <DrawID, is-indexed-draw, 0, 0> VE1 is it kept as it was before, VE2 additionally contains the new system value. Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: Add uses_is_indexed_draw flag	Antia Puentes	2018-05-02	2	-0/+5
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	anv: Allow lookup of vkEnumerateInstanceVersion without an instance	Jason Ekstrand	2018-05-01	1	-0/+1
\| \| \| \| \|	Fixes: cbab2d1da5edfe9df27a010adf8b1aa9dbee473b Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	anv: Don't advertise Float64 or Int64 on HW without 64-bit types	Jason Ekstrand	2018-05-01	1	-2/+4
\| \| \| \|	Reviewed-by: Anuj Phogat <[email protected]>
*	intel: fix check for 48b ppgtt support	Scott D Phillips	2018-04-30	4	-48/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The previous logic of the supports_48b_addresses wasn't actually checking if i915.ko was running with full_48bit_ppgtt. The ENOENT it was checking for was actually coming from the invalid context id provided in the test execbuffer. There is no path in the kernel driver where the presence of EXEC_OBJECT_SUPPORTS_48B_ADDRESS leads to an error. Instead, check the default context's GTT_SIZE param for a value greater than 4 GiB v2 (Ken): Fix in i965 as well. v3 Check GTT_SIZE instead of HAS_ALIASING_PPGTT (Chris Wilson) Reviewed-by: Kenneth Graunke <[email protected]>
*	meson: fix race condition revealed by using 0.44	Dylan Baker	2018-04-27	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously there was a special target that blocked for the generation of anv_entrypoints.h, with meson 0.44 we don't need this, we can use a new language feature instead. The problem is that previously that blocking target would hide a race condition for the generation of another header, anv_extensions.h. Now the build sometimes fails when anv_extensions.h is not generated in time. v2: - clarify the race condition in the commit message (Emil) CC: Mark Janes <[email protected]> Fixes: 92550d9b16d2b295bdac087f31b1fd6d0f808e02 ("meson: remove workaround for custom target creating .h and .c files") Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Scott D Phillips <[email protected]> Reviewed-by: Emil Velikov <[email protected]>
*	anv/icl: Enable Vulkan on Ice Lake	Anuj Phogat	2018-04-26	1	-0/+2
\| \| \| \| \| \| \| \|	This patch enables the Vulkan driver on Ice Lake h/w with added warning about preliminary support. Signed-off-by: Anuj Phogat <[email protected]> Reviewed-by: Nanley Chery <[email protected]>
*	anv: enable VK_EXT_shader_viewport_index_layer	Caio Marcelo de Oliveira Filho	2018-04-26	2	-0/+2
\| \| \| \| \|	Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	anv/allocator: Don't shrink either end of the block pool	Jason Ekstrand	2018-04-26	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, we only tried to ensure that we didn't shrink either end below what was already handed out. However, due to the way we handle relocations with block pools, we can't shrink the back end at all. It's probably best to not shrink in either direction. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105374 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106147 Tested-by: Eero Tamminen <[email protected]> Reviewed-by: Scott D Phillips <[email protected]> Cc: [email protected]
*	meson: remove dummy_cpp	Dylan Baker	2018-04-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	meson has gotten pretty smart about tracking C and C++ dependencies (internal and external), and using the right linker. This wasn't always the case and we created empty c++ files to force the use of the c++ linker. We don't need that any more. Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	meson: remove workaround for custom target creating .h and .c files	Dylan Baker	2018-04-24	1	-13/+4
\| \| \| \| \| \| \| \| \|	In more modern versions of meson a custom_target returns an index-able object. This allows us to create accurate dependency models for targets that rely only on the header and not on the code from anv_entrypoints. Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	intel/compiler: Add scheduler deps for instructions that implicitly read g0	Ian Romanick	2018-04-24	2	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise the scheduler can move the writes after the reads. Signed-off-by: Ian Romanick <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95009 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95012 Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Tested-by: Mark Janes <[email protected]> Cc: Clayton A Craft <[email protected]> Cc: [email protected]
*	intel/compiler: Silence unused parameter warnings in empty ↵	Ian Romanick	2018-04-24	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vec4_instruction_scheduler methods src/intel/compiler/brw_schedule_instructions.cpp: In member function ‘virtual void vec4_instruction_scheduler::count_reads_remaining(backend_instruction)’: src/intel/compiler/brw_schedule_instructions.cpp:764:72: warning: unused parameter ‘be’ [-Wunused-parameter] vec4_instruction_scheduler::count_reads_remaining(backend_instruction be) ^~ src/intel/compiler/brw_schedule_instructions.cpp: In member function ‘virtual void vec4_instruction_scheduler::setup_liveness(cfg_t)’: src/intel/compiler/brw_schedule_instructions.cpp:769:51: warning: unused parameter ‘cfg’ [-Wunused-parameter] vec4_instruction_scheduler::setup_liveness(cfg_t cfg) ^~~ src/intel/compiler/brw_schedule_instructions.cpp: In member function ‘virtual void vec4_instruction_scheduler::update_register_pressure(backend_instruction)’: src/intel/compiler/brw_schedule_instructions.cpp:774:75: warning: unused parameter ‘be’ [-Wunused-parameter] vec4_instruction_scheduler::update_register_pressure(backend_instruction be) ^~ src/intel/compiler/brw_schedule_instructions.cpp: In member function ‘virtual int vec4_instruction_scheduler::get_register_pressure_benefit(backend_instruction)’: src/intel/compiler/brw_schedule_instructions.cpp:779:80: warning: unused parameter ‘be’ [-Wunused-parameter] vec4_instruction_scheduler::get_register_pressure_benefit(backend_instruction be) ^~ src/intel/compiler/brw_schedule_instructions.cpp: In member function ‘virtual int vec4_instruction_scheduler::issue_time(backend_instruction)’: src/intel/compiler/brw_schedule_instructions.cpp:1550:61: warning: unused parameter ‘inst’ [-Wunused-parameter] vec4_instruction_scheduler::issue_time(backend_instruction inst) ^~~~ Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: Silence unused parameter warning in compile_cs_to_nir	Ian Romanick	2018-04-24	1	-4/+3
\| \| \| \| \| \| \| \| \| \|	src/intel/compiler/brw_fs.cpp: In function ‘nir_shader* compile_cs_to_nir(const brw_compiler, void, const brw_cs_prog_key, brw_cs_prog_data, const nir_shader, unsigned int)’: src/intel/compiler/brw_fs.cpp:7205:44: warning: unused parameter ‘prog_data’ [-Wunused-parameter] struct brw_cs_prog_data prog_data, ^~~~~~~~~ Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/compiler: Silence unused parameter warnings in generate_foo methods	Ian Romanick	2018-04-24	2	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since all of the fs_generator::generate_foo methods take a fs_inst * as the first parameter, just remove the name to quiet the compiler. src/intel/compiler/brw_fs_generator.cpp: In member function ‘void fs_generator::generate_barrier(fs_inst, brw_reg)’: src/intel/compiler/brw_fs_generator.cpp:743:41: warning: unused parameter ‘inst’ [-Wunused-parameter] fs_generator::generate_barrier(fs_inst inst, struct brw_reg src) ^~~~ src/intel/compiler/brw_fs_generator.cpp: In member function ‘void fs_generator::generate_discard_jump(fs_inst)’: src/intel/compiler/brw_fs_generator.cpp:1326:46: warning: unused parameter ‘inst’ [-Wunused-parameter] fs_generator::generate_discard_jump(fs_inst inst) ^~~~ src/intel/compiler/brw_fs_generator.cpp: In member function ‘void fs_generator::generate_pack_half_2x16_split(fs_inst, brw_reg, brw_reg, brw_reg)’: src/intel/compiler/brw_fs_generator.cpp:1675:54: warning: unused parameter ‘inst’ [-Wunused-parameter] fs_generator::generate_pack_half_2x16_split(fs_inst inst, ^~~~ src/intel/compiler/brw_fs_generator.cpp: In member function ‘void fs_generator::generate_shader_time_add(fs_inst, brw_reg, brw_reg, brw_reg)’: src/intel/compiler/brw_fs_generator.cpp:1743:49: warning: unused parameter ‘inst’ [-Wunused-parameter] fs_generator::generate_shader_time_add(fs_inst inst, ^~~~ src/intel/compiler/brw_vec4_generator.cpp: In function ‘void generate_set_simd4x2_header_gen9(brw_codegen, brw::vec4_instruction, brw_reg)’: src/intel/compiler/brw_vec4_generator.cpp:1412:52: warning: unused parameter ‘inst’ [-Wunused-parameter] vec4_instruction inst, ^~~~ src/intel/compiler/brw_vec4_generator.cpp: In function ‘void generate_mov_indirect(brw_codegen, brw::vec4_instruction, brw_reg, brw_reg, brw_reg, brw_reg)’: src/intel/compiler/brw_vec4_generator.cpp:1430:41: warning: unused parameter ‘inst’ [-Wunused-parameter] vec4_instruction inst, ^~~~ src/intel/compiler/brw_vec4_generator.cpp:1432:63: warning: unused parameter ‘length’ [-Wunused-parameter] struct brw_reg indirect, struct brw_reg length) ^~~~~~ Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/fs: Return mlen * 8 for size_read() for INTERPOLATE_AT_*	Jason Ekstrand	2018-04-23	1	-0/+2
\| \| \| \| \| \| \| \| \|	They are send messages and this makes size_read() and mlen agree. For both of these opcodes, the payload is just a dummy so mlen == 1 and this should decrease register pressure a bit. Reviewed-by: Francisco Jerez <[email protected]> Cc: [email protected]
*	anv/blorp: Do the gen11 BTI flush	Jason Ekstrand	2018-04-20	1	-0/+14
\| \| \| \| \|	Reviewed-by: Anuj Phogat <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965/fs: retype offset_reg to UD at load_ssbo	Jose Maria Casanova Crespo	2018-04-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	All operations with offset_reg at do_vector_read are done with UD type. So copy propagation was not working through the generated MOVs: mov(8) vgrf9:UD, vgrf7:D This change allows removing the MOV generated for reading the first components for 16-bit and 64-bit ssbo reads with non-constant offsets. Reviewed-by: Iago Toral Quiroga <[email protected]>
*	nir: Offset vertex_id by first_vertex instead of base_vertex	Neil Roberts	2018-04-19	2	-7/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	base_vertex will be zero for non-indexed calls and in that case we need vertex_id to be offset by the ‘first’ parameter instead. That is what we get with first_vertex. This is true for both GL and Vulkan. The freedreno driver is also setting vertex_id_zero_based on nir_options. In order to avoid breakage this patch switches the relevant code to handle SYSTEM_VALUE_FIRST_VERTEX so that it can retain the same behavior. v2: change a3xx/fd3_emit.c and a4xx/fd4_emit.c from SYSTEM_VALUE_BASE_VERTEX to SYSTEM_VALUE_FIRST_VERTEX (Kenneth). Reviewed-by: Ian Romanick <[email protected]> Cc: Rob Clark <[email protected]> Acked-by: Marek Olšák <[email protected]>
*	spirv: Lower BaseVertex to FIRST_VERTEX instead of BASE_VERTEX	Neil Roberts	2018-04-19	2	-4/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	The base vertex in Vulkan is different from GL in that for non-indexed primitives the value is taken from the firstVertex parameter instead of being set to zero. This coincides with the new SYSTEM_VALUE_FIRST_VERTEX instead of BASE_VERTEX. v2 (idr): Add comment describing why SYSTEM_VALUE_FIRST_VERTEX is used for SpvBuiltInBaseVertex. Suggested by Jason. Reviewed-by: Ian Romanick <[email protected]> [v1] Reviewed-by: Jason Ekstrand <[email protected]>
*	intel: Handle firstvertex in an identical way to BaseVertex	Antia Puentes	2018-04-19	3	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	Until we set gl_BaseVertex to zero for non-indexed draw calls both have an identical value. The Vertex Elements are kept like that: * VE 1: <BaseVertex/firstvertex, BaseInstance, VertexID, InstanceID> * VE 2: <Draw ID, 0, 0, 0> v2 (idr): Mark nir_intrinsic_load_first_vertex as "unreachable" in emit_system_values_block and fs_visitor::nir_emit_vs_intrinsic.