aboutsummaryrefslogtreecommitdiffstats
path: root/src/amd/compiler
Commit message (Expand)AuthorAgeFilesLines
* aco: improve SCC handling in some SALU combinesRhys Perry2020-02-121-6/+4
* aco: disable some instruction combining if it could change an exec operandRhys Perry2020-02-121-2/+13
* aco: fix creating v_madak if v_mad_f32 has two sgpr literalsSamuel Pitoiset2020-02-111-1/+1
* aco: fix waiting for scalar stores before "writing back" data on GFX8-GFX9Samuel Pitoiset2020-02-101-6/+9
* aco: do not use ds_{read,write}2 on GFX6Samuel Pitoiset2020-02-071-4/+6
* aco: gfx10_wave64_bpermute reduce op to print_irRhys Perry2020-02-061-0/+1
* aco: fix gfx10_wave64_bpermuteRhys Perry2020-02-061-1/+6
* aco/optimizer: Don't combine uniform bool s_and to s_andn2.Timur Kristóf2020-02-051-0/+3
* aco: fix image_atomic_cmp_swapDaniel Schürmann2020-01-312-2/+3
* aco: fix MUBUF VS input loads when expanding vec3 to vec4 on GFX6Samuel Pitoiset2020-01-311-1/+6
* aco: Fix the meaning of is_atomic.Timur Kristóf2020-01-292-7/+7
* aco: simplify gathering of MIMG address componentsDaniel Schürmann2020-01-291-129/+79
* aco: simplify adjust_sample_index_using_fmask() & get_image_coords()Daniel Schürmann2020-01-291-26/+27
* aco: fix register allocation with multiple live-range splitsDaniel Schürmann2020-01-291-7/+13
* aco: reorder VMEM operands in ACO IRDaniel Schürmann2020-01-2911-132/+153
* aco: fix exec mask consistency issuesRhys Perry2020-01-291-3/+7
* aco: parallelcopy exec mask before s_wqmRhys Perry2020-01-291-0/+7
* aco: fill reg_demand with sensible information in add_coupling_code()Rhys Perry2020-01-291-14/+22
* aco: improve assertion at the end of spillerRhys Perry2020-01-291-1/+1
* aco: set exec_potentially_empty after continues/breaks in nested IFsRhys Perry2020-01-292-15/+51
* aco: error when block has no logical preds but VGPRs are live at the startRhys Perry2020-01-291-1/+2
* aco: don't always add logical edges from continue_break blocks to headersRhys Perry2020-01-291-1/+2
* aco: only create parallelcopy to restore exec at loop exit if neededRhys Perry2020-01-291-2/+29
* aco: don't update demand in add_coupling_code() for loop headersRhys Perry2020-01-291-3/+9
* aco: don't consider loop header blocks branch blocks in add_coupling_codeRhys Perry2020-01-291-1/+1
* aco: fix target calculation when vgpr spilling introduces sgpr spillingRhys Perry2020-01-291-2/+1
* aco: fix VS input loads with MUBUF on GFX6Samuel Pitoiset2020-01-291-1/+7
* aco: run p_wqm instructions in WQMRhys Perry2020-01-291-2/+3
* aco: ensure predecessors' p_logical_end is in WQM when a p_phi is in WQMRhys Perry2020-01-291-1/+10
* aco: implement VK_AMD_shader_explicit_vertex_parameterSamuel Pitoiset2020-01-292-5/+42
* aco: fix combine_salu_not_bitwise() when SCC is usedDaniel Schürmann2020-01-281-0/+1
* aco: fix rebase error from GS copy shader supportRhys Perry2020-01-281-0/+15
* aco: use MUBUF in some situations instead of splitting vertex fetchesRhys Perry2020-01-281-10/+25
* aco: value-number MUBUF instructionsRhys Perry2020-01-281-2/+16
* aco: handle unaligned vertex fetch on GFX10Rhys Perry2020-01-281-1/+2
* aco: skip unused channels at the start when fetching verticesRhys Perry2020-01-281-1/+10
* aco: rework vertex fetching a bitRhys Perry2020-01-283-74/+128
* aco: fix literal application with v_cndmask_b32/v_addc_co_u32/etcRhys Perry2020-01-271-0/+3
* aco: always add sgprs to sgpr_ids when choosing literalsRhys Perry2020-01-271-3/+5
* aco: fix operand to scc when selecting SGPR ufind_msb/ifind_msbRhys Perry2020-01-271-1/+1
* aco: fix WaR check for >64-bit FLAT/GLOBAL instructionsRhys Perry2020-01-271-2/+2
* aco: copy the literal offset of SMEM instructions to a temporarySamuel Pitoiset2020-01-241-5/+10
* aco: fix a hazard with v_interp_* and v_{read,readfirst}lane_* on GFX6Samuel Pitoiset2020-01-242-0/+27
* aco: fix a hardware bug for MRTZ exports on GFX6Samuel Pitoiset2020-01-241-0/+9
* aco: combine MRTZ (depth, stencil, sample mask) exportsSamuel Pitoiset2020-01-242-280/+259
* aco/gfx10: Fix VcmpxExecWARHazard mitigation.Timur Kristóf2020-01-242-3/+2
* aco: Transform uniform bitwise instructions to 32-bit if possible.Timur Kristóf2020-01-241-0/+87
* aco: use nir_move_copiesRhys Perry2020-01-241-1/+2
* aco: implement GS copy shadersRhys Perry2020-01-244-148/+327
* aco: remove needs_instance_idRhys Perry2020-01-242-6/+0