summaryrefslogtreecommitdiffstats
path: root/src/amd/compiler
Commit message (Expand)AuthorAgeFilesLines
* aco: add Instruction::usesModifiers() and add more checks in the optimizerRhys Perry2019-11-082-7/+23
* radv: adjust loop unrolling heuristics for int64Rhys Perry2019-11-071-7/+1
* aco: workaround Tonga/Iceland hardware bugDaniel Schürmann2019-11-071-5/+5
* radv: determine shaders wavesize at pipeline levelSamuel Pitoiset2019-11-062-5/+4
* aco: fix accidential reordering of instructions when schedulingDaniel Schürmann2019-11-041-10/+47
* aco: only use single-dword loads/stores for spillingDaniel Schürmann2019-11-041-41/+10
* aco: fix immediate offset for spills if scratch is usedDaniel Schürmann2019-11-041-6/+6
* aco: implement VGPR spillingDaniel Schürmann2019-10-301-7/+162
* aco: always set scratch_offset in startpgmDaniel Schürmann2019-10-303-23/+22
* aco: omit linear VGPRs as spill variablesDaniel Schürmann2019-10-301-4/+8
* aco: ensure that spilled VGPR reloads are done after p_logical_startDaniel Schürmann2019-10-301-34/+43
* aco: simplify calculation of target register pressure when spillingDaniel Schürmann2019-10-301-39/+12
* aco: fix new_demand calculation for first instructionsRhys Perry2019-10-301-4/+7
* aco: don't add interferences between spilled phi operandsDaniel Schürmann2019-10-301-8/+8
* aco: consider loop_exit blocks like merge blocks, even if they have only one ...Daniel Schürmann2019-10-301-2/+2
* aco: don't insert the exec mask into set of live-out variables when spillingDaniel Schürmann2019-10-301-14/+6
* aco: fix transitive affinities of spilled variablesDaniel Schürmann2019-10-301-25/+79
* aco: fix live-range splits of phisDaniel Schürmann2019-10-301-14/+23
* aco: remove potential critical edge on loops.Daniel Schürmann2019-10-302-18/+23
* aco: improve live variable analysisDaniel Schürmann2019-10-301-25/+64
* aco: Lower to CSSADaniel Schürmann2019-10-304-41/+268
* aco: rename README to README.mdRhys Perry2019-10-301-0/+0
* aco: a couple loop handling fixes for GFX10 hazard passRhys Perry2019-10-301-3/+3
* aco: try to group together VMEM loads of the same resourceRhys Perry2019-10-301-10/+56
* aco: don't schedule instructions through depending VMEM instructionsDaniel Schürmann2019-10-301-0/+3
* aco: add can_reorder flags to load_ubo and load_constantDaniel Schürmann2019-10-301-5/+9
* aco: only skip RAR dependencies if the variable is killed somewhereDaniel Schürmann2019-10-301-21/+46
* aco: restrict scheduling depending on max_wavesDaniel Schürmann2019-10-301-9/+15
* aco: Introduce vgpr_limit to keep track of available VGPRs.Timur Kristóf2019-10-286-5/+12
* aco: Implement subgroup shuffle in GFX10 wave64 mode.Timur Kristóf2019-10-286-16/+113
* aco: Remove dead code in reduction lowering.Rhys Perry2019-10-281-16/+14
* aco: Fix reductions on GFX10.Rhys Perry2019-10-283-18/+95
* aco: Refactor hazard mitigations, separate pass for GFX10.Timur Kristóf2019-10-251-70/+113
* aco/gfx10: Fix mitigation of VMEMtoScalarWriteHazard.Timur Kristóf2019-10-251-10/+20
* aco/gfx10: Mitigate LdsBranchVmemWARHazard.Timur Kristóf2019-10-252-0/+66
* aco/gfx10: Mitigate SMEMtoVectorWriteHazard.Timur Kristóf2019-10-252-0/+70
* aco/gfx10: Mitigate VcmpxExecWARHazard.Timur Kristóf2019-10-252-0/+59
* aco/gfx10: Mitigate VcmpxPermlaneHazard.Timur Kristóf2019-10-252-0/+28
* aco/gfx10: Add notes about some GFX10 hazards.Timur Kristóf2019-10-252-2/+37
* aco: take LDS into account when calculating num_wavesRhys Perry2019-10-234-7/+42
* aco: increase accuracy of SGPR limitsRhys Perry2019-10-236-28/+95
* Revert "aco: only emit waitcnt on loop continues if we there was some load or...Rhys Perry2019-10-221-1/+1
* aco: add missing bld.scc()Rhys Perry2019-10-221-1/+1
* aco: keep can_reorder/barrier when combining addition into SMEMRhys Perry2019-10-221-0/+2
* aco: add a few missing checks in value numberingRhys Perry2019-10-221-1/+4
* aco: use ds_read2_b64/ds_write2_b64Rhys Perry2019-10-221-7/+24
* aco: properly combine additions into ds_write2_b64/ds_read2_b64Rhys Perry2019-10-221-1/+2
* aco: fix sparse store_lds()Rhys Perry2019-10-221-8/+40
* aco: create load_lds/store_lds helpersRhys Perry2019-10-221-176/+195
* aco: fix 64-bit p_extract_vector on 32-bit p_create_vectorRhys Perry2019-10-221-1/+2