radv: enable denorms for 64-bit and 16-bit floats

Similar to RadeonSI. This fixes: dEQP-VK.image.texel_view_compatible.graphic.basic.attachment_read.bc*r16g16b16a16_sfloat dEQP-VK.image.extended_usage_bit.attachment_write.r16_sfloat Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
author: Samuel Pitoiset <[email protected]> 2017-12-28 22:55:27 +0100
committer: Samuel Pitoiset <[email protected]> 2018-01-05 09:51:33 +0100
commit: ec63ab39be0b48550ec6da58537566d0041a628e (patch)
tree: 9485e0325a9279bb27df527f89f4eebcd6fe84e9 /src
parent: 7643c715278caa8c98246c2359d7a70ecb3b7455 (diff)
1 files changed, 14 insertions, 0 deletions
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index ac0a2e67d40..48e2920a158 100644
--- a/src/amd/common/ac_nir_to_llvm.c
+++ b/src/amd/common/ac_nir_to_llvm.c
@@ -6892,6 +6892,20 @@ static void ac_compile_llvm_module(LLVMTargetMachineRef tm,
 	/* +3 for scratch wave offset and VCC */
 	config->num_sgprs = MAX2(config->num_sgprs,
 	                         shader_info->num_input_sgprs + 3);
+
+	/* Enable 64-bit and 16-bit denormals, because there is no performance
+	 * cost.
+	 *
+	 * If denormals are enabled, all floating-point output modifiers are
+	 * ignored.
+	 *
+	 * Don't enable denormals for 32-bit floats, because:
+	 * - Floating-point output modifiers would be ignored by the hw.
+	 * - Some opcodes don't support denormals, such as v_mad_f32. We would
+	 *   have to stop using those.
+	 * - SI & CI would be very slow.
+	 */
+	config->float_mode |= V_00B028_FP_64_DENORMS;
 }
 
 static void
author	Samuel Pitoiset <[email protected]>	2017-12-28 22:55:27 +0100
committer	Samuel Pitoiset <[email protected]>	2018-01-05 09:51:33 +0100
commit	ec63ab39be0b48550ec6da58537566d0041a628e (patch)
tree	9485e0325a9279bb27df527f89f4eebcd6fe84e9 /src
parent	7643c715278caa8c98246c2359d7a70ecb3b7455 (diff)