util: return 0 for NaNs in float_to_ubyte

d3d10 requires NaNs to get converted to 0 for float->unorm conversions (and float->int etc.). GL spec probably doesn't care in general, but it would make sense to have reasonable behavior in any case imho - the old code was converting negative NaNs to 0, and positive NaNs to 255. (Note that using float comparison isn't actually all that much more effort in any case, at least with sse2 it's just float comparison (ucommiss) instead of int one - I converted the second comparison to float too simply because it saves the probably somewhat expensive transfer of the float from simd to int domain (with sse2 via stack), so the generated code actually has 2 less instructions, although float comparisons are more expensive than int ones.) Reviewed-by: Brian Paul <[email protected]>
author: Roland Scheidegger <[email protected]> 2018-08-03 05:34:07 +0200
committer: Roland Scheidegger <[email protected]> 2018-08-03 17:07:38 +0200
commit: c72f91debab68d2c0ac7f71807a451b696859afd (patch)
tree: 7a8e2bb9aeba697e0f55c8fa85e9bc51e39f4f15 /src/gallium
parent: 1d900e55fd5587d06184e01e57859cedbbdc9af8 (diff)
1 files changed, 5 insertions, 6 deletions
diff --git a/src/gallium/auxiliary/util/u_math.h b/src/gallium/auxiliary/util/u_math.h
index 79869a119af..712305c98a4 100644
--- a/src/gallium/auxiliary/util/u_math.h
+++ b/src/gallium/auxiliary/util/u_math.h
@@ -360,7 +360,6 @@ uif(uint32_t ui)
 
 /**
  * Convert ubyte to float in [0, 1].
- * XXX a 256-entry lookup table would be slightly faster.
  */
 static inline float
 ubyte_to_float(ubyte ub)
@@ -375,16 +374,16 @@ ubyte_to_float(ubyte ub)
 static inline ubyte
 float_to_ubyte(float f)
 {
-   union fi tmp;
-
-   tmp.f = f;
-   if (tmp.i < 0) {
+   /* return 0 for NaN too */
+   if (!(f > 0.0f)) {
       return (ubyte) 0;
    }
-   else if (tmp.i >= 0x3f800000 /* 1.0f */) {
+   else if (f >= 1.0f) {
       return (ubyte) 255;
    }
    else {
+      union fi tmp;
+      tmp.f = f;
       tmp.f = tmp.f * (255.0f/256.0f) + 32768.0f;
       return (ubyte) tmp.i;
    }
author	Roland Scheidegger <[email protected]>	2018-08-03 05:34:07 +0200
committer	Roland Scheidegger <[email protected]>	2018-08-03 17:07:38 +0200
commit	c72f91debab68d2c0ac7f71807a451b696859afd (patch)
tree	7a8e2bb9aeba697e0f55c8fa85e9bc51e39f4f15 /src/gallium
parent	1d900e55fd5587d06184e01e57859cedbbdc9af8 (diff)