diff options
author | Matt Turner <[email protected]> | 2017-09-28 20:59:49 -0700 |
---|---|---|
committer | Matt Turner <[email protected]> | 2017-10-04 14:08:54 -0700 |
commit | 7e88f934697fd4ce05093c077b5f11454e377ed0 (patch) | |
tree | 42ebb0420d905cf3df1501645708e62da44eaa10 /doxygen | |
parent | b541945c2027990ac571184bbf8e01285be0e33a (diff) |
i965/fs: Rewrite fsign64 to skip the float -> double conversion
... without the float -> double conversion. Low power parts have
additional restrictions when it comes to operating on 64-bit types, and
the instruction used to do the conversion violates one of them:
specifically, the restriction that "Source and Destination horizontal
stride must be aligned to the same qword".
Previously we generated a float and then converted, but we can avoid the
conversion by using the same extract-the-sign-bit + or-in-1.0 algorithm
by directly operating on the high four bytes of each double-precision
component in the result.
In SIMD8 and SIMD16 this cuts one instruction from the implementation,
and more importantly that instruction is the one which violated the
regioning restriction.
Along the way I removed some comments that I did not think helped, and
some code about double comparisons which does not seem to be necessary
today.
This prevents validation failures caught by the new EU validation code
added in later patches.
Reviewed-by: Iago Toral Quiroga <[email protected]>
Diffstat (limited to 'doxygen')
0 files changed, 0 insertions, 0 deletions