[PATCH v2 3/3] msvcrt: Add an SSE2 memset_aligned_32 implementation.

Rémi Bernon rbernon at codeweavers.com
Tue Sep 14 06:13:35 CDT 2021


On 9/14/21 1:10 PM, Piotr Caban wrote:
> On 9/14/21 12:55 PM, Rémi Bernon wrote:
>> On 9/14/21 12:15 PM, Marvin wrote:
>> * guard it with #ifdef __SSE2__, and in that case now that we import 
>> x86intrin.h we could use intel intrinsics instead of inline assembly,
> I didn't check that __SSE2__ is not defined when i686-w64-mingw is used. 
> I think that the SSE enabled version should be available in this case.
> 

It may be defined, and it is for me when I build locally, but not on the 
testbot for some reason. You can have the same result with -mno-sse.

>> * or make it an external assembly function.
> We can also consider moving the ERMS version to separate function to 
> avoid using inline assembly.

That too. The call has an overhead though (but for large sizes that may 
not matter much).
-- 
Rémi Bernon <rbernon at codeweavers.com>



More information about the wine-devel mailing list