On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128...
authorNadav Rotem <nrotem@apple.com>
Fri, 18 Jan 2013 23:10:30 +0000 (23:10 +0000)
committerNadav Rotem <nrotem@apple.com>
Fri, 18 Jan 2013 23:10:30 +0000 (23:10 +0000)
commit48177ac90fb940833b9deea1a6716092348cfe82
tree5252f0617e256f0dd1f8b26082f05a088d7232b9
parent7336f7febb5170b374a4cbffee273ad82ff8a1a3
On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@172868 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Target/X86/X86ISelLowering.cpp
test/CodeGen/X86/sandybridge-loads.ll [new file with mode: 0644]
test/CodeGen/X86/v8i1-masks.ll