Programming Languages Research Group: Git

author	Chandler Carruth <chandlerc@gmail.com>
	Tue, 23 Sep 2014 10:08:29 +0000 (10:08 +0000)
committer	Chandler Carruth <chandlerc@gmail.com>
	Tue, 23 Sep 2014 10:08:29 +0000 (10:08 +0000)
commit	8f637786d825f631ecdd58e3c773f06505310048
tree	026457e7f3155db6e325f1533bf96f11acc4e30a	tree \| snapshot
parent	5f843038fbc274615ddc113e421c379552e212c8	commit \| diff

[x86] Teach the AVX1 path of the new vector shuffle lowering one more
trick that I missed.

VPERMILPS has a non-immediate memory operand mode that allows it to do
asymetric shuffles in the two 128-bit lanes. Use this rather than two
shuffles and a blend.

However, it turns out the variable shuffle path to VPERMILPS (and
VPERMILPD, although that one offers no functional differenc from the
immediate operand other than variability) wasn't even plumbed through
codegen. Do such plumbing so that we can reasonably emit
a variable-masked VPERMILP instruction. Also plumb basic comment parsing
and printing through so that the tests are reasonable.

There are still a few tests which don't show the shuffle pattern. These
are tests with undef lanes. I'll teach the shuffle decoding and printing
to handle undef mask entries in a follow-up. I've looked at the masks
and they seem reasonable.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@218300 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/X86/Utils/X86ShuffleDecode.cpp		diff \| blob \| history
lib/Target/X86/Utils/X86ShuffleDecode.h		diff \| blob \| history
lib/Target/X86/X86ISelLowering.cpp		diff \| blob \| history
lib/Target/X86/X86ISelLowering.h		diff \| blob \| history
lib/Target/X86/X86InstrFragmentsSIMD.td		diff \| blob \| history
lib/Target/X86/X86InstrSSE.td		diff \| blob \| history
lib/Target/X86/X86MCInstLower.cpp		diff \| blob \| history
test/CodeGen/X86/vector-shuffle-256-v8.ll		diff \| blob \| history