Programming Languages Research Group: Git

author	Sanjay Patel <spatel@rotateright.com>
	Fri, 20 Mar 2015 21:47:56 +0000 (21:47 +0000)
committer	Sanjay Patel <spatel@rotateright.com>
	Fri, 20 Mar 2015 21:47:56 +0000 (21:47 +0000)
commit	be9ee969260fdc03eebcd9f647dcaa5f1384b0cf
tree	0251cd78916d4e4ab6e01a21054c2e62eb120125	tree \| snapshot
parent	e0e1c1d94db62adc65fb4a64dce5a1bb95281b05	commit \| diff

[X86, AVX] instcombine common cases of vperm2* intrinsics into shuffles

vperm2* intrinsics are just shuffles.
In a few special cases, they're not even shuffles.

Optimizing intrinsics in InstCombine is better than
handling this in the front-end for at least two reasons:

1. Optimizing custom-written SSE intrinsic code at -O0 makes vector coders
really angry (and so I have regrets about some patches from last week).

2. Doing mask conversion logic in header files is hard to write and
subsequently read.

There are a couple of TODOs in this patch to complete this optimization.

Differential Revision: http://reviews.llvm.org/D8486

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@232852 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Transforms/InstCombine/InstCombineCalls.cpp		diff \| blob \| history
test/Transforms/InstCombine/x86-vperm2.ll	[new file with mode: 0644]	blob