optimize "p != gep p, ..." better. This allows us to compile
authorChris Lattner <sabre@nondot.org>
Tue, 22 Apr 2008 02:53:33 +0000 (02:53 +0000)
committerChris Lattner <sabre@nondot.org>
Tue, 22 Apr 2008 02:53:33 +0000 (02:53 +0000)
commit10c0d9179e596dd295d85cad468f6d68015f5a46
tree62085a8028d110d43fd1c9a7000deded8f85dd1e
parenteb411291fa7614496ec8b51b0ced9f6cc2b78313
optimize "p != gep p, ..." better.  This allows us to compile
getelementptr-seteq.ll into:

define i1 @test(i64 %X, %S* %P) {
%C = icmp eq i64 %X, -1 ; <i1> [#uses=1]
ret i1 %C
}

instead of:

define i1 @test(i64 %X, %S* %P) {
%A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1]
%C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1]
ret i1 %C
}

And fixes the second half of PR2235.  This speeds up the insertion sort
case by 45%, from 1.12s to 0.77s.  In practice, this will significantly
speed up for loops structured like:

for (double *P = Base + N; P != Base; --P)
  ...

Which happens frequently for C++ iterators.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50079 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Transforms/Scalar/InstructionCombining.cpp
test/Transforms/InstCombine/getelementptr-seteq.ll [new file with mode: 0644]