X-Git-Url: http://demsky.eecs.uci.edu/git/?a=blobdiff_plain;f=lib%2FTarget%2FSparc%2FREADME.txt;h=b4991fe5790b705e6dfa5a8e4bfc5aa351ded1ea;hb=15a1a226be46dfaaa15c537daa9722b6216a981d;hp=5f9f85f814d9b14a97d843542587d7050010764d;hpb=0d8fcd3218ed93e338a2e7b845f358f1c6f74d58;p=oota-llvm.git diff --git a/lib/Target/Sparc/README.txt b/lib/Target/Sparc/README.txt index 5f9f85f814d..b4991fe5790 100644 --- a/lib/Target/Sparc/README.txt +++ b/lib/Target/Sparc/README.txt @@ -1,86 +1,59 @@ -Meta TODO list: -1. Convert asmprinter to use lib/CodeGen/AsmPrinter for global init printing - constant pool printing, etc. -2. Create a new DAG -> DAG instruction selector, by adding patterns to the - instructions. -3. profit! - - -SparcV8 backend skeleton ------------------------- - -This directory houses a 32-bit SPARC V8 backend employing an expander-based -instruction selector. It is not yet functionally complete. Watch -this space for more news coming soon! - -Current expected test failures ------------------------------- - -Here are the currently-expected SingleSource failures for V8 -(Some C++ programs are crashing in libstdc++ at the moment; -I'm not sure why.) - - (llc) SingleSource/Regression/C++/EH/exception_spec_test - (llc) SingleSource/Regression/C++/EH/throw_rethrow_test - -Here are the currently-expected MultiSource failures for V8: - - (llc,cbe) MultiSource/Applications/d/make_dparser - (llc,cbe) MultiSource/Applications/hexxagon - (llc) MultiSource/Benchmarks/Fhourstones - (llc,cbe) MultiSource/Benchmarks/McCat/03-testtrie - (llc) MultiSource/Benchmarks/McCat/18-imp - (llc,cbe) MultiSource/Benchmarks/Prolangs-C/bison/mybison - (llc,cbe) MultiSource/Benchmarks/Prolangs-C/fixoutput - (llc,cbe) MultiSource/Benchmarks/Prolangs-C/gnugo - (llc,cbe) MultiSource/Benchmarks/Prolangs-C/plot2fig - (llc,cbe) MultiSource/Benchmarks/Ptrdist/anagram - (llc,cbe) MultiSource/Benchmarks/FreeBench/analyzer - * DANGER * analyzer will run the machine out of VM - (I don't know whether the following fail in cbe:) - (llc) MultiSource/Benchmarks/FreeBench/distray - (llc) MultiSource/Benchmarks/FreeBench/fourinarow - (llc) MultiSource/Benchmarks/FreeBench/pifft - (llc) MultiSource/Benchmarks/MallocBench/gs - (llc) MultiSource/Benchmarks/Prolangs-C++/deriv1 - (llc) MultiSource/Benchmarks/Prolangs-C++/deriv2 - -Known SPEC failures for V8 (probably not an exhaustive list): - - (llc) 134.perl - (llc) 177.mesa - (llc) 188.ammp -- FPMover bug? - (llc) 256.bzip2 - (llc,cbe) 130.li - (native,llc,cbe) 126.gcc - (native,llc,cbe) 255.vortex - To-do ----- -* support shl on longs (fourinarow needs this) -* support casting 64-bit integers to FP types (fhourstones needs this) -* support FP rem (call fmod) - * Keep the address of the constant pool in a register instead of forming its address all of the time. - -* Change code like this: - or %o0, %lo(.CPI_main_0), %o0 - ld [%o0+0], %o0 - into: - ld [%o0+%lo(.CPI_main_0)], %o0 - for constant pool access. - * We can fold small constant offsets into the %hi/%lo references to constant pool addresses as well. - -* Directly support select instructions, and fold setcc instructions into them - where possible. I think this is what afflicts the inner loop of Olden/tsp - (hot block = tsp():no_exit.1.i, overall GCC/LLC = 0.03). - -* Generate fsqrtd for calls to sqrt() (~ 4% speedup on Olden/tsp). - -$Date$ - +* When in V9 mode, register allocate %icc[0-3]. +* Add support for isel'ing UMUL_LOHI instead of marking it as Expand. +* Emit the 'Branch on Integer Register with Prediction' instructions. It's + not clear how to write a pattern for this though: + +float %t1(int %a, int* %p) { + %C = seteq int %a, 0 + br bool %C, label %T, label %F +T: + store int 123, int* %p + br label %F +F: + ret float undef +} + +codegens to this: + +t1: + save -96, %o6, %o6 +1) subcc %i0, 0, %l0 +1) bne .LBBt1_2 ! F + nop +.LBBt1_1: ! T + or %g0, 123, %l0 + st %l0, [%i1] +.LBBt1_2: ! F + restore %g0, %g0, %g0 + retl + nop + +1) should be replaced with a brz in V9 mode. + +* Same as above, but emit conditional move on register zero (p192) in V9 + mode. Testcase: + +int %t1(int %a, int %b) { + %C = seteq int %a, 0 + %D = select bool %C, int %a, int %b + ret int %D +} + +* Emit MULX/[SU]DIVX instructions in V9 mode instead of fiddling + with the Y register, if they are faster. + +* Codegen bswap(load)/store(bswap) -> load/store ASI + +* Implement frame pointer elimination, e.g. eliminate save/restore for + leaf fns. +* Fill delay slots + +* Implement JIT support