Stacker: An Example Of Using LLVM

For example, suppose we have a global variable whose type is [24 x int]. The variable itself represents a pointer to that array. To subscript the @@ -374,9 +367,9 @@ functions in the LLVM IR that make things easier. Here's what I learned:

- - - - +

Definition Of Operation Of Built In Words
LOGICAL OPERATIONS

+ + + - + @@ -579,7 +577,7 @@ using the following construction:

- + @@ -621,7 +619,7 @@ using the following construction:

are bitwise exclusive OR'd together and pushed back on the stack. For example, The sequence 1 3 XOR yields 2. - + @@ -702,7 +700,7 @@ using the following construction:

- + @@ -789,7 +787,7 @@ using the following construction:

- + @@ -847,7 +845,7 @@ using the following construction:

how much to rotate. That is, ROLL with n=1 is the same as ROT and ROLL with n=2 is the same as ROT2. - + @@ -900,7 +898,7 @@ using the following construction:

pushed back on the stack so this doesn't count as a "use ptr" in the FREE idiom. - + @@ -948,26 +946,30 @@ using the following construction:

executed. In either case, after the (words....) have executed, execution continues immediately following the ENDIF. - - + + - - - + 10 WHILE >d -- END
+ This will print the numbers from 10 down to 1. 10 is pushed on the + stack. Since that is non-zero, the while loop is entered. The top of + the stack (10) is printed out with >d. The top of the stack is + decremented, yielding 9 and control is transfered back to the WHILE + keyword. The process starts all over again and repeats until + the top of stack is decremented to 0 at which point the WHILE test + fails and control is transfered to the word after the END. + + + @@ -1294,13 +1296,26 @@ remainder of the story.

Directory Structure

The source code, test programs, and sample programs can all be found -under the LLVM "projects" directory. You will need to obtain the LLVM sources -to find it (either via anonymous CVS or a tarball. See the -Getting Started document).

Under the "projects" directory there is a directory named "Stacker". That -directory contains everything, as follows:

+in the LLVM repository named llvm-stacker This should be checked out to +the projects directory so that it will auto-configure. To do that, make +sure you have the llvm sources in llvm +(see Getting Started) and then use these +commands:

+ +

+% svn co http://llvm.org/svn/llvm-project/llvm-top/trunk llvm-top
+% cd llvm-top
+% make build MODULE=stacker
+

+ +

Under the projects/llvm-stacker directory you will find the +implementation of the Stacker compiler, as follows:

lib - contains most of the source code
- sample - contains the sample programs

The Lexer

See projects/Stacker/lib/compiler/Lexer.l

See projects/llvm-stacker/lib/compiler/Lexer.l

The Parser

See projects/Stacker/lib/compiler/StackerParser.y

See projects/llvm-stacker/lib/compiler/StackerParser.y

The Compiler

See projects/Stacker/lib/compiler/StackerCompiler.cpp

See projects/llvm-stacker/lib/compiler/StackerCompiler.cpp

The Runtime

See projects/Stacker/lib/runtime/stacker_rt.c

See projects/llvm-stacker/lib/runtime/stacker_rt.c

Compiler Driver

See projects/Stacker/tools/stkrc/stkrc.cpp

See projects/llvm-stacker/tools/stkrc/stkrc.cpp

Test Programs

See projects/Stacker/test/*.st

See projects/llvm-stacker/test/*.st

Exercise

@@ -1374,16 +1392,9 @@ interested, here are some things that could be implemented better:

Write an LLVM pass to compute the correct stack depth needed by the program. Currently the stack is set to a fixed number which means programs with large numbers of definitions might fail.

Enhance to run on 64-bit platforms like SPARC. Right now the size of a - pointer on 64-bit machines will cause incorrect results because of the - 32-bit size of a stack element currently supported. This feature was not - implemented because LLVM needs a union type to be able to support the - different sizes correctly (portably and efficiently).

Write an LLVM pass to optimize the use of the global stack. The code emitted currently is somewhat wasteful. It gets cleaned up a lot by existing passes but more could be done.

Add -O -O1 -O2 and -O3 optimization switches to the compiler driver to - allow LLVM optimization without using "opt."

Make the compiler driver use the LLVM linking facilities (with IPO) before depending on GCC to do the final link.

Clean up parsing. It doesn't handle errors very well.

@@ -1409,7 +1420,7 @@ interested, here are some things that could be implemented better:

src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!">Reid Spencer
- LLVM Compiler Infrastructure
+ LLVM Compiler Infrastructure
Last modified: $Date$

Definition Of Operation Of Built In Words
LOGICAL OPERATIONS
Word	Name	Operation	Description
<	LT	w1 w2 -- b	-- b	The boolean value TRUE (-1) is pushed on to the stack.
BITWISE OPERATORS
BITWISE OPERATORS
Word	Name
ARITHMETIC OPERATORS
ARITHMETIC OPERATORS
Word	Name	Two values are popped off the stack. The larger value is pushed back on to the stack.
STACK MANIPULATION OPERATORS
STACK MANIPULATION OPERATORS
Word	Name
RROT	RROT	w1 w2 w3 -- w2 w3 w1	w1 w2 w3 -- w3 w1 w2	Reverse rotation. Like ROT, but it rotates the other way around. Essentially, the third element on the stack is moved to the top of the stack.
MEMORY OPERATORS
MEMORY OPERATORS
Word	Name
CONTROL FLOW OPERATORS
CONTROL FLOW OPERATORS
Word	Name
WHILE (words...) END	WHILE (words...) END
WHILE word END	WHILE word END	b -- b	The boolean value on the top of the stack is examined. If it is non-zero then the - "words..." between WHILE and END are executed. Execution then begins again at the WHILE where another - boolean is popped off the stack. To prevent this operation from eating up the entire - stack, you should push on to the stack (just before the END) a boolean value that indicates - whether to terminate. Note that since booleans and integers can be coerced you can - use the following "for loop" idiom: - `(push count) WHILE (words...) -- END` +	The boolean value on the top of the stack is examined (not popped). If + it is non-zero then the "word" between WHILE and END is executed. + Execution then begins again at the WHILE where the boolean on the top of + the stack is examined again. The stack is not modified by the WHILE...END + loop, only examined. It is imperative that the "word" in the body of the + loop ensure that the top of the stack contains the next boolean to examine + when it completes. Note that since booleans and integers can be coerced + you can use the following "for loop" idiom: + `(push count) WHILE word -- END` For example: - `10 WHILE DUP >d -- END` - This will print the numbers from 10 down to 1. 10 is pushed on the stack. Since that is - non-zero, the while loop is entered. The top of the stack (10) is duplicated and then - printed out with >d. The top of the stack is decremented, yielding 9 and control is - transfered back to the WHILE keyword. The process starts all over again and repeats until - the top of stack is decremented to 0 at which the WHILE test fails and control is - transfered to the word after the END.
INPUT & OUTPUT OPERATORS
INPUT & OUTPUT OPERATORS
Word	Name