Context: I need to write a mostly stateless compiler which transforms VM bytecode into machine codes. Most VM commands can be translated statelessly with pure function like the following:
compilePop = ["mov ax, @sp", "dec ax", "mov @sp, ax"]
compile :: VM_COMMAND -> [String]
compile STACK_POP = compilePop
-- compile whole program
compileAll :: [VM_COMMAND] -> [String]
compileAll = flatMap compile
But some commands need inserting labels which should be different for every call.
I understand how to do this with a state object "global" for entire compiler:
compileGt n = [label ++ ":", "cmp ax,bx", "jgt " ++ label]
where label = "cmp" ++ show n
compile :: Int -> COMPILER_STATE -> VM_COMMAND -> (COMPILER_STATE, [String])
-- here state currently contains only single integer, but it will grow larger
compile lcnt STACK_POP = (lcnt, compilePop)
compile lcnt CMP_GT = (lcnt + 1, compileGt lcnt)
compileAll commands = snd $ foldr compile commands 0
-- incorrect, but you get the idea
But I think this is bad because every specialized compile function needs only little piece of a state or even none at all. For example in not such a purely functional JavaScript I'd implement specialized compile functions with local state in a closure.
// compile/gt.js
var i = 0;
export default const compileGt = () => {
const label = "cmp" + i++;
return [label ++ ":", "cmp ax,bx", "jgt " ++ label];
};
// index.js
import compileGt from './compile/gt';
function compile (cmd) {
switch (cmd) {
case CMP_GT: return compileGt();
// ...
}
}
export default const compileAll = (cmds) => cmds.flatMap(compile);
So the question is how can I do the same in Haskell or an explanation why it's really bad idea. Should it be something like that?
type compileFn = State -> VM_COMMAND -> [String]
(compileFn, State) -> VM_COMMAND -> ([String], (compileFn, State))
If you have...
data Big = Big { little :: Little, stuff :: Whatever }
... you can define your...
littleProcessor :: State Little [String]
... and then use a function like this one...
innerState :: Monad m
=> (s -> i) -> (i -> s -> s) -> StateT i m a -> StateT s m a
innerState getI setI (StateT m) = StateT $ \s -> do
(a, i) <- m (getI s)
return (a, setI i s)
... to lift it to the bigger state:
bigProcessor :: State Big [String]
bigProcessor = innerState little (\l b -> b {little = l}) littleProcessor
(Add auxiliary definitions to taste.)
The use of the getter/setter pair in innerState
makes it look like it should be possible to phrase it in terms of lenses. Indeed, zoom
from lens is basically innerState
with minimised boilerplate:
{-# LANGUAGE TemplateHaskell #-}
import Control.Lens
data Big = Big { _little :: Little, _stuff :: Whatever }
makeLenses ''Big -- little is now a lens.
bigProcessor :: State Big [String]
bigProcessor = zoom little littleProcessor
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With