1.1k
views2
comments

Parser for Calculator example - student project

Hello all,

I'm working on a lab for a human-computer interface course. We're supposed to implement a calculator, in the language of our choice, illustrating good separation of concerns. Guess what language I chose to implement the business logic. :) (My UI is WPF-based; I use an M-V-VM architecture for the presentation tier.)

I know Calculator is a classic example, but unfortunately it's so classic that I can't find any sufficiently simple examples online to help me write a parser to generate an abstract syntax tree.

Here is what I have so far (very MSDN-ish; also, I've borrowed a bit from the Channel 9 lectures):

type Expr =
    | Num of double
    | Add of Expr * Expr
    | Sub of Expr * Expr
    | Mul of Expr * Expr

let rec Eval exp =
    match exp with
    | Num n     -> n
    | Add (x,y) -> Eval(x) + Eval(y)
    | Sub (x,y) -> Eval(x) - Eval(y)
    | Mul (x,y) -> Eval(x) * Eval(y)
    | Div (_,y) when Eval(y) = 0.0 -> failwith "Divisor cannot be zero."
    | Div (x,y) -> Eval(x) / Eval(y)
    | Div of Expr * Expr

//BEGIN: this part is broken
let (|Number|OpAdd|OpSub|OpMul|OpDiv|_|) (s : string) =
    let mutable floatvalue = 0.0

    if System.Double.TryParse(str, &floatvalue) then Some(floatvalue)
    elif String.Contains("+") then OpAdd(?,?)
    elif String.Contains("-") then OpSub(?,?)
    elif String.Contains("*") then OpMul(?,?)
    elif String.Contains("/") then OpDiv(?,?)
    else None

let Parse (s : string) : (Expr option) =
    match s with
    | Number      -> Num(System.Double.Parse(s)) hubfs.net
    | OpAdd(l,r)  -> Add(Parse l,Parse r)
    | OpSub(l,r)  -> Sub(Parse l,Parse r)
    | OpMul(l,r)  -> Mul(Parse l,Parse r)
    | OpDiv(l,r)  -> Div(Parse l,Parse r)
    | None        -> None

//END: The Evaluate function, fortunately, is not broken (I think). :)

let Evaluate s : double =
    match Parse s with
    | Some expr -> Eval expr
    | None      -> failwith "Invalid expression."

Obviously, my partial active pattern and my Parse function are in deep trouble. :)

I guess I have two questions. 1) What is a good way to solve this (short, concise, preferably no use of libraries so I can learn more)? 2) What is "the best" way to solve this?

(Oh, and I have looked at [link:www.devx.com] but that's a bit over my head at this point.)

Any help is greatly appreciated!

Recursive descent parsers are the simplest thing. Code below is a parser for a simple expression language with a left-associative '+', a right-associative '^', parens, and integers. This covers all the commonest patterns, after you write a number of such parsers you'll be able to write them in your sleep provided you know the language grammar.

 

open System

type Expr =
    | Add of Expr * Expr
    | Pow of Expr * Expr
    | Num of int

// every parser takes the string and the current index, returns Expr and next index
let rec ParseExpr(s:string, i:int) =
    // Start with lowest operator precedence and work way up
    ParseLowPrecOp(s,i)
and ParseLowPrecOp(s:string, i:int) =
    // left-associative operators use a while loop to accumulate values of next-highest precedence
    let mutable e, j = ParseHighPrecOp(s,i)
    while j < s.Length && s.[j] = '+' do
        let mutable e2, k = ParseHighPrecOp(s,j+1)
        j <- k
        e <- Add(e,e2)
    e, j
and ParseHighPrecOp(s:string, i:int) =
    // right-associative operators parse one expr of next-highest precedence, then recurse as needed to accumulate
    let leftExpr, j = ParseParenExpr(s,i)
    if j >= s.Length || s.[j] <> '^' then
        leftExpr, j
    else
        let rightExpr, k = ParseHighPrecOp(s,j+1)
        Pow(leftExpr, rightExpr), k
and ParseParenExpr(s:string, i:int) =
    // parentheses bind tightest
    if s.[ i] = '(' then
        let e, j = ParseExpr(s, i+1)
        if s.[j] <> ')' then
            failwith "parse error"
        else
            e, j+1
    else
        // if gone through all ops, literals are only remaining forms
        ParseNum(s, i)    
and ParseNum(s:string, i:int) =
    if not(Char.IsDigit(s.[ i])) then
        failwith "parse error"
    let mutable j = i
    while Char.IsDigit(s.[j]) do
        j <- j + 1
    Num(int(s.Substring(i, j-i))), j

let s = "1+2+3^4+5+6+((7+8)^9^10)"
printfn "%s" s
printfn "%A" (ParseExpr(s,0))

Parsing is a whole sub-field of CS, one of the oldest and most studied CS things. Some would say it is a "solved problem", but it still a very rich and active area of CS innovation (especially regarding error diagnostics for failed parses, error recovery, etc). I am personally a great fan of the monadic parser combinator approach, a la

[link:lorgonblog.spaces.live.com]

but there are a variety of parsing strategies, including recursive descent parsers (like above), table-driven parsers (like those creates by lex/yacc DSLs), EDSLs like monadic parser combinatorsm etc, which all have various strengths and weaknesses along such axes as performance, diagnostics, and classes of languages they can easily handle.

The parsing fool,
Brian

By brianmcn on 3/9/2010 10:51 PM (permalink)

Thanks Brian!

I will try working through (and understanding!) that code.

If I understand correctly (I'm a software+electrical engineering major, not a computer science major ;)), "combinatorial" parsers are those that are combined from simple parser functions (like those in chapter 8 of this book: [link:www.cs.nott.ac.uk] There, the parser is combined from a series of small and individually simple functions. That's very appealing to me; the problem with the parser above is that I have a hard time decomposing it.

Also, from a brief excursion on Wikipedia, I believe I'm going after a top-down parser; is that correct?

[link:en.wikipedia.org]

Well, I'm off to try this out. Thanks again!

By LarsKemmann on 3/10/2010 6:07 AM (permalink)

Topic tags

Built with WebSharper

Home

Answers

Events

Courses

Groups and Conferences

Blogs

Jobs

Developers

Topic tags