Cannot compute minimal length of a parser - uu-parsinglib in Haskell

Asked 16/8, 2013 at 13:59 Answered 17/8, 2013 at 20:36

Solved parsing haskell parsec parser-combinators uu-parsinglib

Lets see the code snippet:

pSegmentBegin p i   = pIndentExact i *> ((:) <$> p i <*> ((pEOL *> pSegment p i) <|> pure []))

if I change this code in my parser to:

pSegmentBegin p i   = do
    pIndentExact i
    ((:) <$> p i <*> ((pEOL *> pSegment p i) <|> pure []))

I've got an error:

canot compute minmal length of a parser due to occurrence of a moadic bind, use addLength to override

I thought the above parser should behave the same way. Why this error can occur?

EDIT

The above example is very simple (to simplify the question) and as noted below it is not necessary to use do notation here, but the real case I wanted it to use is as follows:

pSegmentBegin p i   = do
    j <- pIndentAtLast i
    (:) <$> p j <*> ((pEOL *> pSegments p j) <|> pure [])

I have noticed that adding "addLength 1" before the do statement solves the problem, but I'm unsure if its a correct solution:

pSegmentBegin p i   = addLength 2 $ do
    j <- pIndentAtLast i
    (:) <$> p j <*> ((pEOL *> pSegments p j) <|> pure [])

Beater answered 16/8, 2013 at 13:59 Comment(0)

As I have mentioned many times the monadic interface should be avoided whenever possible. let me try to explain why the applicative interface is to be preferred.

One of the distinctive features of my library is that it performs error correction by inserting or deleting problems. Of course we can take an umlimited look-ahead here but that would make the process VERY expensive. So we take only a limited lookahead of three steps.

Now suppose we have to insert an expression and one of the expression alternatives is:

expr := "if" expr "then" expr "else" expr

then we want to exclude this alternative since choosing this alternative would necessitate the insertion of another expression etc. So we perform an abstract interpretation of the alternatives and make sure that in case of a draw (i.e. equal costs for the limited lookahead) we take one of the non-recursive alternatives.

Unfortunately this scheme breaks down when one writes monadic parsers, since the length of the right hand side of the bind may depend on the result of the left-hand side. So we issue the error message, and ask some help from the programmer to indicate the number of tokens this alternative might consume. The actual value does not matter so much, as long as you do not provide a finite length for something which is recursive and may lead to infinite insertions. It is only used to select the shortest alternative in case of an insertion.

This abstract interpretation has some costs and if you write all your parsers in monadic style it is unavoidable that this analysis is repeated over an over again. so: DO NOT WRITE MONADIC STYLE PARSERS WHEN USING THIS LIBRARY IF THERE IS AN APPLICATIVE ALTERNATIVE.

Mordy answered 17/8, 2013 at 19:0 Comment(3)

Thank you! It clarifies a lot. I think in my case I have to use monadic style parser combinator (pSegmentBegin consumes spaces, count the new indentation level and then forces all lines below to have this indentation used), so it is not possible to write it in "pure applicative" style – Beater 17/8, 2013 at 22:51

According to your question about StackOverflow - I do not know if there is any question subscription available - if there is I havent used it so far, but I think, posting questions here is very good idea - when I was searching anything about uu-parsinglib - the only results I found were on StackOverflow and additional a lot of programmers ude it - it is a lot more accessible than any mailing list. – Beater 17/8, 2013 at 22:54

Hi @Doaitse! You can get email notifications of all future SO questions which include uu-parsinglib tag by hovering over the tag and clicking "Subscribe". I'll edit your question to clean up irrelevant points and sharpen the punchline :) All the best. – Puncheon 15/1, 2015 at 13:31

It's trying to statically analyze how much input needs to be read in order to optimize performance, but that kind of optimization requires a statically known parser structure—the kind that can be built by Applicatives since the parser effect cannot depend upon the parser value such what (>>=) does.

So that's what goes wrong—when you use do notation it translates to a Monadic bind which breaks the Applicative predictor. It'd be nice if the library only exposed one of the two interfaces so that this kind of error cannot happen, but instead there's some inconsistency if you use both interfaces together in the same parser.

Since this use of do is strictly unnecessary—you're not using the extra power the monadic interface gives you—it's probably better to just avoid it.

Godship answered 16/8, 2013 at 16:50 Comment(6)

Aren't (>>) and (*>) supposed to be equivalent? – Alchemy 16/8, 2013 at 16:55

There's an unwritten rule that Applicative and Monad instances should match, else things get confusing (like here), but if you're using properties of Applicative that you can get in Monad then it's possible to write an Applicative instance that cannot have a Monad. – Godship 16/8, 2013 at 17:32

It'll become a written rule when every Monad becomes Applicative. And you can prove that the default bindings for (>>) and (*>) are equivalent, using only the monad laws and the assumption that under the assumption that ap = (<*>). – Alchemy 16/8, 2013 at 19:18

Thank you for the answer :) Currently I wanted to use the "power of do style" (see the updated example). Did I solve it in proper way? – Beater 16/8, 2013 at 20:54

@Rhymoid it's precisely the assumption that ap == (<*>) that causes problems. When we have Applicative f => Monad f it'll be even more evident that those should align... but still just an unwritten rule. You need something dependently typed to make it compiler checked. – Godship 16/8, 2013 at 21:38

@J.Abrahamson: Thank you, you are right -in my original example the do was not necessery (I've oversimplied the example to be easy read), but I've updated it already :) – Beater 17/8, 2013 at 23:15

I have a workaround I use with monadic parsers in uuparsinglib. Its a self-answer here: Monadic parse with uu-parsinglib

You may find it useful

Mane answered 17/8, 2013 at 20:36 Comment(0)

Recommended topics

Hot tags