Hacker Read

jgalt212 · 2017-11-17 15:08:30

a parser that is 97% correct is broken.

likeclockwork | karma 713 | avg karma 1.35 · | 2013-02-19 17:01:54

My parser seems to be broken.

auggierose | karma 4751 | avg karma 1.77 · | 2021-04-07 08:32:13

Now that's a pretty big fail for a parser generator, as it should be at its best ESPECIALLY when the parse gets complicated.

nijiko | karma 219 | avg karma 1.39 · | 2014-05-05 19:01:13+00:00

The parser needs to be rewritten really.

Vivtek | karma 10545 | avg karma 3.96 · | 2016-06-01 13:54:31

Well, that is just choosing the semantically correct parse from multiple syntactically correct parses. This parser isn't even finding syntactically correct parses.

vesinisa | karma 5581 | avg karma 5.92 · | 2019-11-08 16:59:51

Nevertheless, the parser failing at only 100 levels of nesting is shockingly bad.

babuskov | karma 5151 | avg karma 4.35 · | 2016-06-10 10:33:30

I looked at the code... writing my own parser would be faster than fixing it. But that's beside the point. I decided to use a 3rd party library because I did not want to invest my time into that. The moment I had to even look at the source code, that was broken.

thfuran | karma 7179 | avg karma 1.96 · | 2024-02-21 21:00:43

Yeah, natural languages don't have a specification or canonical parser implementation, so they cannot be reliably parsed.

andrewflnr | karma 11547 | avg karma 2.11 · | 2018-10-16 02:19:05+00:00

I don't think you actually disagree with the author. I think they would basically agree with everything you wrote and just add on, "therefore write your parser so it actually does recover correctly." Which is what most of the post boils down to.

ralphb | karma 141 | avg karma 3.71 · | 2022-11-21 11:03:10

I'm confused and very far from an expert here. What is wrong with parsers, and what is the alternative?

Kurtz79 | karma 3355 | avg karma 2.92 · | 2017-11-20 10:26:31

Tell me of some parsers that do not deal with deterministic inputs and have 100% accuracy, then.

badlogic | karma 823 | avg karma 5.56 · | 2016-05-23 06:22:12

The parsers available are good enough for English. Sadly, that's absolutely not true for other languages.

jheriko | karma 1542 | avg karma 0.94 · | 2023-07-19 17:19:40

Good to see someone avoiding horrible parser generators, even if the code is ugly, poorly styled, and bug prone.

"In short, there are a few reasons that parsing is a mess, and none of those reasons are actually resolvable by parser generators."

I'm pretty sure this is untrue /and/ part of the problem. Build quality on these tools is appalling...

reply

xapata | karma 5059 | avg karma 1.56 · | 2021-02-27 06:50:09+00:00

If four popular parsers all had serious bugs, 6 years seems not too shabby.

grabcocque | karma 2424 | avg karma 6.43 · | 2017-03-20 14:43:06+00:00

It's a shame parsers are such a PITA to write. So many problems could be trivially solved if writing a grammar and generating a parser for it were in any way a pleasant process.

robryan | karma 6222 | avg karma 1.7 · | 2012-09-01 20:13:18

Without having investigated it, I would guess that the parser isn't abstract and modular enough so they end up with a mess of code trying to handle all the different possible combinations of syntax.

mariusmg | karma 1780 | avg karma 3.02 · | 2016-01-13 13:09:38+00:00

They have a parser , not a entire compiler

Goladus | karma 4410 | avg karma 1.97 · | 2014-01-10 04:21:39

the parser is usually really good at spotting those errors and once they've been pointed out are trivial to correct.

Go0the0gophers | karma -9 | avg karma -0.45 · | 2018-09-11 12:47:05+00:00

Just fuzz every parser you write. And the problem is solved.

cratermoon | karma 12830 | avg karma 2.05 · | 2023-01-07 20:10:51

So what you're saying is that, except for all the times you come back to work on the code because something broke, it works reliably? Nothing about the parser could be improved so that it doesn't break on data format changes? Nothing could be improved such that instead of alerting you to failures, it could be pro-actively adjusted to accept new formats before something fails?