Is the project still active ? #147

sleroy · 2021-02-02T13:36:16Z

Hi, I don't know if the project is still active.

I am an user of Parboiled and I would like to give an hand in maintaining or upgrading the framework.

Sylvain

sirthias · 2021-02-02T13:41:02Z

Hi Sylvain,
the project is not really active anymore.
A lot has happened since it was started 12 years ago.

What exactly would you need it for?

sleroy · 2021-02-02T13:56:12Z

I am using any time I need to write some small parsers and even a big one recently ( ActionScript 3). I did not find anything close to Parboiled apart SonarQube/Sonarsource parsing framework.

I am quite fluent in parsing and I like the test friendly approach of your framework allowing an incremental writing process in contrast of Antlr, Javacc etc.

nmcb · 2021-02-02T14:03:07Z

fyi: after having used (and learned from the excellent) parboiled library up till about 4 years ago, i switched to the (also) excellent (scala) fastparse. https://www.lihaoyi.com/fastparse/

sirthias · 2021-02-02T14:07:45Z

Yes, fastparse is great and can be a nice solution if you are writing parsers in Scala.

@sleroy Are you using parboiled's Java or Scala side?

sleroy · 2021-02-02T14:19:13Z

I am using the Java side. Most projects I am dealing with are written in Java or compatible Java.

sleroy · 2021-02-02T14:20:15Z

One of the modification, I would like to bring is either upgrading ASM or embedding it to avoid conflicts with spring/hibernate etc

sirthias · 2021-02-02T14:23:16Z

Unfortunately I haven't been writing Java for many years now and, as such, am totally out of touch with the latest developments with regard to the language and the library eco-system.
But I'd be more than happy to support a fork and further development on your side, similarly to what I've already done with pegdown.

So, if you'd like to take over: Just fork, hack and cut a new release and I'll happily put in a pointer and promotion, if you'd like.

sleroy · 2021-02-02T14:25:08Z

Thank you ,that's really nice of you. By the way, it's sad that the blog articles have been deleted or removed from the blog ( in the Wiki)

Garagoth · 2021-03-13T18:35:49Z

Hi,
Can you recommend any other Java project that is at least a bit similar to parboiled?
Bonus if there is a not very complicated migration path from parboiled rules (which I have used extensively to parse somewhat structured outputs of various commands and configuration dumps). I am having trouble googling anything even remotely similar...
Regards,
Garagoth.

sleroy · 2021-03-13T20:03:30Z

@Garagoth I am still using Parboiled for this purpose. Scala alternatives have been advised by @sirthias.
You may write them using ANTLR or JavaCC or SonarSource parsing framework if you have some time to invest.

lukehutch · 2021-06-27T00:02:50Z

Can you recommend any other Java project that is at least a bit similar to parboiled?

I have developed a packrat PEG parser (called the pika parser) that works bottom-up, and has some interesting properties (see the linked paper for details):

https://github.com/lukehutch/pikaparser

I am currently working on a new packrat PEG parser (the squirrel parser) that works top-down. However, I haven't written documentation yet for it (I'm writing the paper for this parsing algorithm now):

https://github.com/lukehutch/squirrelparser

Both of the above parsers fully support both direct and indirect left recursion. The benefit of the pika parser is that it supports optimal error recovery, because it works bottom-up, so it can find all grammatically-correct structure fragments, no matter what sort of syntax error is present. The benefit of the squirrel parser is that, at least according to my benchmarks so far, it is the fastest PEG parser for the JVM ecosystem.

zhong-j-yu · 2021-08-19T21:41:34Z

(shameless plug) I have written a PEG parser generator for Java 17 that derives grammar rules from datatypes of parse trees. See Rekex .

The basic idea is that alternation/concatenation grammar rules correspond to sum/product datatypes, or, sealed/record types in Java; therefore it is possible to have datatypes of the parse tree to reflect the grammar precisely; it is not necessary to define the grammar rules as a separate step. And constructors of datatypes are used to construct tree nodes; with record type it is very succinct.

Note that in parboiled (and others like jparsec), rules are Java objects, which raises a question - how does a rule reference itself recursively. Some magics are needed to build a cyclic object graph.

In Rekex, rules are Java types, and types can reference themselves recursively. It is as natural to define a recursive type as to define a recursive grammar. The two share the same model with which we conceptualize a context-free language.

Status of the project - I'm pretty sure it's production ready; I've done a lot of testing myself. But the main concern right now is whether this new approach to parsing is adequate for real world application, whether it is acceptable by the public. I would really appreciate any feeback, thanks!

lukehutch · 2021-08-19T23:38:06Z

(shameless plug) I have written a PEG parser generator for Java 17 that derives grammar rules from datatypes of parse trees. See Rekex .

Wow, this is very cool, and I am designing a programming language right now that will do exactly this -- all algebraic data types will be able to be serialized and deserialized according to a bijective mapping between their field values and some syntax.

I like how you used parameter annotations on record parameters to achieve this with Java! That's a genius idea.

Note that in parboiled (and others like jparsec), rules are Java objects, which raises a question - how does a rule reference itself recursively. Some magics are needed to build a cyclic object graph.

This is also known as the left recursion problem. Specifically, a top-down parsing function cannot recurse directly or indirectly into itself, if the parser does not make forward progress by consuming at least one character between nested recursive calls to the same function.

Status of the project - I'm pretty sure it's production ready; I've done a lot of testing myself. But the main concern right now is whether this new approach to parsing is adequate for real world application, whether it is acceptable by the public. I would really appreciate any feeback, thanks!

Structurally it's a brilliant way to define a grammar, in my opinion, because it uses existing language features. But how do you solve left recursion in Rekex?

There are several complex workarounds for the left recursion problem. Both the parsers I link above have (different) very clean solutions to this problem. We can discuss more over there if you are interested: https://github.com/lukehutch/squirrelparser/discussions

zhong-j-yu · 2021-08-20T01:23:07Z

Thank you, Luke. Serializing an AST to text is an interesting topic, I'll keep up with your project.

Left recursion would cause my parser to stackoverflow at runtime, as you would've expected:) Note that the official PEG paper frowns upon left recursion - A well-formed grammar is a grammar that contains no directly or mutually left-recursive rules. Apparently there are some techniques that can automatically deal with left-recursive PEG, but I have not looked into it closely. In any case, the grammar itself, derived from datatype definitions, can allow left recursions; it is up to the parser implementation how to handle it, and I may provide other parser implementations in future besides the current recursive decent implementation.

What's more important is the realization that there is a relationship between grammar rules and datatypes, an idea that can be applied to other types of grammars. Rekex picks PEG as a practical matter.

binkley · 2021-08-29T03:20:26Z

I have an OSS project relying on Parboiled. It is not yet, but approaching unmaintainable with current Java versions.

Will this project get updated?
Should I migrate to another project?

I am rather comfortable with the API I get from Parboiled, and would be disappointed to lose that.

kinow · 2021-11-18T05:44:52Z

I'm using it too, trying to replace a regular expression parser written ages ago for tap4j (used in a Jenkins plugin too). Would be great if pull requests & fixes could be applied @sirthias . Maybe offer co-maintainership to some frequent collaborators ? Thanks for parboiled anyway!

sirthias · 2021-11-18T08:54:35Z

@kinow Thank you!
See my comment above: #147 (comment)

kinow · 2021-11-18T09:24:08Z

@kinow Thank you! See my comment above: #147 (comment)

Hi @sirthias

I saw that comment about a fork 🙂

What I was trying to suggest was to keep using the repository and maven groupId/artifactId, if possible. I've seen other projects being forked successfully, but also a fair share that had a few forks active but that stalled or didn't form enough community. And it would be a shame if the same happened to parboiled. I'm truly enjoying using its API, the documentation is really great, it appears to have a good user base and community, and the code appears to be good too from the little I could see.

Thanks!
Bruno

sleroy · 2021-11-18T15:25:33Z

@kinow

I did a fork that I am using in some projects actually on https://github.com/byoskill/parboiled. However, I am not publishing on the parboiled repository.

Since I am more a Java user than a Scala user, my focus is on the first one.

I am basically maintaining working for the latest versions of Java and adding some small features ( @FunctionalInterface) to ease the writing in Java.

imagingbook · 2023-02-07T15:18:42Z

Thanks for sharing parboiled - the Java implementation is brilliant and I would not want to see it go! Unfortunately, as others noted before, its artifacts cannot be loaded as a dependency in a modular (Maven) project, because of overlapping packages. I therefore also did a fork plus some minimal refactoring to make it modular, mainly for my own purpose, but anyone interested can find it in https://github.com/imagingbook/parboiled-modular .

One additional minor modification is the use of binary search for the contains() method of character sets (Characters). Seems natural, not sure it makes much difference in practice.

eabase mentioned this issue May 17, 2021

[JDK9] Illegal reflective access by org.parboiled.transform.AsmUtils to method java.lang.ClassLoader.findLoadedClass(java.lang.String) #109

Closed

binkley mentioned this issue Feb 13, 2023

How close to a drop-in replacement? imagingbook/parboiled-modular#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the project still active ? #147

Is the project still active ? #147

sleroy commented Feb 2, 2021

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

nmcb commented Feb 2, 2021 •

edited

Loading

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

sleroy commented Feb 2, 2021

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

Garagoth commented Mar 13, 2021

sleroy commented Mar 13, 2021 •

edited

Loading

lukehutch commented Jun 27, 2021

zhong-j-yu commented Aug 19, 2021

lukehutch commented Aug 19, 2021

zhong-j-yu commented Aug 20, 2021

binkley commented Aug 29, 2021 •

edited

Loading

kinow commented Nov 18, 2021

sirthias commented Nov 18, 2021

kinow commented Nov 18, 2021

sleroy commented Nov 18, 2021 •

edited

Loading

imagingbook commented Feb 7, 2023 •

edited

Loading

Is the project still active ? #147

Is the project still active ? #147

Comments

sleroy commented Feb 2, 2021

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

nmcb commented Feb 2, 2021 • edited Loading

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

sleroy commented Feb 2, 2021

sirthias commented Feb 2, 2021

sleroy commented Feb 2, 2021

Garagoth commented Mar 13, 2021

sleroy commented Mar 13, 2021 • edited Loading

lukehutch commented Jun 27, 2021

zhong-j-yu commented Aug 19, 2021

lukehutch commented Aug 19, 2021

zhong-j-yu commented Aug 20, 2021

binkley commented Aug 29, 2021 • edited Loading

kinow commented Nov 18, 2021

sirthias commented Nov 18, 2021

kinow commented Nov 18, 2021

sleroy commented Nov 18, 2021 • edited Loading

imagingbook commented Feb 7, 2023 • edited Loading

nmcb commented Feb 2, 2021 •

edited

Loading

sleroy commented Mar 13, 2021 •

edited

Loading

binkley commented Aug 29, 2021 •

edited

Loading

sleroy commented Nov 18, 2021 •

edited

Loading

imagingbook commented Feb 7, 2023 •

edited

Loading