Idea regarding the exhaustive integration testing of the parser #8

lagodiuk · 2016-05-15T14:39:39Z

As far as parser could handle potentially all possible context free grammars (CFG), and can parse all possible valid statements of the corresponding CFG, it was a bit unclear how to find the proper and exhaustive approach for testing the implementation of the parser.

Few days ago I've came up with an idea regarding the exhaustive testing of the parser.
Of course, as far as there is an infinite amount of CFGs, the idea of testing is based on randomization principle.

Hence, the algorithm:

Repeat multiple times:
1. Generate random CFG
1.1. Using generated CFG - produce the random valid statement, which belongs to the given CFG. It can be done using the recursive top-down approach (also, meanwhile, the top-down approach - helps to generate the correct parse tree as well).
1.2. Parse generated statement using the implementation of the parsing algorithm. Assert, that expected parse tree is present inside the generated parse forest (because, in case of ambiguous CFG - there might be generated multiple parse trees).
1.3. go to 1.1.

gitmh · 2016-05-15T15:16:26Z

We tried something similar just without generating random CFG. We started with a set of grammars at step 1.1. which was not so easy. Let's take a real world example for a calculator grammar like this:

S -> Input
Input      -> Expression | Input Expression 
Expression -> Expression * Expression 
            | Expression + Expression 
            | Expression - Expression 
            | Expression / Expression 
            | Number
Number     -> Digit | Number Digit 
Digit      -> 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

We used a weighted random generator for generating valid statements. So that it will be more likely to choose a path not yet chosen and so that the algorithm prefers rules that do not contain them self (Expression -> Number over Expression -> Expression ...). It is still not easy to configure the algorithm so that we get real world input statements (not too short, not too long and no infinit runtime).

As soon as we find a grammar not working with your parser implementation I'll post it here :-)

lagodiuk · 2016-05-15T15:55:05Z

@gitmh thank you for sharing your experience regarding the randomized testing, and thank you for collaboration!

lagodiuk changed the title ~~Idea regarding the Unit Testing of the Parser~~ Idea regarding the exhaustive Unit Testing of the Parser May 15, 2016

lagodiuk mentioned this issue May 15, 2016

Grammars with EPSILON issues #6

Closed

lagodiuk changed the title ~~Idea regarding the exhaustive Unit Testing of the Parser~~ Idea regarding the exhaustive integration testing of the parser May 15, 2016

lagodiuk added the enhancement label May 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Idea regarding the exhaustive integration testing of the parser #8

Idea regarding the exhaustive integration testing of the parser #8

lagodiuk commented May 15, 2016

gitmh commented May 15, 2016

lagodiuk commented May 15, 2016

Idea regarding the exhaustive integration testing of the parser #8

Idea regarding the exhaustive integration testing of the parser #8

Comments

lagodiuk commented May 15, 2016

gitmh commented May 15, 2016

lagodiuk commented May 15, 2016