Refactored grammar to remove need for second transformer #385

sognetic · 2023-11-06T13:56:41Z

Hi,

as promised (with some delay) I've had another think about the feature introduced in #100 which slowed down processing quite a bit by introducing an unnecessary transformer. I've removed this transformer again and refactored the grammar such that the MODEL_NAME terminal doesn't need to include the white space / semicolon anymore but just matches on a word boundary (\b). This simplifies the grammar quite a bit and speeds up parsing of the LHCb / Belle II decay files by 15%.
I've removed the tests corresponding to the removed transformer, I'm not sure if / how I should replace them with something in this PR. All other tests pass.

Let me know what you think!

EDIT: I don't know why the pre_commit check in the CI doesn't work, everything is okay locally.

eduardo-rodrigues · 2023-11-08T11:33:01Z

Hello @sognetic, nice to hear back from you and thank you very much already for this new contribution 👍!

This PR pushed me to continue on task #357, see the latest #387 from today. One model in particular, rather unique in the way it span many lines, needed to be tested.

I will get to your PR early next week at the very latest. In the meantime can you rebase to (1) get some more important tests run and (2) get a fix for the pre-commit hook CI (failing for you as well abvove)? Thanks.

…el_alias_transformer_replacement

sognetic · 2023-11-08T14:30:00Z

Hi @eduardo-rodrigues!
Wow, #357 seems like a huge effort but also extremely useful to corral the EvtGen model zoo. I've merged in the current master branch so hopefully that fixes the failing CI thing. Everything still passes locally.

I've also worked on a second feature which removes the need to maintain two lists of models in both dec/enums.py and the grammar definition itself. I'll open a PR with this once this one is merged since I've based it on the grammar changes already included here.

eduardo-rodrigues · 2023-11-08T15:13:31Z

Sounds great!

Yeah, task #357 is a bit of a pain and repetitive work, but worth to ensure the parsing not only succeedes but also gives correctly parsed info. So far I've found 2-3 issues and made several improvements along the way. It does take time.

src/decaylanguage/data/decfile.lark

eduardo-rodrigues

Brilliant 👍!

I'm requesting trivial changes but that's trivial.

eduardo-rodrigues

Thanks again for this!

Refactored grammar to remove need for second transformer

d38a472

eduardo-rodrigues self-assigned this Nov 8, 2023

eduardo-rodrigues added the enhancement New feature or request label Nov 8, 2023

Merge branch 'master' of github.com:scikit-hep/decaylanguage into mod…

e8e5797

…el_alias_transformer_replacement

eduardo-rodrigues reviewed Nov 8, 2023

View reviewed changes

src/decaylanguage/data/decfile.lark Show resolved Hide resolved

eduardo-rodrigues requested changes Nov 8, 2023

View reviewed changes

Removing %import common.NEWLINE from the Lark grammar.

8a9cdf5

eduardo-rodrigues approved these changes Nov 9, 2023

View reviewed changes

eduardo-rodrigues merged commit d31b68a into scikit-hep:master Nov 9, 2023
10 checks passed

sognetic deleted the model_alias_transformer_replacement branch November 9, 2023 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactored grammar to remove need for second transformer #385

Refactored grammar to remove need for second transformer #385

sognetic commented Nov 6, 2023 •

edited

Loading

eduardo-rodrigues commented Nov 8, 2023

sognetic commented Nov 8, 2023

eduardo-rodrigues commented Nov 8, 2023

eduardo-rodrigues left a comment

eduardo-rodrigues left a comment

Refactored grammar to remove need for second transformer #385

Refactored grammar to remove need for second transformer #385

Conversation

sognetic commented Nov 6, 2023 • edited Loading

eduardo-rodrigues commented Nov 8, 2023

sognetic commented Nov 8, 2023

eduardo-rodrigues commented Nov 8, 2023

eduardo-rodrigues left a comment

Choose a reason for hiding this comment

eduardo-rodrigues left a comment

Choose a reason for hiding this comment

sognetic commented Nov 6, 2023 •

edited

Loading