Refactored and Optimized Logic:: Parser Logic, Latex Based Output & Shared Modules #283

IIITM-Jay · 2024-09-16T19:49:08Z

This PR intends to refactor the parse.py to achieve maintainability and scalability.

Scopes Covered

Modularization: Breaking the codes in small and maintainable helper functions containing understandable lines of code
Optimization: Reducing number of lines of codes using techniques such as list comprehensions etc.
Removing Redundancy: Removing duplicacy and utilizing reusablity of logics
Refactorization: Refactor a long script into dedicated scripts for scalability

Approach Followed

The parse.py file now contains only the main function where it calls respective scripts for generating the output based on extensions selected.
Common methods/ functions are moved to shared_utils.py so that other scripts can re-use them efficiently. These functions are shared between various extension scripts
The output for Latex is refactored and optimized in larex_utils.py
Scripts: parse.py and shared_utils.py are refactored, modularized and optimized as well
Separate dedicated scripts being created for each extension

Needs to be done

The codes for other extension outputs are only moved to their respective scripts(except for latex). They require improvements and enhancements. They are just simply taken from parse.py to their modules like c_utils.py for generating c based output
In the shared_utils.py, the method create_inst_dict() is yet to be refactored.

…hared Modules

IIITM-Jay · 2024-09-16T19:53:56Z

Hi @aswaterman and @rpsene, I have made an attempt to refactor the parse.py. For now, only latex_utils.py, shared_utils.py and parse.py and refactored, optimized and modularized.

The test cases are failing as we need to modify the tests.py as well based on the accepted refactored code.
P.S. Note: For the time being, I have imported the shared module methods for running the test cases, now all checks passed

Requesting feedback on the modified code and suggestions on what best we can do to achieve maintainability and scalability.

IIITM-Jay · 2024-09-21T07:31:07Z

Hi @aswaterman, in addition to the written explanation of the parsing logic inside "Flow of parse.py" section of README file, I believe including flowcharts would greatly enhance the clarity and readability of the process. By visualizing the flow, readers can get a clearer understanding of the key steps involved in parsing instruction encodings.

Like, we have three main steps with each having sequence of procedures:
1. The first pass, we cover only the regular expression and follow the below steps:

flowchart TD
    A[Start: parse.py] --> B[Create list of all rv* files]
    B --> C{File contains regular instructions?}
    C -->|Yes| D[Parse file line by line]
    D --> E[Perform checks on regular instructions]
    
    E --> F[Check 1: msb > lsb in range assignment]
    E --> G[Check 2: Value representable in range]
    E --> H[Check 3: No multiple assignments to same bit]
    E --> I[Check 4: All bit positions must be accounted for]
    
    F --> J[Pass checks?]
    G --> J
    H --> J
    I --> J
    
    J -->|Yes| K[Create dictionary for regular instruction]
    K --> L[Add encoding, extension, mask, match, variable_fields]
    L --> M[Add to instr_dict]
    
    M --> N[Process next regular instruction]
    N -->|All regular instructions processed| O[End of Regular Instruction Parsing]

IIITM-Jay · 2024-09-21T07:33:26Z

2. In the second pass, we do the checks for pseudo_instr carrying out similar procedure.

3. In the last step, the output generation,

flowchart TD
    A[Start Output Generation] --> B[Generate LaTeX tables]
    B --> C[Generate encoding.h file]
    C --> D[Generate other artifacts]

    D --> E[Output files generated]
    E --> F[End]

IIITM-Jay · 2024-09-21T07:35:44Z

@aswaterman and @rpsene , These flowchart creation process are markdown friendly. I think it will also help to give a nutshell view of what we are doing. Let me know the feedback and suggestions, if these will add up to an enhancement for the existing repository.

IIITM-Jay · 2024-09-25T17:19:05Z

@aswaterman and @rpsene , Refactored and Optimized the method used for creating instruction dictionary.

P.S. The second point from Needs to be Done header in PR description is ticked(marked as completed) as of now

Refactored and Optimized Logic:: Parser Logic, Latex Based Output & S…

1b0ef5d

…hared Modules

IIITM-Jay requested review from aswaterman and rpsene September 16, 2024 19:49

modified test.py for running test cases

3dd1273

Optimized and modularized method for Instruction Dictionary

88e9809

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactored and Optimized Logic:: Parser Logic, Latex Based Output & Shared Modules #283

Refactored and Optimized Logic:: Parser Logic, Latex Based Output & Shared Modules #283

IIITM-Jay commented Sep 16, 2024 •

edited

Loading

IIITM-Jay commented Sep 16, 2024 •

edited

Loading

IIITM-Jay commented Sep 21, 2024

IIITM-Jay commented Sep 21, 2024

IIITM-Jay commented Sep 21, 2024 •

edited

Loading

IIITM-Jay commented Sep 25, 2024 •

edited

Loading

Refactored and Optimized Logic:: Parser Logic, Latex Based Output & Shared Modules #283

Are you sure you want to change the base?

Refactored and Optimized Logic:: Parser Logic, Latex Based Output & Shared Modules #283

Conversation

IIITM-Jay commented Sep 16, 2024 • edited Loading

Scopes Covered

Approach Followed

Needs to be done

IIITM-Jay commented Sep 16, 2024 • edited Loading

IIITM-Jay commented Sep 21, 2024

IIITM-Jay commented Sep 21, 2024

IIITM-Jay commented Sep 21, 2024 • edited Loading

IIITM-Jay commented Sep 25, 2024 • edited Loading

IIITM-Jay commented Sep 16, 2024 •

edited

Loading

IIITM-Jay commented Sep 16, 2024 •

edited

Loading

IIITM-Jay commented Sep 21, 2024 •

edited

Loading

IIITM-Jay commented Sep 25, 2024 •

edited

Loading