mg-lc-parse

A basic left-corner parser for minimalist grammars, implemented in Python. The parsing algorithm is based on the following paper.

Usage

Basic Run

The main function is parse, given in the module lc_parser.py. It takes an input sentence (list of words) and returns a list of all possible parses. Each parse is a final, successful configuration the parser has reached, alongside the history of applied rules.

Here is a basic template for using the parser:

g1 = MG('input/g1.json')
parser = LCParser(g1)
results = parser.parse(['Aca', 'knows', 'what', 'Bibi', 'likes'])

General flow

Load the grammar from a JSON file.
Create a parser object with the loaded grammar.
Parse a sentence:
1. Create a stack with the initial config.
2. while (stack != empty):
  1. Pop a configuration from the stack.
  2. Apply all possible rules to the configuration.
  3. If the rule produced a result, create a new configuration and push it to the stack.
  4. If the configuration is successful, add it to the results list.

Grammar Rules

Note that the default parsing behaviour is loading the rules from the grammar. Regarding empty-shift rules (where the shifted lexical item is not consumed from the remaining input): for each empty lexical item in the grammar ('': ["=v,c", "=v,+wh,c"]), the appropriate empty-shift rules are created and added to the parsing rules list.

Another working assumption is that shift rules are never executed consecutively. That is, if a shift rule is applied, the next rule cannot be another shift rule, and is therefore skipped with the current log:
Skipping rule: {rule} because it follows a shift rule!

Parsing Modes

The parse function supports two more modes of running:

rules: A list of rules to be used in the parsing process. This replaces the default list of rules loaded from the grammar.
manual: If set to true, alongside a list of rules, the parser will apply them in the order given (for directly testing the correct parsing process).

In either case, when a rule's condition is not met, or we tried to apply it and got nothing new (it's result will be None), we can except a log message ending in returning same config.

Other

Current use of log levels are to show the parsing process in detail and display with color (that's why rule application are logged as "warnings")

Credits

Based on the original implementation in prolog by Miloš Stanojević.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
grammar		grammar
input		input
lc		lc
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
test_g1.py		test_g1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mg-lc-parse

Usage

Basic Run

General flow

Grammar Rules

Parsing Modes

Other

Credits

About

Releases

Packages

Languages

roym44/mg-lc-parse

Folders and files

Latest commit

History

Repository files navigation

mg-lc-parse

Usage

Basic Run

General flow

Grammar Rules

Parsing Modes

Other

Credits

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages