Increasing range of terms parsable into MathExpr #289

sundararajan-s · 2020-08-27T07:11:10Z

The following is a list of issues which need to be fixed/improved on in the parser for MathExpr

The text was updated successfully, but these errors were encountered:

siddhartha-gadgil · 2020-08-27T10:27:44Z

Does your first point mean we extend the MathExpr language (to allow contexts?)
At present tex expressions are always assigned the part of speech "proper noun". For a|b etc we need a different part of speech (I do not know if a single word type is possible here).
As you may have noticed, many cases are handled by pre-processing, ideally by merging tokens and assigning a correct part-of-speech tag.
As you go along, please give examples and precise errors for the cases; e.g. unparsed, wrongly parsed, or part wrongly parsed so the whole is unparsed.

sundararajan-s · 2020-08-27T10:46:38Z

That is one possibility. I was thinking it would be simpler to just drop the specific words. I think this will be needed in the subsequent step, the conversion from MathExpr to HoTT.
I do not think so either. There are certain simple cases for which a fix is possible, I shall experiment with those. If it does not have any issues I may temporarily add those.
Actually I did not notice much preprocessing. The preprocessing in the TeXParsed class is commented out, and besides that I did not find any preprocessing.
I shall do that. I shall edit the original issue with those.

siddhartha-gadgil · 2020-08-27T10:52:31Z

The language should be extended if and only if the meaning of the sentence cannot be expressed. Otherwise one changes the parsing.
The POS tags are modified in a few cases. I think "such that" is replaced with where. There isn't much preprocessing because there isn't much of anything specific.

sundararajan-s · 2020-08-27T11:23:36Z

In that case I don't think the language will need to be extended for that issue. However for the adverb issue will require an extension to the language.
The substitution was commented out, I shall re-enable it and see the results.

siddhartha-gadgil · 2020-08-27T12:35:52Z

If it was commented out it probably is unnecessary due to a change somewhere, either my code or the Stanford parser.

sundararajan-s · 2020-09-02T12:59:24Z

Added new sub issue regarding conjunct adjectives.

sundararajan-s · 2021-02-15T05:19:53Z

The sub-issue regarding verbs inside TeX expressions has been solved by replacing the specific TeX expression, for example, $a > b$ with "$a > b$ is true". The correct TeX expression is selected by iterating over all possible swaps and checking which one parses.

siddhartha-gadgil · 2021-02-15T10:08:33Z

Nice. So every LaTeX expression is a noun. We need rules for adding "is true", but these should be simple to some extent, and amenable to machine learning

…

On Mon, 15 Feb 2021 at 10:50, sundararajan-s ***@***.***> wrote: The sub-issue regarding verbs inside TeX expressions has been solved by replacing the specific TeX expression, for example, $a > b$ with "$a > b$ is true". The correct TeX expression is selected by iterating over all possible swaps and checking which one parses. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#289 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA3K3JGURSBEXVX62FW65RDS7CVIXANCNFSM4QMUXKGA> .

sundararajan-s · 2021-02-16T05:08:44Z

For now, I'm doing an exhaustive search, but I do think in the future we could speed it up with some NLP methods.

sundararajan-s mentioned this issue Aug 29, 2020

Handling raised MatchExceptions in the Determiner Object and adding more adjective and adverb support #290

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increasing range of terms parsable into MathExpr #289

Increasing range of terms parsable into MathExpr #289

sundararajan-s commented Aug 27, 2020 •

edited

Loading

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Aug 27, 2020

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Aug 27, 2020 •

edited

Loading

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Sep 2, 2020 •

edited

Loading

sundararajan-s commented Feb 15, 2021

siddhartha-gadgil commented Feb 15, 2021 via email

sundararajan-s commented Feb 16, 2021

Increasing range of terms parsable into MathExpr #289

Increasing range of terms parsable into MathExpr #289

Comments

sundararajan-s commented Aug 27, 2020 • edited Loading

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Aug 27, 2020

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Aug 27, 2020 • edited Loading

siddhartha-gadgil commented Aug 27, 2020

sundararajan-s commented Sep 2, 2020 • edited Loading

sundararajan-s commented Feb 15, 2021

siddhartha-gadgil commented Feb 15, 2021 via email

sundararajan-s commented Feb 16, 2021

sundararajan-s commented Aug 27, 2020 •

edited

Loading

sundararajan-s commented Aug 27, 2020 •

edited

Loading

sundararajan-s commented Sep 2, 2020 •

edited

Loading