Bugfix: Standalone dotlambda just before EOF #16240

T-Gro · 2023-11-08T13:33:07Z

This addresses #16220 .

If my hypothesis is correct, the difference was caused by an end-of-file marker.
In a bigger statement (e.g. with a "let" binding), rules higher up cover for an EOF.
In a standalone parsing of just the dotlambda construct, this is not covered.

Why did matter? Because EOF is one possibility for "recover", which was meant for error handling when this feature was being implemented. But EOF does not always mean an error.

@auduchinok , @DedSec256 :
I did not like the fact of special casing EOF, because most other constructs do not need it either, it is dealt with higher up.
I would more than welcome any suggestion of yours, both for the EOF as well deduplicating the rules here.

auduchinok

@T-Gro I see two problems with the existing rules:

Why do the rules use different rules for expression, i.e. atomicExpr vs appExpr?
atomicExpr covers the cases with high-precedence applications, so _.M() is already covered.
appExpr allows _.M () to be parsed but adds an additional error.

I think it's wrong because dotLambda is an atomic expression, according to the grammar, but it currently captures arbitrary non-atomic app expressions after it, breaking many assumptions about what atomic expressions are. It also makes f _.P 123 inconsistent with other nested applications like f g 123. On top of that it may limit possible future changes in the language and parser recovery.
Why does the second rule have recover at all?
It doesn't look like there something is missing in it where we expect the recovery to kick in. This extra recover (i.e. not replacing something that is missing) is the source of the issue with EOF indeed.

It seems that simply removing the second rule can fix it (it looks too wrong and interferes with the first rule), while adding this third one only buries the problem deeper.

That will allow _.P 123 to be parsed as:

appExpr
  dotLambdaExpr
    _
    .
    namedExpr
      P
  constExpr
    123

and f _.P 123 to be parsed as:

appExpr
  appExpr
    namedExpr
      f
    dotLambdaExpr
      _
      .
      namedExpr
        P
  constExpr
    123

Currently f _.P 123 is parsed as this:

appExpr
  namedExpr
    f
  dotLambdaExpr
    _
    .
    appExpr
      namedExpr
        P
      constExpr
        123

Side question: if the second rule is removed, do we need having a special precedence at all? Having less of them always makes things easier down the road.

tests/service/data/SyntaxTree/DotLambda/TopLevelLet.fs

T-Gro · 2023-11-08T16:12:01Z

@T-Gro I see two problems with the existing rules:

Why do the rules use different rules for expression, i.e. atomicExpr vs appExpr?
atomicExpr covers the cases with high-precedence applications, so _.M() is already covered.
appExpr allows _.M () to be parsed but adds an additional error. It looks wrong to me, we should only parse atomic expressions here as a part of shorthand lambdas.

Why does the second rule have recover at all?
It doesn't look like there something is missing in it where we expect the recovery to kick in. This extra recover (i.e. not replacing something that is missing) is the source of the issue with EOF indeed.

It seems that simply removing the second rule can fix it (it looks too wrong and interferes with the first rule), while adding this third one only buries the problem deeper.

That will allow _.P 123 to be parsed as:
appExpr
  dotLambdaExpr
    _
    .
    namedExpr
      P
  constExpr
    123
and f _.P 123 to be parsed as:
appExpr
  appExpr
    namedExpr
      f
    dotLambdaExpr
      _
      .
      namedExpr
        P
  constExpr
    123
Currently f _.P 123 is parsed as this:
appExpr
  namedExpr
    f
  dotLambdaExpr
    _
    .
    appExpr
      namedExpr
        P
      constExpr
        123
It seems wrong and a bit too limiting to possible future changes in the language and parser recovery. It also doesn't seem consistent with other nested applications like f g 123.

Side question: if the second rule is removed, do we need having a special precedence at all? Having less of them always makes things easier down the road.

The second rule only exists in order to give a meaningful error message instead of a generic one, while keeping some parser recovery working.
But true, the "recover" rule is not needed there at all.

auduchinok · 2023-11-09T09:06:38Z

The second rule only exists in order to give a meaningful error message instead of a generic one, while keeping some parser recovery working.

Could you please show an example of the generic error message when this rule is removed? I think it's very important to fix the atomic expressions being non-atomic here, and the only way I see is to remove this rule.

tests/service/data/SyntaxTree/DotLambda/UnderscoreToString.fs.bsl

...service/data/SyntaxTree/DotLambda/UnderscoreToFunctionNallWithSpaceAndUnitApplication.fs.bsl

tests/service/data/SyntaxTree/DotLambda/WithNonTupledFunctionCall.fs.bsl

T-Gro · 2023-11-09T09:24:43Z

The second rule only exists in order to give a meaningful error message instead of a generic one, while keeping some parser recovery working.

Could you please show an example of the generic error message when this rule is removed? I think it's very important to fix the atomic expressions non being atomic here, and the only way I see is to remove this rule.

It is now visible in the diff + I added commented explanations.
I see three options:

I move the "nonatomic" error into typechecker
I keep 1 rule only in the parser, but implemented addition logic inside the rule to separate atomic/nonatomic
I revert to two rules, one passing (atomic) and one failing (nonatomic), but will avoid using the 'recover' in the rule.

The second one looks best to me, WDYT?

auduchinok · 2023-11-09T10:11:50Z

@T-Gro Thanks, all the examples are good. Looking at these, I think the first option is probably the best as things stand now. A simple check in the type checker could be checking that dot lambdas aren't allowed as function expressions in application expressions:

It would make this allowed, because _.P is the argument expression in the inner app expression:

let _ = f _.P 123

letBinding
  appExpr
    appExpr
      namedExpr
        f
      dotLambdaExpr
        _
        .
        namedExpr
          P
    constExpr
      123

And this would not be allowed, because _.P is the function expr:

let _ = _.P 123

letBinding
  appExpr
    dotLambdaExpr
      _
      .
      namedExpr
        P
    constExpr
      123

Can the special precedence also be removed if non-atomic expressions are no longer allowed there?

I keep 1 rule only in the parser, but implemented addition logic inside the rule to separate atomic/nonatomic

That would still mean we're parsing 'top-level' non-atomic expression inside an atomic one. I think we should avoid it, it makes the whole grammar less sound.

auduchinok

It looks good to me now, it's good to have the underlying problem fixed!

T-Gro · 2023-11-09T16:15:44Z

Agree.

I also wanted to remove the %prec, tried that, but I started to get test failures which did not repro locally.
But it meant that _.Prop was getting parsed as an identifier and dot access, and not as a dot lambda, I wasn't sure why since I could not repro.

=> I will keep that outside of this PR, so that it can get it as it is.

== this is ready for reviews and merging it

src/Compiler/Checking/CheckExpressions.fs

T-Gro added 2 commits November 8, 2023 14:25

failing test

5287481

errors removed

5f71856

T-Gro requested a review from a team as a code owner November 8, 2023 13:33

T-Gro requested a review from auduchinok November 8, 2023 13:33

T-Gro changed the title ~~Bugfix: Standalone dotlambda as the very last statement~~ Bugfix: Standalone dotlambda just before EOF Nov 8, 2023

auduchinok reviewed Nov 8, 2023

View reviewed changes

tests/service/data/SyntaxTree/DotLambda/TopLevelLet.fs Show resolved Hide resolved

T-Gro added 3 commits November 8, 2023 17:16

move tests into expression folder

71a43db

simplify parser rules

e9e9b11

get them back

085dc50

Simplifying ruleset

e947086

T-Gro commented Nov 9, 2023

View reviewed changes

tests/service/data/SyntaxTree/DotLambda/UnderscoreToString.fs.bsl Show resolved Hide resolved

T-Gro commented Nov 9, 2023

View reviewed changes

...service/data/SyntaxTree/DotLambda/UnderscoreToFunctionNallWithSpaceAndUnitApplication.fs.bsl Show resolved Hide resolved

T-Gro commented Nov 9, 2023

View reviewed changes

tests/service/data/SyntaxTree/DotLambda/WithNonTupledFunctionCall.fs.bsl Show resolved Hide resolved

T-Gro added 4 commits November 9, 2023 14:16

Validating incorrect usage in typechecking, not in parsing like before

7e96db3

Moving tests into Expression folder

6d80c16

removing explicit precedence rule for dot_lambda

dff2b37

put %prec back, tests were failing

3801f3f

auduchinok approved these changes Nov 9, 2023

View reviewed changes

abonie reviewed Nov 10, 2023

View reviewed changes

src/Compiler/Checking/CheckExpressions.fs Show resolved Hide resolved

abonie approved these changes Nov 10, 2023

View reviewed changes

psfinaki approved these changes Nov 10, 2023

View reviewed changes

psfinaki merged commit 9a0b9bf into dotnet:main Nov 10, 2023

auduchinok mentioned this pull request Nov 21, 2023

Unexpected parsing error in shorthand lambda #16220

Closed

allisonchou mentioned this pull request Nov 21, 2023

[Automated] PRs inserted in VS build main-34321.28 #16320

Closed

brianrourkeboll mentioned this pull request Dec 21, 2023

Unexpected FS3571 shorthand lambda atomic error #16457

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bugfix: Standalone dotlambda just before EOF #16240

Bugfix: Standalone dotlambda just before EOF #16240

Uh oh!

T-Gro commented Nov 8, 2023

Uh oh!

auduchinok left a comment •

edited

Loading

Uh oh!

Uh oh!

T-Gro commented Nov 8, 2023

Uh oh!

auduchinok commented Nov 9, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

T-Gro commented Nov 9, 2023

Uh oh!

auduchinok commented Nov 9, 2023 •

edited

Loading

Uh oh!

auduchinok left a comment

Uh oh!

T-Gro commented Nov 9, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Bugfix: Standalone dotlambda just before EOF #16240

Bugfix: Standalone dotlambda just before EOF #16240

Uh oh!

Conversation

T-Gro commented Nov 8, 2023

Uh oh!

auduchinok left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

T-Gro commented Nov 8, 2023

Uh oh!

auduchinok commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

T-Gro commented Nov 9, 2023

Uh oh!

auduchinok commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

auduchinok left a comment

Choose a reason for hiding this comment

Uh oh!

T-Gro commented Nov 9, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

auduchinok left a comment •

edited

Loading

auduchinok commented Nov 9, 2023 •

edited

Loading

auduchinok commented Nov 9, 2023 •

edited

Loading