Fix incorrect location data in OUTDENT nodes #4291

alangpierce · 2016-07-30T03:46:14Z

In f609036, a line was changed from
if length > 0 then (length - 1) else 0 to Math.max 0, length - 1. However,
in some cases, the length variable can be undefined. The previous code would
correctly compute lastCharacter as 0, but the new code would compute it as
NaN. This would cause trouble later on: the end location would just be the end
of the current chunk, which would be incorrect. This confuses tools that rely on
the CoffeeScript parser and need the location info to be correct.

Here's a specific case where the parser was behaving incorrectly:

a {
  b: ->
    return c d,
      if e
        f
}
g

The OUTDENT tokens after the f had an undefined length, so the NaN made it
so the end location was at the end of the file. That meant that various nodes in
the AST, like the return node, would incorrectly have an end location at the
end of the file.

To fix, I just reverted the change to that particular line.

In f609036, a line was changed from `if length > 0 then (length - 1) else 0` to `Math.max 0, length - 1`. However, in some cases, the `length` variable can be `undefined`. The previous code would correctly compute `lastCharacter` as 0, but the new code would compute it as `NaN`. This would cause trouble later on: the end location would just be the end of the current chunk, which would be incorrect. Here's a specific case where the parser was behaving incorrectly: ``` a { b: -> return c d, if e f } g ``` The OUTDENT tokens after the `f` had an undefined length, so the `NaN` made it so the end location was at the end of the file. That meant that various nodes in the AST, like the `return` node, would incorrectly have an end location at the end of the file. To fix, I just reverted the change to that particular line.

lydell · 2016-08-01T10:39:47Z

Is it possible to write a test for this?

vendethiel · 2016-08-01T15:26:47Z

Why do they have a undefined length? Shouldn't it be at least 0?

alangpierce · 2016-08-02T04:50:13Z

@vendethiel To be honest I don't know this code that well, but my impression is that length is optional for OUTDENT tokens, since outdents are sometimes more "logical" tokens that don't correspond go a direct range of source code. If you look at the outdentToken function in lexer.coffee, it takes outdentLength as the third parameter, and two of the three usages don't pass in any length. The function also has the code outdentLength and @chunk[outdentLength] in INDENTABLE_CLOSERS, suggesting that outdentLength is intentionally optional. I could potentially try to change it to always be specified, but I don't know what other consequences that might have. The simple revert certainly seems safest.

@lydell When I initially looked I thought there weren't any tests directly for the lexer, but now I see that test/location.coffee is what I want. I just added a test there.

lydell · 2016-08-04T12:45:46Z

LGTM

Backport of jashkenas#4291 . In f609036, a line was changed from `if length > 0 then (length - 1) else 0` to `Math.max 0, length - 1`. However, in some cases, the `length` variable can be `undefined`. The previous code would correctly compute `lastCharacter` as 0, but the new code would compute it as `NaN`. This would cause trouble later on: the end location would just be the end of the current chunk, which would be incorrect. Here's a specific case where the parser was behaving incorrectly: ``` a { b: -> return c d, if e f } g ``` The OUTDENT tokens after the `f` had an undefined length, so the `NaN` made it so the end location was at the end of the file. That meant that various nodes in the AST, like the `return` node, would incorrectly have an end location at the end of the file. To fix, I just reverted the change to that particular line. Also add a test that tokens have locations that are in order

…on-data Fix incorrect location data in OUTDENT nodes

alangpierce mentioned this pull request Jul 30, 2016

CoffeeScript parser gives incorrect location data in some cases decaffeinate/decaffeinate#356

Closed

Add a test that tokens have locations that are in order

feb42e5

alangpierce mentioned this pull request Aug 6, 2016

fix: properly set location data in OUTDENT nodes decaffeinate/coffeescript#2

Merged

This was referenced Aug 23, 2016

Change OUTDENT tokens to be positioned at the end of the previous token #4296

Merged

CS2 Discussion: Question: New Compiler Starting Point coffeescript6/discuss#25

Closed

lydell merged commit ec9c4d8 into jashkenas:master Sep 14, 2016

lydell added bug fixed labels Sep 14, 2016

coffeescriptbot mentioned this pull request Feb 19, 2018

CS2 Discussion: Question: New Compiler Starting Point #4923

Closed

EsrefDurna added a commit to EsrefDurna/coffeescript that referenced this pull request Nov 12, 2025

Merge pull request jashkenas#4291 from alangpierce/fix-outdent-locati…

053ebed

…on-data Fix incorrect location data in OUTDENT nodes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix incorrect location data in OUTDENT nodes #4291

Fix incorrect location data in OUTDENT nodes #4291

Uh oh!

alangpierce commented Jul 30, 2016

Uh oh!

lydell commented Aug 1, 2016

Uh oh!

vendethiel commented Aug 1, 2016

Uh oh!

alangpierce commented Aug 2, 2016

Uh oh!

lydell commented Aug 4, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix incorrect location data in OUTDENT nodes #4291

Fix incorrect location data in OUTDENT nodes #4291

Uh oh!

Conversation

alangpierce commented Jul 30, 2016

Uh oh!

lydell commented Aug 1, 2016

Uh oh!

vendethiel commented Aug 1, 2016

Uh oh!

alangpierce commented Aug 2, 2016

Uh oh!

lydell commented Aug 4, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants