-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-1459] Use local path (and not complete URL) when opening local lo... #375
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Can one of the admins verify this patch? |
|
/cc @andrewor14 |
|
Good catch. The existing code fails when the |
|
Jenkins, test this please. |
1 similar comment
|
Jenkins, test this please. |
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. All automated tests passed. |
|
All automated tests passed. |
|
FYI: just found another issue in this same code. It does not handle the case where the default fs is not "file:", and will try to use FileOutputStream even when it's something like hdfs. I'll fix that and push the new code. |
|
Not sure if I understand. Are you saying that if we somehow configure the default file scheme to be |
|
Ok, maybe it's not a bug per se. But I think it somewhat breaks the user's expectations. Let's say that given a user's configuration, the default fs is hdfs; I'd expect that if I specify the log directory to be "/user/vanzin/logs", that would be in HDFS, not in the local fs. But with the current FileLogger code, it will assume that is a local path, since there is no scheme defined. Anyway, the current patch fixes the actual bug; if you think the above should be handled too I have the code to do it working, but it's not strictly necessary. |
|
Commit that implements what I'm suggesting, if you're interested: I haven't sent a pull request for that one. |
|
@vanzin I recently experienced what you mean on a YARN cluster myself. I've looked at your patch and I think it's a good fix for Spark on YARN. Could you submit a PR for it? It will be more convenient for me to leave my comments there. |
|
On a separate note, @pwendell this PR is ready for merge. |
|
Thanks, I've merged this. |
… lo... ...g file. Author: Marcelo Vanzin <[email protected]> Closes #375 from vanzin/event-file and squashes the following commits: f673029 [Marcelo Vanzin] [SPARK-1459] Use local path (and not complete URL) when opening local log file. (cherry picked from commit ac164b7) Signed-off-by: Patrick Wendell <[email protected]>
This is related to SPARK-1459 / PR apache#375. Without this fix, FileLogger.createLogDir() may try to create the log dir on HDFS, while createWriter() will try to open the log file on the local file system, leading to interesting errors and confusion.
This is related to SPARK-1459 / PR #375. Without this fix, FileLogger.createLogDir() may try to create the log dir on HDFS, while createWriter() will try to open the log file on the local file system, leading to interesting errors and confusion. Author: Marcelo Vanzin <[email protected]> Closes #450 from vanzin/event-file-2 and squashes the following commits: 592cdb3 [Marcelo Vanzin] Honor default fs name when initializing event logger.
This is related to SPARK-1459 / PR #375. Without this fix, FileLogger.createLogDir() may try to create the log dir on HDFS, while createWriter() will try to open the log file on the local file system, leading to interesting errors and confusion. Author: Marcelo Vanzin <[email protected]> Closes #450 from vanzin/event-file-2 and squashes the following commits: 592cdb3 [Marcelo Vanzin] Honor default fs name when initializing event logger. (cherry picked from commit dd1b7a6) Signed-off-by: Patrick Wendell <[email protected]>
… lo... ...g file. Author: Marcelo Vanzin <[email protected]> Closes apache#375 from vanzin/event-file and squashes the following commits: f673029 [Marcelo Vanzin] [SPARK-1459] Use local path (and not complete URL) when opening local log file.
This is related to SPARK-1459 / PR apache#375. Without this fix, FileLogger.createLogDir() may try to create the log dir on HDFS, while createWriter() will try to open the log file on the local file system, leading to interesting errors and confusion. Author: Marcelo Vanzin <[email protected]> Closes apache#450 from vanzin/event-file-2 and squashes the following commits: 592cdb3 [Marcelo Vanzin] Honor default fs name when initializing event logger.
Test with ./dev/scalastyle
Test with ./dev/scalastyle
…mbol is a token (apache#375) In the case where the offending symbol is a CommonToken, this PR increases the accuracy of the start and stop origin by leveraging the start and stop index information from CommonToken.
…mbol is a token (apache#375) In the case where the offending symbol is a CommonToken, this PR increases the accuracy of the start and stop origin by leveraging the start and stop index information from CommonToken.
…pache#375) This change add the job of terraform-provider-huaweicloud-acceptance-test-fusioncloud Closes-Bug: theopenlab/openlab#130
…quet (apache#375) [SPARK-47368][SQL]][3.5] Remove inferTimestampNTZ config check in ParquetRo… ### What changes were proposed in this pull request? The configuration `spark.sql.parquet.inferTimestampNTZ.enabled` is not related the parquet row converter. This PR is the remove the config check `spark.sql.parquet.inferTimestampNTZ.enabled` in the ParquetRowConverter ### Why are the changes needed? Bug fix. Otherwise reading TimestampNTZ columns may fail when `spark.sql.parquet.inferTimestampNTZ.enabled` is disabled. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? New UT ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#45492 from gengliangwang/PR_TOOL_PICK_PR_45480_BRANCH-3.5. Authored-by: Gengliang Wang <[email protected]> Signed-off-by: Gengliang Wang <[email protected]> Co-authored-by: Gengliang Wang <[email protected]>
...g file.