Skip to content

Conversation

@xinrong-meng
Copy link
Member

What changes were proposed in this pull request?

The current documentation incorrectly suggests that "plotting.sample_ratio" defaults to "plotting.max_rows". In reality, if "plotting.sample_ratio" is not explicitly set, it is derived based on the ratio of "plotting.max_rows" to the dataset size.

Why are the changes needed?

To avoid misunderstandings about how these options interact.

Does this PR introduce any user-facing change?

Doc change only.

How was this patch tested?

Existing tests.

Was this patch authored or co-authored using generative AI tooling?

No.

@xinrong-meng xinrong-meng changed the title [SPARK-49532][DOCS] Improve documentation of "plotting.sample_ratio" option [SPARK-49532][DOCS][PS] Improve documentation of "plotting.sample_ratio" option Sep 6, 2024
@HyukjinKwon
Copy link
Member

Merged to master.

@xinrong-meng
Copy link
Member Author

Thank you @HyukjinKwon !

@xinrong-meng xinrong-meng deleted the doc_impr branch September 10, 2024 08:01
IvanK-db pushed a commit to IvanK-db/spark that referenced this pull request Sep 20, 2024
…io" option

### What changes were proposed in this pull request?
The current documentation incorrectly suggests that "plotting.sample_ratio" **defaults** to "plotting.max_rows". In reality, if "plotting.sample_ratio" is not explicitly set, it is **derived** based on the ratio of "plotting.max_rows" to the dataset size.

### Why are the changes needed?
To avoid misunderstandings about how these options interact.

### Does this PR introduce _any_ user-facing change?
Doc change only.

### How was this patch tested?
Existing tests.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#48013 from xinrong-meng/doc_impr.

Authored-by: Xinrong Meng <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
attilapiros pushed a commit to attilapiros/spark that referenced this pull request Oct 4, 2024
…io" option

### What changes were proposed in this pull request?
The current documentation incorrectly suggests that "plotting.sample_ratio" **defaults** to "plotting.max_rows". In reality, if "plotting.sample_ratio" is not explicitly set, it is **derived** based on the ratio of "plotting.max_rows" to the dataset size.

### Why are the changes needed?
To avoid misunderstandings about how these options interact.

### Does this PR introduce _any_ user-facing change?
Doc change only.

### How was this patch tested?
Existing tests.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#48013 from xinrong-meng/doc_impr.

Authored-by: Xinrong Meng <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
himadripal pushed a commit to himadripal/spark that referenced this pull request Oct 19, 2024
…io" option

### What changes were proposed in this pull request?
The current documentation incorrectly suggests that "plotting.sample_ratio" **defaults** to "plotting.max_rows". In reality, if "plotting.sample_ratio" is not explicitly set, it is **derived** based on the ratio of "plotting.max_rows" to the dataset size.

### Why are the changes needed?
To avoid misunderstandings about how these options interact.

### Does this PR introduce _any_ user-facing change?
Doc change only.

### How was this patch tested?
Existing tests.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#48013 from xinrong-meng/doc_impr.

Authored-by: Xinrong Meng <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants