Skip to content

Conversation

@aw-was-here
Copy link
Contributor

Some folks don't want to to this in JIRA.

@aw-was-here aw-was-here closed this Apr 1, 2015
mekasone pushed a commit to mekasone/hadoop that referenced this pull request Feb 19, 2017
chancez pushed a commit to chancez/hadoop that referenced this pull request Aug 20, 2019
Dockerfile.okd: Use shaded GCS jar in OKD image
Amithsha pushed a commit to Amithsha/hadoop that referenced this pull request Jan 28, 2021
HADOOP-17122: Preserving Directory Attributes in DistCp with Atomic C…
passaro added a commit to passaro/hadoop that referenced this pull request Nov 18, 2022
saxenapranav referenced this pull request in saxenapranav/hadoop Apr 20, 2023
Create file over blob endpoint.
singer-bin pushed a commit to singer-bin/hadoop that referenced this pull request Dec 19, 2024
TestDictionary#testZeroValues demonstrates the problem, where a page of
all null values is decoded using the DicitonaryValuesReader. Because
there are no non-null values, the page values section is 0 byte, but the
DictionaryValuesReader assumes there is at least one encoded value and
attempts to read a bit width. The test passes a byte array to
initFromPage with the offset equal to the array's length.

The fix is to detect that there are no input bytes to read. To avoid
adding validity checks to the read path, this sets the internal decoder
to one that will throw an exception if any reads are attempted.

Author: Ryan Blue <[email protected]>

Closes apache#18 from rdblue/PARQUET-18-fix-nulls-with-dictionary and squashes the following commits:

0711766 [Ryan Blue] PARQUET-18: Fix all-null value pages with dict encoding.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant