Skip to content

Conversation

@divyam
Copy link

@divyam divyam commented Dec 3, 2014

No description provided.

cnauroth and others added 30 commits January 12, 2015 10:18
…container statuses on heartbeat. Contributed by Chengbing Liu
…dows using Cygwin. Contributed by Chris Nauroth.
…tead of availableResource for maxAllocation. (adhoot via rkanter)
…n and yarn-default.xml. (rchiang via rkanter)
…Cache#testEviction (Sangjin Lee via Colin P. McCabe)
…er incorrectly (Jonathan Mace via Colin P. McCabe)
…rScheduler.allocate call. (kasha via rkanter)
distributed cache with enabling wired encryption at the same time.
Contributed by Junping Du.
…uler when activating applications. Contributed by Craig Welch
…onID() for logging AttemptId in RMContainerAllocator.java (Contributed by Leitao Guo)
oza and others added 23 commits February 6, 2015 13:45
…istoryClientService are uniform when application-attempt is not found. Contributed by Zhijie Shen.
…line server not-fatal. Contributed by Jonathan Eagles
…runcate tests such as truncate with HA setup, negative tests, truncate with other operations and multiple truncates.
@divyam divyam closed this Feb 10, 2015
mekasone pushed a commit to mekasone/hadoop that referenced this pull request Feb 19, 2017
Add extra check for Marathon-lb.
chancez pushed a commit to chancez/hadoop that referenced this pull request Jul 26, 2019
Dockerfile: Add OKD Dockerfile based on UBI
shanthoosh pushed a commit to shanthoosh/hadoop that referenced this pull request Oct 15, 2019
containsValue method should invoke map.containsValue not map.containsKey

Author: michaelwong <[email protected]>

Reviewers: Jagadish <[email protected]>

Closes apache#12 from jwongo/master
singer-bin pushed a commit to singer-bin/hadoop that referenced this pull request Dec 19, 2024
This patch adds the ability to use column index based access to parquet files in pig, which allows for rename capability similar to other file formats.  This is achieved by using the parametrized loader with an alternate schema.

Example:
# File Schema: {c1:int, c2:float, c3:chararray}
p = LOAD '/data/parquet/' USING parquet.pig.ParquetLoader('n1:int, n2:float, n3:chararray', 'true');

In this example, the names from the requested schema will be translated to the column positions from the file and will produce tuples based on the index position.

Two test cases are included that exercise index based access for both full file reads and column projected reads.

Note:  This patch also disables the enforcer plugin on the pig project per discussion at the parquet meetup.  The justification for this is that the enforcer is too strict for internal classes and results in dead code because duplicating methods is required to add parameters where there is only one usage of the constructor/method.  The interface for the pig loader is imposed by LoadFunc and StoreFunc by the pig project and the implementations internals should not be used directly.

Author: Daniel Weeks <[email protected]>

Closes apache#12 from dcw-netflix/column-index-access and squashes the following commits:

1b5c5cf [Daniel Weeks] Refactored based on rewview comments
12b53c1 [Daniel Weeks] Fixed some formatting and the missing filter method sig
e5553f1 [Daniel Weeks] Adding back default constructor to satisfy other project requirements
69d21e0 [Daniel Weeks] Merge branch 'master' into column-index-access
f725c6f [Daniel Weeks] Removed enforcer for pig support
d182dc6 [Daniel Weeks] Introduces column index access
1c3c0c7 [Daniel Weeks] Fixed test with strict checking off
f3cb495 [Daniel Weeks] Added type persuasion for primitive types with a flag to control strict type checking for conflicting schemas, which is strict by default.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.