Skip to content

Conversation

@openinx
Copy link
Member

@openinx openinx commented Jun 4, 2019

…OutputStream is causing lots of heap allocation in HBase when using short-circut read

…OutputStream is causing lots of heap allocation in HBase when using short-circut read
@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 40 Docker mode activated.
_ Prechecks _
+1 dupname 0 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
-1 test4tests 0 The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ trunk Compile Tests _
+1 mvninstall 1134 trunk passed
+1 compile 42 trunk passed
+1 checkstyle 21 trunk passed
+1 mvnsite 46 trunk passed
+1 shadedclient 174 branch has no errors when building and testing our client artifacts.
+1 javadoc 28 trunk passed
0 spotbugs 128 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 126 trunk passed
_ Patch Compile Tests _
+1 mvninstall 41 the patch passed
+1 compile 37 the patch passed
+1 javac 37 the patch passed
-0 checkstyle 17 hadoop-hdfs-project/hadoop-hdfs-client: The patch generated 1 new + 32 unchanged - 0 fixed = 33 total (was 32)
+1 mvnsite 40 the patch passed
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 811 patch has no errors when building and testing our client artifacts.
+1 javadoc 26 the patch passed
+1 findbugs 150 the patch passed
_ Other Tests _
+1 unit 119 hadoop-hdfs-client in the patch passed.
+1 asflicense 26 The patch does not generate ASF License warnings.
2995
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-899/1/artifact/out/Dockerfile
GITHUB PR #899
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux 3ad98fe9d044 4.4.0-141-generic #167~14.04.1-Ubuntu SMP Mon Dec 10 13:20:24 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / 606061a
Default Java 1.8.0_212
checkstyle https://builds.apache.org/job/hadoop-multibranch/job/PR-899/1/artifact/out/diff-checkstyle-hadoop-hdfs-project_hadoop-hdfs-client.txt
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-899/1/testReport/
Max. process+thread count 305 (vs. ulimit of 5500)
modules C: hadoop-hdfs-project/hadoop-hdfs-client U: hadoop-hdfs-project/hadoop-hdfs-client
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-899/1/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

@toddlipcon
Copy link
Contributor

No new tests seems fine since this is just a small perf optimization. +1.

@toddlipcon toddlipcon merged commit ea3b0a1 into apache:trunk Jun 4, 2019
shanthoosh added a commit to shanthoosh/hadoop that referenced this pull request Oct 15, 2019
…izer (apache#899)

* Update ZkClient in samza-standalone to use string serialization.

Changes:

* Switch from using SerializableSerializer to ZkSerializer.
  ZkClient uses pluggable serde to serialize/deserialize user-defined types to bytes before it's written to zookeeper.
  In the current implementation, the predefined SerializableSerializer(which uses the java native serialization) provided as default by the ZkClient library is used by samza-standalone. SerializableSerializer auguments special characters  on top of the serialized data as the auxillary data. The presence of these special characters makes the parsing zookeeper data impossible  when building custom tools in languages(such as python) that does not have java native serializer. To solve this problem, this patch in introduces string serializer to serialize/deserialize data before it's written to zookeeper.
* Adds a new string serializer which implements the ZkSerializer serialization abstraction provided by the ZkClient.
* Fix usages where direct type conversion where done.

Testing:
* Added unit tests for the newly introduced serializer.
* Integration testing done with the sample standalone test jobs from samza-hello-samza.

* Address review comments.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants