Multiple executors per node, and spark.executor.cores support #3

mgummelt · 2016-01-27T21:23:33Z

I still need to update the docs, but PTAL

…tions Update user guide for RFormula feature interactions. Meanwhile we also update other new features such as supporting string label in Spark 1.6. Author: Yanbo Liang <[email protected]> Closes apache#10222 from yanboliang/spark-11965.

Closes apache#9046 Closes apache#8532 Closes apache#10756 Closes apache#8960 Closes apache#10485 Closes apache#10467

Added color coding to the Executors page for Active Tasks, Failed Tasks, Completed Tasks and Task Time. Active Tasks is shaded blue with it's range based on percentage of total cores used. Failed Tasks is shaded red ranging over the first 10% of total tasks failed Completed Tasks is shaded green ranging over 10% of total tasks including failed and active tasks, but only when there are active or failed tasks on that executor. Task Time is shaded red when GC Time goes over 10% of total time with it's range directly corresponding to the percent of total time. Author: Alex Bozarth <[email protected]> Closes apache#10154 from ajbozarth/spark12149.

This PR brings back visualization for generated operators, they looks like: ![sql](https://cloud.githubusercontent.com/assets/40902/12460920/0dc7956a-bf6b-11e5-9c3f-8389f452526e.png) ![stage](https://cloud.githubusercontent.com/assets/40902/12460923/11806ac4-bf6b-11e5-9c72-e84a62c5ea93.png) Note: SQL metrics are not supported right now, because they are very slow, will be supported once we have batch mode. Author: Davies Liu <[email protected]> Closes apache#10828 from davies/viz_codegen.

… of Partitioning Columns When users are using `partitionBy` and `bucketBy` at the same time, some bucketing columns might be part of partitioning columns. For example, ``` df.write .format(source) .partitionBy("i") .bucketBy(8, "i", "k") .saveAsTable("bucketed_table") ``` However, in the above case, adding column `i` into `bucketBy` is useless. It is just wasting extra CPU when reading or writing bucket tables. Thus, like Hive, we can issue an exception and let users do the change. Also added a test case for checking if the information of `sortBy` and `bucketBy` columns are correctly saved in the metastore table. Could you check if my understanding is correct? cloud-fan rxin marmbrus Thanks! Author: gatorsmile <[email protected]> Closes apache#10891 from gatorsmile/commonKeysInPartitionByBucketBy.

```PCAModel``` can output ```explainedVariance``` at Python side. cc mengxr srowen Author: Yanbo Liang <[email protected]> Closes apache#10830 from yanboliang/spark-12905.

This PR adds serialization support for `CountMinSketch`. A version number is added to version the serialized binary format. Author: Cheng Lian <[email protected]> Closes apache#10893 from liancheng/cms-serialization.

As we begin to use unsafe row writing framework(`BufferHolder` and `UnsafeRowWriter`) in more and more places(`UnsafeProjection`, `UnsafeRowParquetRecordReader`, `GenerateColumnAccessor`, etc.), we should add more doc to it and make it easier to use. This PR abstract the technique used in `UnsafeRowParquetRecordReader`: avoid unnecessary operatition as more as possible. For example, do not always point the row to the buffer at the end, we only need to update the size of row. If all fields are of primitive type, we can even save the row size updating. Then we can apply this technique to more places easily. a local benchmark shows `UnsafeProjection` is up to 1.7x faster after this PR: **old version** ``` Intel(R) Core(TM) i7-4960HQ CPU 2.60GHz unsafe projection: Avg Time(ms) Avg Rate(M/s) Relative Rate ------------------------------------------------------------------------------- single long 2616.04 102.61 1.00 X single nullable long 3032.54 88.52 0.86 X primitive types 9121.05 29.43 0.29 X nullable primitive types 12410.60 21.63 0.21 X ``` **new version** ``` Intel(R) Core(TM) i7-4960HQ CPU 2.60GHz unsafe projection: Avg Time(ms) Avg Rate(M/s) Relative Rate ------------------------------------------------------------------------------- single long 1533.34 175.07 1.00 X single nullable long 2306.73 116.37 0.66 X primitive types 8403.93 31.94 0.18 X nullable primitive types 12448.39 21.56 0.12 X ``` For single non-nullable long(the best case), we can have about 1.7x speed up. Even it's nullable, we can still have 1.3x speed up. For other cases, it's not such a boost as the saved operations only take a little proportion of the whole process. The benchmark code is included in this PR. Author: Wenchen Fan <[email protected]> Closes apache#10809 from cloud-fan/unsafe-projection.

This PR adds an initial implementation of bloom filter in the newly added sketch module. The implementation is based on the [`BloomFilter` class in guava](https://code.google.com/p/guava-libraries/source/browse/guava/src/com/google/common/hash/BloomFilter.java). Some difference from the design doc: * expose `bitSize` instead of `sizeInBytes` to user. * always need the `expectedInsertions` parameter when create bloom filter. Author: Wenchen Fan <[email protected]> Closes apache#10883 from cloud-fan/bloom-filter.

liancheng please take a look Author: tedyu <[email protected]> Closes apache#10906 from tedyu/master.

…izer Add Python API for ml.feature.QuantileDiscretizer. One open question: Do we want to do this stuff to re-use the java model, create a new model, or use a different wrapper around the java model. cc brkyvz & mengxr Author: Holden Karau <[email protected]> Closes apache#10085 from holdenk/SPARK-11937-SPARK-11922-Python-API-for-ml.feature.QuantileDiscretizer.

https://issues.apache.org/jira/browse/SPARK-12834 We use `SerDe.dumps()` to serialize `JavaArray` and `JavaList` in `PythonMLLibAPI`, then deserialize them with `PickleSerializer` in Python side. However, there is no need to transform them in such an inefficient way. Instead of it, we can use type conversion to convert them, e.g. `list(JavaArray)` or `list(JavaList)`. What's more, there is an issue to Ser/De Scala Array as I said in https://issues.apache.org/jira/browse/SPARK-12780 Author: Xusen Yin <[email protected]> Closes apache#10772 from yinxusen/SPARK-12834.

…in PySpark for now I saw several failures from recent PR builds, e.g., https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50015/consoleFull. This PR marks the test as ignored and we will fix the flakyness in SPARK-10086. gliptak Do you know why the test failure didn't show up in the Jenkins "Test Result"? cc: jkbradley Author: Xiangrui Meng <[email protected]> Closes apache#10909 from mengxr/SPARK-10086.

This pull request simply fixes a few minor coding style issues in csv, as I was reviewing the change post-hoc. Author: Reynold Xin <[email protected]> Closes apache#10919 from rxin/csv-minor.

This PR adds serialization support for BloomFilter. A version number is added to version the serialized binary format. Author: Wenchen Fan <[email protected]> Closes apache#10920 from cloud-fan/bloom-filter.

JIRA: https://issues.apache.org/jira/browse/SPARK-12961 To prevent memory leak in snappy-java, just call the method once and cache the result. After the library releases new version, we can remove this object. JoshRosen Author: Liang-Chi Hsieh <[email protected]> Closes apache#10875 from viirya/prevent-snappy-memory-leak.

…s inconsistent with Scala's Iterator->Iterator Fix Java function API methods for flatMap and mapPartitions to require producing only an Iterator, not Iterable. Also fix DStream.flatMap to require a function producing TraversableOnce only, not Traversable. CC rxin pwendell for API change; tdas since it also touches streaming. Author: Sean Owen <[email protected]> Closes apache#10413 from srowen/SPARK-3369.

Call system.exit explicitly to make sure non-daemon user threads terminate. Without this, user applications might live forever if the cluster manager does not appropriately kill them. E.g., YARN had this bug: HADOOP-12441. Author: zhuol <[email protected]> Closes apache#9946 from zhuoliu/10911.

… hive metadata format This PR adds a new table option (`skip_hive_metadata`) that'd allow the user to skip storing the table metadata in hive metadata format. While this could be useful in general, the specific use-case for this change is that Hive doesn't handle wide schemas well (see https://issues.apache.org/jira/browse/SPARK-12682 and https://issues.apache.org/jira/browse/SPARK-6024) which in turn prevents such tables from being queried in SparkSQL. Author: Sameer Agarwal <[email protected]> Closes apache#10826 from sameeragarwal/skip-hive-metadata.

…uctions for streaming-akka project Since `actorStream` is an external project, we should add the linking and deploying instructions for it. A follow up PR of apache#10744 Author: Shixiong Zhu <[email protected]> Closes apache#10856 from zsxwing/akka-link-instruction.

https://issues.apache.org/jira/browse/SPARK-11923 Author: Xusen Yin <[email protected]> Closes apache#10186 from yinxusen/SPARK-11923.

…r other than its parent class https://issues.apache.org/jira/browse/SPARK-12952 Author: Xusen Yin <[email protected]> Closes apache#10863 from yinxusen/SPARK-12952.

…r in dev/run-tests This patch improves our `dev/run-tests` script to test modules in a topologically-sorted order based on modules' dependencies. This will help to ensure that bugs in upstream projects are not misattributed to downstream projects because those projects' tests were the first ones to exhibit the failure Topological sorting is also useful for shortening the feedback loop when testing pull requests: if I make a change in SQL then the SQL tests should run before MLlib, not after. In addition, this patch also updates our test module definitions to split `sql` into `catalyst`, `sql`, and `hive` in order to allow more tests to be skipped when changing only `hive/` files. Author: Josh Rosen <[email protected]> Closes apache#10885 from JoshRosen/SPARK-8725.

Otherwise the `^` character is always marked as error in IntelliJ since it represents an unclosed superscript markup tag. Author: Cheng Lian <[email protected]> Closes apache#10926 from liancheng/agg-doc-fix.

environment variable ADD_FILES is created for adding python files on spark context to be distributed to executors (SPARK-865), this is deprecated now. User are encouraged to use --py-files for adding python files. Author: Jeff Zhang <[email protected]> Closes apache#10913 from zjffdu/SPARK-12993.

The current python ml params require cut-and-pasting the param setup and description between the class & ```__init__``` methods. Remove this possible case of errors & simplify use of custom params by adding a ```_copy_new_parent``` method to param so as to avoid cut and pasting (and cut and pasting at different indentation levels urgh). Author: Holden Karau <[email protected]> Closes apache#10216 from holdenk/SPARK-10509-excessive-param-boiler-plate-code.

Right now RpcEndpointRef.ask may throw exception in some corner cases, such as calling ask after stopping RpcEnv. It's better to avoid throwing exception from RpcEndpointRef.ask. We can send the exception to the future for `ask`. Author: Shixiong Zhu <[email protected]> Closes apache#10568 from zsxwing/send-ask-fail.

… Add LibSVMOutputWriter The behavior of LibSVMRelation is not changed except adding LibSVMOutputWriter * Partition is still not supported * Multiple input paths is not supported Author: Jeff Zhang <[email protected]> Closes apache#9595 from zjffdu/SPARK-11622.

This patch adds support for complex types for ColumnarBatch. ColumnarBatch supports structs and arrays. There is a simple mapping between the richer catalyst types to these two. Strings are treated as an array of bytes. ColumnarBatch will contain a column for each node of the schema. Non-complex schemas consists of just leaf nodes. Structs represent an internal node with one child for each field. Arrays are internal nodes with one child. Structs just contain nullability. Arrays contain offsets and lengths into the child array. This structure is able to handle arbitrary nesting. It has the key property that we maintain columnar throughout and that primitive types are only stored in the leaf nodes and contiguous across rows. For example, if the schema is ``` array<array<int>> ``` There are three columns in the schema. The internal nodes each have one children. The leaf node contains all the int data stored consecutively. As part of this, this patch adds append APIs in addition to the Put APIs (e.g. putLong(rowid, v) vs appendLong(v)). These APIs are necessary when the batch contains variable length elements. The vectors are not fixed length and will grow as necessary. This should make the usage a lot simpler for the writer. Author: Nong Li <[email protected]> Closes apache#10820 from nongli/spark-12854.

Fixed the bug in linear regression train for the case when the target variable is constant. The two cases for `fitIntercept=true` or `fitIntercept=false` should be treated differently. Author: Imran Younus <[email protected]> Closes apache#10702 from iyounus/SPARK-12732_bug_fix_in_linear_regression_train.

`rpcEnv.awaitTermination()` was not added in apache#10854 because some Streaming Python tests hung forever. This patch fixed the hung issue and added rpcEnv.awaitTermination() back to SparkEnv. Previously, Streaming Kafka Python tests shutdowns the zookeeper server before stopping StreamingContext. Then when stopping StreamingContext, KafkaReceiver may be hung due to https://issues.apache.org/jira/browse/KAFKA-601, hence, some thread of RpcEnv's Dispatcher cannot exit and rpcEnv.awaitTermination is hung.The patch just changed the shutdown order to fix it. Author: Shixiong Zhu <[email protected]> Closes apache#11031 from zsxwing/awaitTermination.

1. try to avoid the suffix (unique id) 2. remove the comment if there is no code generated. 3. re-arrange the order of functions 4. trop the new line for inlined blocks. Author: Davies Liu <[email protected]> Closes apache#11032 from davies/better_suffix.

…kSQL Based on the semantics of a query, we can derive a number of data constraints on output of each (logical or physical) operator. For instance, if a filter defines `‘a > 10`, we know that the output data of this filter satisfies 2 constraints: 1. `‘a > 10` 2. `isNotNull(‘a)` This PR proposes a possible way of keeping track of these constraints and propagating them in the logical plan, which can then help us build more advanced optimizations (such as pruning redundant filters, optimizing joins, among others). We define constraints as a set of (implicitly conjunctive) expressions. For e.g., if a filter operator has constraints = `Set(‘a > 10, ‘b < 100)`, it’s implied that the outputs satisfy both individual constraints (i.e., `‘a > 10` AND `‘b < 100`). Design Document: https://docs.google.com/a/databricks.com/document/d/1WQRgDurUBV9Y6CWOBS75PQIqJwT-6WftVa18xzm7nCo/edit?usp=sharing Author: Sameer Agarwal <[email protected]> Closes apache#10844 from sameeragarwal/constraints.

…uration columns I have clearly prefix the two 'Duration' columns in 'Details of Batch' Streaming tab as 'Output Op Duration' and 'Job Duration' Author: Mario Briggs <[email protected]> Author: mariobriggs <[email protected]> Closes apache#11022 from mariobriggs/spark-12739.

A row from stream side could match multiple rows on build side, the loop for these matched rows should not be interrupted when emitting a row, so we buffer the output rows in a linked list, check the termination condition on producer loop (for example, Range or Aggregate). Author: Davies Liu <[email protected]> Closes apache#10989 from davies/gen_join.

The ```SparkSqlLexer``` currently swallows characters which have not been defined in the grammar. This causes problems with SQL commands, such as: ```add jar file:///tmp/ab/TestUDTF.jar```. In this example the `````` is swallowed. This PR adds an extra Lexer rule to handle such input, and makes a tiny modification to the ```ASTNode```. cc davies liancheng Author: Herman van Hovell <[email protected]> Closes apache#11052 from hvanhovell/SPARK-13157.

…ation web UI Added a Cores column in the Executors UI Author: Alex Bozarth <[email protected]> Closes apache#11039 from ajbozarth/spark3611.

They seem redundant and we can simply use DataFrameReader/Writer. The new usage looks like: ```scala val df = sqlContext.read.stream("...") val handle = df.write.stream("...") handle.stop() ``` Author: Reynold Xin <[email protected]> Closes apache#11062 from rxin/SPARK-13166.

Best time is stabler than average time, also added a column for nano seconds per row (which could be used to estimate contributions of each components in a query). Having best time and average time together for more information (we can see kind of variance). rate, time per row and relative are all calculated using best time. The result looks like this: ``` Intel(R) Core(TM) i7-4558U CPU 2.80GHz rang/filter/sum: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------- rang/filter/sum codegen=false 14332 / 16646 36.0 27.8 1.0X rang/filter/sum codegen=true 845 / 940 620.0 1.6 17.0X ``` Author: Davies Liu <[email protected]> Closes apache#11018 from davies/gen_bench.

Make an internal non-deprecated version of incBytesRead and incRecordsRead so we don't have unecessary deprecation warnings in our build. Right now incBytesRead and incRecordsRead are marked as deprecated and for internal use only. We should make private[spark] versions which are not deprecated and switch to those internally so as to not clutter up the warning messages when building. cc andrewor14 who did the initial deprecation Author: Holden Karau <[email protected]> Closes apache#11056 from holdenk/SPARK-13152-fix-task-metrics-deprecation-warnings.

This is a step towards consolidating `SQLContext` and `HiveContext`. This patch extends the existing Catalog API added in apache#10982 to include methods for handling table partitions. In particular, a partition is identified by `PartitionSpec`, which is just a `Map[String, String]`. The Catalog is still not used by anything yet, but its API is now more or less complete and an implementation is fully tested. About 200 lines are test code. Author: Andrew Or <[email protected]> Closes apache#11069 from andrewor14/catalog.

Jira: https://issues.apache.org/jira/browse/SPARK-12828 Author: Daoyuan Wang <[email protected]> Closes apache#10762 from adrian-wang/naturaljoin.

minor fix for api link in ml onevsrest Author: Yuhao Yang <[email protected]> Closes apache#11068 from hhbyyh/onevsrestDoc.

…age number JIRA: https://issues.apache.org/jira/browse/SPARK-13113 As we shift bits right, looks like the bitwise AND operation is unnecessary. Author: Liang-Chi Hsieh <[email protected]> Closes apache#11002 from viirya/improve-decodepagenumber.

This is a small addendum to apache#10762 to make the code more robust again future changes. Author: Reynold Xin <[email protected]> Closes apache#11070 from rxin/SPARK-12828-natural-join.

In the current implementation the mesos coarse scheduler does not wait for the mesos tasks to complete before ending the driver. This causes a race where the task has to finish cleaning up before the mesos driver terminates it with a SIGINT (and SIGKILL after 3 seconds if the SIGINT doesn't work). This PR causes the mesos coarse scheduler to wait for the mesos tasks to finish (with a timeout defined by `spark.mesos.coarse.shutdown.ms`) This PR also fixes a regression caused by [SPARK-10987] whereby submitting a shutdown causes a race between the local shutdown procedure and the notification of the scheduler driver disconnection. If the scheduler driver disconnection wins the race, the coarse executor incorrectly exits with status 1 (instead of the proper status 0) With this patch the mesos coarse scheduler terminates properly, the executors clean up, and the tasks are reported as `FINISHED` in the Mesos console (as opposed to `KILLED` in < 1.6 or `FAILED` in 1.6 and later) Author: Charles Allen <[email protected]> Closes apache#10319 from drcrallen/SPARK-12330.

Building with scala 2.11 results in the warning trait SynchronizedBuffer in package mutable is deprecated: Synchronization via traits is deprecated as it is inherently unreliable. Consider java.util.concurrent.ConcurrentLinkedQueue as an alternative. Investigation shows we are already using ConcurrentLinkedQueue in other locations so switch our uses of SynchronizedBuffer to ConcurrentLinkedQueue. Author: Holden Karau <[email protected]> Closes apache#11059 from holdenk/SPARK-13164-replace-deprecated-synchronized-buffer-in-core.

Currently the Master would always set an application's initial executor limit to infinity. If the user specified `spark.dynamicAllocation.initialExecutors`, the config would not take effect. This is similar to apache#11047 but for standalone mode. Author: Andrew Or <[email protected]> Closes apache#11054 from andrewor14/standalone-da-initial.

These were ignored because they are incorrectly written; they don't actually trigger stage retries, which is what the tests are testing. These tests are now rewritten to induce stage retries through fetch failures. Note: there were 2 tests before and now there's only 1. What happened? It turns out that the case where we only resubmit a subset of of the original missing partitions is very difficult to simulate in tests without potentially introducing flakiness. This is because the `DAGScheduler` removes all map outputs associated with a given executor when this happens, and we will need multiple executors to trigger this case, and sometimes the scheduler still removes map outputs from all executors. Author: Andrew Or <[email protected]> Closes apache#10969 from andrewor14/unignore-accum-test.

This commit exists to close the following pull requests on Github: Closes apache#7971 (requested by yhuai) Closes apache#8539 (requested by srowen) Closes apache#8746 (requested by yhuai) Closes apache#9288 (requested by andrewor14) Closes apache#9321 (requested by andrewor14) Closes apache#9935 (requested by JoshRosen) Closes apache#10442 (requested by andrewor14) Closes apache#10585 (requested by srowen) Closes apache#10785 (requested by srowen) Closes apache#10832 (requested by andrewor14) Closes apache#10941 (requested by marmbrus) Closes apache#11024 (requested by andrewor14)

Spark SQL should collapse adjacent `Repartition` operators and only keep the last one. Author: Josh Rosen <[email protected]> Closes apache#11064 from JoshRosen/collapse-repartition.

Support spark.executor.cores on Mesos.

yanboliang and others added 30 commits January 25, 2016 11:52

Closes apache#10879

ef8fb36

Closes apache#9046 Closes apache#8532 Closes apache#10756 Closes apache#8960 Closes apache#10485 Closes apache#10467

[SPARK-12901][SQL][HOT-FIX] Fix scala 2.11 compilation.

00026fa

[SPARK-12905][ML][PYSPARK] PCAModel return eigenvalues for PySpark

dcae355

```PCAModel``` can output ```explainedVariance``` at Python side. cc mengxr srowen Author: Yanbo Liang <[email protected]> Closes apache#10830 from yanboliang/spark-12905.

[SPARK-12934][SQL] Count-min sketch serialization

6f0f1d9

This PR adds serialization support for `CountMinSketch`. A version number is added to version the serialized binary format. Author: Cheng Lian <[email protected]> Closes apache#10893 from liancheng/cms-serialization.

[SPARK-12934] use try-with-resources for streams

fdcc351

liancheng please take a look Author: tedyu <[email protected]> Closes apache#10906 from tedyu/master.

[SQL][MINOR] A few minor tweaks to CSV reader.

d54cfed

This pull request simply fixes a few minor coding style issues in csv, as I was reviewing the change post-hoc. Author: Reynold Xin <[email protected]> Closes apache#10919 from rxin/csv-minor.

[SPARK-12937][SQL] bloom filter serialization

6743de3

This PR adds serialization support for BloomFilter. A version number is added to version the serialized binary format. Author: Wenchen Fan <[email protected]> Closes apache#10920 from cloud-fan/bloom-filter.

[SPARK-11923][ML] Python API for ml.feature.ChiSqSelector

8beab68

https://issues.apache.org/jira/browse/SPARK-11923 Author: Xusen Yin <[email protected]> Closes apache#10186 from yinxusen/SPARK-11923.

[SPARK-12952] EMLDAOptimizer initialize() should return EMLDAOptimize…

fbf7623

…r other than its parent class https://issues.apache.org/jira/browse/SPARK-12952 Author: Xusen Yin <[email protected]> Closes apache#10863 from yinxusen/SPARK-12952.

[SQL] Minor Scaladoc format fix

83507fe

Otherwise the `^` character is always marked as error in IntelliJ since it represents an unclosed superscript markup tag. Author: Cheng Lian <[email protected]> Closes apache#10926 from liancheng/agg-doc-fix.

iyounus and others added 22 commits February 2, 2016 20:38

[SPARK-3611][WEB UI] Show number of cores for each executor in applic…

3221edd

…ation web UI Added a Cores column in the Executors UI Author: Alex Bozarth <[email protected]> Closes apache#11039 from ajbozarth/spark3611.

[SPARK-12828][SQL] add natural join support

0f81318

Jira: https://issues.apache.org/jira/browse/SPARK-12828 Author: Daoyuan Wang <[email protected]> Closes apache#10762 from adrian-wang/naturaljoin.

[ML][DOC] fix wrong api link in ml onevsrest

c2c956b

minor fix for api link in ml onevsrest Author: Yuhao Yang <[email protected]> Closes apache#11068 from hhbyyh/onevsrestDoc.

[SPARK-12828][SQL] Natural join follow-up

dee801a

This is a small addendum to apache#10762 to make the code more robust again future changes. Author: Reynold Xin <[email protected]> Closes apache#11070 from rxin/SPARK-12828-natural-join.

[SPARK-13168][SQL] Collapse adjacent repartition operators

33212cb

Spark SQL should collapse adjacent `Repartition` operators and only keep the last one. Author: Josh Rosen <[email protected]> Closes apache#11064 from JoshRosen/collapse-repartition.

mgummelt force-pushed the executor_sizing branch 4 times, most recently from b587f8f to 7e3f39d Compare February 5, 2016 19:15

Support multiple executors per node on Mesos.

ecad77a

Support spark.executor.cores on Mesos.

mgummelt force-pushed the executor_sizing branch from 7e3f39d to ecad77a Compare February 9, 2016 18:36

mgummelt closed this Feb 12, 2016

mgummelt deleted the executor_sizing branch February 12, 2016 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multiple executors per node, and spark.executor.cores support #3

Multiple executors per node, and spark.executor.cores support #3

Uh oh!

mgummelt commented Jan 27, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

47 participants

Multiple executors per node, and spark.executor.cores support #3

Multiple executors per node, and spark.executor.cores support #3

Uh oh!

Conversation

mgummelt commented Jan 27, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

47 participants