Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
879 commits
Select commit Hold shift + click to select a range
637a6c2
[SPARK-24663][STREAMING][TESTS] StreamingContextSuite: Wait until slo…
HeartSaVioR Sep 11, 2019
339b0f2
[SPARK-29045][SQL][TESTS] Drop table to avoid test failure in SQLMetr…
LantaoJin Sep 12, 2019
ac10d73
[SPARK-29079][INFRA] Enable GitHub Action on PR
dongjoon-hyun Sep 13, 2019
58ad3e6
[SPARK-26989][CORE][TEST][2.4] DAGSchedulerSuite: ensure listeners ar…
HeartSaVioR Sep 15, 2019
21649e3
[SPARK-27122][CORE][2.4] Jetty classes must not be return via getters…
ajithme Sep 15, 2019
b41795a
[SPARK-29087][CORE][STREAMING] Use DelegatingServletContextHandler to…
dongjoon-hyun Sep 15, 2019
1c57da3
[SPARK-25277][YARN] YARN applicationMaster metrics should not registe…
LucaCanali Dec 13, 2018
68e29ba
[SPARK-29046][SQL][2.4] Fix NPE in SQLConf.get when active SparkConte…
HeartSaVioR Sep 17, 2019
4dedd39
[SPARK-26713][CORE][2.4] Interrupt pipe IO threads in PipedRDD when t…
advancedxy Sep 18, 2019
00589bd
[SPARK-29104][CORE][TESTS] Fix PipedRDDSuite to use `eventually` to c…
dongjoon-hyun Sep 17, 2019
cc0f659
[SPARK-29124][CORE] Use MurmurHash3 `bytesHash(data, seed)` instead o…
dongjoon-hyun Sep 18, 2019
89a065d
[MINOR][SS][DOCS] Adapt multiple watermark policy comment to the reality
bartosz25 Sep 18, 2019
efcca57
[SPARK-29042][CORE][BRANCH-2.4] Sampling-based RDD with unordered inp…
viirya Sep 18, 2019
0770037
[SPARK-28616][INFRA] Improve merge-spark-pr script to warn WIP PRs an…
dongjoon-hyun Aug 5, 2019
60600c8
[SPARK-28857][INFRA] Clean up the comments of PR template during merging
dongjoon-hyun Aug 23, 2019
f146853
[SPARK-28683][BUILD][2.4] Upgrade Scala to 2.12.10
wangyum Sep 19, 2019
92189f2
[SPARK-28683][BUILD][FOLLOW-UP][2.4] Fix javadoc generation issue aft…
wangyum Sep 19, 2019
0e63603
[SPARK-29159][BUILD] Increase ReservedCodeCacheSize to 1G
dongjoon-hyun Sep 19, 2019
267d318
[SPARK-29165][SQL][TEST] Set log level of log generated code as ERROR…
HeartSaVioR Sep 19, 2019
7ea3195
[SPARK-29101][SQL][2.4] Fix count API for csv file when DROPMALFORMED…
sandeep-katta Sep 19, 2019
71b0562
[MINOR][BUILD][2.4] Fix 4 misc build warnings
srowen Sep 20, 2019
026e789
[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics…
mgaido91 Sep 20, 2019
0d26cfc
[SPARK-27460][TESTS][2.4] Running slowest test suites in their own fo…
gengliangwang Sep 20, 2019
25d4b3a
[MINOR][INFRA] Use java-version instead of version for GitHub Action
wangyum Sep 20, 2019
e21c52b
[SPARK-29199][INFRA] Add linters and license/dependency checkers to G…
dongjoon-hyun Sep 21, 2019
1b939ea
[SPARK-19147][CORE] Gracefully handle error in task after executor is…
Sep 21, 2019
c835ccb
[CORE][MINOR] Correct a log message in DAGScheduler
Sep 22, 2019
5fbb65d
[SPARK-29201][INFRA][2.4] Add Hadoop 2.6 combination to GitHub Action
dongjoon-hyun Sep 22, 2019
56cf17e
[SPARK-29177][CORE] fix zombie tasks after stage abort
adrian-wang Sep 23, 2019
bc78f98
[SPARK-29053][WEBUI][2.4] Sort does not work on some columns
amanomer Sep 23, 2019
866f763
fix compilation
cloud-fan Sep 23, 2019
328c4ec
[SPARK-25903][CORE] TimerTask should be synchronized on ContextBarrie…
viirya Sep 23, 2019
03079cd
[SPARK-28599][SQL][2.4] Fix `Duration` column sorting for ThriftServe…
wangyum Sep 23, 2019
05a32ca
[SPARK-28678][DOC] Specify that array indices start at 1 for function…
sheepstop Sep 24, 2019
5267e6e
[SPARK-29229][SQL] Change the additional remote repository in Isolate…
xuanyuanking Sep 24, 2019
e052cd5
[SPARK-23197][STREAMING][TESTS][2.4] Fix ReceiverSuite."receiver_life…
HeartSaVioR Sep 25, 2019
64bb083
[SPARK-29203][SQL][TESTS][2.4] Reduce shuffle partitions in SQLQueryT…
wangyum Sep 26, 2019
3dbe065
[SPARK-29213][SQL] Generate extra IsNotNull predicate in FilterExec
wangshuo128 Sep 27, 2019
361b605
[SPARK-29240][PYTHON] Pass Py4J column instance to support PySpark co…
HyukjinKwon Sep 27, 2019
99e503c
[SPARK-29263][SCHEDULER] Update `availableSlots` in `resourceOffers()…
juliuszsompolski Sep 27, 2019
9ae7393
[SPARK-29263][CORE][TEST][FOLLOWUP][2.4] Fix build failure of `TaskSc…
jiangxb1987 Sep 27, 2019
e12398c
[SPARK-29247][SQL] Redact sensitive information in when construct Hiv…
AngersZhuuuu Sep 29, 2019
7ea4b9f
[SPARK-29186][SQL] AliasIdentifier should be converted to Json in pre…
viirya Sep 30, 2019
332f9da
[SPARK-29186][SQL][2.4][FOLLOWUP] AliasIdentifier should be converted…
viirya Sep 30, 2019
3173439
[SPARK-29055][CORE] Update driver/executors' storage memory when bloc…
HeartSaVioR Oct 1, 2019
9cf7ea6
[SPARK-29244][CORE] Prevent freed page in BytesToBytesMap free again
viirya Oct 1, 2019
66c1d50
[SPARK-29244][CORE][FOLLOWUP] Fix compilation
dongjoon-hyun Oct 1, 2019
fd01c9e
[SPARK-29244][CORE][FOLLOWUP] Fix java lint error due to line length
dongjoon-hyun Oct 1, 2019
1560f6f
[SPARK-29203][TESTS][MINOR][FOLLOW UP] Add access modifier for sparkC…
xuanyuanking Oct 4, 2019
5992e29
[SPARK-29286][PYTHON][TESTS] Uses UTF-8 with 'replace' on errors at P…
HyukjinKwon Oct 4, 2019
daa1749
[SPARK-25753][CORE][2.4] Fix reading small files via BinaryFileRDD
10110346 Oct 4, 2019
008ee63
[SPARK-28938][K8S][2.4] Move to supported OpenJDK docker image for Ku…
viirya Oct 7, 2019
04b3e0e
[MINOR][BUILD] Fix an incorrect path in license file
beliefer Oct 8, 2019
4f46e8f
[SPARK-28917][CORE] Synchronize access to RDD mutable state
squito Oct 8, 2019
80cded3
[SPARK-29410][BUILD] Update commons-beanutils to 1.9.4
peter-toth Oct 12, 2019
b2f96a5
[SPARK-29445][CORE] Bump netty-all from 4.1.39.Final to 4.1.42.Final
Fokko Oct 12, 2019
90139f6
[SPARK-27259][CORE] Allow setting -1 as length for FileBlock
prasha2 Oct 16, 2019
65c0a78
[SPARK-27812][K8S][2.4] Bump K8S client version to 4.6.1
igorcalabria Oct 18, 2019
4d476ed
[SPARK-29494][SQL] Fix for ArrayOutofBoundsException while converting…
rahulsmahadev Oct 18, 2019
b094774
Revert "[SPARK-29494][SQL] Fix for ArrayOutofBoundsException while co…
zsxwing Oct 18, 2019
3d334ac
[SPARK-29494][SQL][2.4] Fix for ArrayOutofBoundsException while conve…
rahulsmahadev Oct 19, 2019
c0101de
[SPARK-28963][BUILD] Fall back to archive.apache.org in build/mvn for…
srowen Sep 4, 2019
92b9706
[SPARK-29556][CORE] Avoid putting request path in error response in E…
srowen Oct 22, 2019
7c9bdd7
[SPARK-29560][BUILD] Add typesafe bintray repo for sbt-mima-plugin
dongjoon-hyun Oct 22, 2019
b1ba6fa
[SPARK-21492][SQL][2.4] Fix memory leak in SortMergeJoin
xuanyuanking Oct 23, 2019
9838df2
[SPARK-21492][SQL][FOLLOW UP] Reimplement UnsafeExternalRowSorter in …
xuanyuanking Oct 24, 2019
ac72b0e
[SPARK-21287][SQL] Remove requirement of fetch_size>=0 from JDBCOptions
fuwhu Oct 24, 2019
be323d2
[SPARK-29530][SQL][2.4] Make SQLConf in SQL parse process thread safe
AngersZhuuuu Oct 25, 2019
f42a40e
[SPARK-29498][SQL][2.4] CatalogTable to HiveTable should not change t…
wangyum Oct 25, 2019
e180668
[SPARK-26985][CORE] Fix "access only some column of the all of column…
vibhutisawant Oct 29, 2019
fc7d918
[SPARK-28938][K8S][2.4][FOLLOWUP] Use `/usr/bin/tini` instead of `/sb…
Oct 29, 2019
4583d14
[SPARK-29637][CORE] Add description to Job SHS web API
gaborgsomogyi Oct 29, 2019
21580d9
[SPARK-29647][TESTS][2.4] Use Python 3.7 in GitHub Action to recover …
dongjoon-hyun Oct 30, 2019
5248651
[SPARK-29604][SQL][2.4] Force initialize SessionState before initiali…
HeartSaVioR Oct 30, 2019
042116c
[SPARK-29666][BUILD] Fix the publish release failure under dry-run mode
jiangxb1987 Oct 30, 2019
9b17a68
[SPARK-29651][SQL][2.4] Fix parsing of interval seconds fraction
MaxGekk Nov 1, 2019
3aad111
[SPARK-29677][DSTREAMS] amazon-kinesis-client 1.12.0
etspaceman Nov 2, 2019
4587ad6
[SPARK-24152][R][TESTS] Disable check-cran from run-tests.sh
dongjoon-hyun Nov 3, 2019
4d2ed47
Revert "[SPARK-24152][R][TESTS] Disable check-cran from run-tests.sh"
viirya Nov 3, 2019
8180364
[MINOR][DOCS][2.4] Fix pyspark documentation
mstill3 Nov 3, 2019
19e55c3
[SPARK-29743][SQL] sample should set needCopyResult to true if its ch…
cloud-fan Nov 4, 2019
0e558be
[MINOR][INFRA] Change the Github Actions build command to `mvn install`
wangyum Nov 6, 2019
465a596
[SPARK-29796][SQL][TESTS] `HiveExternalCatalogVersionsSuite` should i…
dongjoon-hyun Nov 7, 2019
e59370a
[SPARK-29781][BUILD][2.4] Override SBT Jackson dependency like Maven
dongjoon-hyun Nov 7, 2019
bef7c0f
[SPARK-29790][DOC] Note required port for Kube API
Tapped Nov 8, 2019
f42e741
[SPARK-29820][INFRA] Use GitHub Action Cache for `./.m2/repository/[c…
dongjoon-hyun Nov 10, 2019
a7b5746
[SPARK-29820][INFRA][FOLLOWUP][2.4] Use scala version instead of java…
dongjoon-hyun Nov 10, 2019
2cc56e0
[SPARK-28939][SQL][2.4] Propagate SQLConf for plans executed by toRdd
mgaido91 Nov 10, 2019
7459353
[SPARK-29850][SQL] sort-merge-join an empty table should not memory leak
cloud-fan Nov 12, 2019
e9df8b6
[SPARK-29875][PYTHON][SQL][2.4] Avoid to use deprecated pyarrow.open_…
HyukjinKwon Nov 13, 2019
7bdc76f
[SPARK-29682][SQL] Resolve conflicting attributes in Expand correctly
imback82 Nov 14, 2019
1cff055
[SPARK-26499][SQL][2.4] JdbcUtils.makeGetter does not handle ByteType
shivsood Nov 15, 2019
9c7e8be
[SPARK-29904][SQL][2.4] Parse timestamps in microsecond precision by …
MaxGekk Nov 15, 2019
eda6360
[SPARK-29644][SQL][2.4] Corrected ShortType and ByteType mapping to S…
shivsood Nov 17, 2019
ee6693e
[SPARK-29932][R][TESTS] lint-r should do non-zero exit in case of errors
dongjoon-hyun Nov 17, 2019
b4e7e50
[SPARK-29936][R][2.4] Fix SparkR lint errors and add lint-r GitHub Ac…
dongjoon-hyun Nov 18, 2019
7f2c88d
[MINOR][TESTS] Ignore GitHub Action and AppVeyor file changes in testing
dongjoon-hyun Nov 16, 2019
568fa69
Revert "[SPARK-29644][SQL][2.4] Corrected ShortType and ByteType mapp…
dongjoon-hyun Nov 18, 2019
dc2abe5
[SPARK-29918][SQL] RecordBinaryComparator should check endianness whe…
WangGuangxin Nov 19, 2019
47cb1f3
[SPARK-29949][SQL][2.4] Fix formatting of timestamps by JSON/CSV data…
MaxGekk Nov 19, 2019
1a26c8e
[SPARK-29964][BUILD] lintr github workflows failed due to buggy GnuPG
viirya Nov 19, 2019
a936522
[SPARK-29758][SQL][2.4] Fix truncation of requested string fields in …
MaxGekk Nov 20, 2019
12ec338
[SPARK-27558][CORE] Gracefully cleanup task when it fails with OOM ex…
Nov 21, 2019
eb87e53
[SPARK-29970][WEBUI] Preserver open/close state of Timelineview
sarutak Nov 24, 2019
6880ccd
[MINOR][INFRA] Use GitHub Action Cache for `build`
dongjoon-hyun Nov 24, 2019
94ddc2a
[SPARK-29971][CORE][2.4] Fix buffer leaks in `TransportFrameDecoder/T…
normanmaurer Nov 25, 2019
0dc22df
[SPARK-30030][INFRA] Use RegexChecker instead of TokenChecker to chec…
dongjoon-hyun Nov 25, 2019
00b61e3
Revert "[SPARK-28152][SQL][2.4] Mapped ShortType to SMALLINT and Floa…
HyukjinKwon Nov 29, 2019
cc86b72
Revert "Revert "[SPARK-28152][SQL][2.4] Mapped ShortType to SMALLINT …
HyukjinKwon Nov 29, 2019
76576b6
[SPARK-30050][SQL] analyze table and rename table should not erase hi…
viirya Dec 2, 2019
663441f
[SPARK-30082][SQL][2.4] Do not replace Zeros when replacing NaNs
johnhany97 Dec 4, 2019
3b64e2f
[SPARK-30129][CORE][2.4] Set client's id in TransportClient after suc…
Dec 5, 2019
6dff114
[SPARK-24666][ML] Fix infinity vectors produced by Word2Vec when numI…
viirya Dec 6, 2019
89d0f7c
[SPARK-30163][INFRA] Use Google Maven mirror in GitHub Action
dongjoon-hyun Dec 7, 2019
0884766
[SPARK-30163][INFRA][FOLLOWUP] Make `.m2` directory for cold start wi…
dongjoon-hyun Dec 7, 2019
74d8cf1
[SPARK-29152][2.4][CORE] Executor Plugin shutdown when dynamic alloca…
iRakson Dec 11, 2019
b0d5eb0
[SPARK-30198][CORE] BytesToBytesMap does not grow internal long array…
viirya Dec 11, 2019
f4e95ca
[MINOR][SS][DOC] Fix the ss-kafka doc for availability of 'minPartiti…
HeartSaVioR Dec 11, 2019
c411269
[SPARK-30238][SQL][2.4] hive partition pruning can only support strin…
cloud-fan Dec 13, 2019
f0d6989
[SPARK-30263][CORE] Don't log potentially sensitive value of non-Spar…
srowen Dec 14, 2019
0bf0e44
Revert "[SPARK-29152][2.4][CORE] Executor Plugin shutdown when dynami…
dongjoon-hyun Dec 15, 2019
cd6a0c4
[MINOR][DOCS] Fix documentation for slide function
bboutkov Dec 16, 2019
6d90298
[SPARK-25392][CORE][WEBUI] Prevent error page when accessing pools pa…
shahidki31 Dec 16, 2019
185ab4b
[SPARK-30274][CORE] Avoid BytesToBytesMap lookup hang forever when ho…
viirya Dec 17, 2019
bbeb191
[SPARK-30236][SQL][DOCS][2.4] Clarify date and time patterns supporte…
johnhany97 Dec 19, 2019
07caebf
[SPARK-17398][SQL] Fix ClassCastException when querying partitioned J…
wypoon Dec 20, 2019
871aacc
[SPARK-29918][SQL][FOLLOWUP][TEST] Fix arrayOffset in `RecordBinaryCo…
jiangxb1987 Dec 19, 2019
d10e414
[SPARK-30318][CORE] Upgrade jetty to 9.3.27.v20190418
sandeep-katta Dec 21, 2019
bfa851c
[SPARK-30269][SQL][2.4] Should use old partition stats to decide whet…
wzhfy Dec 23, 2019
db32408
[SPARK-30333][CORE][BUILD] Upgrade jackson-databind to 2.6.7.3
sandeep-katta Dec 24, 2019
03cea11
[SPARK-30339][SQL][2.4] Avoid to fail twice in function lookup
wzhfy Dec 31, 2019
f989a35
[SPARK-26560][SQL][2.4] Spark should be able to run Hive UDF using ja…
HeartSaVioR Jan 2, 2020
16f8fae
[SPARK-30285][CORE] Fix deadlock between LiveListenerBus#stop and Asy…
wangshuo128 Jan 3, 2020
f935e4d
[SPARK-30225][CORE] Correct read() behavior past EOF in NioBufferedFi…
Jan 3, 2020
d1fa3ed
[SPARK-30450][2.4][INFRA] Exclude .git folder for python linter
ericfchang Jan 8, 2020
e52ae4e
[SPARK-30450][INFRA][FOLLOWUP][2.4] Fix git folder regex for windows …
ericfchang Jan 9, 2020
6ac3659
[SPARK-30410][SQL][2.4] Calculating size of table with large number o…
wzhfy Jan 9, 2020
0a5757e
[SPARK-30447][SQL][2.4] Constant propagation nullability issue
peter-toth Jan 10, 2020
3b029d9
[SPARK-30489][BUILD] Make build delete pyspark.zip file properly
jeff303 Jan 11, 2020
9b53480
[SPARK-30312][SQL][2.4] Preserve path permission and acl when truncat…
viirya Jan 11, 2020
d84a96c
[SPARK-30312][SQL][FOLLOWUP] Use inequality check instead to be robust
viirya Jan 11, 2020
b249634
[SPARK-30478][CORE][DOCS] Fix Memory Package documentation
sddyljsx Jan 12, 2020
d1b527f
[SPARK-30458][WEBUI] Fix Wrong Executor Computing Time in Time Line o…
sddyljsx Jan 12, 2020
69de7f3
[SPARK-28152][SQL][FOLLOWUP] Add a legacy conf for old MsSqlServerDia…
dongjoon-hyun Jan 13, 2020
33bd2be
Preparing Spark release v2.4.5-rc1
dongjoon-hyun Jan 13, 2020
b149046
Preparing development version 2.4.6-SNAPSHOT
dongjoon-hyun Jan 13, 2020
390917b
[SPARK-30246][CORE] OneForOneStreamManager might leak memory in conne…
hensg Jan 15, 2020
036568d
[SPARK-30325][CORE][2.4] markPartitionCompleted cause task status inc…
Jan 16, 2020
830a4ec
[SPARK-30312][SQL][FOLLOWUP] Rename conf by adding `.enabled`
viirya Jan 16, 2020
47a73b2
[SPARK-30491][INFRA][2.4] Enable dependency audit files to tell depen…
mengCareers Jan 16, 2020
94b5d3f
Revert "[SPARK-27868][CORE] Better default value and documentation fo…
dongjoon-hyun Jan 16, 2020
d6261a1
[SPARK-29450][SS][2.4] Measure the number of output rows for streamin…
HeartSaVioR Jan 16, 2020
60a908e
[SPARK-29708][SQL][2.4] Correct aggregated values when grouping sets …
maropu Jan 16, 2020
ac54156
[SPARK-30534][INFRA] Use mvn in `dev/scalastyle`
dongjoon-hyun Jan 17, 2020
7d56dc7
[SPARK-30333][BUILD][FOLLOWUP][2.4] Update sbt build together
dongjoon-hyun Jan 17, 2020
4df509f
[SPARK-30312][DOCS][FOLLOWUP] Add a migration guide
dongjoon-hyun Jan 17, 2020
3ff5021
[SPARK-28152][DOCS][FOLLOWUP] Add a migration guide for MsSQLServer J…
dongjoon-hyun Jan 18, 2020
9da2bc4
[MINOR][DOCS] Remove note about -T for parallel build
srowen Jan 18, 2020
fbd95b1
[MINOR][HIVE] Pick up HIVE-22708 HTTP transport fix
srowen Jan 18, 2020
ebaa6fe
[SPARK-30572][BUILD] Add a fallback Maven repository
dongjoon-hyun Jan 20, 2020
90dea83
Revert "[SPARK-30534][INFRA] Use mvn in `dev/scalastyle`"
HyukjinKwon Jan 22, 2020
0300d4b
[SPARK-30553][DOCS] fix structured-streaming java example error
Jan 22, 2020
c6b02cf
[SPARK-30601][BUILD][2.4] Add a Google Maven Central as a primary rep…
HyukjinKwon Jan 23, 2020
2fc562c
[SPARK-30556][SQL][2.4] Copy sparkContext.localproperties to child th…
ajithme Jan 23, 2020
d7be535
[SPARK-30630][ML][2.4] Deprecate numTrees in GBT in 2.4.5
huaxingao Jan 24, 2020
1b3ddcf
[SPARK-30645][SPARKR][TESTS][WINDOWS] Move Unicode test data to exter…
zero323 Jan 26, 2020
81ea5a4
[SPARK-29777][FOLLOW-UP][SPARKR] Remove no longer valid test for recu…
zero323 Jan 26, 2020
5f1cb2f
Revert "[SPARK-29777][FOLLOW-UP][SPARKR] Remove no longer valid test …
HyukjinKwon Jan 26, 2020
ad9f578
[SPARK-30633][SQL] Append L to seed when type is LongType
patrickcording Jan 27, 2020
6c29070
[SPARK-23435][2.4][SPARKR][TESTS] Update testthat to >= 2.0.0
zero323 Jan 29, 2020
12f4492
[SPARK-30512] Added a dedicated boss event loop group
otterc Jan 29, 2020
c7c9f9e
[SPARK-30310][CORE][2.4] Resolve missing match case in SparkUncaughtE…
tinhto-000 Jan 29, 2020
b93f250
[SPARK-29367][DOC][2.4] Add compatibility note for Arrow 0.15.0 to SQ…
BryanCutler Jan 29, 2020
eeef0e7
[SPARK-29578][TESTS][2.4] Add "8634" as another skipped day for Kwaja…
srowen Jan 30, 2020
4c3c1d6
[SPARK-29890][SQL][2.4] DataFrameNaFunctions.fill should handle dupli…
imback82 Jan 31, 2020
c7c2bda
[SPARK-30065][SQL][2.4] DataFrameNaFunctions.drop should handle dupli…
imback82 Jan 31, 2020
cb4a736
[SPARK-30704][INFRA] Use jekyll-redirect-from 0.15.0 instead of the l…
dongjoon-hyun Feb 2, 2020
cee4ecb
Preparing Spark release v2.4.5-rc2
dongjoon-hyun Feb 2, 2020
9bf11ed
Preparing development version 2.4.6-SNAPSHOT
dongjoon-hyun Feb 2, 2020
f674d88
[SPARK-30733][R][HOTFIX] Fix SparkR tests per testthat and R version …
HyukjinKwon Feb 5, 2020
40062e3
[SPARK-30737][SPARK-27262][R][BUILD] Reenable CRAN check with UTF-8 e…
HyukjinKwon Feb 6, 2020
e627b0a
[MINOR][SQL][DOCS][2.4] Fix the timestamp pattern in the example for …
MaxGekk Feb 6, 2020
cf9f955
[SPARK-30797][SQL] Set tradition user/group/other permission to ACL e…
viirya Feb 12, 2020
0fa72a0
[PYSPARK][DOCS][MINOR] Changed `:func:` to `:attr:` Sphinx roles, fix…
DavidToneian Feb 14, 2020
a312b85
[SPARK-30823][PYTHON][DOCS] Set `%PYTHONPATH%` when building PySpark …
DavidToneian Feb 14, 2020
6294cb8
[SPARK-30826][SQL] Respect reference case in `StringStartsWith` pushe…
MaxGekk Feb 15, 2020
c8f9ce8
[SPARK-30834][DOCS][PYTHON][2.4] Add note for recommended pandas and …
BryanCutler Feb 17, 2020
fecc7d5
[SPARK-30793][SQL][2.4] Fix truncations of timestamps before the epoc…
MaxGekk Feb 18, 2020
86e01c0
[SPARK-30857][SQL][2.4] Fix truncations of timestamps before the epoc…
MaxGekk Feb 18, 2020
7285eea
[SPARK-30811][SQL][2.4] CTE should not cause stack overflow when it r…
hvanhovell Feb 18, 2020
6e0c116
[SPARK-30731] Update deprecated Mkdocs option
nchammas Feb 19, 2020
c80b79f
[SPARK-30763][SQL][2.4] Fix java.lang.IndexOutOfBoundsException No gr…
beliefer Feb 19, 2020
8597a56
[SPARK-30556][SQL][BACKPORT-2.4] Reset the status changed in SQLExecu…
xuanyuanking Feb 19, 2020
1b793ed
[SPARK-30907][DOCS] Revise the doc of spark.ui.retainedTasks
gengliangwang Feb 21, 2020
8fdd039
[MINOR][DOCS] Fix ForEachWriter Java example
roland-ondeviceresearch Feb 22, 2020
b302caf
[SPARK-30944][BUILD] Update URL for Google Cloud Storage mirror of Ma…
JoshRosen Feb 25, 2020
56fa200
[SPARK-30759][SQL][3.0] Fix cache initialization in StringRegexExpres…
MaxGekk Feb 26, 2020
b0a2c17
[MINOR][BUILD] Fix make-distribution.sh to show usage without 'echo' cmd
yaooqinn Feb 26, 2020
a549a07
[SPARK-23435][INFRA][FOLLOW-UP] Remove unnecessary dependency in AppV…
HyukjinKwon Feb 27, 2020
2749043
[MINOR][ML] Fix confusing error message in VectorAssembler
Saluev Feb 27, 2020
7574d99
[SPARK-30968][BUILD] Upgrade aws-java-sdk-sts to 1.11.655
dongjoon-hyun Feb 28, 2020
ff5ba49
[SPARK-30970][K8S][CORE][2.4] Fix NPE while resolving k8s master url
yaooqinn Feb 28, 2020
0d1664c
[SPARK-29419][SQL] Fix Encoder thread-safety bug in createDataset(Seq)
JoshRosen Mar 2, 2020
cd8f86a
[SPARK-30813][ML] Fix Matrices.sprand comments
xwu-intel Mar 2, 2020
0b71b4d
[SPARK-31003][TESTS] Fix incorrect uses of assume() in tests
JoshRosen Mar 2, 2020
7216099
[SPARK-30993][SQL][2.4] Use its sql type for UDT when checking the ty…
HeartSaVioR Mar 3, 2020
f4c8c48
[SPARK-30998][SQL][2.4] ClassCastException when a generator having ne…
maropu Mar 3, 2020
0ea91da
[MINOR][DOCS] ForeachBatch java example fix
roland-ondeviceresearch Mar 3, 2020
1c17ede
[MINOR][CORE] Expose the alias -c flag of --conf for spark-submit
yaooqinn Mar 5, 2020
7c237cc
[MINOR][SQL] Remove an ignored test from JsonSuite
MaxGekk Mar 6, 2020
f378c7f
[SPARK-30941][PYSPARK] Add a note to asDict to document its behavior …
viirya Mar 9, 2020
8e1021d
[SPARK-31095][BUILD][2.4] Upgrade netty-all to 4.1.47.Final
dongjoon-hyun Mar 11, 2020
c017422
[SPARK-29295][SQL][2.4] Insert overwrite to Hive external table parti…
viirya Mar 12, 2020
e6bcaaa
[SPARK-31130][BUILD] Use the same version of `commons-io` in SBT
dongjoon-hyun Mar 12, 2020
51ccb6f
[SPARK-31144][SQL][2.4] Wrap Error with QueryExecutionException to no…
zsxwing Mar 13, 2020
26ad3fe
[SPARK-31163][SQL] TruncateTableCommand with acl/permission should ha…
Ngone51 Mar 16, 2020
6a60c66
[MINOR][SQL] Update the DataFrameWriter.bucketBy comment
maropu Mar 17, 2020
73cc8b5
[SPARK-31164][SQL][2.4] Inconsistent rdd and output partitioning for …
wzhfy Mar 17, 2020
244405f
[SPARK-26293][SQL][2.4] Cast exception when having python udf in subq…
cloud-fan Mar 20, 2020
223b9fb
[SPARK-31231][BUILD] Explicitly setuptools version as 46.0.0 in pip p…
HyukjinKwon Mar 24, 2020
e37f664
Revert "[SPARK-31231][BUILD] Explicitly setuptools version as 46.0.0 …
dongjoon-hyun Mar 24, 2020
4381ad5
[SPARK-30494][SQL][2.4] Fix cached data leakage during replacing an e…
LantaoJin Mar 24, 2020
916a25a
[SPARK-31258][BUILD] Pin the avro version in SBT
yaooqinn Mar 26, 2020
844f207
[SPARK-31231][BUILD][FOLLOW-UP] Set the upper bound (before 46.1.0) f…
HyukjinKwon Mar 26, 2020
4217f75
Revert "[SPARK-31258][BUILD] Pin the avro version in SBT"
dongjoon-hyun Mar 26, 2020
801d6a9
[SPARK-31261][SQL] Avoid npe when reading bad csv input with `columnN…
wzhfy Mar 29, 2020
f05ac28
[SPARK-31293][DSTREAMS][KINESIS][DOC] Fix wrong examples and help mes…
sekikn Mar 29, 2020
4add8ad
[SPARK-31101][BUILD][2.4] Upgrade Janino to 3.0.16
HeartSaVioR Mar 30, 2020
e226f68
[SPARK-31306][DOCS] update rand() function documentation to indicate …
Smeb Mar 31, 2020
22e0a5a
[SPARK-31312][SQL][2.4] Cache Class instance for the UDF instance in …
HeartSaVioR Apr 1, 2020
aa9701b
[SPARK-31231][BUILD] Unset setuptools version in pip packaging test
HyukjinKwon Apr 3, 2020
fd6b0bc
[SPARK-25102][SQL][2.4] Write Spark version to ORC/Parquet file metadata
dongjoon-hyun Apr 7, 2020
1a2c233
[SPARK-31327][SQL][2.4] Write Spark version into Avro file metadata
cloud-fan Apr 8, 2020
9657575
[SPARK-31382][BUILD] Show a better error message for different python…
HyukjinKwon Apr 9, 2020
26d5e8f
[SPARK-31422][CORE] Fix NPE when BlockManagerSource is used after Blo…
dongjoon-hyun Apr 11, 2020
1c65b1d
[SPARK-31422][CORE][FOLLOWUP] Fix a test case compilation error
dongjoon-hyun Apr 11, 2020
775e958
[SPARK-31420][WEBUI][2.4] Infinite timeline redraw in job details page
sarutak Apr 15, 2020
49abdc4
[SPARK-31186][PYSPARK][SQL][2.4] toPandas should not fail on duplicat…
viirya Apr 15, 2020
d34590c
[SPARK-31441][PYSPARK][SQL][2.4] Support duplicated column names for …
ueshin Apr 15, 2020
ea75c15
[SPARK-31420][WEBUI][FOLLOWUP] Make locale of timeline-view 'en'
sarutak Apr 16, 2020
9416b7c
Apply appropriate RPC handler to receive, receiveStream when auth ena…
srowen Apr 17, 2020
1af3b48
[SPARK-31234][SQL][2.4] ResetCommand should not affect static SQL Con…
gatorsmile Apr 19, 2020
43cc620
[SPARK-31256][SQL] DataFrameNaFunctions.drop should work for nested c…
imback82 Apr 20, 2020
5183984
[SPARK-31503][SQL][2.4] fix the SQL string of the TRIM functions
cloud-fan Apr 22, 2020
2fefb60
[SPARK-30199][DSTREAM] Recover `spark.(ui|blockManager).port` from ch…
aaruna Apr 24, 2020
a2a0c52
[SPARK-31532][SQL] Builder should not propagate static sql configs to…
yaooqinn Apr 24, 2020
0477f21
[SPARK-31532][SPARK-31234][SQL][2.4][FOLLOWUP] Use lowercases for GLO…
maropu Apr 25, 2020
5e6bcca
[SPARK-31563][SQL] Fix failure of InSet.sql for collections of Cataly…
MaxGekk Apr 25, 2020
1cd8167
catalogString distinguish Vectors in ml and mllib
TJX2014 Apr 26, 2020
e7834c4
improve code style and import fix to mantain as former
TJX2014 Apr 26, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
104 changes: 104 additions & 0 deletions .github/workflows/branch-2.4.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,104 @@
name: branch-2.4

on:
push:
branches:
- branch-2.4
pull_request:
branches:
- branch-2.4

jobs:
build:

runs-on: ubuntu-latest
strategy:
matrix:
scala: [ '2.11', '2.12' ]
hadoop: [ 'hadoop-2.6', 'hadoop-2.7' ]
name: Build Spark with Scala ${{ matrix.scala }} / Hadoop ${{ matrix.hadoop }}

steps:
- uses: actions/checkout@master
# We split caches because GitHub Action Cache has a 400MB-size limit.
- uses: actions/cache@v1
with:
path: build
key: build-${{ hashFiles('**/pom.xml') }}
restore-keys: |
build-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/com
key: ${{ matrix.scala }}-${{ matrix.hadoop }}-maven-com-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.scala }}-${{ matrix.hadoop }}-maven-com-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/org
key: ${{ matrix.scala }}-${{ matrix.hadoop }}-maven-org-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.scala }}-${{ matrix.hadoop }}-maven-org-
- name: Set up JDK 8
uses: actions/setup-java@v1
with:
java-version: '1.8'
- name: Change to Scala ${{ matrix.scala }}
run: |
dev/change-scala-version.sh ${{ matrix.scala }}
- name: Build with Maven
run: |
export MAVEN_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=512m -Dorg.slf4j.simpleLogger.defaultLogLevel=WARN"
export MAVEN_CLI_OPTS="--no-transfer-progress"
mkdir -p ~/.m2
./build/mvn $MAVEN_CLI_OPTS -DskipTests -Pyarn -Pmesos -Pkubernetes -Phive -Phive-thriftserver -Pscala-${{ matrix.scala }} -P${{ matrix.hadoop }} -Phadoop-cloud install
rm -rf ~/.m2/repository/org/apache/spark


lint:
runs-on: ubuntu-latest
name: Linters (Java/Scala/Python), licenses, dependencies
steps:
- uses: actions/checkout@master
- uses: actions/setup-java@v1
with:
java-version: '1.8'
- uses: actions/setup-python@v1
with:
python-version: '3.7'
architecture: 'x64'
- name: Scala
run: ./dev/lint-scala
- name: Java
run: ./dev/lint-java
- name: Python
run: |
pip install flake8 sphinx numpy
./dev/lint-python
- name: License
run: ./dev/check-license
- name: Dependencies
run: ./dev/test-dependencies.sh

lintr:
runs-on: ubuntu-latest
name: Linter (R)
steps:
- uses: actions/checkout@master
- uses: actions/setup-java@v1
with:
java-version: '1.8'
- name: install R
run: |
echo 'deb https://cloud.r-project.org/bin/linux/ubuntu bionic-cran35/' | sudo tee -a /etc/apt/sources.list
curl -sL "https://keyserver.ubuntu.com/pks/lookup?op=get&search=0xE298A3A825C0D65DFD57CBB651716619E084DAB9" | sudo apt-key add
sudo apt-get update
sudo apt-get install -y r-base r-base-dev libcurl4-openssl-dev
- name: install R packages
run: |
sudo Rscript -e "install.packages(c('curl', 'xml2', 'httr', 'devtools', 'testthat', 'knitr', 'rmarkdown', 'roxygen2', 'e1071', 'survival'), repos='https://cloud.r-project.org/')"
sudo Rscript -e "devtools::install_github('jimhester/[email protected]')"
- name: package and install SparkR
run: ./R/install-dev.sh
- name: lint-r
run: ./dev/lint-r
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ project/plugins/project/build.properties
project/plugins/src_managed/
project/plugins/target/
python/lib/pyspark.zip
python/.eggs/
python/deps
python/test_coverage/coverage_data
python/test_coverage/htmlcov
Expand Down
50 changes: 0 additions & 50 deletions .travis.yml

This file was deleted.

2 changes: 1 addition & 1 deletion LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -243,7 +243,7 @@ MIT License
core/src/main/resources/org/apache/spark/ui/static/dagre-d3.min.js
core/src/main/resources/org/apache/spark/ui/static/*dataTables*
core/src/main/resources/org/apache/spark/ui/static/graphlib-dot.min.js
ore/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/sorttable.js
docs/js/vendor/anchor.min.js
docs/js/vendor/jquery*
Expand Down
2 changes: 1 addition & 1 deletion LICENSE-binary
Original file line number Diff line number Diff line change
Expand Up @@ -305,7 +305,6 @@ com.google.code.gson:gson
com.google.inject:guice
com.google.inject.extensions:guice-servlet
com.twitter:parquet-hadoop-bundle
commons-beanutils:commons-beanutils-core
commons-cli:commons-cli
commons-dbcp:commons-dbcp
commons-io:commons-io
Expand Down Expand Up @@ -468,6 +467,7 @@ Common Development and Distribution License (CDDL) 1.1
------------------------------------------------------

javax.annotation:javax.annotation-api https://jcp.org/en/jsr/detail?id=250
javax.el:javax.el-api https://javaee.github.io/uel-ri/
javax.servlet:javax.servlet-api https://javaee.github.io/servlet-spec/
javax.transaction:jta http://www.oracle.com/technetwork/java/index.html
javax.ws.rs:javax.ws.rs-api https://github.com/jax-rs
Expand Down
2 changes: 1 addition & 1 deletion R/pkg/.lintr
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
linters: with_defaults(line_length_linter(100), multiple_dots_linter = NULL, object_name_linter = NULL, camel_case_linter = NULL, open_curly_linter(allow_single_line = TRUE), closed_curly_linter(allow_single_line = TRUE))
linters: with_defaults(line_length_linter(100), multiple_dots_linter = NULL, object_name_linter = NULL, camel_case_linter = NULL, open_curly_linter(allow_single_line = TRUE), closed_curly_linter(allow_single_line = TRUE), object_usage_linter = NULL, cyclocomp_linter = NULL)
exclusions: list("inst/profile/general.R" = 1, "inst/profile/shell.R")
11 changes: 6 additions & 5 deletions R/pkg/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
Package: SparkR
Type: Package
Version: 2.4.0
Title: R Frontend for Apache Spark
Description: Provides an R Frontend for Apache Spark.
Version: 2.4.6
Title: R Front End for 'Apache Spark'
Description: Provides an R Front end for 'Apache Spark' <https://spark.apache.org>.
Authors@R: c(person("Shivaram", "Venkataraman", role = c("aut", "cre"),
email = "[email protected]"),
person("Xiangrui", "Meng", role = "aut",
Expand All @@ -11,8 +11,8 @@ Authors@R: c(person("Shivaram", "Venkataraman", role = c("aut", "cre"),
email = "[email protected]"),
person(family = "The Apache Software Foundation", role = c("aut", "cph")))
License: Apache License (== 2.0)
URL: http://www.apache.org/ http://spark.apache.org/
BugReports: http://spark.apache.org/contributing.html
URL: https://www.apache.org/ https://spark.apache.org/
BugReports: https://spark.apache.org/contributing.html
SystemRequirements: Java (== 8)
Depends:
R (>= 3.0),
Expand Down Expand Up @@ -61,3 +61,4 @@ Collate:
RoxygenNote: 5.0.1
VignetteBuilder: knitr
NeedsCompilation: no
Encoding: UTF-8
1 change: 0 additions & 1 deletion R/pkg/NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -313,7 +313,6 @@ exportMethods("%<=>%",
"lower",
"lpad",
"ltrim",
"map_entries",
"map_from_arrays",
"map_keys",
"map_values",
Expand Down
9 changes: 4 additions & 5 deletions R/pkg/R/DataFrame.R
Original file line number Diff line number Diff line change
Expand Up @@ -503,7 +503,6 @@ setMethod("createOrReplaceTempView",
#' @param x A SparkDataFrame
#' @param tableName A character vector containing the name of the table
#'
#' @family SparkDataFrame functions
#' @seealso \link{createOrReplaceTempView}
#' @rdname registerTempTable-deprecated
#' @name registerTempTable
Expand Down Expand Up @@ -2199,7 +2198,7 @@ setMethod("mutate",

# The last column of the same name in the specific columns takes effect
deDupCols <- list()
for (i in 1:length(cols)) {
for (i in seq_len(length(cols))) {
deDupCols[[ns[[i]]]] <- alias(cols[[i]], ns[[i]])
}

Expand Down Expand Up @@ -2363,7 +2362,7 @@ setMethod("arrange",
# builds a list of columns of type Column
# example: [[1]] Column Species ASC
# [[2]] Column Petal_Length DESC
jcols <- lapply(seq_len(length(decreasing)), function(i){
jcols <- lapply(seq_len(length(decreasing)), function(i) {
if (decreasing[[i]]) {
desc(getColumn(x, by[[i]]))
} else {
Expand Down Expand Up @@ -2695,7 +2694,7 @@ genAliasesForIntersectedCols <- function(x, intersectedColNames, suffix) {
col <- getColumn(x, colName)
if (colName %in% intersectedColNames) {
newJoin <- paste(colName, suffix, sep = "")
if (newJoin %in% allColNames){
if (newJoin %in% allColNames) {
stop("The following column name: ", newJoin, " occurs more than once in the 'DataFrame'.",
"Please use different suffixes for the intersected columns.")
}
Expand Down Expand Up @@ -3394,7 +3393,7 @@ setMethod("str",
cat(paste0("'", class(object), "': ", length(names), " variables:\n"))

if (nrow(localDF) > 0) {
for (i in 1 : ncol(localDF)) {
for (i in seq_len(ncol(localDF))) {
# Get the first elements for each column

firstElements <- if (types[i] == "character") {
Expand Down
7 changes: 4 additions & 3 deletions R/pkg/R/SQLContext.R
Original file line number Diff line number Diff line change
Expand Up @@ -254,7 +254,7 @@ createDataFrame.default <- function(data, schema = NULL, samplingRatio = 1.0,
as.list(schema)
}
if (is.null(names)) {
names <- lapply(1:length(row), function(x) {
names <- lapply(seq_len(length(row)), function(x) {
paste("_", as.character(x), sep = "")
})
}
Expand All @@ -270,7 +270,7 @@ createDataFrame.default <- function(data, schema = NULL, samplingRatio = 1.0,
})

types <- lapply(row, infer_type)
fields <- lapply(1:length(row), function(i) {
fields <- lapply(seq_len(length(row)), function(i) {
structField(names[[i]], types[[i]], TRUE)
})
schema <- do.call(structType, fields)
Expand Down Expand Up @@ -655,7 +655,8 @@ loadDF <- function(x = NULL, ...) {
#'
#' @param url JDBC database url of the form \code{jdbc:subprotocol:subname}
#' @param tableName the name of the table in the external database
#' @param partitionColumn the name of a column of integral type that will be used for partitioning
#' @param partitionColumn the name of a column of numeric, date, or timestamp type
#' that will be used for partitioning.
#' @param lowerBound the minimum value of \code{partitionColumn} used to decide partition stride
#' @param upperBound the maximum value of \code{partitionColumn} used to decide partition stride
#' @param numPartitions the number of partitions, This, along with \code{lowerBound} (inclusive),
Expand Down
1 change: 0 additions & 1 deletion R/pkg/R/catalog.R
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,6 @@ createExternalTable <- function(x, ...) {
#' @param ... additional named parameters as options for the data source.
#' @return A SparkDataFrame.
#' @rdname createTable
#' @seealso \link{createExternalTable}
#' @examples
#'\dontrun{
#' sparkR.session()
Expand Down
Loading