Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1075 commits
Select commit Hold shift + click to select a range
c5e83ab
[SPARK-25496][SQL] Deprecate from_utc_timestamp and to_utc_timestamp
MaxGekk Apr 2, 2019
0b150f8
[SPARK-26224][SQL] Advice the user when creating many project on subs…
mgaido91 Apr 2, 2019
a0d807d
[SPARK-26856][PYSPARK][FOLLOWUP] Fix UT failure due to wrong patterns…
dongjoon-hyun Apr 2, 2019
d575a45
Revert "[SPARK-25496][SQL] Deprecate from_utc_timestamp and to_utc_ti…
dongjoon-hyun Apr 2, 2019
d4420b4
[SPARK-27323][CORE][SQL][STREAMING] Use Single-Abstract-Method suppor…
srowen Apr 2, 2019
57aff93
[SPARK-26998][CORE] Remove SSL configuration from executors
gaborgsomogyi Apr 2, 2019
13c5c1f
[SPARK-27180][BUILD][YARN] Fix testing issues with yarn module in Had…
wangyum Apr 2, 2019
949d712
[SPARK-27346][SQL] Loosen the newline assert condition on 'examples' …
HyukjinKwon Apr 2, 2019
d7dd59a
[SPARK-26224][SQL][PYTHON][R][FOLLOW-UP] Add notes about many project…
HyukjinKwon Apr 2, 2019
3628242
[MINOR][DSTREAMS] Add DStreamCheckpointData.cleanup warning if delete…
gaborgsomogyi Apr 2, 2019
b8b5acd
[SPARK-19712][SQL][FOLLOW-UP] Don't do partial pushdown when pushing …
dilipbiswal Apr 3, 2019
1d20d13
[SPARK-25496][SQL] Deprecate from_utc_timestamp and to_utc_timestamp
MaxGekk Apr 3, 2019
3286bff
[SPARK-27255][SQL] Report error when illegal expressions are hosted b…
dilipbiswal Apr 3, 2019
1bc6723
[SPARK-27344][SQL][TEST] Support the LocalDate and Instant classes in…
MaxGekk Apr 3, 2019
d04a737
[MINOR][DOC][SQL] Remove out-of-date doc about ORC in DataFrameReader…
viirya Apr 3, 2019
ffb362a
[SPARK-19712][SQL][FOLLOW-UP] reduce code duplication
cloud-fan Apr 3, 2019
b517636
Revert "[SPARK-27278][SQL] Optimize GetMapValue when the map is a fol…
dongjoon-hyun Apr 3, 2019
69dd44a
[SPARK-27216][CORE] Upgrade RoaringBitmap to 0.7.45 to fix Kryo unsaf…
LantaoJin Apr 4, 2019
6c4552c
[SPARK-27338][CORE] Fix deadlock in UnsafeExternalSorter.SpillableIte…
Apr 4, 2019
5c50f68
[SPARK-26811][SQL][FOLLOWUP] fix some documentation
cloud-fan Apr 4, 2019
b56e433
[SPARK-27338][CORE][FOLLOWUP] remove trailing space
cloud-fan Apr 4, 2019
1d95dea
[SPARK-27349][SQL] Dealing with TimeVars removed in Hive 2.x
wangyum Apr 4, 2019
f7bd1ab
[SPARK-26811][SQL][FOLLOWUP] some more document fixes
cloud-fan Apr 4, 2019
938d954
[SPARK-27382][SQL][TEST] Update Spark 2.4.x testing in HiveExternalCa…
dongjoon-hyun Apr 4, 2019
0e44a51
[SPARK-24345][SQL] Improve ParseError stop location when offending sy…
Apr 4, 2019
04e53d2
[SPAR-27342][SQL] Optimize Limit 0 queries
aayushmaanjain Apr 5, 2019
568db94
[SPARK-27356][SQL] File source V2: Fix the case that data columns ove…
gengliangwang Apr 5, 2019
5678e68
[SPARK-27393][SQL] Show ReusedSubquery in the plan when the subquery …
gatorsmile Apr 5, 2019
a840b99
[MINOR][DOC] Fix html tag broken in configuration.md
HeartSaVioR Apr 5, 2019
23bde44
[SPARK-27358][UI] Update jquery to 1.12.x to pick up security fixes
srowen Apr 5, 2019
982c4c8
[SPARK-27390][CORE][SQL][TEST] Fix package name mismatch
dongjoon-hyun Apr 5, 2019
39f75b4
[SPARK-27192][CORE] spark.task.cpus should be less or equal than spar…
liutang123 Apr 5, 2019
979bb90
[SPARK-26936][SQL] Fix bug of insert overwrite local dir can not crea…
beliefer Apr 5, 2019
4a5768b
[SPARK-27391][SS] Don't initialize a lazy val in ContinuousExecution …
jose-torres Apr 5, 2019
6450c59
[SPARK-26992][STS] Fix STS scheduler pool correct delivery
cxzl25 Apr 6, 2019
53e31e2
[SPARK-27399][STREAMING][KAFKA] Arrange scattered config and reduce h…
beliefer Apr 6, 2019
017919b
[SPARK-27383][SQL][TEST] Avoid using hard-coded jar names in Hive tests
wangyum Apr 6, 2019
18b36ee
[SPARK-27253][SQL][FOLLOW-UP] Add a note about parent-session configu…
HyukjinKwon Apr 8, 2019
02e9f93
[SPARK-27384][SQL] File source V2: Prune unnecessary partition columns
gengliangwang Apr 8, 2019
215609d
[SPARK-25407][SQL] Allow nested access for non-existent field for Par…
mallman Apr 8, 2019
33f3c48
[SPARK-27176][SQL] Upgrade hadoop-3's built-in Hive maven dependencie…
wangyum Apr 8, 2019
52838e7
[SPARK-13704][CORE][YARN] Reduce rack resolution time
LantaoJin Apr 8, 2019
0024173
[SPARK-27405][SQL][TEST] Restrict the range of generated random times…
MaxGekk Apr 8, 2019
d50603a
[SPARK-27271][SQL] Migrate Text to File Data Source V2
gengliangwang Apr 8, 2019
dfa2328
[SPARK-26881][MLLIB] Heuristic for tree aggregate depth
gagafunctor Apr 9, 2019
051336d
[SPARK-25496][SQL][FOLLOWUP] avoid using to_utc_timestamp
cloud-fan Apr 9, 2019
f16dfb9
[SPARK-27328][SQL] Add 'deprecated' in ExpressionDescription for exte…
HyukjinKwon Apr 9, 2019
3e4cfe9
[SPARK-27406][SQL] UnsafeArrayData serialization breaks when two mach…
Apr 9, 2019
601fac2
[SPARK-27411][SQL] DataSourceV2Strategy should not eliminate subquery
francis0407 Apr 9, 2019
5ff39cd
[SPARK-27394][WEBUI] Flush LiveEntity if necessary when receiving Spa…
zsxwing Apr 9, 2019
3db117e
[SPARK-27407][SQL] File source V2: Invalidate cache data on overwrite…
gengliangwang Apr 9, 2019
63e4bf4
[SPARK-27401][SQL] Refactoring conversion of Timestamp to/from java.s…
MaxGekk Apr 9, 2019
f62f44f
[SPARK-27387][PYTHON][TESTS] Replace sqlutils.assertPandasEqual with …
BryanCutler Apr 9, 2019
05f6b87
[SPARK-27410][MLLIB] Remove deprecated / no-op mllib.KMeans getRuns, …
srowen Apr 10, 2019
08858f6
[SPARK-27253][SQL][FOLLOW-UP] Update doc about parent-session configu…
viirya Apr 10, 2019
58674d5
[SPARK-27181][SQL] Add public transform API
rdblue Apr 10, 2019
2e90574
[SPARK-27414][SQL] make it clear that date type is timezone independent
cloud-fan Apr 10, 2019
85e5d4f
[SPARK-24872] Replace taking the $symbol with $sqlOperator in BinaryO…
httfighter Apr 10, 2019
5ea4dee
[SPARK-26012][SQL] Null and '' values should not cause dynamic partit…
eatoncys Apr 10, 2019
1470f23
[SPARK-27422][SQL] current_date() should return current date in the s…
MaxGekk Apr 10, 2019
ab8710b
[SPARK-27423][SQL] Cast DATE <-> TIMESTAMP according to the SQL standard
MaxGekk Apr 10, 2019
181d190
[MINOR][SQL] Unnecessary access to externalCatalog
OCaballero Apr 11, 2019
0745333
[SPARK-27088][SQL] Add a configuration to set log level for each batc…
chakravarthiT Apr 11, 2019
4177292
[SPARK-27435][SQL] Support schema pruning in ORC V2
gengliangwang Apr 11, 2019
d33ae2e
[SPARK-26953][CORE][TEST] Disable result checking in the test: java.l…
MaxGekk Apr 11, 2019
239082d
[SPARK-27403][SQL] Fix `updateTableStats` to update table stats alway…
sujith71955 Apr 11, 2019
43da473
[SPARK-27225][SQL] Implement join strategy hints
maryannxue Apr 11, 2019
4ec7f63
[SPARK-27404][CORE][SQL][STREAMING][YARN] Fix build warnings for 3.0:…
srowen Apr 11, 2019
5d8aee5
[SPARK-27445][SQL][TEST] Update SQLQueryTestSuite to process files en…
dilipbiswal Apr 11, 2019
94adffa
[SPARK-27270][SS] Add Kafka dynamic JAAS authentication debug possibi…
gaborgsomogyi Apr 11, 2019
bbbe54a
[SPARK-27199][SQL][FOLLOWUP] Fix bug in codegen templates in UnixTime…
rednaxelafx Apr 12, 2019
4eb694c
[SPARK-27443][SQL] Support UDF input_file_name in file source V2
gengliangwang Apr 12, 2019
0407070
[SPARK-27444][SQL] multi-select can be used in subquery
cloud-fan Apr 12, 2019
9ed60c2
[MINOR][TEST][ML] Speed up some tests of ML regression by loosening t…
srowen Apr 12, 2019
0881f64
[SPARK-27451][BUILD] Upgrade lz4-java to 1.5.1
dongjoon-hyun Apr 13, 2019
38fc8e2
[MINOR][DOCS] Fix some broken links in docs
srowen Apr 13, 2019
67bd124
[MINOR][TEST] Speed up slow tests in QuantileDiscretizerSuite
srowen Apr 13, 2019
4704af4
[SPARK-27449] Move WholeStageCodegen.limitNotReachedCond class checks…
hvanhovell Apr 14, 2019
eea3f55
[SPARK-27446][R] Use existing spark conf if available.
MrBago Apr 14, 2019
0bb716b
Revert [SPARK-23433][SPARK-25250][CORE] Later created TaskSet should …
cloud-fan Apr 14, 2019
27d625d
[SPARK-27459][SQL] Revise the exception message of schema inference f…
gengliangwang Apr 15, 2019
3ab96d7
[SPARK-27444][SQL][FOLLOWUP][MINOR][TEST] Add a test for describing m…
dilipbiswal Apr 15, 2019
d35e81f
[SPARK-27454][ML][SQL] Spark image datasource fail when encounter som…
WeichenXu123 Apr 15, 2019
c58a4fe
[SPARK-27351][SQL] Wrong outputRows estimation after AggregateEstimat…
pengbo Apr 15, 2019
a4cf1a4
[SPARK-27469][CORE] Update Commons BeanUtils to 1.9.3
srowen Apr 16, 2019
f9837d3
[SPARK-27448][SQL] File source V2 table provider should be compatible…
gengliangwang Apr 16, 2019
8718367
[SPARK-27470][PYSPARK] Update pyrolite to 4.23
srowen Apr 16, 2019
257d01a
[SPARK-27397][CORE] Take care of OpenJ9 JVM in Spark
kiszk Apr 16, 2019
88d9de2
[SPARK-27464][CORE] Added Constant instead of referring string litera…
shivusondur Apr 16, 2019
a8f20c9
[SPARK-27452][BUILD] Update zstd-jni to 1.3.8-9
dongjoon-hyun Apr 16, 2019
7c4a643
[SPARK-27467][BUILD][TEST-MAVEN] Upgrade Maven to 3.6.1
dongjoon-hyun Apr 16, 2019
b404e02
[SPARK-27476][SQL] Refactoring SchemaPruning rule to remove duplicate…
viirya Apr 16, 2019
26ed65f
[SPARK-27453] Pass partitionBy as options in DataFrameWriter
liwensun Apr 16, 2019
1bb0c8e
[SPARK-25348][SQL] Data source for binary files
WeichenXu123 Apr 16, 2019
61feb16
[SPARK-27479][BUILD] Hide API docs for org.apache.spark.util.kvstore
gatorsmile Apr 17, 2019
54b0d1e
[SPARK-27416][SQL] UnsafeMapData & UnsafeArrayData Kryo serialization …
pengbo Apr 17, 2019
e6618de
[SPARK-27430][SQL] broadcast hint should be respected for broadcast n…
cloud-fan Apr 17, 2019
e1c90d6
[SPARK-19712][SQL] Pushdown LeftSemi/LeftAnti below join
dilipbiswal Apr 17, 2019
f93460d
[SPARK-27493][BUILD] Upgrade ASM to 7.1
dongjoon-hyun Apr 18, 2019
50bdc9b
[SPARK-27423][SQL][FOLLOWUP] Minor polishes to Cast codegen templates…
rednaxelafx Apr 18, 2019
7d44ba0
[SPARK-27490][SQL] File source V2: return correct result for Dataset.…
gengliangwang Apr 18, 2019
9c238b8
[SPARK-27460][TESTS] Running slowest test suites in their own forked …
gengliangwang Apr 18, 2019
9c41bfd
[SPARK-27502][SQL][TEST] Update nested schema benchmark result for Or…
viirya Apr 18, 2019
3748b38
[SPARK-27460][TESTS][FOLLOWUP] Add HiveClientVersions to parallel tes…
gengliangwang Apr 18, 2019
e1ece6a
[SPARK-25079][PYTHON] update python3 executable to 3.6.x
shaneknapp Apr 19, 2019
8f82237
[SPARK-27501][SQL][TEST] Add test for HIVE-13083: Writing HiveDecimal…
wangyum Apr 19, 2019
163a6e2
[SPARK-27514] Skip collapsing windows with empty window expressions
yifeih Apr 19, 2019
31488e1
[SPARK-27504][SQL] File source V2: support refreshing metadata cache
gengliangwang Apr 19, 2019
16bbe0f
[SPARK-27486][CORE][TEST] Enable History server storage information t…
shahidki31 Apr 19, 2019
777b450
[SPARK-27176][FOLLOW-UP][SQL] Upgrade Hive parquet to 1.10.1 for hado…
wangyum Apr 19, 2019
d61b3bc
[SPARK-27527][SQL][DOCS] Improve descriptions of Timestamp and Date t…
MaxGekk Apr 21, 2019
4cb1cd6
[SPARK-27532][DOC] Correct the default value in the Documentation for…
shivusondur Apr 21, 2019
ad60c6d
[SPARK-27439][SQL] Use analyzed plan when explaining Dataset
viirya Apr 21, 2019
9793d9e
[SPARK-27473][SQL] Support filter push down for status fields in bina…
WeichenXu123 Apr 21, 2019
8a8643c
[SPARK-27480][SQL] Improve `EXPLAIN DESC QUERY` to show the input SQL…
dilipbiswal Apr 21, 2019
009059e
[SPARK-27496][CORE] Fatal errors should also be sent back to the sender
zsxwing Apr 22, 2019
d4a16f4
[SPARK-27419][FOLLOWUP][DOCS] Add note about spark.executor.heartbeat…
srowen Apr 22, 2019
777b797
[SPARK-27522][SQL][TEST] Test migration from INT96 to TIMESTAMP_MICRO…
MaxGekk Apr 22, 2019
d36cce1
[SPARK-27276][PYTHON][SQL] Increase minimum version of pyarrow to 0.1…
BryanCutler Apr 22, 2019
79d3bc0
[SPARK-27438][SQL] Parse strings with timestamps by to_timestamp() in…
MaxGekk Apr 22, 2019
5172190
[SPARK-27392][SQL] TestHive test tables should be placed in shared te…
ericl Apr 22, 2019
3240e52
[SPARK-27531][SQL] Improve `EXPLAIN DESC TABLE` to show the input par…
dilipbiswal Apr 22, 2019
43a73e3
[SPARK-27528][SQL] Use Parquet logical type TIMESTAMP_MICROS by default
MaxGekk Apr 23, 2019
55f26d8
[SPARK-27533][SQL][TEST] Date and timestamp CSV benchmarks
MaxGekk Apr 23, 2019
93a264d
[SPARK-27535][SQL][TEST] Date and timestamp JSON benchmarks
MaxGekk Apr 23, 2019
d9b2ce0
[SPARK-27539][SQL] Fix inaccurate aggregate outputRows estimation wit…
pengbo Apr 23, 2019
7cc15af
[SPARK-27481][BUILD] Upgrade commons-logging to 1.1.3 for hadoop-3.2
wangyum Apr 23, 2019
ecfdffc
[SPARK-27503][DSTREAM] JobGenerator thread exit for some fatal errors…
uncleGen Apr 23, 2019
00f2f31
[SPARK-27128][SQL] Migrate JSON to File Data Source V2
gengliangwang Apr 23, 2019
810be5d
[SPARK-27493][BUILD][FOLLOWUP] Upgrade ASM to 7.1 in plugins.sbt
dongjoon-hyun Apr 23, 2019
5bf5d9d
[SPARK-26970][PYTHON][ML] Add Spark ML interaction transformer to PyS…
Andrew-Crosby Apr 23, 2019
596a5ff
[MINOR][BUILD] Update genjavadoc to 0.13
srowen Apr 24, 2019
a30983d
[SPARK-27512][SQL] Avoid to replace ',' in CSV's decimal type inferen…
HyukjinKwon Apr 24, 2019
cd4a284
[SPARK-27460][FOLLOW-UP][TESTS] Fix flaky tests
gatorsmile Apr 24, 2019
b7f9830
[MINOR][TEST] switch from 2.4.1 to 2.4.2 in HiveExternalCatalogVersio…
cloud-fan Apr 25, 2019
b1c6b60
[SPARK-26729][K8S] Fix typo with default value for R image name
rvesse Apr 25, 2019
f82ed5e
[MINOR][TEST] Remove out-dated hive version in run-tests.py
wangyum Apr 25, 2019
8b86326
[SPARK-27551][SQL] Improve error message of mismatched types for CASE…
viirya Apr 25, 2019
d5dbf05
Revert "[SPARK-27439][SQL] Use analyzed plan when explaining Dataset"
dongjoon-hyun Apr 26, 2019
d2656aa
[SPARK-27494][SS] Null values don't work in Kafka source v2
uncleGen Apr 26, 2019
2234667
[SPARK-27563][SQL][TEST] automatically get the latest Spark versions …
cloud-fan Apr 26, 2019
85fd552
[SPARK-27190][SQL] add table capability for streaming
cloud-fan Apr 26, 2019
fe99305
[SPARK-27556][BUILD] Exclude com.zaxxer:HikariCP-java7 from hadoop-ya…
wangyum Apr 26, 2019
7b367bf
[SPARK-27477][BUILD] Kafka token provider should have provided depend…
koertkuipers Apr 26, 2019
6328be7
[MINOR][TEST][DOC] Execute action miss name message
uncleGen Apr 27, 2019
90085a1
[SPARK-23619][DOCS] Add output description for some generator express…
jashgala Apr 27, 2019
bde30bc
[SPARK-27467][FOLLOW-UP][BUILD] Upgrade Maven to 3.6.1 in AppVeyor an…
wangyum Apr 27, 2019
447d018
Revert "[SPARK-27467][BUILD][TEST-MAVEN] Upgrade Maven to 3.6.1"
HyukjinKwon Apr 28, 2019
d8db7db
Revert "[SPARK-27467][FOLLOW-UP][BUILD] Upgrade Maven to 3.6.1 in App…
HyukjinKwon Apr 28, 2019
20a3ef7
[SPARK-27534][SQL] Do not load `content` column in binary data source…
mengxr Apr 28, 2019
05b85eb
[SPARK-27474][CORE] avoid retrying a task failed with CommitDeniedExc…
cloud-fan Apr 29, 2019
07d07fe
[SPARK-27580][SQL] Implement `doCanonicalize` in BatchScanExec for co…
gengliangwang Apr 29, 2019
5a62295
[SPARK-27580][HOT-FIX] Fix wrong import order in FileScan.scala
wangyum Apr 29, 2019
76785cd
[SPARK-27581][SQL] DataFrame countDistinct("*") shouldn't fail with A…
viirya Apr 29, 2019
fbc7942
[SPARK-27472] add user guide for binary file data source
mengxr Apr 29, 2019
8a17d26
[SPARK-27536][CORE][ML][SQL][STREAMING] Remove most use of scala.lang…
srowen Apr 29, 2019
a6716d3
[SPARK-27571][CORE][YARN][EXAMPLES] Avoid scala.language.reflectiveCalls
srowen Apr 29, 2019
fb6b19a
[SPARK-23014][SS] Fully remove V1 memory sink.
gaborgsomogyi Apr 29, 2019
75b40a5
[SPARK-27575][CORE][YARN] Yarn file-related confs should merge new va…
HeartSaVioR Apr 29, 2019
618d6bf
[SPARK-27588] Binary file data source fails fast and doesn't attempt …
mengxr Apr 29, 2019
7432e7d
[SPARK-24935][SQL][FOLLOWUP] support INIT -> UPDATE -> MERGE -> FINIS…
cloud-fan Apr 30, 2019
25ee047
[SPARK-26936][MINOR][FOLLOWUP] Don't need the JobConf anymore, it seems
srowen Apr 30, 2019
a35043c
[SPARK-27591][SQL] Fix UnivocityParser for UserDefinedType
kalkolab Apr 30, 2019
6eca435
[SPARK-27608][BUILD][test-maven] Upgrade Surefire plugin to 3.0.0-M3
dongjoon-hyun May 1, 2019
8375103
[SPARK-27557][DOC] Add copy button to Python API docs for easier copy…
sangramga May 1, 2019
9623420
[SPARK-27276][PYTHON][DOCS][FOLLOW-UP] Update documentation about Arr…
HyukjinKwon May 1, 2019
fcc42d4
[SPARK-27493][BUILD][FOLLOWUP] Upgrade ASM to 7.1 in Maven plugins
srowen May 1, 2019
3670826
[SPARK-26921][R][DOCS] Document Arrow optimization and vectorized R APIs
HyukjinKwon May 2, 2019
2da406c
[SPARK-27618][SQL][FOLLOW-UP] Unnecessary access to externalCatalog
gatorsmile May 2, 2019
b73744a
[SPARK-27611][BUILD] Exclude jakarta.activation:jakarta.activation-ap…
liancheng May 2, 2019
253a879
[SPARK-26921][R][DOCS][FOLLOWUP] Document Arrow optimization and vect…
viirya May 2, 2019
df8aa7b
[SPARK-27606][SQL] Deprecate 'extended' field in ExpressionDescriptio…
HyukjinKwon May 2, 2019
7a8cc8e
[SPARK-27607][SQL] Improve Row.toString performance
mgaido91 May 2, 2019
3ecafb0
[SPARK-27601][BUILD] Upgrade stream-lib to 2.9.6
wangyum May 2, 2019
f950e53
[MINOR][CORE] Update taskName in PythonRunner
jiangxb1987 May 2, 2019
875e7e1
[SPARK-27620][BUILD] Upgrade jetty to 9.4.18.v20190429
wangyum May 3, 2019
375cfa3
[SPARK-27467][BUILD] Upgrade Maven to 3.6.1
dongjoon-hyun May 3, 2019
3859ca3
[SPARK-27586][SQL] Improve binary comparison: replace Scala's for-com…
May 3, 2019
5c47924
[SPARK-27612][PYTHON] Use Python's default protocol instead of highes…
HyukjinKwon May 3, 2019
9a419c3
[SPARK-24360][FOLLOW-UP][SQL] Add missing options for sql-migration-g…
wangyum May 3, 2019
6c2d351
[SPARK-27626][K8S] Fix `docker-image-tool.sh` to be robust in non-bas…
dongjoon-hyun May 3, 2019
51de86b
[SPARK-27510][CORE] Avoid Master falls into dead loop while launching…
Ngone51 May 3, 2019
4241a72
[SPARK-27621][ML] Linear Regression - validate training related param…
May 3, 2019
c66ec43
[SPARK-27555][SQL] HiveSerDe should fall back to hadoopconf if hive.d…
sandeep-katta May 4, 2019
5182aa2
[MINOR][DOCS] Correct date_trunc docs
mojodna May 4, 2019
d9bcacf
[SPARK-27629][PYSPARK] Prevent Unpickler from intervening each unpick…
viirya May 4, 2019
6001d47
[SPARK-27596][SQL] The JDBC 'query' option doesn't work for Oracle da…
dilipbiswal May 6, 2019
4b725e5
[SPARK-27439][SQL] Explainging Dataset should show correct resolved p…
viirya May 6, 2019
6ef4530
[SPARK-27579][SQL] remove BaseStreamingSource and BaseStreamingSink
cloud-fan May 6, 2019
eec1a3c
[SPARK-23299][SQL][PYSPARK] Fix __repr__ behaviour for Rows
tbcs May 6, 2019
d5308cd
[SPARK-27577][MLLIB] Correct thresholds downsampled in BinaryClassifi…
shishaochen May 7, 2019
d124ce9
[SPARK-27590][CORE] do not consider skipped tasks when scheduling spe…
cloud-fan May 7, 2019
8ef4da7
[SPARK-27610][YARN] Shade netty native libraries
amuraru May 7, 2019
614a5cc
[SPARK-27624][CORE] Fix CalenderInterval to show an empty interval co…
dongjoon-hyun May 7, 2019
2f55809
[SPARK-27294][SS] Add multi-cluster Kafka delegation token
gaborgsomogyi May 7, 2019
5e79ae3
[SPARK-23961][SPARK-27548][PYTHON] Fix error when toLocalIterator goe…
BryanCutler May 7, 2019
303ee3f
[SPARK-24252][SQL] Add TableCatalog API
rdblue May 8, 2019
83f628b
[SPARK-27253][SQL][FOLLOW-UP] Add a legacy flag to restore old sessio…
jose-torres May 8, 2019
8329e7d
[SPARK-27649][SS] Unify the way use 'spark.network.timeout'
beliefer May 8, 2019
3ea44e5
[SPARK-27639][SQL] InMemoryTableScan shows the table name on UI if po…
wangyum May 8, 2019
e63fbfc
[SPARK-25139][SPARK-18406][CORE] Avoid NonFatals to kill the Executor…
jiangxb1987 May 8, 2019
bae5baa
[SPARK-27642][SS] make v1 offset extends v2 offset
cloud-fan May 8, 2019
09422f5
[MINOR][DOCS] Fix invalid documentation for StreamingQueryManager Class
asaf400 May 8, 2019
57450ed
[MINOR][SS] Rename `secondLatestBatchId` to `secondLatestOffsets`
beliefer May 8, 2019
78a403f
[SPARK-27627][SQL] Make option "pathGlobFilter" as a general option f…
gengliangwang May 8, 2019
0969d7a
[SPARK-27207][SQL] : Ensure aggregate buffers are initialized again f…
May 9, 2019
b5ffec1
[SPARK-27563][FOLLOWUP] Fix to download new release from `dist.apache…
wangyum May 9, 2019
9b3211a
[SPARK-27540][MLLIB] Add 'meanAveragePrecision_at_k' metric to Rankin…
qb-tarushg May 9, 2019
fbb56f2
[SPARK-27636][MLLIB] Remove cached RDD blocks after PIC execution
shahidki31 May 9, 2019
cfe236f
[MINOR][DOCS] Make Spark's description consistent in docs with websites
rxin May 10, 2019
78748b5
[SPARK-27625][SQL] ScalaReflection support for annotated types
mgaido91 May 10, 2019
80de449
[MINOR][TEST] Fix schema mismatch error
ericl May 10, 2019
3442fca
[SPARK-27672][SQL] Add `since` info to string expressions
HyukjinKwon May 10, 2019
c71f217
[SPARK-27673][SQL] Add `since` info to random, regex, null expressions
HyukjinKwon May 10, 2019
fa5dc0a
[SPARK-26632][CORE] Separate Thread Configurations of Driver and Exec…
jiafuzha May 10, 2019
bcd3b61
[SPARK-27347][MESOS] Fix supervised driver retry logic for outdated t…
samvantran May 10, 2019
dbb8143
[MINOR][SS][DOC] Added missing config `maxFileAge` in file streaming …
May 10, 2019
9ff77b1
[SPARK-27675][SQL] do not use MutableColumnarRow in ColumnarBatch
cloud-fan May 12, 2019
5a8aad0
[SPARK-27343][KAFKA][SS] Avoid hardcoded for spark-sql-kafka-0-10
May 12, 2019
be6d39c
[SPARK-27668][SQL] File source V2: support reporting statistics
gengliangwang May 13, 2019
2bc42ad
[MINOR][REPL] Remove dead code of Spark Repl in Scala 2.11
gatorsmile May 13, 2019
126310c
[SPARK-26601][SQL] Make broadcast-exchange thread pool configurable
caneGuy May 13, 2019
d169b0a
[SPARK-27653][SQL] Add max_by() and min_by() SQL aggregate functions
viirya May 13, 2019
f3ddd6f
[SPARK-27402][SQL][TEST-HADOOP3.2][TEST-MAVEN] Fix hadoop-3.2 test is…
wangyum May 13, 2019
8b0bdaa
[SPARK-27671][SQL] Fix error when casting from a nested null in a struct
viirya May 13, 2019
66f5a42
[SPARK-27638][SQL] Cast string to date/timestamp in binary comparison…
May 14, 2019
db2e3c4
[SPARK-27024] Executor interface for cluster managers to support GPU …
tgravescs May 14, 2019
695dbe2
[SPARK-25719][UI] : Search functionality in datatables in stages page…
May 14, 2019
a10608c
[SPARK-27680][CORE][SQL][GRAPHX] Remove usage of Traversable
srowen May 14, 2019
fee695d
[SPARK-27690][SQL] Remove materialized views first in `HiveClientImpl…
wangyum May 14, 2019
2da5b21
[SPARK-24923][SQL] Implement v2 CreateTableAsSelect
rdblue May 15, 2019
7dd2dd5
[MINOR][SS] Remove duplicate 'add' in comment of `StructuredSessioniz…
beliefer May 15, 2019
fd9acf2
[SPARK-27713][SQL] Move org.apache.spark.sql.execution.* in catalyst …
May 15, 2019
bfb3ffe
[SPARK-27682][CORE][GRAPHX][MLLIB] Replace use of collections and met…
srowen May 15, 2019
d14e2d7
[SPARK-27678][UI] Allow user impersonation in the UI.
May 15, 2019
efa3035
[SPARK-27687][SS] Rename Kafka consumer cache capacity conf and docum…
gaborgsomogyi May 15, 2019
0bba5cf
[SPARK-20774][SPARK-27036][SQL] Cancel the running broadcast executio…
jiangxb1987 May 15, 2019
02c3369
[SPARK-27354][SQL] Move incompatible code from the hive-thriftserver …
wangyum May 15, 2019
3e30a98
[SPARK-27674][SQL] the hint should not be dropped after cache lookup
cloud-fan May 15, 2019
c6a45e6
[SPARK-27722][SQL] removed the unsed "UnsafeKeyValueSorter" file.
shivusondur May 16, 2019
6a317c8
[SPARK-27735][SS] Parsing interval string should be case-insensitive …
zsxwing May 16, 2019
fc5bd6d
[SPARK-27576][SQL] table capability to skip the output column resolution
cloud-fan May 16, 2019
9e0d8c6
[SPARK-27752][CORE] Upgrade lz4-java from 1.5.1 to 1.6.0
kiszk May 17, 2019
e39e97b
[SPARK-27699][SQL] Partially push down disjunctive predicated in Parq…
gengliangwang May 17, 2019
141a3bf
[SPARK-27755][BUILD] Update zstd-jni to 1.4.0-1
dongjoon-hyun May 17, 2019
9bca99b
[SPARK-27552][SQL] The configuration `hive.exec.stagingdir` is invali…
10110346 May 17, 2019
e354042
[SPARK-24985][SQL] Fix OOM in Full Outer Join in presence of data skew.
Aug 21, 2018
824d357
Merge branch 'SPARK-24985' of github.com:sujithjay/spark into SPARK-2…
sujithjay May 18, 2019
d47e788
[SPARK-24985][SQL] Fix OOM in Full Outer Join in presence of data skew.
Aug 21, 2018
b435303
Merge branch 'SPARK-24985' of github.com:sujithjay/spark into SPARK-2…
sujithjay May 18, 2019
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -94,3 +94,6 @@ spark-warehouse/
*.Rproj.*

.Rproj.user

# For SBT
.jvmopts
4 changes: 2 additions & 2 deletions LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -222,7 +222,7 @@ Python Software Foundation License
----------------------------------

pyspark/heapq3.py

python/docs/_static/copybutton.js

BSD 3-Clause
------------
Expand Down Expand Up @@ -258,4 +258,4 @@ data/mllib/images/kittens/29.5.a_b_EGDP022204.jpg
data/mllib/images/kittens/54893.jpg
data/mllib/images/kittens/DP153539.jpg
data/mllib/images/kittens/DP802813.jpg
data/mllib/images/multi-channel/chr30.4.184.jpg
data/mllib/images/multi-channel/chr30.4.184.jpg
62 changes: 33 additions & 29 deletions LICENSE-binary
Original file line number Diff line number Diff line change
Expand Up @@ -209,34 +209,34 @@ org.apache.zookeeper:zookeeper
oro:oro
commons-configuration:commons-configuration
commons-digester:commons-digester
com.chuusai:shapeless_2.11
com.chuusai:shapeless_2.12
com.googlecode.javaewah:JavaEWAH
com.twitter:chill-java
com.twitter:chill_2.11
com.twitter:chill_2.12
com.univocity:univocity-parsers
javax.jdo:jdo-api
joda-time:joda-time
net.sf.opencsv:opencsv
org.apache.derby:derby
org.objenesis:objenesis
org.roaringbitmap:RoaringBitmap
org.scalanlp:breeze-macros_2.11
org.scalanlp:breeze_2.11
org.typelevel:macro-compat_2.11
org.scalanlp:breeze-macros_2.12
org.scalanlp:breeze_2.12
org.typelevel:macro-compat_2.12
org.yaml:snakeyaml
org.apache.xbean:xbean-asm5-shaded
com.squareup.okhttp3:logging-interceptor
com.squareup.okhttp3:okhttp
com.squareup.okio:okio
org.apache.spark:spark-catalyst_2.11
org.apache.spark:spark-kvstore_2.11
org.apache.spark:spark-launcher_2.11
org.apache.spark:spark-mllib-local_2.11
org.apache.spark:spark-network-common_2.11
org.apache.spark:spark-network-shuffle_2.11
org.apache.spark:spark-sketch_2.11
org.apache.spark:spark-tags_2.11
org.apache.spark:spark-unsafe_2.11
org.apache.spark:spark-catalyst_2.12
org.apache.spark:spark-kvstore_2.12
org.apache.spark:spark-launcher_2.12
org.apache.spark:spark-mllib-local_2.12
org.apache.spark:spark-network-common_2.12
org.apache.spark:spark-network-shuffle_2.12
org.apache.spark:spark-sketch_2.12
org.apache.spark:spark-tags_2.12
org.apache.spark:spark-unsafe_2.12
commons-httpclient:commons-httpclient
com.vlkan:flatbuffers
com.ning:compress-lzf
Expand All @@ -260,9 +260,6 @@ net.sf.supercsv:super-csv
org.apache.arrow:arrow-format
org.apache.arrow:arrow-memory
org.apache.arrow:arrow-vector
org.apache.calcite:calcite-avatica
org.apache.calcite:calcite-core
org.apache.calcite:calcite-linq4j
org.apache.commons:commons-crypto
org.apache.commons:commons-lang3
org.apache.hadoop:hadoop-annotations
Expand All @@ -287,25 +284,24 @@ org.apache.orc:orc-mapreduce
org.mortbay.jetty:jetty
org.mortbay.jetty:jetty-util
com.jolbox:bonecp
org.json4s:json4s-ast_2.11
org.json4s:json4s-core_2.11
org.json4s:json4s-jackson_2.11
org.json4s:json4s-scalap_2.11
org.json4s:json4s-ast_2.12
org.json4s:json4s-core_2.12
org.json4s:json4s-jackson_2.12
org.json4s:json4s-scalap_2.12
com.carrotsearch:hppc
com.fasterxml.jackson.core:jackson-annotations
com.fasterxml.jackson.core:jackson-core
com.fasterxml.jackson.core:jackson-databind
com.fasterxml.jackson.dataformat:jackson-dataformat-yaml
com.fasterxml.jackson.module:jackson-module-jaxb-annotations
com.fasterxml.jackson.module:jackson-module-paranamer
com.fasterxml.jackson.module:jackson-module-scala_2.11
com.fasterxml.jackson.module:jackson-module-scala_2.12
com.github.mifmif:generex
com.google.code.findbugs:jsr305
com.google.code.gson:gson
com.google.inject:guice
com.google.inject.extensions:guice-servlet
com.twitter:parquet-hadoop-bundle
commons-beanutils:commons-beanutils-core
commons-cli:commons-cli
commons-dbcp:commons-dbcp
commons-io:commons-io
Expand Down Expand Up @@ -415,8 +411,8 @@ com.thoughtworks.paranamer:paranamer
org.scala-lang:scala-compiler
org.scala-lang:scala-library
org.scala-lang:scala-reflect
org.scala-lang.modules:scala-parser-combinators_2.11
org.scala-lang.modules:scala-xml_2.11
org.scala-lang.modules:scala-parser-combinators_2.12
org.scala-lang.modules:scala-xml_2.12
org.fusesource.leveldbjni:leveldbjni-all
net.sourceforge.f2j:arpack_combined_all
xmlenc:xmlenc
Expand All @@ -437,15 +433,15 @@ is distributed under the 3-Clause BSD license.
MIT License
-----------

org.spire-math:spire-macros_2.11
org.spire-math:spire_2.11
org.typelevel:machinist_2.11
org.spire-math:spire-macros_2.12
org.spire-math:spire_2.12
org.typelevel:machinist_2.12
net.razorvine:pyrolite
org.slf4j:jcl-over-slf4j
org.slf4j:jul-to-slf4j
org.slf4j:slf4j-api
org.slf4j:slf4j-log4j12
com.github.scopt:scopt_2.11
com.github.scopt:scopt_2.12

core/src/main/resources/org/apache/spark/ui/static/dagre-d3.min.js
core/src/main/resources/org/apache/spark/ui/static/*dataTables*
Expand Down Expand Up @@ -487,6 +483,14 @@ org.glassfish.jersey.core:jersey-server
org.glassfish.jersey.media:jersey-media-jaxb


Eclipse Distribution License (EDL) 1.0
--------------------------------------

org.glassfish.jaxb:jaxb-runtime
jakarta.xml.bind:jakarta.xml.bind-api
com.sun.istack:istack-commons-runtime


Mozilla Public License (MPL) 1.1
--------------------------------

Expand Down
9 changes: 0 additions & 9 deletions NOTICE-binary
Original file line number Diff line number Diff line change
Expand Up @@ -792,15 +792,6 @@ Copyright 2005-2006 The Apache Software Foundation
Apache Jakarta HttpClient
Copyright 1999-2007 The Apache Software Foundation

Calcite Avatica
Copyright 2012-2015 The Apache Software Foundation

Calcite Core
Copyright 2012-2015 The Apache Software Foundation

Calcite Linq4j
Copyright 2012-2015 The Apache Software Foundation

Apache HttpClient
Copyright 1999-2017 The Apache Software Foundation

Expand Down
18 changes: 18 additions & 0 deletions R/CRAN_RELEASE.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,21 @@
---
license: |
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to You under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
---

# SparkR CRAN Release

To release SparkR as a package to CRAN, we would use the `devtools` package. Please work with the
Expand Down
18 changes: 18 additions & 0 deletions R/DOCUMENTATION.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,21 @@
---
license: |
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to You under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
---

# SparkR Documentation

SparkR documentation is generated by using in-source comments and annotated by using
Expand Down
10 changes: 1 addition & 9 deletions R/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,15 +39,7 @@ To set other options like driver memory, executor memory etc. you can pass in th

#### Using SparkR from RStudio

If you wish to use SparkR from RStudio or other R frontends you will need to set some environment variables which point SparkR to your Spark installation. For example
```R
# Set this to where Spark is installed
Sys.setenv(SPARK_HOME="/Users/username/spark")
# This line loads SparkR from the installed directory
.libPaths(c(file.path(Sys.getenv("SPARK_HOME"), "R", "lib"), .libPaths()))
library(SparkR)
sparkR.session()
```
If you wish to use SparkR from RStudio, please refer [SparkR documentation](https://spark.apache.org/docs/latest/sparkr.html#starting-up-from-rstudio).

#### Making changes to SparkR

Expand Down
18 changes: 18 additions & 0 deletions R/WINDOWS.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,21 @@
---
license: |
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to You under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
---

## Building SparkR on Windows

To build SparkR on Windows, the following steps are required
Expand Down
8 changes: 4 additions & 4 deletions R/pkg/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
Package: SparkR
Type: Package
Version: 3.0.0
Title: R Frontend for Apache Spark
Description: Provides an R Frontend for Apache Spark.
Title: R Front End for 'Apache Spark'
Description: Provides an R Front end for 'Apache Spark' <https://spark.apache.org>.
Authors@R: c(person("Shivaram", "Venkataraman", role = c("aut", "cre"),
email = "[email protected]"),
person("Xiangrui", "Meng", role = "aut",
Expand All @@ -11,8 +11,8 @@ Authors@R: c(person("Shivaram", "Venkataraman", role = c("aut", "cre"),
email = "[email protected]"),
person(family = "The Apache Software Foundation", role = c("aut", "cph")))
License: Apache License (== 2.0)
URL: http://www.apache.org/ http://spark.apache.org/
BugReports: http://spark.apache.org/contributing.html
URL: https://www.apache.org/ https://spark.apache.org/
BugReports: https://spark.apache.org/contributing.html
SystemRequirements: Java (== 8)
Depends:
R (>= 3.1),
Expand Down
8 changes: 7 additions & 1 deletion R/pkg/NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,8 @@ exportMethods("glm",
"spark.fpGrowth",
"spark.freqItemsets",
"spark.associationRules",
"spark.findFrequentSequentialPatterns")
"spark.findFrequentSequentialPatterns",
"spark.assignClusters")

# Job group lifecycle management methods
export("setJobGroup",
Expand Down Expand Up @@ -311,8 +312,10 @@ exportMethods("%<=>%",
"lower",
"lpad",
"ltrim",
"map_concat",
"map_entries",
"map_from_arrays",
"map_from_entries",
"map_keys",
"map_values",
"max",
Expand Down Expand Up @@ -351,6 +354,8 @@ exportMethods("%<=>%",
"row_number",
"rpad",
"rtrim",
"schema_of_csv",
"schema_of_json",
"second",
"sha1",
"sha2",
Expand Down Expand Up @@ -403,6 +408,7 @@ exportMethods("%<=>%",
"weekofyear",
"when",
"window",
"xxhash64",
"year")

exportClasses("GroupedData")
Expand Down
Loading