Skip to content

Conversation

@yanboliang
Copy link
Contributor

Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame, include:

@mengxr

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

StringIndexer will cast label column to String automatically and then index, so we don't need this code snippet.

@SparkQA
Copy link

SparkQA commented Nov 13, 2015

Test build #45858 has finished for PR 9690 at commit 83a0b2c.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this!

asfgit pushed a commit that referenced this pull request Nov 13, 2015
…dLibSVMFile to load DataFrame

Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame, include:
* Use libSVM data source for all example codes under examples/ml, and remove unused import.
* Use libSVM data source for user guides under ml-*** which were omitted by #8697.
* Fix bug: We should use ```sqlContext.read().format("libsvm").load(path)``` at Java side, but the API doc and user guides misuse as ```sqlContext.read.format("libsvm").load(path)```.
* Code cleanup.

mengxr

Author: Yanbo Liang <[email protected]>

Closes #9690 from yanboliang/spark-11723.

(cherry picked from commit 99693fe)
Signed-off-by: Xiangrui Meng <[email protected]>
@mengxr
Copy link
Contributor

mengxr commented Nov 13, 2015

LGTM. Merged into master and branch-1.6. Thanks!

@asfgit asfgit closed this in 99693fe Nov 13, 2015
dskrvk pushed a commit to dskrvk/spark that referenced this pull request Nov 13, 2015
…dLibSVMFile to load DataFrame

Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame, include:
* Use libSVM data source for all example codes under examples/ml, and remove unused import.
* Use libSVM data source for user guides under ml-*** which were omitted by apache#8697.
* Fix bug: We should use ```sqlContext.read().format("libsvm").load(path)``` at Java side, but the API doc and user guides misuse as ```sqlContext.read.format("libsvm").load(path)```.
* Code cleanup.

mengxr

Author: Yanbo Liang <[email protected]>

Closes apache#9690 from yanboliang/spark-11723.
@yanboliang yanboliang deleted the spark-11723 branch November 16, 2015 02:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants