Skip to content

Conversation

@ganonp
Copy link
Contributor

@ganonp ganonp commented Dec 14, 2014

Wanted to customize the private minCount variable in the Word2Vec class. Added
a method to do so.

Wanted to customize the minCount variable in the Word2Vec class. Added
a method to do so.
@AmplabJenkins
Copy link

Can one of the admins verify this patch?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2-space indentation

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be formatted as:

  /**
   * The minimum number of times a token must occur in the training corpus to be
   * included in the word2vec model (default: 5).
   */

@ganonp
Copy link
Contributor Author

ganonp commented Dec 18, 2014

Sorry (first time contributing), do you mean I should use 4 spaces instead of tab per the convention? When I do this, my additions appear out of alignment with the rest of the code...

@jkbradley
Copy link
Member

@ganonp No problem (but we are pretty strict about style). When in doubt, check out other code in the project for example. Also, I'd recommend checking out these resources:

@jkbradley
Copy link
Member

Oh, and yes, I meant to use 4 spaces inside the function.

@ganonp
Copy link
Contributor Author

ganonp commented Dec 18, 2014

O wow, I just didn't see that the function and everything inside was lining up... Hurts to look at.

Thanks for those links and your patience. Spark now makes up about 70% of my job, so I'll definitely be contributing more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, I just noticed that the other options are grouped at the top of the Word2Vec class. Would you mind moving this there? Thanks a lot!

@jkbradley
Copy link
Member

Sounds good!

Moved mincount variable to top and removed its javadoc and moved setMinCount below other set methods.
@JoshRosen
Copy link
Contributor

Bumping this, since it looks like it might be good to go. Looks like there's a new change here related to making norm public, which might want additional review (I don't know anything about MLlib; just trying to help identify stuff that looks good-to-go to help drain the PR queue before the holidays).

@ganonp
Copy link
Contributor Author

ganonp commented Dec 24, 2014

Sorry I didn't mean to commit that norm method for this pull request. That said, I think it makes sense for norm to be public or at least a d=2 version of norm.

@mengxr
Copy link
Contributor

mengxr commented Dec 29, 2014

@ganonp Could you update the branch and remove the last commit?

@mengxr
Copy link
Contributor

mengxr commented Dec 29, 2014

ok to test

@SparkQA
Copy link

SparkQA commented Dec 29, 2014

Test build #24865 has started for PR 3693 at commit ad534f2.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 29, 2014

Test build #24865 has finished for PR 3693 at commit ad534f2.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24865/
Test PASSed.

@mengxr
Copy link
Contributor

mengxr commented Dec 29, 2014

LGTM (including the change to norm). Merged into master. Thanks!

@asfgit asfgit closed this in 343db39 Dec 29, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants