Skip to content

Conversation

@ontarionick
Copy link
Contributor

@tdas @koeninger

This updates the Spark Streaming + Kafka Integration Guide doc with a working method to access the offsets of a KafkaRDD through Python.

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cant we simply use offsetRanges = rdd.offsetRanges()?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried that and couldn't get it working - seems to be related to a new object being created instead of updating the old one.

However, if I add global offsetRanges before, that seems to work fine. I'll push that change out.

@ontarionick
Copy link
Contributor Author

hey @tdas any new thoughts on this? sorry, forgot to notify you that I had made your suggested changes

@tdas
Copy link
Contributor

tdas commented Nov 11, 2015

LGTM. Merging this to master and 1.6. Thanks! :)

asfgit pushed a commit that referenced this pull request Nov 11, 2015
…t the offset ranges for a KafkaRDD

tdas koeninger

This updates the Spark Streaming + Kafka Integration Guide doc with a working method to access the offsets of a `KafkaRDD` through Python.

Author: Nick Evans <[email protected]>

Closes #9289 from manygrams/update_kafka_direct_python_docs.

(cherry picked from commit dd77e27)
Signed-off-by: Tathagata Das <[email protected]>
@asfgit asfgit closed this in dd77e27 Nov 11, 2015
@ontarionick ontarionick deleted the update_kafka_direct_python_docs branch November 11, 2015 21:34
dskrvk pushed a commit to dskrvk/spark that referenced this pull request Nov 13, 2015
…t the offset ranges for a KafkaRDD

tdas koeninger

This updates the Spark Streaming + Kafka Integration Guide doc with a working method to access the offsets of a `KafkaRDD` through Python.

Author: Nick Evans <[email protected]>

Closes apache#9289 from manygrams/update_kafka_direct_python_docs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants