Skip to content

Conversation

asadurski
Copy link
Member

Suggesting extension of retry times and retry time window for HCFCrawlManager.
This would be helpful in cases of short API downtimes (API is usually up in 10 minutes), otherwise the manager dies after 3 retries in 60 s.
Since the managers are supposed to run for days, it would be better if they survived occasional API downtime.

Copy link
Member

@kalessin kalessin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR includes a commit that doesn't belong to the change proposed and conflicts with master. Can you clean it?

@@ -40,7 +40,7 @@ def flush(self, slot=None):
slot_obj = self._frontier.get(slot)
slot_obj.flush()
self._links_to_flush_count[slot] = 0
LOG.info('Flushed %d link(s) to slot %s', slot)
LOG.info('Flushed %d link(s) to slot %s', n_links_to_flush, slot)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fix doesn't apply anymore.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants