Send Kafka a TERM signal at pod stop and wait for shutdown #207

solsson · 2018-09-29T12:32:46Z

Fixes #206.

Based on https://github.com/apache/kafka/blob/trunk/bin/kafka-server-stop.sh and https://github.com/apache/kafka/blob/trunk/bin/zookeeper-server-stop.sh but these scripts don't wait for shutdown to complete.

Got the wait loop from https://stackoverflow.com/questions/17894720/kill-a-process-and-wait-for-the-process-to-exit

Currently I've only tested the PR locally with little load. @stigok Can you confirm that you no longer get corrupted indices?

This reverts commit c60c28d.

solsson · 2018-09-29T12:34:57Z

The last log entry I see is INFO [KafkaServer id=0] shut down completed (kafka.server.KafkaServer)

solsson · 2018-09-29T12:38:36Z

Maybe Zookeeper doesn't need controlled shutdown. I see no effect in logs of invoking the script.

stigok · 2018-09-30T13:03:47Z

I'm unable to reproduce the bad indices. I don't know how I ended up with them in the first place. We've been having a lot of pod restarts and failed probes running in AKS, so it could've been caused by a lot of different factors.

stigok · 2018-09-30T13:05:06Z

But this is PR is certainly a step in the right direction 👍

stigok · 2018-11-18T16:55:30Z

I had bad indexes again after my disks went full. Maybe that is a "good way" to simulate broken indexes.

Configure log.retention.bytes to a value greater than available disk-space
Produce enough messages to fill the disk
Watch Kafka die
Expand disk and expect to see bad indexes

solsson added 5 commits September 29, 2018 13:51

Adds the ps command as a layer atop the existing kafka image

c60c28d

But the script doesn't wait for termination

8fb8dfa

Waits for termination before exiting hook

af5a8bf

Zookeeper's shutdown is identical in the kafka dist too

e282aec

Turns out we didn't need the ps command anyway

72ca76d

This reverts commit c60c28d.

solsson merged commit 198666d into master Nov 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Send Kafka a TERM signal at pod stop and wait for shutdown #207

Send Kafka a TERM signal at pod stop and wait for shutdown #207

Uh oh!

solsson commented Sep 29, 2018

Uh oh!

solsson commented Sep 29, 2018

Uh oh!

solsson commented Sep 29, 2018

Uh oh!

stigok commented Sep 30, 2018 •

edited

Loading

Uh oh!

stigok commented Sep 30, 2018 •

edited

Loading

Uh oh!

stigok commented Nov 18, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Send Kafka a TERM signal at pod stop and wait for shutdown #207

Send Kafka a TERM signal at pod stop and wait for shutdown #207

Uh oh!

Conversation

solsson commented Sep 29, 2018

Uh oh!

solsson commented Sep 29, 2018

Uh oh!

solsson commented Sep 29, 2018

Uh oh!

stigok commented Sep 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stigok commented Sep 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stigok commented Nov 18, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stigok commented Sep 30, 2018 •

edited

Loading

stigok commented Sep 30, 2018 •

edited

Loading