Activate operators that may want to shut down #488

frankmcsherry · 2022-11-28T14:49:26Z

Operator shutdown was previously pretty loose, and only in response to operator activation. However, the conditions for shutdown can change without prompting an activation if e.g. a frontier becomes empty or a final capability is dropped. This meant that operators that should be shut down would instead linger until the dataflow itself is shut down.

This PR adds that test as progress information is pushed to operators, in order to better clean up operators mid-dataflow.

NB: Failing to shut down an operator should not have resulted in non-termination, unless operators were relying on dropping their state to signal something of consequence outward. All progress information would still be correct, and all downstream operators would receive correct frontiers.

lluki · 2022-11-28T15:32:54Z

Unfortunately it doesnt fix the operator leak of running TPC-H loadgen + materialized view with query 14. This is the situation after drop materialized view q14; and waiting 30s:

and this is mz_dataflow_operator_dataflows:

materialize=> set database to tpch;
SET
materialize=> drop materialized view q14;
DROP MATERIALIZED VIEW
materialize=> select * from mz_internal.mz_dataflow_operator_dataflows;
 id  |                   name                   | worker_id | dataflow_id |   dataflow_name   
-----+------------------------------------------+-----------+-------------+-------------------
 333 | Map                                      | 0         | 188         | Dataflow: 2.6.q14
 329 | FlatMap                                  | 0         | 188         | Dataflow: 2.6.q14
 331 | Exchange                                 | 0         | 188         | Dataflow: 2.6.q14
 326 | InspectBatch                             | 0         | 188         | Dataflow: 2.6.q14
 338 | InspectBatch                             | 0         | 188         | Dataflow: 2.6.q14
 345 | Dataflow: 2.6.q14                        | 0         | 188         | Dataflow: 2.6.q14
 188 | Dataflow: 2.6.q14                        | 0         | 188         | Dataflow: 2.6.q14
 328 | persist_sink u11 write_batches           | 0         | 188         | Dataflow: 2.6.q14
 340 | persist_sink u11 append_batches          | 0         | 188         | Dataflow: 2.6.q14
 323 | persist_sink u11 mint_batch_descriptions | 0         | 188         | Dataflow: 2.6.q14
(10 rows)

materialize=> show materialized views;
 name | cluster 
------+---------
(0 rows)

materialize=>

frankmcsherry · 2022-11-28T15:35:20Z

Ah, yes this wasn't meant to fix that for certain. It does fix a leak for e.g. simple.rs, and I'm happy to crack open the same example in MZ (the one based on generate_series(0, large)).

lluki · 2022-11-28T15:53:28Z

List of operators during various stages of the TPC-H run: ops.txt

frankmcsherry · 2022-11-28T15:55:20Z

Great, that appears to have the desired outcome, as we think that the remaining operators are the ones that haven't shut down, for whatever reason.

antiguru

This seems fine, as explained in the office hours!

One thing to keep in mind is that we could restrict the nodes added to maybe_shutdown to operators that have no inputs or the notify bit turned off.

Activate operators that may want to shut down

6eba6a6

frankmcsherry requested a review from antiguru November 28, 2022 14:49

antiguru approved these changes Nov 28, 2022

View reviewed changes

frankmcsherry merged commit 1cf7b4a into TimelyDataflow:master Jan 9, 2023

github-actions bot mentioned this pull request Oct 29, 2024

chore: release #594

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Activate operators that may want to shut down #488

Activate operators that may want to shut down #488

Uh oh!

frankmcsherry commented Nov 28, 2022

Uh oh!

lluki commented Nov 28, 2022

Uh oh!

frankmcsherry commented Nov 28, 2022

Uh oh!

lluki commented Nov 28, 2022

Uh oh!

frankmcsherry commented Nov 28, 2022

Uh oh!

antiguru left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Activate operators that may want to shut down #488

Activate operators that may want to shut down #488

Uh oh!

Conversation

frankmcsherry commented Nov 28, 2022

Uh oh!

lluki commented Nov 28, 2022

Uh oh!

frankmcsherry commented Nov 28, 2022

Uh oh!

lluki commented Nov 28, 2022

Uh oh!

frankmcsherry commented Nov 28, 2022

Uh oh!

antiguru left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants