skip pruned TUF repos when creating artifact config #9109

iliana · 2025-09-27T06:05:53Z

Part of #7135. Related to (and should probably be rebased on or merged into) #9107.

No explicit tests yet but seems to work as expected for existing tests.

Also modifies the artifacts_for_repo function to be paginated, as the number of artifacts per repo has grown and is expected to continue to grow.

davepacheco

I've got some minor suggestions but this looks good!

I think I'd probably rebase this one on the other one rather than combining them, at least for the purposes of review and approval. I don't feel strongly between those two options. For what it's worth, I already made a branch that starts from this one and merges in the branch from the other one. There was a tiny conflict to resolve. I could push that here if you want (and then you'd want to actually change the base branch in GitHub). (My branch is called dap/testing/pruning.) see my next comment

davepacheco · 2025-09-27T18:23:02Z

nexus/test-utils/src/background.rs

        );
    };

+    dbg!(&last_result_completed);


not sure if you meant to leave these here

Debatable! I'd love some nicer error handling here because when there's an {"error":"blah blah blah"} it panics without showing the message at all. This is halfway to that, and only is printed to the console if a test calls these replication-related helpers and the background tasks fail.

davepacheco · 2025-09-27T18:28:01Z

nexus/db-queries/src/db/datastore/update.rs

+        opctx: &OpContext,
+        repo_id: TufRepoUuid,
+    ) -> ListResultVec<TufArtifact> {
+        opctx.authorize(authz::Action::Read, &authz::FLEET).await?;


I'd add do two things to try to avoid people accidentally using this in API endpoints (since we're making multiple queries here):

add a call to opctx.check_complex_operations_allowed()?;

add _batched() to the name to convey that (we do this in a few other places in the datastore)

Really, I'd be tempted to apply this to artifacts_for_repo, but that may currently break some callers that might be using it from the API. Those should probably be made paginated but we can do that when the dust settles on these APIs.

After #9106 lands we may be able to refactor things a little and apply this to artifacts_for_repo, since it removes the list of artifacts from the public APIs.

davepacheco · 2025-09-27T18:29:21Z

nexus/db-queries/src/db/datastore/update.rs

+        .inner_join(tuf_artifact_dsl::tuf_artifact.on(
+            tuf_artifact_dsl::id.eq(tuf_repo_artifact_dsl::tuf_artifact_id),
+        ))


For our use case, I think we don't need this join. We only need the artifact ids. It might be nice to have a version of this that just does that.

I think I'm going to revert artifacts_for_repo and wait to possibly paginate it during a refactor, particularly if we add check_complex_operations_allowed there. I'll write the query that simply returns the artifact IDs for a repo in the function we add instead.

No wait we do need the join here, because we ultimately use the artifact sha256sums in the replication task.

jgallagher · 2025-10-01T14:05:14Z

nexus/src/app/background/tasks/tuf_artifact_replication.rs

+        {
+            let generation_now =
+                self.datastore.tuf_get_generation(opctx).await?;
+            ensure!(
+                generation == generation_now,
+                "generation changed from {generation} \
+                to {generation_now}, bailing"
+            );
+        }


Making sure I understand: we have to do this check because if the generation changed, the config we'd build from repos would be inconsistent with other Nexuses building a config at generation_now, right? This seems pretty critical and maybe easy to miss - maybe worth a comment? (My very first reaction reading this was "why do we need to check this? if a new repo has been pruned or added that's fine and we'll just pick it up the next time we run")

Alternatively: should tuf_list_repos_unpruned_batched() take a generation argument and fail if it changes at any point during the listing?

Initially I did have this implemented where we read generation and then paginated through the repositories during a single transaction, but I replaced it with the datastore methods from #9107. The comment for tuf_list_repos_unpruned_batched() reads:

/// Since this involves pagination, this is not a consistent snapshot. /// Consider using `tuf_get_generation()` before calling this function and /// then making any subsequent queries conditional on the generation not /// having changed.

So the second check is my interpretation of making subsequent queries conditional on the generation not having changed. I will add a comment to this effect.

Hah, yeah, one conditional check after does seem better than reasserting the condition as we page through a table. Thanks.

jgallagher · 2025-10-01T14:08:19Z

sled-agent/src/sim/artifact_store.rs

+        let mut watcher = self.storage.delete_done_rx.clone();
+        watcher.mark_unchanged();
+        watcher


Nit - I think we could return self.storage.delete_done_tx.subscribe() instead? Then we wouldn't need the mark_unchanged(), and maybe could also drop the delete_done_rx field entirely?

Oh, yes, we could do that. I forgot about the subscribe() method.

iliana requested review from davepacheco and jgallagher September 27, 2025 06:05

davepacheco approved these changes Sep 27, 2025

View reviewed changes

This comment was marked as outdated.

Sign in to view

This was referenced Sep 29, 2025

must not be able to set target release to a pruned release #9114

Closed

add background task to prune TUF repos #9107

Merged

davepacheco added this to the 17 milestone Sep 30, 2025

davepacheco assigned iliana Sep 30, 2025

skip pruned TUF repos when creating artifact config

fdbed8b

iliana force-pushed the iliana/artifact-pruning branch from 9f685b4 to fdbed8b Compare October 1, 2025 00:47

iliana marked this pull request as ready for review October 1, 2025 00:47

jgallagher approved these changes Oct 1, 2025

View reviewed changes

review suggestions

eaebb42

iliana enabled auto-merge (squash) October 1, 2025 20:16

iliana merged commit 6731ce2 into main Oct 1, 2025
16 checks passed

iliana deleted the iliana/artifact-pruning branch October 1, 2025 22:11

This was referenced Oct 2, 2025

drop generation_added column from tuf_artifact table #9141

Open

TUF repository deletion #7135

Closed

davepacheco mentioned this pull request Oct 7, 2025

[nexus] Add TUF repo list endpoint, make other endpoints conventional #9106

Merged

skip pruned TUF repos when creating artifact config #9109

skip pruned TUF repos when creating artifact config #9109

Uh oh!

Conversation

iliana commented Sep 27, 2025

Uh oh!

davepacheco left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davepacheco left a comment •

edited

Loading