HTTP replication #537

psarna · 2023-07-20T10:14:34Z

This draft adds an experimental endpoint which serves frames for replication purposes. It's capabilities are:

Metadata is available at /hello endpoint
Frames are available at /frames endpoint

And that's about it. For simplicity, when frames can't be served from a FrameStream and instead need to be loaded from a snapshot file, the whole contents of the snapshot is sent in a single request. That's a limitation, but it's also why the endpoint is considered experimental. For the same reason, it is right now implemented as a separate service running on a dedicated port (e.g. 8081), so that it's isolated from user workloads.

This endpoint also does not account any reads in the metrics, and it definitely should, as it's very read-heavy. We should also consider having a separate auth path for it, because its main purpose is for being used by replicas (possibly embedded), not for regular direct access to frames.

Testing setup for sqld:

cargo run -- --http-replication-listen-addr 127.0.0.1:8081

curl localhost:8081/hello
curl -d '{"next_offset": 1}' localhost:8081/frames

HTTP replication is also used in this experimental project, and in particular it can be tested via cargo run --example replica from libsql/crates/core source directory.

It still does not perform any kind of handshake, just asks for frames starting N. Tested with: $ cargo run -- --http-replication-listen-addr 127.0.0.1:8081 $ curl -d '{"next_offset": 0}' -v localhost:8081/frames

Serves the same purpose as gRPC hello and provides: - generation id - generation start index - database id

An arbitrary limit to make sure we do not overload memory.

With gRPC replication, it was reasonable to assume that there are listeners to the max frame number notifier, but with HTTP it's not necessarily the case. Since watch::send() fails if there are no receivers, we hereby switch to send_replace(), which successfully updates the value even if there are no active receivers.

The HTTP replication will now react to SnapshotRequired error by just sending all frames from a snapshot to the user. That's prone to overcommitting memory, but better than giving up. This change should be followed up by streaming the snapshot frames in multiple smaller bits.

MarinPostma · 2023-07-21T09:58:04Z

sqld/src/replication/http.rs

+    match (req.method(), req.uri().path()) {
+        (&Method::GET, "/hello") => handle_hello(logger).await,
+        (&Method::POST, "/frames") => handle_query(req, auth, logger).await,
+        _ => Ok(Response::builder().status(404).body(Body::empty()).unwrap()),
+    }


we slowly started migrating to Axum (see the admin API). I don't want to slow you down on other stuff you need to do, but if you have some time, could you use Axum instead, so we don't have to port that later 🙏

sure, I'll need to bootstrap myself with Axum first, but if we already have a precedent in the admin API, I'll start by reading that

k, done, with one minor quirk: Axum is a little overeager in refusing plaintext HTTP requests when we except a JSON (lack of proper Content-Type), so I decided to be a little more lenient and just parse the string at runtime in case it's a valid json after all.

MarinPostma · 2023-07-21T10:03:05Z

sqld/src/replication/http.rs

+    for _ in 0..MAX_FRAMES_IN_SINGLE_RESPONSE {
+        use futures::StreamExt;
+
+        match frame_stream.next().await {
+            Some(Ok(frame)) => {
+                tracing::trace!("Read frame {}", frame_stream.current_frame_no);
+                frames.push(frame);
+            }
+            Some(Err(LogReadError::SnapshotRequired)) => {
+                drop(frame_stream);
+                if frames.is_empty() {
+                    tracing::debug!("Snapshot required, switching to snapshot mode");
+                    frames = load_snapshot(logger, next_offset)?;
+                } else {
+                    tracing::debug!("Snapshot required, but some frames were read - returning.");
+                }
+                break;
+            }
+            Some(Err(e)) => {
+                tracing::error!("Error reading frame: {}", e);
+                return Ok(Response::builder()
+                    .status(hyper::StatusCode::INTERNAL_SERVER_ERROR)
+                    .body(Body::empty())
+                    .unwrap());
+            }
+            None => break,
+        }
+
+        if frame_stream.max_available_frame_no <= frame_stream.current_frame_no {
+            break;
+        }
+    }
+
+    if frames.is_empty() {
+        return Ok(Response::builder()
+            .status(hyper::StatusCode::NO_CONTENT)
+            .body(Body::empty())
+            .unwrap());
+    }
+
+    Ok(Response::builder()
+        .status(hyper::StatusCode::OK)
+        .body(Body::from(serde_json::to_string(&frames)?))
+        .unwrap())
+}


wdyt about streaming frames instead? https://docs.rs/hyper/latest/hyper/body/struct.Body.html

ah yeah, makes perfect sense, will do

@MarinPostma I'm inclined to stall returning a stream here, because of the current convoluted logic when a snapshot is needed - then, we either return whatever frames we have so far, or just ship the whole snapshot, and that whole magic logic would need to be wrapped in a single struct that implements futures::Stream. Doable of course, but perhaps in interation 2?

Following the example in admin_api.rs

psarna added 2 commits July 20, 2023 11:41

replication: add HTTP endpoint skeleton

047e744

replication: add HTTP implementation for /frames endpoint

58e8dfa

It still does not perform any kind of handshake, just asks for frames starting N. Tested with: $ cargo run -- --http-replication-listen-addr 127.0.0.1:8081 $ curl -d '{"next_offset": 0}' -v localhost:8081/frames

psarna force-pushed the http_replication branch from 8da8af7 to 58e8dfa Compare July 20, 2023 10:19

replication: add /hello endpoint

fdaa3fd

Serves the same purpose as gRPC hello and provides: - generation id - generation start index - database id

psarna force-pushed the http_replication branch from a299d08 to fdaa3fd Compare July 20, 2023 11:11

psarna marked this pull request as ready for review July 20, 2023 11:12

psarna added 3 commits July 20, 2023 14:11

replication: limit HTTP response in frames to 256

f0b0e5e

An arbitrary limit to make sure we do not overload memory.

psarna requested review from MarinPostma and penberg July 21, 2023 09:19

MarinPostma reviewed Jul 21, 2023

View reviewed changes

replication: migrate HTTP to Axum

3aba36f

Following the example in admin_api.rs

MarinPostma approved these changes Jul 21, 2023

View reviewed changes

penberg added this pull request to the merge queue Jul 24, 2023

Merged via the queue into libsql:main with commit f50803f Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HTTP replication #537

HTTP replication #537

Uh oh!

psarna commented Jul 20, 2023 •

edited

Loading

Uh oh!

MarinPostma Jul 21, 2023

Uh oh!

psarna Jul 21, 2023

Uh oh!

psarna Jul 21, 2023

Uh oh!

MarinPostma Jul 21, 2023

Uh oh!

psarna Jul 21, 2023

Uh oh!

psarna Jul 21, 2023

Uh oh!

Uh oh!

HTTP replication #537

HTTP replication #537

Uh oh!

Conversation

psarna commented Jul 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarinPostma Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

psarna Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

psarna Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

MarinPostma Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

psarna Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

psarna Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

psarna commented Jul 20, 2023 •

edited

Loading