OpenTelemetry support #2434

bruno-garcia · 2023-06-21T01:00:24Z

PR from work done by Matt

TODO

Update the GH discussion: Sentry + OpenTelemetry sentry#40712

Not planned

GenerateSentryErrorsFromOtelSpan is Blocked by Record Exception.Data when calling RecordException open-telemetry/opentelemetry-dotnet#2439 (comment).

This is something only our Javascript SDK has. None of the other Sentry SDKs support it.

It's not critical as we already capture errors through the normal exception handlers. It would only be an issue if a particular Otel instrumentation decided to not let exceptions bubble up, and only to report them as Otel events.

github-actions · 2023-06-21T01:10:56Z

	Messages
📖	Do not forget to update Sentry-docs with your feature once the pull request gets approved.

Generated by 🚫 dangerJS against 84d5fb7

jamescrosswell · 2023-06-22T01:36:38Z

@bruno-garcia what did you mean by "Missing a sample project (that is already in the solution filters).."? I assume you saw Samples.AspNetCore.OpenTelemetry/Sentry.Samples.AspNetCore.OpenTelemetry.csproj.

…ntry-dotnet into feat/open-telemetry

jamescrosswell · 2023-06-22T11:06:39Z

I noticed some TryGetBaggageHeader and TryGetSentryTraceHeader methods in Sentry.AspNet.HttpContextExtensions.

I also noticed almost the same in SentryTracingMiddleware in the AspNetCore project.

They're slightly different because they're targeting different HttpContext abstractions.

        var value = context.Request.Headers.Get(BaggageHeader.HttpHeaderName); // AspNet
        var value = context.Request.Headers.GetValueOrDefault(BaggageHeader.HttpHeaderName); // AspNetCore

It turns out, we also need something very similar in the SentryPropagator for Otel.

A couple of questions:

Are we trying to target AspNet and AspNetCore with our Open Telemetry efforts?
If we're not, is it OK for the Sentry.OpenTelemetry project to take a reference to Sentry.AspNetCore to use those extensions?
Those extensions do use SentryOptions.LogDebug. I'm not sure if we can inject SentryOptions into the SentryPropagator. Maybe all moot in that case. It just makes me itchy to use copy/paste as a form of code reuse 🙈

bruno-garcia · 2023-06-22T14:37:08Z

If we're not, is it OK for the Sentry.OpenTelemetry project to take a reference to Sentry.AspNetCore to use those extensions?

I'd say rather not take a dependency on AspNetCore. Any use of OTel outside ASP.NET would have that dep and also would add another dependency to deal with versioning and breaking changes.

Those two methods seem quite small. I'd say we could be pragmatic and copy them over but if I understand correctly the goal is to get the AspNetCore HTTP context object so we can read from it? If that's the case we need to consider introducing a new package like: Sentry.OpenTelemetry.AspNetCore instead. Open has:
https://www.nuget.org/packages/OpenTelemetry.Instrumentation.AspNetCore/

That does take dep on OpenTelemetry and AspNetCore:

Are we trying to target AspNet and AspNetCore with our Open Telemetry efforts?

In any regard better to break the work up in parts, if we can do core bits first, we can follow up with AspNetCore after (we can leave AspNet out for the time being, possibly forever)

Those extensions do use SentryOptions.LogDebug. I'm not sure if we can inject SentryOptions into the SentryPropagator. Maybe all moot in that case. It just makes me itchy to use copy/paste as a form of code reuse

We've been passing SentryOptions all over the place. Often all we need is the IDiagnosticLogger but giving the option allows us to change the logger instance mid-flight for example, and start logging (e.g: going from null to a ConsoleLogger). If passing this instance is done automagically by us under the hood i don't see why not? If we expect the user to pass it, that becomes a problem because the API sounds poor/leaky

bruno-garcia · 2023-06-22T14:42:38Z

@bruno-garcia what did you mean by "Missing a sample project (that is already in the solution filters).."? I assume you saw Samples.AspNetCore.OpenTelemetry/Sentry.Samples.AspNetCore.OpenTelemetry.csproj.

Oh, right! I'll check on why it's erroring out on the solution filters then

jamescrosswell · 2023-06-22T22:10:17Z

In any regard better to break the work up in parts, if we can do core bits first, we can follow up with AspNetCore after (we can leave AspNet out for the time being, possibly forever)

I think we could support Otel span processors without any dependency on HttpContext.

However, propagation has an inherent dependency on the concept of "Http Headers", since that's how the trace id gets passed around (at least in Sentry's code - I'm not sure how it gets passed into things like DB servers). So I don't think we can include a solution for propagation without a dependency on Microsoft.AspNetCore.Http.Abstractions and/or System.Web (for AspNet).

Options for that:

We put it all in a Sentry.OpenTelemetry package with directives to select the right dependencies, based on the target framework. This would be a bit of a bummer for people wanting to use Otel in non web based projects though
We pull it out to separate Sentry.OpenTelemetry.Propagation.AspNet and Sentry.OpenTelemetry.Propagation.AspNetCore assemblies
Something else?

src/Sentry.OpenTelemetry/SentryPropagator.cs

…ntry-dotnet into feat/open-telemetry

jamescrosswell · 2023-06-26T23:55:24Z

The Dynamic Sampling Context Problem

This is the bit that Matt was wrestling with when I started working on the PR, so probably worth documenting, as it's really not that obvious. The following is an explanation of what's going on in the Java implementation. We may have to get creative to solve in .NET as we don't have the same building blocks to work with that they did in Java.

Java implementation

Issue: Java SDK basic OTEL Support sentry-java#2327
Pull requests:

Extracting

SentryPropagator.extract

    Baggage baggage = Baggage.fromHeader(baggageString);
    modifiedContext = modifiedContext.with(SentryOtelKeys.SENTRY_BAGGAGE_KEY, baggage);

.. which indirectly calls this overload,
the only important detail of which is that includeThirdPartyValues==false so basically we just extract any values from
the baggage header that are prefixed with sentry-,

public static Baggage fromHeader(
      final @Nullable String headerValue,
      final boolean includeThirdPartyValues,
      final @NotNull ILogger logger)

All the sentry-* values from the baggage header get stuffed back in the context (as can be seen from the very first
code snippet above) using modifiedContext.with which doesn't exist in the .NET OTEL SDK:

return value.storeInContext(this);

And that puts the data in something implementing ContextStorage... The concrete class is probably not relevant. It'll be stored somewhere so it can be used later. The key question then is where does this DSC get used later?

SentryPropagator.inject below gets everything it needs for DSC injection from the Transaction associated with the
Sentry Span... And it puts all of those things directly in the baggage header of the carrier (e.g. an HttpRequest
header). So the stuff in ContextStorage is not used for injection.

Where it does appear to be used is in SpanProcessor.onStart, which calls SentrySpanProcessor.getTraceData

baggage = parentContext.get(SentryOtelKeys.SENTRY_BAGGAGE_KEY);

When creating a new Sentry Span representing a Transaction, if that information is available then it's used to create
the new Transaction

  TransactionContext.fromSentryTrace(
      transactionName,
      transactionNameSource,
      op,
      traceData.getSentryTraceHeader(),
      traceData.getBaggage(),
      spanId);

Injecting DSC

SentryPropagator.inject

    final @Nullable BaggageHeader baggageHeader = sentrySpan.toBaggageHeader(Collections.emptyList());
    if (baggageHeader != null) {
      setter.set(carrier, baggageHeader.getName(), baggageHeader.getValue());
    }

.. which eventually translates into:

SentryTracer.updateBaggageValues

    baggage.setValuesFromTransaction(
        this, userAtomicReference.get(), hub.getOptions(), this.getSamplingDecision());

... and finally into:

baggage.setValuesFromTransaction

    
    public void setValuesFromTransaction(
      final @NotNull ITransaction transaction,
      final @Nullable User user,
      final @NotNull SentryOptions sentryOptions,
      final @Nullable TracesSamplingDecision samplingDecision
      ) 
    {
        setTraceId(transaction.getSpanContext().getTraceId().toString());
        setPublicKey(new Dsn(sentryOptions.getDsn()).getPublicKey());
        setRelease(sentryOptions.getRelease());
        setEnvironment(sentryOptions.getEnvironment());
        setUserSegment(user != null ? getSegment(user) : null);
        setTransaction(isHighQualityTransactionName(transaction.getTransactionNameSource()) ? transaction.getName() : null);
        setSampleRate(sampleRateToString(sampleRate(samplingDecision)));
    }

The Dynamic Sampling Context Solution

In the .NET SDK the equivalent of the Context object that is being passed around in Java is PropagationContext which
only contains two properties:

public ActivityContext ActivityContext { get; }
public Baggage Baggage { get; }

It turns out those properties are sufficient for our purposes. Some of the details required to create a DSC come from
baggage. The Java implementation also uses SentryTraceHeader.TraceId and
SentryTraceHeader.SpanId... which we extract/store in the PropagationContext.ActivityContext.

So we already have everything we need. We simply need to leverage these in SentrySpanProcessor.OnStart when creating Spans for Transactions.

…ntry

jamescrosswell · 2023-06-27T11:55:43Z

src/Sentry.OpenTelemetry/SentryPropagator.cs

+
+        var activityContext = new ActivityContext(
+            sentryTraceHeader.TraceId.AsActivityTraceId(),
+            sentryTraceHeader.SpanId.AsActivitySpanId(),


This is possibly incorrect. The sentryTraceHeader.SpanId would be the Id of the upstream span right? Should we be using this to set the SpanTracer.ParentSpanId orTransactionTracer.ParentSpanId in the SentrySpanProcessor instead? And then we'd set the ActivityContext.SpanId here to ActivityId.GetRandom()?

It seems to be what the Java code is doing though...

final SpanId spanId = new SpanId(traceData.getSpanId());

jamescrosswell · 2023-06-27T12:00:11Z

@bruno-garcia / @bitsandfoxes I think this is "first draft complete". Keen to get a proper review on this.

Also, let me know if you can think of a more comprehensive way of integration testing this.

Currently I'm using the following Python script, which makes a call to the hello endpoint of Sentry.Samples.OpenTelemetry.AspNetCore:

import requests
import sentry_sdk
#
sentry_sdk.init(
    dsn="yourDsnHere",
    traces_sample_rate=1.0,
    debug=True
)

def eat_slice(slice):
    with sentry_sdk.start_span(description="Eat Slice") as span:
        print(f'span.trace_id: {span.trace_id}')
        print(f'span.span_id: {span.span_id}')
        span.set_tag("slice.topping", slice)
        response = requests.get('http://localhost:5092/hello')
        print(response.status_code)

def eat_pizza():
    while True:
        user_input = input("What kind of pizza do you want to eat?")
        if user_input.lower() == "quit":
            break
        else:
            with sentry_sdk.start_transaction(op="task", name="Eat Pizza") as transaction:
                print(f'transaction.trace_id: {transaction.trace_id}')
                eat_slice(user_input)

eat_pizza()

The /hello endpoint then makes an HTTP request of it's own to /echo... so that tests:

An outbound request from Python, with sentry-trace and baggage headers set
An inbound call to /hello where trace information gets pulled from the setnry-trace and baggage headers (tests Propagator.Extract)
Appropriate transaction span gets created by the .NET SDK (tests SpanProcessor for root spans)
An outbound call to /echo where trace information gets propagated via the setnry-trace and baggage headers (tests Propagator.Inject)
A couple of non-transaction spans get created by the .NET SDK (tests SpanProcessor for non root spans)

All of that then shows up in Sentry looks something like this:

Maybe instead of calling it's own /echo endpoint, the Python app could be a Flask server with an echo endpoint that the ASPNETCORE app calls back... that would guarantee the trace data was being propagated out of process?

jamescrosswell · 2023-06-28T01:40:37Z

@bruno-garcia suggested creating a new PR for this so that I can get a review from him (rather than the other way around)... I'll close this PR and create a new one with all the relevant history/context then.

mattjohnsonpint added 8 commits June 14, 2023 21:47

Initial OTel implementation and ASP.NET Core sample

cd109ae

.

5fe897e

.

5e05c03

Add Instrumenter checks

869cba5

Refactoring

0b126a2

Start Sentry Propagagator

47eca60

Update sample to use propagator

9ca8265

Update otel to 1.5.0

a1095c1

bruno-garcia requested review from SeanFeldman, bitsandfoxes and jamescrosswell as code owners June 21, 2023 01:00

Merge branch 'main' into feat/open-telemetry

5fbe3b7

jamescrosswell linked an issue Jun 21, 2023 that may be closed by this pull request

OpenTelemetry Support #2066

Closed

jamescrosswell removed a link to an issue Jun 21, 2023

OpenTelemetry Support #2066

Closed

jamescrosswell added 3 commits June 22, 2023 16:52

Added initial tests for SentrySpanProcessorTests

278ca85

Merge branch 'feat/open-telemetry' of https://github.com/getsentry/se…

5e818a1

…ntry-dotnet into feat/open-telemetry

Added Sentry.Samples.AspNetCore.OpenTelemetry.csproj to Sentry.Full.sln

6fc599f

First hypothesis for SentryPropagator.Extract - sharing for discussion

5b0de89

jamescrosswell reviewed Jun 22, 2023

View reviewed changes

src/Sentry.OpenTelemetry/SentryPropagator.cs Outdated Show resolved Hide resolved

bruno-garcia and others added 5 commits June 22, 2023 19:27

add otel to full sln

2a71d6f

remove mobile tfms from otel tests

8e0907a

Removed dependency on AspNetCore package

ed02304

Merge branch 'feat/open-telemetry' of https://github.com/getsentry/se…

1a0f6f1

…ntry-dotnet into feat/open-telemetry

Updated CHANGELOG

b9e4906

Implemented SentryPropagator.Inject

e3c8191

jamescrosswell changed the title ~~feat: OpenTelemetry span processor~~ OpenTelemetry support Jun 26, 2023

jamescrosswell added 12 commits June 26, 2023 16:34

Added tests for SentryPropagator

930647d

Merge branch 'main' into feat/open-telemetry

5155436

Renamed Open Telemetry sample project

0f2da04

Added unit tests for IsSentryRequest

ab8cdc8

Added tests for SentrySpanProcessor.OnEnd

96e842f

Merge branch 'feat/open-telemetry' of https://github.com/getsentry/se…

97ae522

…ntry-dotnet into feat/open-telemetry

Added OpenTelemetry projects to solution filters

dc7e703

Removed unnecessary references from Sentry.OpenTelemetry.Tests project

c8cbccf

Fixed broken Sentry.Full.sln... hopefully

53c5cbf

Update Sentry-CI-Build-Linux.slnf

e51b8d8

Update Sentry-CI-Build-Windows.slnf

2d2b713

Update Sentry-CI-Build-macOS.slnf

c222272

smeubank linked an issue Jun 26, 2023 that may be closed by this pull request

OpenTelemetry Support #2066

Closed

Added tests to ensure we're not double instrumenting

5c64a5d

jamescrosswell self-assigned this Jun 26, 2023

jamescrosswell added 2 commits June 27, 2023 14:45

Inject DynamicSamplingContext in SentrySpanProcessor.OnStart

1ac3c55

Fixed multiple transaction roots problem when showing traces on getse…

a1bd3c3

…ntry

jamescrosswell reviewed Jun 27, 2023

View reviewed changes

Integrated some tweaks from Matt's stashed work

84d5fb7

jamescrosswell closed this Jun 28, 2023

jamescrosswell removed a link to an issue Jun 28, 2023

OpenTelemetry Support #2066

Closed

Uh oh!

OpenTelemetry support #2434

OpenTelemetry support #2434

Uh oh!

Conversation

bruno-garcia commented Jun 21, 2023 • edited by jamescrosswell Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Not planned

Uh oh!

github-actions bot commented Jun 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jamescrosswell commented Jun 22, 2023

Uh oh!

jamescrosswell commented Jun 22, 2023

Uh oh!

bruno-garcia commented Jun 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bruno-garcia commented Jun 22, 2023

Uh oh!

jamescrosswell commented Jun 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jamescrosswell commented Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The Dynamic Sampling Context Problem

Java implementation

Extracting

Injecting DSC

The Dynamic Sampling Context Solution

Uh oh!

jamescrosswell Jun 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jamescrosswell Jun 27, 2023

Choose a reason for hiding this comment

Uh oh!

jamescrosswell commented Jun 27, 2023

Uh oh!

jamescrosswell commented Jun 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bruno-garcia commented Jun 21, 2023 •

edited by jamescrosswell

Loading

github-actions bot commented Jun 21, 2023 •

edited

Loading

bruno-garcia commented Jun 22, 2023 •

edited

Loading

jamescrosswell commented Jun 22, 2023 •

edited

Loading

jamescrosswell commented Jun 26, 2023 •

edited

Loading

jamescrosswell Jun 27, 2023 •

edited

Loading