Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce sampling package as reference implementation for OTEP 235 #29720

Merged
merged 42 commits into from
Jan 31, 2024
Merged
Changes from 1 commit
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
2741a32
Add comments, README, metadata.yaml relative to #24811
jmacd Dec 8, 2023
91f6909
add new package
jmacd Dec 8, 2023
2834241
crosslink
jmacd Dec 8, 2023
12530ec
doc
jmacd Dec 8, 2023
4a83262
more place
jmacd Dec 8, 2023
b4c9b78
Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…
jmacd Dec 11, 2023
53ac105
changelog
jmacd Dec 11, 2023
910beac
no change here
jmacd Dec 11, 2023
5997a19
more linty
jmacd Dec 11, 2023
2e9ddeb
lintier than thou
jmacd Dec 11, 2023
1909040
Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…
jmacd Dec 11, 2023
3de01cc
fix that
jmacd Dec 11, 2023
1307b1d
hmm
jmacd Dec 11, 2023
28ea1e4
delint
jmacd Dec 11, 2023
5661ced
do what it says
jmacd Dec 11, 2023
95ac7f0
Comment on why must() and mustNot() are here
jmacd Dec 19, 2023
47ddb07
more comments
jmacd Dec 19, 2023
0042984
more comment
jmacd Dec 19, 2023
b9513d8
un-export a few things
jmacd Dec 20, 2023
b33ceeb
document the API
jmacd Dec 20, 2023
0feb399
typos
jmacd Jan 10, 2024
9f4d577
Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…
jmacd Jan 10, 2024
6204743
update pdata dep
jmacd Jan 10, 2024
ebc99ef
several testable examples
jmacd Jan 23, 2024
0302872
Apply suggestions from code review
jmacd Jan 23, 2024
be84226
Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…
jmacd Jan 25, 2024
efb3992
move probability test
jmacd Jan 25, 2024
c76b345
small prob examples
jmacd Jan 25, 2024
dbb04d2
improve probability testing; add comments to the example tests for re…
jmacd Jan 25, 2024
8fc6004
more testing
jmacd Jan 26, 2024
554c351
doc comments
jmacd Jan 26, 2024
c8c196d
Add higher-level example in doc comment
jmacd Jan 26, 2024
e294124
test for no-error, fix test as called out by jpk
jmacd Jan 26, 2024
f48df39
Apply suggestions from code review
jmacd Jan 26, 2024
637a787
lcAlphanum
jmacd Jan 26, 2024
f4e4d7a
Apply suggestions from code review
jmacd Jan 26, 2024
c7c796b
Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…
jmacd Jan 26, 2024
c565f3c
tidy
jmacd Jan 26, 2024
231c3e4
Update pkg/sampling/doc.go
jmacd Jan 30, 2024
896f4b8
Remove most of the Has methods
jmacd Jan 30, 2024
ba64e94
Merge branch 'jmacd/pkgsampl' of github.com:jmacd/opentelemetry-colle…
jmacd Jan 30, 2024
a3844ca
update/tidy
jmacd Jan 30, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
doc comments
  • Loading branch information
jmacd committed Jan 26, 2024
commit 554c3516cdc790a32afb2b9e905baa1f20273263
184 changes: 29 additions & 155 deletions pkg/sampling/doc.go
Original file line number Diff line number Diff line change
@@ -1,160 +1,34 @@
// Copyright The OpenTelemetry Authors
// SPDX-License-Identifier: Apache-2.0

// ### TraceState handling types
// # TraceState representation
//
// The TraceState handling routines in this package preserve unrecognized
// key values which may be present in the OpenTelemetry `tracestate` header.

// - `Threshold`: Represents an exact sampling probability.
// - `Randomness`: Randomness used for sampling decisions.
// - `Probability`: a float64 in the range `[MinSamplingProbability, 1.0]`.

// #### W3C TraceState

// A W3C `tracestate` header can be parsed to extract sampling parameters
// and pass through unrecognized fields.

// - `NewW3CTraceState`: Parse a W3C `tracestate` header with syntax like
// `k1=v1,v2=v2`.
// - `W3CTraceState.Serialize`: After modifying a TraceState with a
// sampling update, this produces a new string encoding to be placed
// back into the span.

// The `W3CTraceState` accessor methods are:

// - `W3CTraceState.HasOTelValue`: Returns true when the `tracestate`
// contains an `ot` field representing the OpenTelemetry trace state.
// Callers may opt to take a fast path when the OpenTelemetry
// tracestate is empty.
// - `W3CTraceState.OTelValue`: Returns a mutable `*OpenTelemetryTraceState`
// belonging to this object.

// `W3CTraceState` objects also support the common TraceState members.

// #### OpenTelemetry TraceState

// An OpenTelemetry `tracestate` field is extracted using the `ot` field
// of the W3C TraceState and is parsed to extract sampling parameters and
// pass through unrecognized fields.

// - `NewOpenTelemetryTraceState`: Parse an OpenTelemetry `tracestate` header with
// syntax like `k1:v1;v2:v2`.
// - `OpenTelemetryTraceState.Serialize`: After modifying a TraceState with a
// sampling update, this produces a new `ot` value.

// There are `OpenTelemetryTraceState` accessor methods associated with `Threshold` and
// `Randomness` discussed below. `OpenTelemetryTraceState` objects also support
// the common TraceState members.

// #### Common TraceState members

// Both W3C and OpenTelemety TraceState values allow additional,
// syntactically valid fields to be stored and re-serialized, that
// effectively pass through samplers as a result.

// - `HasAnyValue`: Returns true when the `tracestate` is not empty.
// Callers may opt to take a fast path when a tracestate is empty.
// - `HasExtraValues`: Returns true when there are unrecognized values
// contained in a TraceState for manual inspection.
// - `ExtraValues`: Returns a slice of `KV` corresponding with additional
// values. This may be by callers to inspect arbitrary fields, such as
// the legacy `p` value representing power-of-two sampling probability.
// - `KV`: Represents unrecognized key-value pairs from in a TraceState
// object.

// #### Sampling Threshold

// The `Threshold` type represents a parsed T-value. Samplers that use a
// fixed probability can store these to make `ShouldSample` an
// inexpensive operation. Samplers that use dynamic probability can
// compute them on the fly. The constructors are:

// - `TValueToThreshold`: Parse an encoded `Threshold` from an encoded
// OpenTelemetry TraceState `tv` field.
// - `ProbabilityToThreshold`: Translate a `float64` probability into a
// `Threshold` with up to 14 hex-digits of precision.
// - `ProbabilityToThresholdWithPrecision`: Translate a `float64`
// probability into a `Threshold` with a limited number of hex-digits
// of precision.

// The `Threshold` methods are:

// - `Threshold.TValue`: Encode the T-value corresponding with `Threshold`.
// - `Threshold.ShouldSample`: Make a sampling decision for a given randomness object.
// - `Threshold.Probability`: Convert a `Threshold` to an exact probability.
// - `ThresholdGreater` and `ThresholdLessThan`: Threshold objects are
// inversely proportional with probability (i.e., smaller threshold
// values indicate larger sampling probabilities). These comparison
// functions allow a sampler to detect when sampling will be
// inconsistent or have no change, depending on the application. If
// two Thresholds are neither less-than or greater to each other, they
// are necessarily equal.

// The `OpenTelemetryTraceState` accessor methods are:

// - `OpenTelemetryTraceState.HasTValue`: Returns true when the tracestate contains a T-value,
// indicating that `TValueThreshold` is valid.
// - `OpenTelemetryTraceState.SetTValue`: Updates the T-value, unconditioanlly modifying
// information about probability sampling.
// - `OpenTelemetryTraceState.ClearTValue`: Allows the sampler to clear an inconsistent T-value.
// - `OpenTelemetryTraceState.TValueThreshold`: This represents a sampling threshold, primarily
// supports the `ShouldSample(Randomness)` interface.
// - `OpenTelemetryTraceState.UpdateTValueWithSampling`: Performs a logically-consistent sampling
// update, which modifies T-value but does not permit raising the sampling
// probability. When successful, this results in an change in `AdjustedCount()`.
// - `OpenTelemetryTraceState.AdjustedCount`: If T-value is present, this returns the inverse
// sampling probability, which should be interpreted as an effective
// count for the item. If T-value is not present, this returns 0.

// #### Sampling Randomness

// The `Randomness` type represents a parsed R-value. Samplers may
// interpret randomness from two sources. The constructors are:

// - `RValueToRandomness`: Parse an encoded Randomness value from the
// OpenTelemetry TraceState `rv` field. This should be used when the
// TraceState contains an `rv` field.
// - `TraceIDToRandomness`: Extract a Randomness value from a TraceID.
// This should be used to make a sampling decision when there is
// otherwise no randomness available.

// The `Randomness` methods are:

// - `Randomness.RValue`: Encode a `rv` field value from `Randomness`.

// The `OpenTelemetryTraceState` accessor methods are:

// - `OpenTelemetryTraceState.RValueRandomness`: Returns the pre-parsed `Randomness` value
// corresponding with the R-value present in the TraceState. When this
// field is present, it will be used instead of TraceID randomness.
// - `OpenTelemetryTraceState.HasRValue`: Returns true when the tracestate contains an explicit
// R-value, in which case `RValueRandomness()` returns a valid result.
// - `OpenTelemetryTraceState.SetRValue`: Modifies the TraceState R-value, which allows a sampler
// to make randomness explicit.
// - `OpenTelemetryTraceState.ClearRValue`: Allows the sampler to clear an inconsistent R-value.

// #### Error values

// - `ErrTraceStateSize`: This is returned when parsing a TraceState with an over-size-limit field.
// - `ErrTValueEmpty`: Returned when T-value has size 0.
// - `ErrTValueSize`: Returned when T-value has size above 14 hex digits.
// - `ErrRValueSize`: Returned when R-value has size not equal to 14 hex digits.
// - `ErrInconsistentSampling`: Indicates an attempt to raise effective sampling probability.
// - `ErrProbabilityRange`: Sampling probability greater than 100% or less than `MinSamplingProbability`.
// - `ErrPrecisionUnderflow`: Sampling precision is too low for the stated probability.

// #### Exported values and constants

// - `AlwaysSampleThreshold`: Represents 100% sampling, which is T-value 0.
// - `MaxAdjustedCount`: Equals 2 to the power 56.
// - `MinSamplingProbability`: Equals `1/MaxAdjustedCount` (i.e., `0x1p-56`).
// - `NumHexDigits`: Equals 14, the number of hex digits required to
// encode 56 bits. This is the maximum supported precision, the maximum
// length of a T-value, and the expected length of an R-value.

package sampling // import "github.com/open-telemetry/opentelemetry-collector-contrib/pkg/sampling"

// Note that the following does not work for non-component packages:
// A [W3CTraceState] object parses and stores the OpenTelemetry
// tracestate field and any other fields that are present in the
// W3C tracestate header, part of the [W3C tracecontext specification].
//
// An [OpenTelemetryTraceState] object parses and stores fields of
// the OpenTelemetry-specific tracestate field, including those recognized
// for probability sampling and any other fields that are present. The
// syntax of the OpenTelemetry field is specified in [Tracestate handling].
//
// The probability sampling-specific fields used here are specified in
// [OTEP 235]. The principle, named fields are:
jmacd marked this conversation as resolved.
Show resolved Hide resolved
//
// //go:generate mdatagen metadata.yaml
// - T-value: The sampling rejection threshold, expresses a 56-bit
// hexadecimal number of traces that will be rejected by sampling.
// - R-value: The sampling randomness value can be implicit in a TraceID,
// otherwise it is explicitly encoded as an R-value.
//
// # Low-level types
//
// The three key data types implemented in this package represent sampling
// decisions.
//
// - [Threshold]: Represents an exact sampling probability.
// - [Randomness]: Randomness used for sampling decisions.
// - [Threshold.Probability]: a float64 in the range [MinSamplingProbability, 1.0].
//
// [W3C tracecontext specification]: https://www.w3.org/TR/trace-context/#tracestate-header
// [Tracestate handling]: https://opentelemetry.io/docs/specs/otel/trace/tracestate-handling/
package sampling // import "github.com/open-telemetry/opentelemetry-collector-contrib/pkg/sampling"
Loading