Skip to content

Latest commit

 

History

History
161 lines (120 loc) · 14.5 KB

cassandra.md

File metadata and controls

161 lines (120 loc) · 14.5 KB

Semantic Conventions for Cassandra

Status: Experimental

The Semantic Conventions for Cassandra extend and override the Database Semantic Conventions.

db.system MUST be set to "cassandra" and SHOULD be provided at span creation time.

Attributes

Attribute Type Description Examples Requirement Level Stability
db.collection.name string The name of the Cassandra table that the operation is acting upon. [1] public.users; customers Conditionally Required [2] Experimental
db.namespace string The keyspace associated with the session. [3] mykeyspace Conditionally Required If available. Experimental
db.operation.name string The name of the operation or command being executed. [4] findAndModify; HMSET; SELECT Conditionally Required [5] Experimental
db.response.status_code string Cassandra protocol error code represented as a string. [6] 102; 40020 Conditionally Required [7] Experimental
error.type string Describes a class of error the operation ended with. [8] timeout; java.net.UnknownHostException; server_certificate_invalid; 500 Conditionally Required If and only if the operation failed. Stable
server.port int Server port number. [9] 80; 8080; 443 Conditionally Required [10] Stable
db.cassandra.consistency_level string The consistency level of the query. Based on consistency values from CQL. all; each_quorum; quorum Recommended Experimental
db.cassandra.coordinator.dc string The data center of the coordinating node for a query. us-west-2 Recommended Experimental
db.cassandra.coordinator.id string The ID of the coordinating node for a query. be13faa2-8574-4d71-926d-27f16cf8a7af Recommended Experimental
db.cassandra.idempotence boolean Whether or not the query is idempotent. Recommended Experimental
db.cassandra.page_size int The fetch size used for paging, i.e. how many rows will be returned at once. 5000 Recommended Experimental
db.cassandra.speculative_execution_count int The number of times a query was speculatively executed. Not set or 0 if the query was not executed speculatively. 0; 2 Recommended Experimental
db.operation.batch.size int The number of queries included in a batch operation. [11] 2; 3; 4 Recommended Experimental
db.query.summary string Low cardinality representation of a database query text. [12] SELECT wuser_table; INSERT shipping_details SELECT orders; get user by id Recommended [13] Experimental
db.query.text string The database query being executed. [14] SELECT * FROM wuser_table where username = ?; SET mykey ? Recommended [15] Experimental
network.peer.address string Peer address of the database node where the operation was performed. [16] 10.1.2.80; /tmp/my.sock Recommended Stable
network.peer.port int Peer port number of the network connection. 65123 Recommended if and only if network.peer.address is set. Stable
server.address string Name of the database host. [17] example.com; 10.1.2.80; /tmp/my.sock Recommended Stable
db.query.parameter.<key> string A query parameter used in db.query.text, with <key> being the parameter name, and the attribute value being a string representation of the parameter value. [18] someval; 55 Opt-In Experimental

[1]: It is RECOMMENDED to capture the value as provided by the application without attempting to do any case normalization.

A single database query may involve multiple collections.

If the collection name is parsed from the query text, it SHOULD only be captured for queries that contain a single collection and it SHOULD match the value provided in the query text including any schema and database name prefix.

For batch operations, if the individual operations are known to have the same collection name then that collection name SHOULD be used.

If the operation or query involves multiple collections, db.collection.name SHOULD NOT be captured.

This attribute has stability level RELEASE CANDIDATE.

[2]: If readily available and if a database call is performed on a single collection. The collection name MAY be parsed from the query text, in which case it SHOULD be the single collection name in the query.

[3]: If a database system has multiple namespace components, they SHOULD be concatenated (potentially using database system specific conventions) from most general to most specific namespace component, and more specific namespaces SHOULD NOT be captured without the more general namespaces, to ensure that "startswith" queries for the more general namespaces will be valid. Semantic conventions for individual database systems SHOULD document what db.namespace means in the context of that system. It is RECOMMENDED to capture the value as provided by the application without attempting to do any case normalization. This attribute has stability level RELEASE CANDIDATE.

[4]: It is RECOMMENDED to capture the value as provided by the application without attempting to do any case normalization.

A single database query may involve multiple operations. If the operation name is parsed from the query text, it SHOULD only be captured for queries that contain a single operation or when the operation name describing the whole query is available by other means.

For batch operations, if the individual operations are known to have the same operation name then that operation name SHOULD be used prepended by BATCH , otherwise db.operation.name SHOULD be BATCH or some other database system specific term if more applicable.

This attribute has stability level RELEASE CANDIDATE.

[5]: If readily available and if there is a single operation name that describes the database call. The operation name MAY be parsed from the query text, in which case it SHOULD be the single operation name found in the query.

[6]: The status code returned by the database. Usually it represents an error code, but may also represent partial success, warning, or differentiate between various types of successful outcomes. Semantic conventions for individual database systems SHOULD document what db.response.status_code means in the context of that system. This attribute has stability level RELEASE CANDIDATE.

[7]: If the operation failed and status code is available.

[8]: The error.type SHOULD match the db.response.status_code returned by the database or the client library, or the canonical name of exception that occurred. When using canonical exception type name, instrumentation SHOULD do the best effort to report the most relevant type. For example, if the original exception is wrapped into a generic one, the original exception SHOULD be preferred. Instrumentations SHOULD document how error.type is populated.

[9]: When observed from the client side, and when communicating through an intermediary, server.port SHOULD represent the server port behind any intermediaries, for example proxies, if it's available.

[10]: If using a port other than the default port for this DBMS and if server.address is set.

[11]: Operations are only considered batches when they contain two or more operations, and so db.operation.batch.size SHOULD never be 1. This attribute has stability level RELEASE CANDIDATE.

[12]: db.query.summary provides static summary of the query text. It describes a class of database queries and is useful as a grouping key, especially when analyzing telemetry for database calls involving complex queries. Summary may be available to the instrumentation through instrumentation hooks or other means. If it is not available, instrumentations that support query parsing SHOULD generate a summary following Generating query summary section. This attribute has stability level RELEASE CANDIDATE.

[13]: if readily available or if instrumentation supports query summarization.

[14]: For sanitization see Sanitization of db.query.text. For batch operations, if the individual operations are known to have the same query text then that query text SHOULD be used, otherwise all of the individual query texts SHOULD be concatenated with separator ; or some other database system specific separator if more applicable. Even though parameterized query text can potentially have sensitive data, by using a parameterized query the user is giving a strong signal that any sensitive data will be passed as parameter values, and the benefit to observability of capturing the static part of the query text by default outweighs the risk. This attribute has stability level RELEASE CANDIDATE.

[15]: Non-parameterized query text SHOULD NOT be collected by default unless there is sanitization that excludes sensitive data, e.g. by redacting all literal values present in the query text. See Sanitization of db.query.text. Parameterized query text SHOULD be collected by default (the query parameter values themselves are opt-in, see db.query.parameter.<key>).

[16]: If a database operation involved multiple network calls (for example retries), the address of the last contacted node SHOULD be used.

[17]: When observed from the client side, and when communicating through an intermediary, server.address SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.

[18]: Query parameters should only be captured when db.query.text is parameterized with placeholders. If a parameter has no name and instead is referenced only by index, then <key> SHOULD be the 0-based index. This attribute has stability level RELEASE CANDIDATE.

The following attributes can be important for making sampling decisions and SHOULD be provided at span creation time (if provided at all):

db.cassandra.consistency_level has the following list of well-known values. If one of them applies, then the respective value MUST be used; otherwise, a custom value MAY be used.

Value Description Stability
all all Experimental
any any Experimental
each_quorum each_quorum Experimental
local_one local_one Experimental
local_quorum local_quorum Experimental
local_serial local_serial Experimental
one one Experimental
quorum quorum Experimental
serial serial Experimental
three three Experimental
two two Experimental

error.type has the following list of well-known values. If one of them applies, then the respective value MUST be used; otherwise, a custom value MAY be used.

Value Description Stability
_OTHER A fallback error value to be used when the instrumentation doesn't define a custom value. Stable