Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proposal for vector db semantic convention #1231

Open
wants to merge 44 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
44 commits
Select commit Hold shift + click to select a range
fa8ee30
Proposal for vector db semantic convention
ezimuel Jul 10, 2024
7068720
Merge + applied feedbacks #1231
ezimuel Jul 17, 2024
5e12a86
Removed allow_custom_values: true in db.yaml
ezimuel Jul 17, 2024
3b61784
Fixed merge
ezimuel Jul 18, 2024
828bacc
Merge branch 'main' into vector-db
ezimuel Jul 20, 2024
53d82d4
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Aug 5, 2024
a3330ff
Updated dimension_count and similarity_metric
ezimuel Aug 5, 2024
e5ff387
Merge remote-tracking branch 'origin/vector-db' into vector-db
ezimuel Aug 5, 2024
da6649b
Merge branch 'main' into vector-db
ezimuel Aug 7, 2024
d99ec10
Fix array attribute examples (#1325)
lmolkova Aug 8, 2024
61b0f2c
Add k8s.{pod,node}.cpu.{time,usage} metrics (#1320)
ChrsMark Aug 11, 2024
ceae2ca
Db metrics pending requests (#1290)
maryliag Aug 12, 2024
6db7ec5
Fix `process.args_count` attribute (#1331)
lmolkova Aug 12, 2024
e5e0d9d
Add k8s.volume.{name,type} attributes (#1251)
ChrsMark Aug 14, 2024
ae0e066
Add tests for rego policies (#1334)
MadVikingGod Aug 14, 2024
03b67bf
add `nodejs.eventloop.time` metric (#1259)
maryliag Aug 15, 2024
93d2cbe
chore: Remove support for the event `fields` referencing/inheriting d…
MSNev Aug 18, 2024
f411554
Attempt to optimise attribute name collision checks. (#1328)
jsuereth Aug 19, 2024
daa0a14
(chore) Add dependabot config to keep tooling up to date. (#1346)
jsuereth Aug 19, 2024
bc8a63c
Fix broken docker link (#1332)
ChrsMark Aug 19, 2024
a5f8661
Bump markdownlint-cli from 0.31.0 to 0.41.0 (#1349)
dependabot[bot] Aug 19, 2024
d996cd9
Bump go.opentelemetry.io/build-tools/chloggen from 0.12.0 to 0.14.0 i…
dependabot[bot] Aug 19, 2024
a10e75f
Bump gulp from 4.0.2 to 5.0.0 (#1348)
dependabot[bot] Aug 19, 2024
fd0f2e7
Fix link anchors (#1354)
lmolkova Aug 19, 2024
1c6bd00
chore: update ids (#1352)
maryliag Aug 20, 2024
9feb74d
Removed db.vector.id and added db.record.id, renamed db.vector.field_…
ezimuel Aug 20, 2024
2357766
Merge branch 'main' into vector-db
ezimuel Aug 20, 2024
81dca47
Merge from upstream/main
ezimuel Sep 25, 2024
ff03da1
Removed db.vector.model and moved db.vector.search.similarity_metric …
ezimuel Sep 25, 2024
523bcb9
Merge branch 'main' into vector-db
ezimuel Sep 30, 2024
fd891f6
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Nov 5, 2024
bc2ddb1
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Nov 5, 2024
fc90f3f
Merge branch 'vector-db' of github.com:ezimuel/semantic-conventions i…
ezimuel Nov 5, 2024
24cc812
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Nov 18, 2024
3d50dc1
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Dec 2, 2024
06a67d4
Added docs + spans
ezimuel Dec 2, 2024
949b198
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Dec 3, 2024
a88ac32
Update model/database/spans.yaml
ezimuel Dec 5, 2024
df19d76
Update model/database/spans.yaml
ezimuel Dec 5, 2024
94c2ca1
Update model/database/spans.yaml
ezimuel Dec 5, 2024
210ecb9
Update model/database/spans.yaml
ezimuel Dec 5, 2024
765a4a8
Update model/database/spans.yaml
ezimuel Dec 5, 2024
ccd11f7
Merge remote-tracking branch 'upstream/main' into vector-db
ezimuel Dec 5, 2024
4def570
Merge branch 'vector-db' of github.com:ezimuel/semantic-conventions i…
ezimuel Dec 5, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Attempt to optimise attribute name collision checks. (#1328)
Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
  • Loading branch information
2 people authored and ezimuel committed Aug 20, 2024
commit f41155462cf3f8019e0338908c96c9ff399f60a4
43 changes: 28 additions & 15 deletions policies/attribute_name_collisions.rego
Original file line number Diff line number Diff line change
@@ -1,25 +1,42 @@
package after_resolution

# Data structures to make checking things faster.
attribute_names := { data |
group := input.groups[_]
attr := group.attributes[_]
data := { "name": attr.name, "const_name": to_const_name(attr.name), "namespace_prefix": to_namespace_prefix(attr.name) }
}


deny[attr_registry_collision(description, name)] {
names := attr_names_except(excluded_const_collisions)
name := names[_]
const_name := to_const_name(name)
collisions:= { n | n := attr_names_except(excluded_const_collisions)[_]; n != name; to_const_name(n) == const_name }
some i
name := attribute_names[i].name
const_name := attribute_names[i].const_name
not excluded_const_collisions[name]
collisions := [other.name |
other := attribute_names[_]
other.name != name
other.const_name == const_name
not excluded_const_collisions[other.name]
]
count(collisions) > 0

# TODO (https://github.com/open-telemetry/weaver/issues/279): provide other violation properties once weaver supports it.
description := sprintf("Attribute '%s' has the same constant name '%s' as '%s'.", [name, const_name, collisions])
}

deny[attr_registry_collision(description, name)] {
names := attr_names_except(excluded_namespace_collisions)
name := names[_]

collisions:= { n | n := input.groups[_].attributes[_].name; startswith(n, to_namespace_prefix(name)) }
some i
name := attribute_names[i].name
prefix := attribute_names[i].namespace_prefix
not excluded_namespace_collisions[name]
collisions := [other.name |
other := attribute_names[_]
other.name != name
startswith(other.name, prefix)
]
count(collisions) > 0

# TODO (https://github.com/open-telemetry/weaver/issues/279): provide other violation properties once weaver supports it.
description := sprintf("Attribute '%s' name is used as a namespace in the following attributes '%s'.", [name, collisions])
description := sprintf("Attribute '%s' is used as a namespace in '%s'.", [name, collisions])
}

attr_registry_collision(description, attr_name) = violation {
Expand All @@ -40,10 +57,6 @@ to_const_name(name) = const_name {
const_name := replace(name, ".", "_")
}

attr_names_except(excluded) = names {
names := { n | n := input.groups[_].attributes[_].name } - excluded
}

# These lists contain exceptions for existing collisions that were introduced unintentionally.
# We'll have a way to specify how collision resolution happens in the schema -
# see phase 2 in https://github.com/open-telemetry/semantic-conventions/issues/1118#issuecomment-2173803006
Expand Down