Skip to content

Releases: fabric-testbed/MeasurementFramework

knit7

23 Sep 22:11
bdf50bb
Compare
Choose a tag to compare

Version used for KNIT7 Sept, 2023

Filebeats

10 Aug 18:13
18ec1e6
Compare
Choose a tag to compare

This release adds the Filebeats deployment.
Prometheus node_exporter role changed to prometheus instead of cloudalchemy.

knit6-may-2023 urllib3 fix

10 May 15:08
e70d8b6
Compare
Choose a tag to compare

This release has a fix for the urllib3 bug in docker-py. Othewise it is the same release that was used for the KNIT6 workshop in Austin, TX April 2023.

knit6-apr-2023

02 May 16:02
11129d1
Compare
Choose a tag to compare

This release was used for the KNIT6 workshop in Austin, TX April 2023.

ODC Migration

21 Mar 16:55
dad2819
Compare
Choose a tag to compare

This release mainly addresses the new FABRIC ODC Rack where the Infrastructure central monitoring systems will be running.
ODC now hosts MinIO (replacing CEPH), central metrics and Thanos compactor.
Changes to Central Metrics role, along with the additional roles for minio, compactor and ping-plus are now implemented using docker-compose instead of Python Docker SDK. This results in more compact installation roles with more autonomy from the other roles. Common tasks are still used for creating directory structures and uploading configuration files. Common files are also still used for NGINX conf files and alerts.
Ping-plus is an addition to enable pings and other Blackbox Exporter probes to be made from the UKY datacenter rack which has access to the operations network. Formerly these probes were made from the Central metrics VM.
Other changes:
Grafana updated to version 9
Thanos updated to v0.29.0
Simple index page added to metrics.fabric-testbed.net
cAdvisor is being used on new ODC VMs and is expected to replace other instances of Docker Exporter as Docker Exporter is no longer being updated.
Nginx Proxy added for Thanos web UI's for status/debuging.
passlib no longer needs to be installed with bcrypt option.
Added promtheus_src for non-head node instances of Prometheus.
Thanos Stores updated to 23 racks.
GENI roles have been removed.
Dev/test install playbooks have been updated or removed.

Ceph to Minio conversion

14 Feb 13:59
22bbd51
Compare
Choose a tag to compare

Change from CEPH to MINIO for thanos object store for the rack data.
Added GPS PTP not responding alert.

v1.0.2

08 Dec 22:33
8f02caa
Compare
Choose a tag to compare

Updated prometheus retention time and added variable for optional setting.
Fixed regression error for missing file.

knit-sept-2022

20 Sep 01:15
eae0e0d
Compare
Choose a tag to compare

Incorporates Instrumentation of Fabric Slice Code.

v1.0.0: Merge pull request #63 from fabric-testbed/dualrack_install

29 Aug 21:10
139dd56
Compare
Choose a tag to compare

Updates

Racks

First release to work on all racks.
Ipv6, Ipv4 or dual racks are all now supported.

Added heartbeat alert for all racks.
Added rack name to many of the alert messages.

v0.1.6

06 Jun 03:22
7443510
Compare
Choose a tag to compare

This release is for the June 2022 production rack deployments.

Updates

All

Alert names should no longer have spaces in the name.
Updated docker image versions.

Lowered alert temperature for HostPhysicalComponentTooHot alert to 60 C.
Added dataplane link alerts.
Added dataplane isis recording rules.

Prometheus port 9090 moved to operations network.

Rack

Ipv6 only racks are not supported.

Central Metrics

Central now has thanos exporters.
Removed unresponsive thanos stores (ipv6 racks).
Grafana now listens on ipv4 and ipv6.
Added beta portal and beta jupyterhub monitoring.

Experiment

Initial addition of instrumentizing user experiment.