Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CHAOS-5921 Revamped resilience probes #7224

Merged
merged 6 commits into from
Jul 1, 2024
Merged

CHAOS-5921 Revamped resilience probes #7224

merged 6 commits into from
Jul 1, 2024

Conversation

SmritiSatya
Copy link
Collaborator

Revamp resilience probes:

  1. Unify legacy and resilience probes.
  2. Add notes about existing customer seeing old version of probes and new customers seeing new version of probes.
  3. Remove "probe acceptance criteria" page and add its contents in other relevant locations.
  4. Remove old content.

@bot-gitexp-user
Copy link

Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://667e8ef4fc8ca00706714cfc--harness-developer.netlify.app

@SmritiSatya SmritiSatya added Content - chore/cleanup Content flag: Cleanup or edits on previously published content DO NOT MERGE PR is not ready to be merged. Review - Editorial Needs style/editorial review from docs council Review - PM Needs review from PM labels Jun 28, 2024
brian-r-fisher
brian-r-fisher previously approved these changes Jun 28, 2024
Copy link
Contributor

@brian-r-fisher brian-r-fisher left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good, added a few suggestions for you to consider

sidebar_position: 1
description: Resilience probes, their importance and types
---
This section introduces you to resilience probes, its importance, types, and generic properties.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This section introduces you to resilience probes, its importance, types, and generic properties.
This topic introduces you to resilience probes, their importance, types, and generic properties.


## What is a resilience probe?

Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment.
Resilience probes (or probes) are pluggable health checkers defined within the chaos engine for any chaos experiment. Resilience probes do the following:

## What is a resilience probe?

Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment.
- It monitors your application's health **before**, **during** and **after** a chaos experiment.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- It monitors your application's health **before**, **during** and **after** a chaos experiment.
- Monitor your application's health **before**, **during**, and **after** a chaos experiment.


Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment.
- It monitors your application's health **before**, **during** and **after** a chaos experiment.
- It explores the behavior of a system in a chaotic or unpredictable manner.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- It explores the behavior of a system in a chaotic or unpredictable manner.
- Explore the behavior of a system in a chaotic or unpredictable manner.

Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment.
- It monitors your application's health **before**, **during** and **after** a chaos experiment.
- It explores the behavior of a system in a chaotic or unpredictable manner.
- It helps understand the underlying patterns and laws that govern the behavior of these systems, and to use that understanding to predict or control their behavior.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- It helps understand the underlying patterns and laws that govern the behavior of these systems, and to use that understanding to predict or control their behavior.
- Help understand the underlying patterns and laws that govern the behavior of systems, and use that understanding to predict or control their behavior.

## Default/System resilience probes

- You can create system (default probes) at the project level **only once**.
- Once you create a default probe, you can't delete or disable or update it.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Once you create a default probe, you can't delete or disable or update it.
- Once you create a default probe, you can't delete, disable, or update it.


- You can create system (default probes) at the project level **only once**.
- Once you create a default probe, you can't delete or disable or update it.
- If you have more than one resilience probe in your chaos experiment, you can disable, delete or update the system probe.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- If you have more than one resilience probe in your chaos experiment, you can disable, delete or update the system probe.
- If you have more than one resilience probe in your chaos experiment, you can disable, delete, or update the system probe.

- Default probes are a part of resilience probes and are entered as annotations in the experiment manifest.

## Image registry support
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry.
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use the default image registry.


## Image registry support
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry.
- HCE doesn't provide image registry support at the moment for default probes.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- HCE doesn't provide image registry support at the moment for default probes.
- HCE doesn't currently provide image registry support for default probes.


## License

- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. With respect to usability, 1,000 probes can be executed in a month.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. With respect to usability, 1,000 probes can be executed in a month.
- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. Concerning usability, 1,000 probes can be executed in a month.

## Known limitations of resilience probes

* Command probes in the **source** mode for Kubernetes is available for both SMP and HCE SAAS.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you check the wording for this? Source mode in CMD resilience probe is only not available for Linux in SaaS

|----------------------|------------------------------------------|------------------------------------|
| probeTimeout | Yes | string with +d[ms][s][m][h] format |
| interval | Yes | string with +d[ms][s][m][h] format |
| retry | Only available for Kubernetes (optional) | |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's remove this since we're not advocating the use of this for Resilience Probe, its only present for backwards compatibility

Comment on lines 15 to 20
- Go to [chaos faults](/docs/chaos-engineering/chaos-faults) to understand where probes are used.
- You should have an active chaos infrastructure where you can schedule the chaos experiment.
- Enterprise Hub connectivity status should be active
- Read/write access to the chaos experiment to schedule or navigate to the probe addition UI.
- Read access to the chaos infrastructure to select a chaos infrastructure when creating an experiment.
- Read access to the chaos hub to select faults from the chaos hub while creating an experiment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we check these prerequisites again! To configure a probe we only require the chaos experiment edit permission. It has no direct relation with hub, infra, or faults at all.

@@ -41,7 +58,7 @@ This section describes steps you can follow to create and use a resilience probe
![setup](./static/use-probe/setup-6.png)

:::tip
If you are a first time user, you can create a resilience probe directly from Chaos Studio drawer (from within an experiment). For this, you will see an option to add a system probe (which is a health check system probe) as a one click button. This will not be present if you have configured atleast one resilience probe.
If you are a first time user, you can create a resilience probe directly from Chaos Studio drawer (from within an experiment). For this, you will see an option to add a system probe (which is a health check system probe) as a one click button. This will not be present if you have configured atleast one resilience probe.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not just for first time users, but for users who are using the platform but haven't created any resilience probes yet.

When you wish to enter the probe name in the manifest (manually) as a probeRef annotation, follow the below format:

```
probeRef: '[{"name":"probe-name","mode":"SOT"}]'

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

name has been updated with probeID

```
probeRef: '[{"name":"probe-name","mode":"SOT"}]'
```
Here, `probe-name` is the name of your probe.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be the ID instead


:::info notes
- **Legacy probes support (Backward compatibility)***: Users can still use [legacy probes](/docs/chaos-engineering/features/resilience-probes/cmd-probe).
- **Audit integration**: There are no audit events for resilience probes.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are audit integrations for resilience probes

@bot-gitexp-user
Copy link

Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://66825881f102c1ea23c4e00f--harness-developer.netlify.app

@bot-gitexp-user
Copy link

Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://66825f5be36167c3511494a3--harness-developer.netlify.app

@bot-gitexp-user
Copy link

Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://6682822ff2cf5a06e471a9f5--harness-developer.netlify.app

@bot-gitexp-user
Copy link

Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://6682ab960473b71432101392--harness-developer.netlify.app

@SmritiSatya SmritiSatya merged commit 6ca6e01 into main Jul 1, 2024
2 checks passed
@SmritiSatya SmritiSatya deleted the CHAOS-5921 branch July 1, 2024 17:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Content - chore/cleanup Content flag: Cleanup or edits on previously published content DO NOT MERGE PR is not ready to be merged. Review - Editorial Needs style/editorial review from docs council Review - PM Needs review from PM
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants