-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CHAOS-5921 Revamped resilience probes #7224
Conversation
Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://667e8ef4fc8ca00706714cfc--harness-developer.netlify.app |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good, added a few suggestions for you to consider
sidebar_position: 1 | ||
description: Resilience probes, their importance and types | ||
--- | ||
This section introduces you to resilience probes, its importance, types, and generic properties. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This section introduces you to resilience probes, its importance, types, and generic properties. | |
This topic introduces you to resilience probes, their importance, types, and generic properties. |
|
||
## What is a resilience probe? | ||
|
||
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment. | |
Resilience probes (or probes) are pluggable health checkers defined within the chaos engine for any chaos experiment. Resilience probes do the following: |
## What is a resilience probe? | ||
|
||
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment. | ||
- It monitors your application's health **before**, **during** and **after** a chaos experiment. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- It monitors your application's health **before**, **during** and **after** a chaos experiment. | |
- Monitor your application's health **before**, **during**, and **after** a chaos experiment. |
|
||
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment. | ||
- It monitors your application's health **before**, **during** and **after** a chaos experiment. | ||
- It explores the behavior of a system in a chaotic or unpredictable manner. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- It explores the behavior of a system in a chaotic or unpredictable manner. | |
- Explore the behavior of a system in a chaotic or unpredictable manner. |
Resilience probes (or probes) are pluggable health-checkers defined within the chaos engine for any chaos experiment. | ||
- It monitors your application's health **before**, **during** and **after** a chaos experiment. | ||
- It explores the behavior of a system in a chaotic or unpredictable manner. | ||
- It helps understand the underlying patterns and laws that govern the behavior of these systems, and to use that understanding to predict or control their behavior. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- It helps understand the underlying patterns and laws that govern the behavior of these systems, and to use that understanding to predict or control their behavior. | |
- Help understand the underlying patterns and laws that govern the behavior of systems, and use that understanding to predict or control their behavior. |
## Default/System resilience probes | ||
|
||
- You can create system (default probes) at the project level **only once**. | ||
- Once you create a default probe, you can't delete or disable or update it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Once you create a default probe, you can't delete or disable or update it. | |
- Once you create a default probe, you can't delete, disable, or update it. |
|
||
- You can create system (default probes) at the project level **only once**. | ||
- Once you create a default probe, you can't delete or disable or update it. | ||
- If you have more than one resilience probe in your chaos experiment, you can disable, delete or update the system probe. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- If you have more than one resilience probe in your chaos experiment, you can disable, delete or update the system probe. | |
- If you have more than one resilience probe in your chaos experiment, you can disable, delete, or update the system probe. |
- Default probes are a part of resilience probes and are entered as annotations in the experiment manifest. | ||
|
||
## Image registry support | ||
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry. | |
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use the default image registry. |
|
||
## Image registry support | ||
- You can configure the image registry to be used with the default probes. If you haven't configured a probe yet, the experiment will use default image registry. | ||
- HCE doesn't provide image registry support at the moment for default probes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- HCE doesn't provide image registry support at the moment for default probes. | |
- HCE doesn't currently provide image registry support for default probes. |
|
||
## License | ||
|
||
- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. With respect to usability, 1,000 probes can be executed in a month. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. With respect to usability, 1,000 probes can be executed in a month. | |
- Resilience probes are not a part of any subscription, and hence you don't have any limit on the number of probes you can create. Concerning usability, 1,000 probes can be executed in a month. |
## Known limitations of resilience probes | ||
|
||
* Command probes in the **source** mode for Kubernetes is available for both SMP and HCE SAAS. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you check the wording for this? Source mode in CMD resilience probe is only not available for Linux in SaaS
|----------------------|------------------------------------------|------------------------------------| | ||
| probeTimeout | Yes | string with +d[ms][s][m][h] format | | ||
| interval | Yes | string with +d[ms][s][m][h] format | | ||
| retry | Only available for Kubernetes (optional) | | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's remove this since we're not advocating the use of this for Resilience Probe, its only present for backwards compatibility
- Go to [chaos faults](/docs/chaos-engineering/chaos-faults) to understand where probes are used. | ||
- You should have an active chaos infrastructure where you can schedule the chaos experiment. | ||
- Enterprise Hub connectivity status should be active | ||
- Read/write access to the chaos experiment to schedule or navigate to the probe addition UI. | ||
- Read access to the chaos infrastructure to select a chaos infrastructure when creating an experiment. | ||
- Read access to the chaos hub to select faults from the chaos hub while creating an experiment. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we check these prerequisites again! To configure a probe we only require the chaos experiment edit permission. It has no direct relation with hub, infra, or faults at all.
@@ -41,7 +58,7 @@ This section describes steps you can follow to create and use a resilience probe | |||
 | |||
|
|||
:::tip | |||
If you are a first time user, you can create a resilience probe directly from Chaos Studio drawer (from within an experiment). For this, you will see an option to add a system probe (which is a health check system probe) as a one click button. This will not be present if you have configured atleast one resilience probe. | |||
If you are a first time user, you can create a resilience probe directly from Chaos Studio drawer (from within an experiment). For this, you will see an option to add a system probe (which is a health check system probe) as a one click button. This will not be present if you have configured atleast one resilience probe. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not just for first time users, but for users who are using the platform but haven't created any resilience probes yet.
When you wish to enter the probe name in the manifest (manually) as a probeRef annotation, follow the below format: | ||
|
||
``` | ||
probeRef: '[{"name":"probe-name","mode":"SOT"}]' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
name
has been updated with probeID
``` | ||
probeRef: '[{"name":"probe-name","mode":"SOT"}]' | ||
``` | ||
Here, `probe-name` is the name of your probe. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be the ID instead
|
||
:::info notes | ||
- **Legacy probes support (Backward compatibility)***: Users can still use [legacy probes](/docs/chaos-engineering/features/resilience-probes/cmd-probe). | ||
- **Audit integration**: There are no audit events for resilience probes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are audit integrations for resilience probes
Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://66825881f102c1ea23c4e00f--harness-developer.netlify.app |
Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://66825f5be36167c3511494a3--harness-developer.netlify.app |
Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://6682822ff2cf5a06e471a9f5--harness-developer.netlify.app |
Please check the Execution Link of the Pipeline for the Website Draft URL. This is located in the Preview Step behind the Harness VPN and also is available in #hdh_alerts. E.g Website Draft URL: https://unique-id--harness-developer.netlify.app. Current Draft URL is: https://6682ab960473b71432101392--harness-developer.netlify.app |
Revamp resilience probes: