-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HDDS-594. SCM CA: DN sends CSR and uses certificate issued by SCM. #547
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
@ajayydv , thanks for the update. Please fix the unit test failure testGenerateCSRWithInvalidParams and checkstyle issue. I will post another comments on the 2nd revision shortly. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
| LOG.error("DN security initialization failed."); | ||
| throw new RuntimeException("DN security initialization failed."); | ||
| case RECOVER: | ||
| LOG.error("DN security initialization failed. OM certificate is " + |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we reword the error message for recovery?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added case info to all cases.
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/HddsDatanodeService.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add todo for handling multiple SCM instances case?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we throw RetriableException instead of RuntimeException so that DN can retry if SCM is not available (e.g., reboot, not fully boot up yet)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think IPC layer already has logic to retry for socket timeout related errors. If it fails there (unrecoverable), then its best to shutdown DN.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given that we remove the SCM_ID/CLUSTER_ID from the CSR, can we assert the certificate returned contains a valid SCM_ID and CLUSTER_ID here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is unit test for only CSR generation on DN side. Updated Default Approver on SCM side to add scm id and cluster id if not already present. Added corresponding tests in TestDefaultCAServer.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need server side change to populate the SCM_ID/CLUSTER_ID into the certificate returned?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Latest commit handles that.
hadoop-hdds/common/src/main/java/org/apache/hadoop/hdds/HddsUtils.java
Outdated
Show resolved
Hide resolved
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/HddsDatanodeService.java
Outdated
Show resolved
Hide resolved
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/HddsDatanodeService.java
Outdated
Show resolved
Hide resolved
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
rebased with trunk. |
|
💔 -1 overall
This message was automatically generated. |
|
why some of the changes from HDDS-1118 are shown up as part of the files changed? |
|
@xiaoyuyao rebase seems to have pushed spurious commit. Forced reset it to intended commits. |
|
+1 pending Jenkins. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
unit test failure seem unrelated, pass locally. |
|
I've tested the patch with docker and validated the DN initialization works as expected. The ci author/acceptance test does not seem to be related to this patch. Will merge it shortly. |
…teFunction when input value is null Currently, the behavior of putting a null value is inconsistent: it is a delete for RocksDB, and not supported in in-memory store, and on a case-by-case basis for remote tables. It is desirable to unify the behavior. Furthermore, it eases the writing of a change captured stream to a table. A change captured stream contains typically 3 types of events: INSERT, UPDATE and DELETE, and they need to be applied properly when written to a table to produce a correct snapshot. In a change captured stream the payload of a DELETE event is typically is null, and this would result in a delete operation to a table in sendTo() operator. Author: Wei Song <wsong@wsong-mn2.linkedin.biz> Closes apache#547 from weisong44/table-fix
No description provided.