[PM-3007] Caching user policies on PolicyService variable #3117

r-tome · 2023-07-18T14:10:25Z

Type of change

- [ ] Bug fix
- [ ] New feature development
- [X] Tech debt (refactoring, code cleanup, dependency upgrades, etc)
- [ ] Build/deploy pipeline (DevOps)
- [ ] Other

Objective

Enhance performance and reduce SQL operations by modifying the stored procedure OrganizationUser_ReadByUserIdWithPolicyDetails to retrieve all policies and filter them in memory for each type, thereby minimizing the number of SQL operations required. The filtered policies will be cached and utilized by the scoped service IPolicyService.

Code changes

src/Core/Repositories/IOrganizationUserRepository.cs: Removed the parameter PolicyType so that the method returns all policy types
src/Core/Services/Implementations/PolicyService.cs: Added a private variable to cache the obtained user policies from the repository
src/Infrastructure.Dapper/Repositories/OrganizationUserRepository.cs: Removed the parameter PolicyType so that the method returns all policy types
src/Infrastructure.EntityFramework/Repositories/OrganizationUserRepository.cs: Removed the parameter PolicyType so that the method returns all policy types
src/Sql/dbo/Stored Procedures/OrganizationUser_ReadByUserIdWithPolicyDetails.sql: Removed the parameter PolicyType so that the method returns all policy types and also replaced the nested query with left joins
test/Core.Test/Services/PolicyServiceTests.cs: Updated unit tests
test/Infrastructure.EFIntegration.Test/Repositories/OrganizationUserRepositoryTests.cs: Updated unit tests
util/Migrator/DbScripts/2023-07-18_00_OrganizationUserReadByUserIdWithPolicyDetails.sql: SQL migration script

Before you submit

Please check for formatting errors (dotnet format --verify-no-changes) (required)
If making database changes - make sure you also update Entity Framework queries and/or migrations
Please add unit tests where it makes sense to do so (encouraged but not required)
If this change requires a documentation update - notify the documentation team
If this change has particular deployment requirements - notify the DevOps team

r-tome · 2023-07-18T14:13:46Z

I've placed the cached policies inside PolicyService but maybe it's more suited inside CurrentContext?

bitwarden-bot · 2023-07-18T14:29:09Z

Checkmarx One – Scan Summary & Details – ae9a48e1-7521-477a-b6d5-26c4591563e5

New Issues

Severity	Issue	Source File / Package	Checkmarx Insight
	CSRF	/src/Api/SecretsManager/Controllers/ServiceAccountsController.cs: 60	Attack Vector

eliykat · 2023-07-18T23:16:24Z

I've placed the cached policies inside PolicyService but maybe it's more suited inside CurrentContext?

CurrentContext is already doing too much in my opinion. I think it's much better to have this cached within the domain-specific service, which is exactly what you've done.

I'm curious whether you've compared the query execution plans for the old & new sproc? I'm sure we're going to see some boost from the caching either way, so I'd like to know that we've also improved the sproc itself. If you want any help with this let me know.

r-tome · 2023-07-19T09:21:09Z

CurrentContext is already doing too much in my opinion. I think it's much better to have this cached within the domain-specific service, which is exactly what you've done.

That is what I thought but I needed validation 😅

I'm curious whether you've compared the query execution plans for the old & new sproc? I'm sure we're going to see some boost from the caching either way, so I'd like to know that we've also improved the sproc itself. If you want any help with this let me know.

I did compare them! Here are both execution plans (old on top):

I think its improved by having less nested loops but that my non-expert opinion. Let me know your thoughts.

withinfocus · 2023-07-19T13:33:25Z

How'd you get these query plans by chance? You'll need the actual plans from production.

r-tome · 2023-07-19T13:41:04Z

How'd you get these query plans by chance? You'll need the actual plans from production.

That is true, who can I ping to get those? These are just from my local dev environment.

withinfocus · 2023-07-19T14:05:50Z

Heh yeah I was asking the same questions to CloudOps and there wasn't an answer. For your sake I think you need to make an ad hoc request for them to jump in and pull some actual execution plans for you.

eliykat · 2023-07-20T04:08:38Z

@withinfocus I was aware of that limitation, but I thought using local query plans would be the next best thing. But comparing them, they're not even close, so that's good to know.

I guess that means we also won't know of any measurable improvements until the new sproc has been running in prod for a while?

withinfocus · 2023-07-20T12:43:35Z

In our situation that's pretty much correct. The usual course of action is to snapshot the table or simulate size and fragmentation so that it could be replicated elsewhere, but we are still adopting better database testing practices.

MGibson1 · 2023-07-20T13:49:22Z

I guess that means we also won't know of any measurable improvements until the new sproc has been running in prod for a while?

It's worth checking qa cloud to see if that matches a little better, but this is probably true.

kspearrin

This seems like an acceptable improvement, but there is certainly much more than can be done to optimize the query within the database alone. For example, I see lots of index scanning that have a large cost to the overall execution plan.

…organization-user-read-by-user-id-with-policy-details-sproc # Conflicts: # src/Core/Repositories/IOrganizationUserRepository.cs

r-tome · 2023-08-01T15:48:27Z

@kspearrin I experimented with adding two new indexes and changing an existing one to include some necessary columns.

CREATE NONCLUSTERED INDEX [IX_ProviderUser_UserIdProviderIdStatus]
        ON [dbo].[ProviderUser]([UserId] ASC, [ProviderId] ASC, [Status] ASC)
        WITH (ONLINE = ON);

CREATE NONCLUSTERED INDEX [IX_ProviderOrganization_ProviderIdOrganizationId]
        ON [dbo].[ProviderOrganization]([ProviderId] ASC, [OrganizationId] ASC);
        WITH (ONLINE = ON);

CREATE NONCLUSTERED INDEX [IX_OrganizationUser_UserIdOrganizationIdStatusEmail]
            ON [dbo].[OrganizationUser]([UserId] ASC, [OrganizationId] ASC, [Status] ASC, [Email] ASC)
            INCLUDE ([AccessAll], [Type], [Permissions])
            WITH (ONLINE = ON);
-- Drop existing index that did not include the [Email] column
DROP INDEX [IX_OrganizationUser_UserIdOrganizationIdStatus] ON [dbo].[OrganizationUser];

Here is the old plan when I ran the query locally:

Here is the new plan with the indexes changes:

With these changes I managed to reduce the amount of index scans but of course this is only my local environment so comparing the actual numbers is pointless.

Do you think we should try to add the indexes?

kspearrin · 2023-08-01T15:53:06Z

@r-tome I think those are good results, however, let's hold off on adding lots of indexes right now until we have the right people available to evaluate our indexing strategy. We can always add them on-demand if this query becomes more of a problem in production.

withinfocus · 2023-08-03T17:00:02Z

util/Migrator/DbScripts/2023-07-18_00_OrganizationUserReadByUserIdWithPolicyDetails.sql

+        WHERE U.[Id] = @UserId AND OU.[Email] = U.[Email] AND OU.[Status] = 0 -- 'Invited' OrgUsers are not linked to a UserId yet, so we have to look up their email
+    )
+END
+GO


⛏️ Newline at the end.

withinfocus · 2023-08-03T17:00:20Z

src/Sql/dbo/Stored Procedures/OrganizationUser_ReadByUserIdWithPolicyDetails.sql

+    OR EXISTS (
+        SELECT 1
+        FROM [dbo].[UserView] U
+        WHERE U.[Id] = @UserId AND OU.[Email] = U.[Email] AND OU.[Status] = 0 -- 'Invited' OrgUsers are not linked to a UserId yet, so we have to look up their email
    )
 END


⛏️ Newline at the end.

)" This reverts commit 78588d0.

… SQL CPU (#3203) * Revert "[PM-3007] Caching user policies on PolicyService variable (#3117)" This reverts commit 78588d0. * Don't delete old migration script * Add migration to revert sproc

… SQL CPU (#3203) * Revert "[PM-3007] Caching user policies on PolicyService variable (#3117)" This reverts commit 78588d0. * Don't delete old migration script * Add migration to revert sproc (cherry picked from commit fc814ff)

[PM-3007] Caching user policies on PolicyService variable

66252a1

r-tome requested review from MGibson1 and kspearrin July 18, 2023 14:26

r-tome marked this pull request as ready for review July 19, 2023 10:19

kspearrin previously approved these changes Jul 31, 2023

View reviewed changes

Merge branch 'master' into PM-3007-resolve-high-db-resource-usage-by-…

f3a996b

…organization-user-read-by-user-id-with-policy-details-sproc # Conflicts: # src/Core/Repositories/IOrganizationUserRepository.cs

r-tome dismissed kspearrin’s stale review via f3a996b August 1, 2023 15:45

kspearrin previously approved these changes Aug 1, 2023

View reviewed changes

bitwarden-devops-bot temporarily deployed to QA Cloud August 3, 2023 14:45 Inactive

withinfocus reviewed Aug 3, 2023

View reviewed changes

[PM-3007] Added missing newlines on sql files

f29ae94

r-tome dismissed kspearrin’s stale review via f29ae94 August 3, 2023 17:08

withinfocus approved these changes Aug 3, 2023

View reviewed changes

r-tome merged commit 78588d0 into master Aug 3, 2023
44 checks passed

r-tome deleted the PM-3007-resolve-high-db-resource-usage-by-organization-user-read-by-user-id-with-policy-details-sproc branch August 3, 2023 17:36

eliykat added a commit that referenced this pull request Aug 16, 2023

Revert "[PM-3007] Caching user policies on PolicyService variable (#3117

4855708

)" This reverts commit 78588d0.

eliykat mentioned this pull request Aug 16, 2023

[AC-1597] Revert GetByUserIdWithPolicyDetailsAsync changes to unblock SQL CPU #3203

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PM-3007] Caching user policies on PolicyService variable #3117

[PM-3007] Caching user policies on PolicyService variable #3117

r-tome commented Jul 18, 2023

r-tome commented Jul 18, 2023

bitwarden-bot commented Jul 18, 2023 •

edited

Loading

eliykat commented Jul 18, 2023 •

edited

Loading

r-tome commented Jul 19, 2023

withinfocus commented Jul 19, 2023

r-tome commented Jul 19, 2023

withinfocus commented Jul 19, 2023

eliykat commented Jul 20, 2023

withinfocus commented Jul 20, 2023

MGibson1 commented Jul 20, 2023

kspearrin left a comment

r-tome commented Aug 1, 2023

kspearrin commented Aug 1, 2023

withinfocus Aug 3, 2023

withinfocus Aug 3, 2023

[PM-3007] Caching user policies on PolicyService variable #3117

[PM-3007] Caching user policies on PolicyService variable #3117

Conversation

r-tome commented Jul 18, 2023

Type of change

Objective

Code changes

Before you submit

r-tome commented Jul 18, 2023

bitwarden-bot commented Jul 18, 2023 • edited Loading

New Issues

eliykat commented Jul 18, 2023 • edited Loading

r-tome commented Jul 19, 2023

withinfocus commented Jul 19, 2023

r-tome commented Jul 19, 2023

withinfocus commented Jul 19, 2023

eliykat commented Jul 20, 2023

withinfocus commented Jul 20, 2023

MGibson1 commented Jul 20, 2023

kspearrin left a comment

Choose a reason for hiding this comment

r-tome commented Aug 1, 2023

kspearrin commented Aug 1, 2023

withinfocus Aug 3, 2023

Choose a reason for hiding this comment

withinfocus Aug 3, 2023

Choose a reason for hiding this comment

bitwarden-bot commented Jul 18, 2023 •

edited

Loading

eliykat commented Jul 18, 2023 •

edited

Loading