Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Statistics] Reduce DB size by removing unused data - edge server IP #3622

Closed
skofman1 opened this issue Mar 4, 2017 · 1 comment
Closed

Comments

@skofman1
Copy link
Contributor

skofman1 commented Mar 4, 2017

Impact on database

A database size reduction could be achieved totalling 46,284.26 MB (or 45.2 GB)!

Removing these indices will also improve performance of insert/delete operations against the [dbo].[Fact_Download] and [dbo].[Fact_Dist_Download] tables.

In addition, we generate less churn on the data by avoiding clean-up operations.

The following database objects would become obsolete:

  • Table [dbo].[Fact_IpAddress] (size = 3.22 MB)
  • Table column [dbo].[Fact_Download].[Fact_EdgeServer_IpAddress_Id] (size = 2,585.75 MB)
  • Table column [dbo].[Fact_Dist_Download].[Fact_EdgeServer_IpAddress_Id] (size = 23.63 MB)
  • Stored Procedure [dbo].[CleanupFactIpAddress]
  • Stored Procedure [dbo].[EnsureIpAddressFactsExist]
  • SQL Type [dbo].[IpAddressFactTableType]
  • Non-clustered index [dbo].[Fact_Download].[Fact_Download_NCI_EdgeServer_IpAddress] (size = 43,278.21 MB)
  • Non-clustered index [dbo].[Fact_Dist_Download].[Fact_Dist_Download_NCI_EdgeServer_IpAddress] (size = 392.80 MB)
  • Non-clustered index [dbo].[Fact_IpAddress].[IX_Fact_IpAddress_UniqueIndex] (size = 672 KB)

Impact on jobs

The Stats.ImportAzureCdnStatistics job would no longer need to deal with edgeserver IP-address data.

We could:

  • avoid the need to check whether an EdgeServer IP-address already exists, and if not, create it during import;
  • avoid the need to retain already imported EdgeServer IP-addresses into in-memory cache;
  • reduce complexity of import by no more linking EdgeServer IP-address identifiers to the download facts.
@skofman1 skofman1 added this to the S115 - 2017.3.6 milestone Mar 4, 2017
@xavierdecoster xavierdecoster changed the title Reduce statistics DB size by removing unused dimension - edge server IP [Statistics] Reduce DB size by removing unused data - edge server IP Mar 4, 2017
@skofman1 skofman1 removed this from the S116 - 2017.3.27 milestone Apr 5, 2017
@skofman1
Copy link
Contributor Author

We decided to go in a different direction to reduce DB size. Will reopen if needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants