Skip to content

[GCP] Data Catalog improvements #3988

@mzueva

Description

@mzueva

Background
At the moment we observe some issues with data catalog including:

  • low performance for large indices
  • instability of security logs
  • duplication of indices

Approach
The following solutions shall be implemented:

  • instability of security logs is addressed via [GCP Logging] Cloud Logging integration #3947, as audit logs are now stored and managed using GCP APIs
  • low performance for large indices:
    • allow to exclude files from indexing
    • support multi node deployment for Elasticsearch on GKE to improve service performance
  • duplication of indices: introduce additional clean up logic during index management

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions