Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto delete job history logs after specific period #39358

Open
venkateshkelevo opened this issue Jun 10, 2024 · 4 comments
Open

Auto delete job history logs after specific period #39358

venkateshkelevo opened this issue Jun 10, 2024 · 4 comments
Labels
area/platform issues related to the platform community team/platform-move type/enhancement New feature or request

Comments

@venkateshkelevo
Copy link

Topic

Auto delete job history logs after specific period

Relevant information

We have used helm charts(Kubernetes environment) to deploy airbyte platform. Used the flag TEMPORAL_HISTORY_RETENTION_IN_DAYS="7" to auto delete the job history logs after 7 days, but the cleanup is not happening and logs are still available. Can you provide a way how can we achieve this?

helm command:
helm install -n foresight-infra airbyte airbyte/airbyte --set global.env_vars.TEMPORAL_HISTORY_RETENTION_IN_DAYS="7"

Thanks.

@marcosmarxm
Copy link
Member

@venkateshkelevo this flag will clean temporal database logs. Is there any other logs you want to clean besides those?

@venkateshkelevo
Copy link
Author

Our basic problem is airbyte-db-0 pod pvc, used space is getting increased slowly. I assume the job history logs and metadata getting stored in db because of that used space is getting increased, we would like to cleanup the db space automatically.

We configured a connector to pull the data from rest source using cron job which runs every two minutes, attached screenshot for logs which I am referring to.

image

Please suggest a way to auto clean up the db pvc space.

Thanks.

@gingeard
Copy link
Contributor

gingeard commented Aug 29, 2024

We have a similar situation:
after a week of data processing with only one connection set up, Airbyte's Postgres DB consumed 1GB already:

  1. db-airbyte db:
image
  1. temporal db:
image

If I understand correctly, setting the TEMPORAL_HISTORY_RETENTION_IN_DAYS parameter can help clean up the temporal db, but not the db-airbyte one.

@iulianapostol-ozone
Copy link

iulianapostol-ozone commented Sep 30, 2024

Would someone suggest me a temporary solution for this ? I know in case of minio space issue we can kubectl into minio (in case of k8s) and delete manually the log files. But what about the psql DB? Can someone suggest me a way to clean up the jobs database ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/platform issues related to the platform community team/platform-move type/enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants