Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error on cadence-system domain when clean start #2617

Closed
vancexu opened this issue Sep 27, 2019 · 3 comments
Closed

Error on cadence-system domain when clean start #2617

vancexu opened this issue Sep 27, 2019 · 3 comments

Comments

@vancexu
Copy link
Contributor

vancexu commented Sep 27, 2019

Start cadence with clean db setup will sometimes having following errors:
This will cause serious ServiceBusyError, and break user experience who use docker for integration tests (such as cadence-client integration tests).

{"level":"error","ts":"2019-09-27T13:24:48.361-0700","msg":"error starting cadence-sys-history-scanner-workflow workflow","service":"cadence-worker","error":"EntityNotExistsError{Message: Domain: cadence-system}","logging-call-at":"scanner.go:164","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/service/worker/scanner.(*Scanner).startWorkflow\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/service/worker/scanner/scanner.go:164\ngithub.com/uber/cadence/service/worker/scanner.(*Scanner).startWorkflowWithRetry.func1\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/service/worker/scanner/scanner.go:150\ngithub.com/uber/cadence/common/backoff.Retry\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/backoff/retry.go:99\ngithub.com/uber/cadence/service/worker/scanner.(*Scanner).startWorkflowWithRetry\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/service/worker/scanner/scanner.go:149"}
{"level":"error","ts":"2019-09-27T13:24:54.470-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"error","ts":"2019-09-27T13:24:54.472-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"info","ts":"2019-09-27T13:24:54.521-0700","msg":"Get dynamic config","name":"history.timerProcessorCompleteTimerFailureRetryCount","value":"10","default-value":"10","logging-call-at":"config.go:58"}
{"level":"info","ts":"2019-09-27T13:24:54.521-0700","msg":"Get dynamic config","name":"history.transferProcessorCompleteTransferFailureRetryCount","value":"10","default-value":"10","logging-call-at":"config.go:58"}
{"level":"error","ts":"2019-09-27T13:25:04.475-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"error","ts":"2019-09-27T13:25:14.475-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"error","ts":"2019-09-27T13:25:24.475-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"error","ts":"2019-09-27T13:25:34.475-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
{"level":"error","ts":"2019-09-27T13:25:44.475-0700","msg":"Error refreshing domain cache","service":"cadence-frontend","error":"ServiceBusyError{Message: Persistence Max QPS Reached.}","logging-call-at":"domainCache.go:359","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/cache.(*domainCache).refreshLoop\n\t/Users/boweixu/gocode/src/github.com/uber/cadence/common/cache/domainCache.go:359"}
vancexu added a commit that referenced this issue Sep 30, 2019
@mfateev
Copy link
Contributor

mfateev commented Oct 5, 2019

I'm not sure if it is related. But this is what I've got running docker-compose for the first time:

cadence_1      | {"level":"info","ts":"2019-10-05T03:31:44.579Z","msg":"cadence-system domain does not exist, attempting to register domain","service":"cadence-worker","logging-call-at":"service.go:339"}
cadence_1      | {"level":"error","ts":"2019-10-05T03:31:44.590Z","msg":"Operation failed with internal error.","service":"cadence-worker","error":"InternalServiceError{Message: CreateDomain operation failed because of uuid collision.}","metric-scope":31,"logging-call-at":"persistenceMetricClients.go:835","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/cadence/common/log/loggerimpl/logger.go:133\ngithub.com/uber/cadence/common/persistence.(*metadataPersistenceClient).updateErrorMetric\n\t/cadence/common/persistence/persistenceMetricClients.go:835\ngithub.com/uber/cadence/common/persistence.(*metadataPersistenceClient).CreateDomain\n\t/cadence/common/persistence/persistenceMetricClients.go:729\ngithub.com/uber/cadence/service/worker.(*Service).registerSystemDomain\n\t/cadence/service/worker/service.go:374\ngithub.com/uber/cadence/service/worker.(*Service).ensureSystemDomainExists\n\t/cadence/service/worker/service.go:340\ngithub.com/uber/cadence/service/worker.(*Service).Start\n\t/cadence/service/worker/service.go:163\nmain.execute\n\t/cadence/cmd/server/server.go:225"}
cadence_1      | {"level":"fatal","ts":"2019-10-05T03:31:44.590Z","msg":"failed to register system domain","service":"cadence-worker","error":"InternalServiceError{Message: CreateDomain operation failed because of uuid collision.}","logging-call-at":"service.go:395","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Fatal\n\t/cadence/common/log/loggerimpl/logger.go:139\ngithub.com/uber/cadence/service/worker.(*Service).registerSystemDomain\n\t/cadence/service/worker/service.go:395\ngithub.com/uber/cadence/service/worker.(*Service).ensureSystemDomainExists\n\t/cadence/service/worker/service.go:340\ngithub.com/uber/cadence/service/worker.(*Service).Start\n\t/cadence/service/worker/service.go:163\nmain.execute\n\t/cadence/cmd/server/server.go:225"}
cadence_1      | 2019/10/05 03:31:44 Command exited with error: exit status 1
docker_cadence_1 exited with code 1

@BrendanBall
Copy link

I'm busy evaluating cadence and was happy to see docker-compose files, but they don't actually work :/ . fyi I got similar errors with mysql.

@vancexu
Copy link
Contributor Author

vancexu commented Oct 7, 2019

"CreateDomain operation failed because of uuid collision" is caused by a recent change. It was fixed by #2639.

@vancexu vancexu closed this as completed Oct 7, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants