GitHub - pg2es/search-replica: PostgreSQL logical decoding to Elasticsearch/Opensearch

Search Replica

Replicates PostgreSQL primary database into Elasticsearch/OpenSearch index read-only replica.

Does not rely on additional database queries or external queues, using exclusively logical replication protocol, allowing almost realtime consistent synchronisation without external dependencies.

Focused on speed and efficiency.

Docs | DockerHub | Try it

Consistent and fault tolerant, without dependencies
Thanks to PostgreSQL replication slots mechanisms.
Initial (re)indexing
Using COPY command
Uses native PG protocol
Both Text and Binary form.
Full DB types support
json fields, composite types, arrays, enums... Except of arrays of composite types.
Native Parent/Child join Including document _id and routing control
Limited denormalization & document modifications. check inlining
Bulk requests Data is flushed to Elasticsearch/OpenSearch in bulk.

Configuration

ConfTags

You can

set routing and document _id fields;
rename or skip fields;
define parent/child join field;
inline rows as object into parent document;
set custom inlining script;
~~set templated fields~~ (planned)
~~json-path names~~ (planned)

Using COMMENTs in your database schema. Check syntax and description

Env Config

Variable	Default	Description
PG_SLOT	pg2es	replication slot name
PG_PUBLICATION	search	publication name
PGHOST	localhost
PGPORT	5432
PGDATABASE	-
PGUSER	-
PGPASSWORD	-
SEARCH_HOST	-	URL or host of ElasticSearch/OpenSearch
SEARCH_USERNAME	-	optional
SEARCH_PASSWORD	-	optional
SEARCH_BULK_SIZE	4	(MB) Bulk request size limit.
SEARCH_PUSH_INTERVAL	30s	idle push interval, when there is no enough rows for full bulk request.
SEARCH_PUSH_THROTTLE	500ms	hard limit. At most one request during this period.
SEARCH_PUSH_DEBOUNCE	500ms	delays bulk after idle, to fetch related data.
LOG_FORMAT	json	json or cli
LOG_LEVEL	warn	from debug to fatal

Notes

The script is single threaded* (not a bottleneck)... Separate goroutine is used to make ES requests.
Links between Database <-> Schema <-> Table <-> Column, shoudld be considered read only, and safe for multithread use... (not yet)
It's fast. All the the efforts shuld be towards readability, reliability and functionality.

Known Limitations:

No 1:1 inlines (yet)
Delete document deletes all inlines (AKA DELETE CASCADE), and they can not be restored.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
conftags		conftags
demo		demo
postgres		postgres
search		search
.dockerignore		.dockerignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.go		config.go
go.mod		go.mod
go.sum		go.sum
how-it-works.md		how-it-works.md
main.go		main.go
state.go		state.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Search Replica

Docs | DockerHub | Try it

Configuration

ConfTags

Env Config

Notes

Known Limitations:

About

Releases 4

Languages

License

pg2es/search-replica

Folders and files

Latest commit

History

Repository files navigation

Search Replica

Docs | DockerHub | Try it

Configuration

ConfTags

Env Config

Notes

Known Limitations:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Languages