Skip to content

Neos CMS ElasticSearch indexer based on the Flowpack JobQueue (to handle big indexing tags, +50'000 nodes)

License

Notifications You must be signed in to change notification settings

Unikka/Flowpack.ElasticSearch.ContentRepositoryQueueIndexer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Neos CMS Elasticsearch indexer based on a job queue

This package can be used to index a huge amount of nodes in Elasticsearch indexes. This package use the Flowpack JobQueue packages to handle the indexing asynchronously.

Breaking change after an upgrade to 3.0

Previously the Beanstalk queue package was installed by default, this is no longer the case.

Install and configure your Queue package

You need to install the correct Queue package based on your needs.

Available packages:

Please check the package documentation for specific configurations.

The default configuration uses Beanstalkd, but you need to install it manually:

composer require flowpack/jobqueue-beanstalkd

Check the Settings.yaml to adapt based on the Queue package, you need to adapt the className:

Flowpack:
  JobQueue:
    Common:
      queues:
        'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer':
          className: 'Flowpack\JobQueue\Beanstalkd\Queue\BeanstalkdQueue'

        'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer.Live':
          className: 'Flowpack\JobQueue\Beanstalkd\Queue\BeanstalkdQueue'

If you use the doctrine package you have to set the tableName manually:

Flowpack:
  JobQueue:
    Common:
      queues:
        'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer':
          className: 'Flowpack\JobQueue\Doctrine\Queue\DoctrineQueue'
          options:
            tableName: 'flowpack_jobqueue_QueueIndexer'

        'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer.Live':
          className: 'Flowpack\JobQueue\Doctrine\Queue\DoctrineQueue'
          options:
            tableName: 'flowpack_jobqueue_QueueIndexerLive'

Batch Indexing

How to build indexing job

flow nodeindexqueue:build --workspace live

How to process indexing job

You can use this CLI command to process indexing job:

flow nodeindexqueue:work --queue batch

Live Indexing

You can disable async live indexing by editing Settings.yaml:

Flowpack:
  ElasticSearch:
    ContentRepositoryQueueIndexer:
      enableLiveAsyncIndexing: false

You can use this CLI command to process indexing job:

flow nodeindexqueue:work --queue live

Supervisord configuration

You can use tools like supervisord to manage long running processes. Bellow you can find a basic configuration:

[supervisord]

[supervisorctl]

[program:elasticsearch_batch_indexing]
command=php flow nodeindexqueue:work --queue batch
stdout_logfile=AUTO
stderr_logfile=AUTO
numprocs=4
process_name=elasticsearch_batch_indexing_%(process_num)02d
environment=FLOW_CONTEXT="Production"
autostart=true
autorestart=true
stopsignal=QUIT

[program:elasticsearch_live_indexing]
command=php flow nodeindexqueue:work --queue live
stdout_logfile=AUTO
stderr_logfile=AUTO
numprocs=4
process_name=elasticsearch_live_indexing_%(process_num)02d
environment=FLOW_CONTEXT="Production"
autostart=true
autorestart=true
stopsignal=QUIT

Acknowledgments

Development sponsored by ttree ltd - neos solution provider.

We try our best to craft this package with a lots of love, we are open to sponsoring, support request, ... just contact us.

License

Licensed under MIT, see LICENSE

About

Neos CMS ElasticSearch indexer based on the Flowpack JobQueue (to handle big indexing tags, +50'000 nodes)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • PHP 100.0%