Quickstart | Docs | Tutorials | Chat
Check out our blog post to grasp what we have been doing for the last months.
NucliaDB is a distributed search engine built from the ground up to offer high accuracy and semantic search on unstructured data. By mere mortals for mere mortals, NucliaDB's architecture is as simple as possible to be scalable and deliver what an NLP Database requires
NucliaDB is written in Rust and Python and built on top of the mighty tantivy library. We designed it to index big datasets and provide multi-teanant suport.
- Store original data, extracting and understanding data on object and blob storage
- Index fields, paragraphs, and semantic sentences on index storage
- Cloud extraction and understanding with Nuclia Understanding APIβ’
- Cloud connection to train ML models with Nuclia Learning APIβ’
- Container security based with Reader, Manager, Writer Roles
- Resources with multiple fields and metadata
- Text/HTML/Markdown plain fields support
- File field support with direct upload and TUS upload
- Link field support
- Conversation field support
- Blocks/Layout field support
- Eventual consistency transactions based on Nats.io
- Distributed source of truth with TiKV and Redis support
- Blob support with S3-compatible API and GCS
- Replication of index storage
- Distributed search
- Cloud-native: Kubernetes only
- Blob support with Azure Blob storage
- Index relations on index storage
Trying NucliaDB is super easy! You can extend your knowledge with the following readings:
- Run NucliaDB with docker with a single command!
- Join forces with Nuclia to run NucliaDB with unlimited power
- We use own concepts as Knowledge Box in NucliaDB, so ensure you read about our basic concepts
- Further dive into our tutorials, reference or internal documentation
- Chat with us in Discord
- π Blog Posts
- Follow us on Twitter
- Do you want to work with us?
The core difference and advantage of NucliaDB is its architecture built from the ground up for cloud and unstructured data. Its vector index plus standard keyword and fuzzy search provide an API to use all extracted and learned information from Nuclia, understanding API and provide super NLP powers to any application with low code and peace of mind.
NucliaDB is open-source under the GNU Affero General Public License Version 3 - AGPLv3. Fundamentally, this means that you are free to use NucliaDB for your project, as long as you don't modify NucliaDB. If you do, you have to make the modifications public.
Our business model relies on our normalization API, this one is based on Nuclia Learning API
and Nuclia Understanding API
. This two APIs offers transformation of unstructured data to NucliaDB compatible data with AI. We also offer NucliaDB as a service at our multi-cloud provider infrastructure: https://nuclia.cloud.
We are always super happy to have contributions: code, documentation, issues, feedback, or even saying hello on discord! Here is how you can get started:
- Have a look through GitHub issues labeled "Good first issue".
- Read our Contributor Covenant Code of Conduct
- Create a fork of NucliaDB and submit your pull request!
β¨ And to thank you for your contributions, claim your swag by emailing us at info at nuclia.com.