From 2b2b094a5ae8e0007a83032a4d01bee456b6afcb Mon Sep 17 00:00:00 2001
From: Ash Vardanian <1983160+ashvardanian@users.noreply.github.com>
Date: Wed, 19 Jul 2023 18:11:07 +0400
Subject: [PATCH 1/3] Docs: Dataset links
---
README.md | 21 +++++++++++----------
1 file changed, 11 insertions(+), 10 deletions(-)
diff --git a/README.md b/README.md
index e7cff16a..2f51fd73 100644
--- a/README.md
+++ b/README.md
@@ -153,9 +153,7 @@ This can result in **20x costs reduction** on AWS and other public clouds.
```py
index.save("index.usearch")
-direct_copy = index.copy()
loaded_copy = index.load("index.usearch")
-
view = Index.restore("index.usearch", view=True)
other_view = Index(ndim=..., metric=CompiledMetric(...))
@@ -234,16 +232,19 @@ server.run()
We have pre-processed some commonly used datasets, cleaning the images, producing the vectors, and pre-building the index.
-| Dataset | Size | Images | Preprocessed |
-| :----------------------------------- | ---: | -----: | --------------------: |
-| [Unsplash 25K][unsplash-25k-origin] | - | 25 K | [HF][unsplash-25k-hf] |
-| [Createve Captions 3M][cc-3m-origin] | - | 3 M | [HF][cc-3m-hf] |
+| Dataset | Modalities | Images | Download |
+| :------------------------------------- | --------------------: | -----: | ------------------------------------: |
+| [Unsplash 25K][unsplash-25k-origin] | Images & Descriptions | 25 K | [HuggingFace / Unum][unsplash-25k-hf] |
+| [Conceptual Captions 3M][cc-3m-origin] | Images & Descriptions | 3 M | [HuggingFace / Unum][cc-3m-hf] |
+| [Arxiv 2M][arxiv-2m-origin] | Titles & Abstracts | 2 M | [HuggingFace / Unum][arxiv-2m-hf] |
+[unsplash-25k-origin]: https://github.com/unsplash/datasets
+[cc-3m-origin]: https://huggingface.co/datasets/conceptual_captions
+[arxiv-2m-origin]: https://www.kaggle.com/datasets/Cornell-University/arxiv
-[unsplash-25k-origin]: https://unum.cloud
-[cc-3m-origin]: https://unum.cloud
-[unsplash-25k-hf]: https://unum.cloud
-[cc-3m-hf]: https://unum.cloud
+[unsplash-25k-hf]: https://huggingface.co/datasets/unum-cloud/ann-unsplash-25k
+[cc-3m-hf]: https://huggingface.co/datasets/unum-cloud/ann-cc-3m
+[arxiv-2m-hf]: https://huggingface.co/datasets/unum-cloud/ann-arxiv-2m
### USearch + RDKit = Molecular Search
From 7a9a8c4d2361a713b67283e3051267b5c4ed7509 Mon Sep 17 00:00:00 2001
From: Ash Vardanian <1983160+ashvardanian@users.noreply.github.com>
Date: Wed, 19 Jul 2023 18:11:18 +0400
Subject: [PATCH 2/3] Make: WASMer publish script
---
.github/workflows/release.yml | 21 ++++++++++++---------
1 file changed, 12 insertions(+), 9 deletions(-)
diff --git a/.github/workflows/release.yml b/.github/workflows/release.yml
index e6236de6..8a4e8c12 100644
--- a/.github/workflows/release.yml
+++ b/.github/workflows/release.yml
@@ -271,11 +271,18 @@ jobs:
tags: ${{ steps.meta.outputs.tags }}
labels: ${{ steps.meta.outputs.labels }}
- build_publish_wasm:
+ publish_wasm:
name: WASM Build
+ needs: versioning
runs-on: ubuntu-latest
+ permissions:
+ contents: read
+ packages: write
+
steps:
- uses: actions/checkout@v3
+ with:
+ ref: 'main'
- run: git submodule update --init --recursive
- name: Prepare WASM Environment
@@ -288,23 +295,19 @@ jobs:
./emsdk/emsdk activate latest && source ./emsdk/emsdk_env.sh
emcmake cmake -DUSEARCH_BUILD_TEST=0 -DUSEARCH_BUILD_BENCHMARK=0 -DUSEARCH_BUILD_WASM=1 -B ./build && emmake make -C ./build
- - name: Install wasmer cli
+ - name: Trigger Wasmer.io CLI
run: |
curl https://get.wasmer.io -sSfL | sh
source ~/.wasmer/wasmer.sh
-
- - name: Login Wasmer.io
- run: wasmer login --token "${{ secrets.WASMER_TOKEN }}"
-
- - name: Publish binary to Wasmer.io
- run: wasmer publish
+ wasmer login --token "${{ secrets.WASMER_TOKEN }}"
+ wasmer publish
build_docs:
name: Build Docs
runs-on: ubuntu-22.04
if: ${{ always() }}
- needs: [publish_python, publish_javascript, publish_rust, publish_java, publish_swift, publish_docker]
+ needs: [publish_python, publish_javascript, publish_rust, publish_java, publish_swift, publish_docker, publish_wasmer]
permissions:
contents: write
steps:
From ad507171ee0790a17274095a0f319f2c3a05e2f6 Mon Sep 17 00:00:00 2001
From: Ash Vardanian <1983160+ashvardanian@users.noreply.github.com>
Date: Wed, 19 Jul 2023 19:19:21 +0400
Subject: [PATCH 3/3] Make: Fix documentation build path
---
README.md | 49 ++++++++++++++++++++-------------------
docs/golang/index.rst | 2 +-
docs/index.rst | 5 ++--
docs/javascript/index.rst | 2 +-
docs/python/index.rst | 2 +-
docs/rust/index.rst | 2 +-
docs/swift/index.rst | 2 +-
javascript/README.md | 9 +++++++
wasm/README.md | 13 +++++++++++
wolfram/README.md | 13 +++++++++++
10 files changed, 68 insertions(+), 31 deletions(-)
create mode 100644 wasm/README.md
create mode 100644 wolfram/README.md
diff --git a/README.md b/README.md
index 2f51fd73..b7709019 100644
--- a/README.md
+++ b/README.md
@@ -20,34 +20,35 @@ Vector Search Engine
Euclidean • Angular • Jaccard • Hamming • Haversine • User-Defined Metrics
-C++11 •
-Python •
-JavaScript •
-Java •
-Rust •
-Objective-C •
-Swift •
-GoLang •
-Wolfram
+C++11 •
+Python •
+JavaScript •
+Java •
+Rust •
+C99 •
+Objective-C •
+Swift •
+GoLang •
+Wolfram
-Linux • MacOS • Windows • Docker • WebAssembly 🔜
+Linux • MacOS • Windows • Docker • WebAssembly