Skip to content

[vector]support lumina#7330

Draft
jerry-024 wants to merge 31 commits intoapache:masterfrom
jerry-024:support_lumina
Draft

[vector]support lumina#7330
jerry-024 wants to merge 31 commits intoapache:masterfrom
jerry-024:support_lumina

Conversation

@jerry-024
Copy link
Contributor

@jerry-024 jerry-024 commented Mar 2, 2026

Purpose

Introduce a new paimon-lumina module that integrates Lumina as a vector index backend for Apache Paimon's global index framework.

Lumina is a DiskANN graph-based approximate nearest neighbor (ANN) search library developed by
the Alibaba Storage Service Team. It is accessed from Java via JNI through the lumina-jni
artifact.

Tests

  • LuminaVectorGlobalIndexTest
  • LuminaVectorGlobalIndexScanTest
  • LuminaVectorIndexTest

API and Format

Documentation

Added paimon-lumina/README.md explaining the module purpose, supported index types, metrics,
and all configuration options.

Generative AI tooling

@jerry-024 jerry-024 marked this pull request as draft March 2, 2026 09:33

<dependency>
<groupId>org.apache.paimon</groupId>
<artifactId>paimon-lumina-jni</artifactId>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

com.aliyun.lumina
lumina-jni

import scala.collection.JavaConverters._

/** End-to-end tests for Lumina vector index read/write operations on Spark 3.5. */
class LuminaVectorIndexE2ETest extends PaimonSparkTestBase {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just move these tests to spark-ut?

<version>1.4-SNAPSHOT</version>
</parent>

<artifactId>paimon-lumina</artifactId>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just one paimon-lumina is OK, no need to have index and e2e.

<version>1.4-SNAPSHOT</version>
</parent>

<artifactId>paimon-lumina</artifactId>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please create a README.md to this, explain what is lumina.

@jerry-024 jerry-024 closed this Mar 6, 2026
@jerry-024 jerry-024 reopened this Mar 6, 2026
@jerry-024 jerry-024 force-pushed the support_lumina branch 2 times, most recently from 33d5790 to 209a6f7 Compare March 9, 2026 01:59
@jerry-024 jerry-024 requested a review from Copilot March 9, 2026 02:05

This comment was marked as resolved.

@jerry-024 jerry-024 closed this Mar 9, 2026
@jerry-024 jerry-024 reopened this Mar 9, 2026
@jerry-024 jerry-024 closed this Mar 9, 2026
@jerry-024 jerry-024 reopened this Mar 9, 2026
@jerry-024 jerry-024 requested a review from Copilot March 9, 2026 09:03
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 20 out of 20 changed files in this pull request and generated 5 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

* upstream/master: (31 commits)
  [python] Correct pyroaring version in requirements
  [test] Enrich table name in MergeIntoTableTestBase
  [test] Fix unstable test: FileStoreLookupFunctionTest.testLookupScanLeak
  [python] Remove useless parameters in blob_writer.py
  [rest] Introduce reset consumer api in rest (apache#7372)
  [python] Introduce Python CLI table snapshot (apache#7373)
  [python] Fix time mapping by always using time32 (apache#7371)
  [python] Introduce catalog list-dbs in Python CLI (apache#7366)
  [hotfix][docs] replace the documentation URL for data-types blob (apache#7370)
  [python] Add i short name to ignore-if-not-exists for database
  [python] Support dynamic bucket write (apache#7363)
  [python] Introduce database related commands to Python CLI (apache#7362)
  [python] Introduce table import, drop and alter to Python CLI (apache#7360)
  [python] Fix py version required in README (apache#7361)
  [test] Add missing LeafFunction coverage in PredicateJsonSerdeTest (apache#7359)
  [python] support predicate NotBetween, Like function (apache#7352)
  [python] Introduce Paimon CLI in PyPaimon (apache#7358)
  [doc] Document ignoreIfExists for create_branch
  [core] Add IF NOT EXISTS support for create_branch operation (apache#7353)
  [python] Remove oss and pylance from requirements deps
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants