Feat: Released FastAPI Service for RAG-Anything by LaansDole · Pull Request #190 · HKUDS/RAG-Anything

LaansDole · 2026-01-18T10:38:53Z

Description

Spin up a minimal API server to query and process documents using RAG-Anything with any OpenAI-compatible backend (LM Studio, Ollama, vLLM, DeepSeek, etc.).

Quick start (using uv):

# Install FastAPI and Uvicorn into the existing uv environment
uv sync

# Run the server (reload for dev)
uv run uvicorn api.app:app --reload
# or using make
make server

Command Reference:

Action	Make Command	Full UV Command
Start Server	`make server`	`uv run uvicorn api.app:app --reload`
Run Integration Test	`make integration-test`	`uv run python api/core_endpoint_test.py api/datasets/patient_records_small.xlsx`
Run Mock Test	`make mock-test`	`uv run python api/core_endpoint_test.py api/datasets/medical_symptoms_small.xlsx`
Dev Mode	`make dev`	`uv run uvicorn api.app:app &`
Stop Server	`make stop`	`pkill -f "uvicorn api.app:app"`

Related Issues

[Reference any related issues or tasks addressed by this pull request.]

Changes Made

FastAPI services based on RAG-Anything features
Enhancement on Excel processing supports

Checklist

Changes tested locally
Code reviewed
Documentation updated (if necessary)
Unit tests added (if applicable)

Additional Notes

[Add any additional notes or context for the reviewer(s).]

- Add documentation for running in an offline environment. - Add a script to create a local tiktoken model cache. - Update env.example with TIKTOKEN_CACHE_DIR.

Sync fork

LarFii · 2026-01-20T06:47:44Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a9d6e35d54

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-20T06:51:26Z

raganything/processor.py

+                            if name:
+                                converted_lines.append(
+                                    f"{name}<|#|>{entity_type}<|#|>{description}"
+                                )


Emit LightRAG entity prefix in format conversion

When converting multimodal JSON responses, _convert_to_lightrag_format emits lines like name<|#|>TYPE<|#|>desc without the entity<|#|> prefix that LightRAG-format strings use elsewhere in this repo (see BaseModalProcessor._convert_to_lightrag_format in raganything/modalprocessors.py). Because _process_extraction_result expects the tagged tuple format, these untagged lines are treated as invalid and the extracted entities/relations for multimodal chunks end up empty; this shows up whenever the LLM returns JSON entities/relations and this conversion path runs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-01-20T06:51:26Z

api/utils.py

+    """Cancel all background tasks"""
+    tasks_to_cancel = [task for task in background_tasks if not task.done()]
+    if tasks_to_cancel:
+        logger.info(f"Cancelling {len(tasks_to_cancel)} running tasks...")
+        await asyncio.gather(*tasks_to_cancel, return_exceptions=True)


Actually cancel background tasks on shutdown

cancel_all_background_tasks only awaits the still-running tasks without cancelling them or signaling shutdown_event. During shutdown (e.g., while a long process_document_complete or query is in flight), this will block indefinitely waiting for those tasks to finish instead of cancelling them, preventing the server from shutting down promptly.

Useful? React with 👍 / 👎.

LarFii · 2026-01-20T06:54:34Z

Thank you very much for your contribution. The content is quite extensive, so we may need some time to carefully review it. We truly appreciate your effort and patience.

LaansDole · 2026-02-08T04:21:26Z

Hi @LarFii please let me know if you want to request any changes or information

LarFii · 2026-02-08T06:14:20Z

Hi @LarFii please let me know if you want to request any changes or information

I’m very sorry I forgot to mention this earlier. We’ve recently been working on integrating LightRAG and RAG-Anything, so we may need to wait until this work is finished. My apologies again for the delay.

LaansDole and others added 30 commits September 6, 2025 17:59

feat: fastapi service

37c8c3d

feat: datasets and unified variables

b4dd200

fix: stream incompatible

b7163e3

raganything with lmstudio

20436b5

feat: excel processor with pandas

1961c75

chores: refactor

3cfe4f9

chores: refactor

5eb67b8

Merge branch 'HKUDS:main' into feat/service

85af553

feat: api

5d04203

feat: refactor

961de52

feat: streamlit ui

25f08c2

merge 'main' branch

65db48a

chores: update ui

dac776a

extend request timeout

bed8d96

fix: refactor

5be9b5f

feat: Add offline support for tiktoken models

5bc8289

- Add documentation for running in an offline environment. - Add a script to create a local tiktoken model cache. - Update env.example with TIKTOKEN_CACHE_DIR.

chores: refactor

f126bc2

Merge branch 'HKUDS:main' into feat/service

15fd608

chores: refactor

07bef25

Fix ruff linter error: change DataFrame type hint to Any

65a205a

chores: refactor

84aa51a

fix: parse response as json

79e9897

chores: update makefile

bbe8d60

Merge branch 'HKUDS:main' into feat/service

f7baf9e

chores:

3ed3ccf

Merge branch 'HKUDS:main' into feat/service

dc1e775

chores: refactor

c201f28

fix: update ui

5ba68cc

Merge branch 'HKUDS:main' into feat/service

05ee89b

fix: dataset format

6451661

LaansDole and others added 20 commits December 13, 2025 14:13

Merge pull request #1 from LaansDole/main

07694e3

Sync fork

Merge branch 'HKUDS:main' into feat/service

c3d9ccc

feat: graceful shutdown

f889a29

Merge branch 'HKUDS:main' into feat/service

3a5b90e

Merge branch 'HKUDS:main' into feat/service

e600880

chores: remove ui

adb7659

chores: refactor

2a045ef

chores: further remove ui

0eb20cc

feat: implement slowapi for rate limit

0900f33

feat: file upload config

0dd5028

fix: ruff check

4e01264

update readme

94c3254

update readme

224e00b

update readme

e532c30

update readme

da3e46b

update readme

3a00ab9

update readme

3e97e80

update readme

1e6d8ae

remove some datasets

3906067

update makefile

a9d6e35

HKUDS deleted a comment from cursor bot Jan 20, 2026

chatgpt-codex-connector bot reviewed Jan 20, 2026

View reviewed changes

Merge branch 'HKUDS:main' into feat/service

90077f5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Feat: Released FastAPI Service for RAG-Anything#190

Feat: Released FastAPI Service for RAG-Anything#190
LaansDole wants to merge 57 commits intoHKUDS:mainfrom
LaansDole:feat/service

LaansDole commented Jan 18, 2026 •

edited

Loading

Uh oh!

LarFii commented Jan 20, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Jan 20, 2026

Uh oh!

chatgpt-codex-connector bot Jan 20, 2026

Uh oh!

LarFii commented Jan 20, 2026

Uh oh!

LaansDole commented Feb 8, 2026

Uh oh!

LarFii commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

LaansDole commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Changes Made

Checklist

Additional Notes

Uh oh!

LarFii commented Jan 20, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

LarFii commented Jan 20, 2026

Uh oh!

LaansDole commented Feb 8, 2026

Uh oh!

LarFii commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LaansDole commented Jan 18, 2026 •

edited

Loading