Description
Self Checks
- I have searched for existing issues search for existing issues, including closed ones.
- I confirm that I am using English to submit this report (Language Policy).
- Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
- Please do not modify this template :) and fill in all the required fields.
Describe your problem
The Ragflow version is v0.17.2 full, and the Chunk Method uses Laws
2025-03-20 18:24:54,293 INFO 15 172.18.0.6 - - [20/Mar/2025 18:24:54] "GET /v1/document/list?kb_id=f0d55d7c057211f096b92a193f0225ba&keywords=&page_size=10&page=1 HTTP/1.1" 200 -
2025-03-20 18:24:59,967 ERROR 17 LLMBundle.encode can't update token usage for 1ee1cb70057211f08d722a193f0225ba/EMBEDDING used_tokens: 2874
2025-03-20 18:25:00,256 WARNING 17 set_progress(219e77ae057311f082802a193f0225ba) got exception DoesNotExist
2025-03-20 18:25:09,374 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:09] "GET /v1/document/list?kb_id=f0d55d7c057211f096b92a193f0225ba&keywords=&page_size=10&page=1 HTTP/1.1" 200 -
2025-03-20 18:25:17,390 INFO 17 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2025-03-20T18:25:17.389+08:00", "boot_at": "2025-03-20T17:58:12.996+08:00", "pending": 3, "lag": 0, "done": 0, "failed": 0, "current": {"219e77ae057311f082802a193f0225ba": {"id": "219e77ae057311f082802a193f0225ba", "doc_id": "1ffcc5e0057311f0b02d2a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "naive", "parser_config": {"pages": [[1, 1000000]]}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"pages": [[1, 1000000]]}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742465249206, "task_type": ""}, "12ebde7a057511f0bc0f2a193f0225ba": {"id": "12ebde7a057511f0bc0f2a193f0225ba", "doc_id": "1ffcc5e0057311f0b02d2a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "naive", "parser_config": {"pages": [[1, 1000000]]}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742466083588, "task_type": ""}, "22ed7cde057511f093f02a193f0225ba": {"id": "22ed7cde057511f093f02a193f0225ba", "doc_id": "211c905c057511f0b0982a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "laws", "parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742466110418, "task_type": ""}}}
2025-03-20 18:25:24,453 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:24] "GET /v1/document/list?kb_id=f0d55d7c057211f096b92a193f0225ba&keywords=&page_size=10&page=1 HTTP/1.1" 200 -
2025-03-20 18:25:25,559 ERROR 17 LLMBundle.encode can't update token usage for 1ee1cb70057211f08d722a193f0225ba/EMBEDDING used_tokens: 2106
2025-03-20 18:25:25,797 INFO 17 set_progress(22ed7cde057511f093f02a193f0225ba), progress: 0.7004123711340206, progress_msg:
2025-03-20 18:25:38,423 ERROR 17 LLMBundle.encode can't update token usage for 1ee1cb70057211f08d722a193f0225ba/EMBEDDING used_tokens: 2800
2025-03-20 18:25:38,615 WARNING 17 set_progress(12ebde7a057511f0bc0f2a193f0225ba) got exception DoesNotExist
2025-03-20 18:25:47,399 INFO 17 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2025-03-20T18:25:47.399+08:00", "boot_at": "2025-03-20T17:58:12.996+08:00", "pending": 3, "lag": 0, "done": 0, "failed": 0, "current": {"219e77ae057311f082802a193f0225ba": {"id": "219e77ae057311f082802a193f0225ba", "doc_id": "1ffcc5e0057311f0b02d2a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "naive", "parser_config": {"pages": [[1, 1000000]]}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"pages": [[1, 1000000]]}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742465249206, "task_type": ""}, "12ebde7a057511f0bc0f2a193f0225ba": {"id": "12ebde7a057511f0bc0f2a193f0225ba", "doc_id": "1ffcc5e0057311f0b02d2a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "naive", "parser_config": {"pages": [[1, 1000000]]}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742466083588, "task_type": ""}, "22ed7cde057511f093f02a193f0225ba": {"id": "22ed7cde057511f093f02a193f0225ba", "doc_id": "211c905c057511f0b0982a193f0225ba", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "f0d55d7c057211f096b92a193f0225ba", "parser_id": "laws", "parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "name": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "type": "doc", "location": "\u4e2d\u534e\u4eba\u6c11\u5171\u548c\u56fd\u6c11\u6cd5\u5178.docx", "size": 234212, "tenant_id": "1ee1cb70057211f08d722a193f0225ba", "language": "English", "embd_id": "BAAI/bge-large-zh-v1.5@BAAI", "pagerank": 0, "kb_parser_config": {"layout_recognize": "DeepDOC", "auto_keywords": 0, "auto_questions": 0, "raptor": {"use_raptor": false}, "graphrag": {"use_graphrag": false}}, "img2txt_id": "Qwen/QVQ-72B-Preview@SILICONFLOW", "asr_id": "", "llm_id": "deepseek-ai/DeepSeek-R1@SILICONFLOW", "update_time": 1742466110418, "task_type": ""}}}
2025-03-20 18:25:49,116 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:49] "GET /v1/kb/detail?kb_id=f0d55d7c057211f096b92a193f0225ba HTTP/1.1" 200 -
2025-03-20 18:25:49,254 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:49] "GET /v1/document/list?kb_id=f0d55d7c057211f096b92a193f0225ba&keywords=&page_size=10&page=1 HTTP/1.1" 200 -
2025-03-20 18:25:49,256 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:49] "GET /v1/kb/f0d55d7c057211f096b92a193f0225ba/knowledge_graph HTTP/1.1" 200 -
2025-03-20 18:25:49,258 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:49] "GET /v1/user/tenant_info HTTP/1.1" 200 -
2025-03-20 18:25:49,269 INFO 15 172.18.0.6 - - [20/Mar/2025 18:25:49] "GET /v1/user/info HTTP/1.1" 200 -
2025-03-20 18:26:02,796 ERROR 17 LLMBundle.encode can't update token usage for 1ee1cb70057211f08d722a193f0225ba/EMBEDDING used_tokens: 2786
2025-03-20 18:26:03,044 WARNING 17 set_progress(219e77ae057311f082802a193f0225ba) got exception DoesNotExist