feat: support LLM understand video #9828

hjlarry · 2024-10-25T02:56:51Z

Checklist:

Important

Please review the checklist below before submitting your pull request.

Please open an issue before creating a PR or link to an existing issue
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

Description

change list

support transfer a video file to a VideoPromptMessageContent
support qwen model understand video
support zhipu model understand video
when create a new agent app and open the vision feature, default allow video format files upload

restrictions

the qwen model requires users to submit a work order for application, and it only support url send mode
the exist agent app, its model config already saved in the database, so can't upload video format files

preview

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update, included: Dify Document
Improvement, including but not limited to code refactoring, performance optimization, and UI/UX improvement
Dependency upgrade

Testing Instructions

test locally

Test local video file send with base64 format
Test remote video file send with base64 format
Test remote video file send with url format
Test workflow's llm node

test link: https://cloud.video.taobao.com/vod/S8T54f_w1rkdfLdYjL3S5zKN9CrhkzuhRwOhF313tIQ.mp4
test file: https://github.com/user-attachments/assets/f5264e1f-e360-4b07-a201-da1216b0310a

laipz8200

LGTM

laipz8200 · 2024-11-08T04:02:04Z

Hi @hjlarry, could you please also add the new environment variable into the .env.example and the docker-compose.yaml file under the docker/?

hjlarry · 2024-11-08T07:12:56Z

Hi @hjlarry, could you please also add the new environment variable into the .env.example and the docker-compose.yaml file under the docker/?

sure, I will add it later.

hjlarry added 4 commits October 24, 2024 09:11

support video prompt message

ab290fb

add VideoPromptMessageContent to run zhipuAI

4d5e2e5

support agent app upload video

59e843a

support llm node of workflow can transfer video

c61f26b

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. ⚙️ feat:model-runtime labels Oct 25, 2024

crazywoola requested review from laipz8200 and takatost October 25, 2024 07:38

laipz8200 approved these changes Nov 7, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Nov 7, 2024

laipz8200 requested a review from iamjoel November 7, 2024 11:29

iamjoel approved these changes Nov 8, 2024

View reviewed changes

laipz8200 merged commit 033ab54 into langgenius:main Nov 8, 2024
7 checks passed

hjlarry mentioned this pull request Nov 8, 2024

chore: add MULTIMODAL_SEND_VIDEO_FORMAT to docker's env #10458

Merged

12 tasks

crazywoola mentioned this pull request Nov 8, 2024

Run failed: file type video is not supported #10453

Closed

5 tasks

laipz8200 mentioned this pull request Nov 11, 2024

chore: update version to 0.11.1 across all configurations and Docker images #10539

Merged

AlwaysBluer pushed a commit to AlwaysBluer/dify that referenced this pull request Nov 14, 2024

feat: support LLM understand video (langgenius#9828)

f23c0e7

Dongnc1017 mentioned this pull request Nov 15, 2024

LLM video understanding #10720

Closed

5 tasks

hanqingwu mentioned this pull request Nov 15, 2024

glm4v_server.py 可以支持视频分析吗？ THUDM/GLM-4#653

Closed

idonotknow pushed a commit to AceDataCloud/Dify that referenced this pull request Nov 16, 2024

feat: support LLM understand video (langgenius#9828)

b78f3c6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support LLM understand video #9828

feat: support LLM understand video #9828

hjlarry commented Oct 25, 2024 •

edited

Loading

laipz8200 left a comment

laipz8200 commented Nov 8, 2024

hjlarry commented Nov 8, 2024

feat: support LLM understand video #9828

feat: support LLM understand video #9828

Conversation

hjlarry commented Oct 25, 2024 • edited Loading

Checklist:

Description

change list

restrictions

preview

Type of Change

Testing Instructions

laipz8200 left a comment

Choose a reason for hiding this comment

laipz8200 commented Nov 8, 2024

hjlarry commented Nov 8, 2024

hjlarry commented Oct 25, 2024 •

edited

Loading