-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ERNIE vil2.0] Update image text retrieval data preprocessing #6251
Conversation
Thanks for your contribution! |
import numpy as np | ||
import paddle | ||
import pyarrow as pa |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是否需要增加requirements.txt
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已经增加requirements.txt
|
||
python preprocess/create_lmdb_dataset.py \ | ||
python preprocess/create_arrow_dataset.py \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个文件在哪儿?create_arrow_dataset.py
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
在ernie-vil2.0的preprocess文件夹中
Codecov Report
@@ Coverage Diff @@
## develop #6251 +/- ##
===========================================
- Coverage 63.48% 63.35% -0.14%
===========================================
Files 515 516 +1
Lines 73714 74305 +591
===========================================
+ Hits 46800 47073 +273
- Misses 26914 27232 +318 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
PR changes
Description