-
Notifications
You must be signed in to change notification settings - Fork 449
fastNLP V0.3.1 #132
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
fastNLP V0.3.1 #132
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- refine & fix Transformer Encoder - refine & speed up biaffine parser
* move used readers from reproduction to io/dataset_loader.py (API shall not call anything from reproduction/)
* 将dataset.py中的assert改为raise error * 给trainer添加try-except,捕捉EarlyStopError * 优化trainer代码 * 给callbacks添加测试
2. FieldArray默认使用AutoPadder, AutoPadder的行为与之前不使用padder是一致的的 3. 为了解决二维padding的问题,引入了EngChar2dPadder用于对character进行padding 4. 增加一份padding的tutorial。
* 重构dtype的检测代码,在FieldArray的初始化和append两处,达到更好的代码复用 * 类型检测的责任完全落在FieldArray,DataSet与之配合 测试: * 整理dtype相关的测试代码 * 给所有tutorial添加测试 其他: * 完善一个完整的Conll dataset loader * 升级POS tag model训练脚本
* 添加测试:FieldArray的初始化
* 添加两类Callback * 完善Trainer对error的捕捉
* rename callback methods. Use fastai's notation. * add a new callback method - on_valid_begin
* load pre-trained BERT weights from local binary * add tests
* 升级parser API和模型 * update docs: add new pages for tutorials * upgrade CWS api download source * add a new method for dataset field access * add introduction for bert * add more unit tests for api/processor * remove unused test data. Add new test data.
Codecov Report
@@ Coverage Diff @@
## master #132 +/- ##
=========================================
+ Coverage 68% 74.5% +6.49%
=========================================
Files 90 88 -2
Lines 6286 7245 +959
=========================================
+ Hits 4275 5398 +1123
+ Misses 2011 1847 -164
Continue to review full report at Codecov.
|
xuyige
approved these changes
Feb 5, 2019
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New features:
EngChar2dPadder
解决二维padding)DataSet
初始化接受的类型Bugs fixed:
FieldArray
Code structure refined:
reproduction/
Testing:
Tutorials: