【Hackathon 10th Spring No.10】ECDFormer模型复现任务Datasets部分#246
Closed
PlumBlossomMaid wants to merge 3 commits intoPaddlePaddle:developfrom
Closed
【Hackathon 10th Spring No.10】ECDFormer模型复现任务Datasets部分#246PlumBlossomMaid wants to merge 3 commits intoPaddlePaddle:developfrom
PlumBlossomMaid wants to merge 3 commits intoPaddlePaddle:developfrom
Conversation
|
Thanks for your contribution! |
Removed duplicate import of OMol25Dataset.
15 tasks
Author
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
【Hackathon 10th Spring No.10】ECDFormer模型复现任务 - Datasets部分
📌 PR说明
一、概述
本PR为ECDFormer模型复现任务的Datasets部分,是四个核心模块(Models/Datasets/Train/Docs)中的第二个。ECDFormer模型需要处理ECD(电子圆二色谱)和IR(红外光谱)两种谱图数据,本PR提供了两个完全自治的数据集模块。
二、本次PR内容
2.1 新增文件
在
ppmat/datasets/目录下新增两个自治数据集模块:ECDFormerDataset模块(ECD光谱数据):
IRDataset模块(IR光谱数据):
2.2 核心特性
scipy.signal.find_peaks2.3 设计亮点
index_all中的文件,避免1小时全量扫描place_env确保数据加载在CPU进行,避免GPU资源浪费collate_fn将Data对象解包为Tensor字典三、后续PR计划
本PR完成后,将陆续提交:
Models部分(已完成)四、测试情况
五、相关链接