Skip to content

Can I use datasets that contain multiple different molecules (i.e. different number of atoms)? #200

Answered by Linux-cpp-lisp
jcartus asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @jcartus ,

This is definitely possible, just not with the NpzDataset, which we use in example.yaml and minimal.yaml because the data is already in that format. We recommend converting your dataset into the extxyz format and using the ASE dataset feature:

dataset: ase
dataset_file_name: whatever.xyz
ase_args:
  format: extxyz

You can also use any other format that ASE can read and change format accordingly.

Please see full.yaml for more details: https://github.com/mir-group/nequip/blob/main/configs/full.yaml#L69-L95

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@jcartus
Comment options

Answer selected by jcartus
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants