fix: small fixes used when finetuning a model #45

avelinoapheris · 2025-11-18T14:14:18Z

Allow training without templates. The current setup will crash if the templates directories are not given, whoever we find use cases where we will like to train without them.
Allows running a validation dataset whose cache do not have "token_count" or "use_metric" fields. This is useful if one wants to run .validate on the training dataset.
Fix a bug that requires the "ema" weights to be present when loading a checkpoint manually. There are cases where we want to start from a public checkpoint with only the weights of the model.
Expose all the arguments for the lightning checkpoint callback.

…oint without ema weights + add extra options for the Checkpoint callback + allows running .validate with caches that do not have token size or use metric flag

jnwei

Thank you for adding the fixes for supporting fine tuning and enabling computation of metrics on the training set.

One suggestion: would it be possible to add a test to check that the case where dataset_config.dataset_paths.template_structure_array_directory is None works as intended? Perhaps a test that is a variation of the tests that are used to test training dataset runner yamls would be suitable, e.g. https://github.com/aqlaboratory/openfold-3/blob/main/openfold3/tests/test_entry_points.py#L63

jnwei · 2025-11-19T04:06:55Z

openfold3/core/data/primitives/structure/template.py

        k = np.min([np.random.randint(0, l), n_templates])

-    if k > 0:
+    if (k > 0) and (template_cache_directory is not None):


Could the parentheses around the individual statements be removed? I believe this will cause an issue for our ruff formatter.

jnwei · 2025-11-19T04:11:12Z

openfold3/entry_points/validator.py

 class CheckpointConfig(BaseModel):
    """Settings for training checkpoint writing."""

+    model_config = PydanticConfigDict(extra="allow")


Is extra="allow" required to support backwards compatibility with old versions of the code?

We generally prefer to set extra="forbid" to help catch instances where a field might be ignored if it is not recognized by the Model (e.g. in the case of a typo or a new field).

fix: allow training without templates + fix bug when loading a checkp…

67005da

…oint without ema weights + add extra options for the Checkpoint callback + allows running .validate with caches that do not have token size or use metric flag

jnwei requested changes Nov 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: small fixes used when finetuning a model #45

fix: small fixes used when finetuning a model #45

Uh oh!

avelinoapheris commented Nov 18, 2025

Uh oh!

jnwei left a comment

Uh oh!

jnwei Nov 19, 2025

Uh oh!

jnwei Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: small fixes used when finetuning a model #45

Are you sure you want to change the base?

fix: small fixes used when finetuning a model #45

Uh oh!

Conversation

avelinoapheris commented Nov 18, 2025

Uh oh!

jnwei left a comment

Choose a reason for hiding this comment

Uh oh!

jnwei Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

jnwei Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants