Use Pathlib everywhere #23

rusheb · 2023-02-20T22:12:57Z

Let's use pathlib everywhere and stop passing strings around.

This could also be a good opportunity to add more unit tests.

valedan · 2023-03-06T20:45:57Z

@rusheb Can you clarify where these changes are needed?

rusheb · 2023-03-07T14:50:50Z

I just did a quick search for the regex path.*: str. I doubt this is exhaustive, but probably a good start:

10 results - 6 files

maze_transformer/evaluation/eval_model.py:
   45  def load_model_with_configs(
   46:     model_path: str, data_cfg_class: type, verbose: bool = False
   47  ) -> LoadedModelConfigs:

  179      # split the tokens into maze (prompt) and path
  180:     path_start_token: str = SPECIAL_TOKENS["start_path"]
  181      path_start_idx: int = tokens.index(path_start_token) + 1

  242  def generate_plot_predicted_path(
  243:     model_path: str,
  244      n_tokens_pred: int = 5,

maze_transformer/evaluation/plot_loss.py:
  12  def plot_loss(
  13:     log_path: str,
  14      window_sizes: int | Iterable[int] | str = (10, 50, 100),

maze_transformer/training/mazedataset.py:
  300          self,
  301:         path_base: str = "data/test-001",
  302          do_config: bool = True,

  345          cls,
  346:         path_base: str,
  347          do_config: bool = False,

maze_transformer/training/training.py:
  209      # ==================================================
  210:     final_model_path: str = output_dir / TRAIN_SAVE_FILES.model_final
  211      logger.saving(f"saving final model to {final_model_path.as_posix()}", 10)

scripts/create_dataset.py:
   36  def create_dataset(
   37:     path_base: str,
   38      n_mazes: int,

  103  
  104: def load(path: str) -> None:
  105      d = MazeDataset.disk_load(path, do_tokenized=True)

scripts/train_model.py:
  18  
  19: def train_model(basepath: str, cfg_name: str = "tiny-v1"):
  20      dataset = MazeDataset.disk_load(basepath, do_config=True, do_tokenized=True)

valedan · 2023-03-07T15:50:30Z

Ah okay great, this is a good starting point! I guess part of this ticket is finding a good way to locate places where we're doing things the bad way.

rusheb · 2023-03-07T15:56:43Z

In my opinion if we replaced all of the above (and added unit tests for all those functions?) then that would be sufficient to close this ticket. Any stragglers could be picked up on an ad-hoc bases.

valedan · 2023-03-22T21:34:40Z

Nice article about some of the benefits of using Pathlib: https://treyhunner.com/2018/12/why-you-should-be-using-pathlib/

rusheb added the good first issue Good for newcomers label Mar 2, 2023

mivanit added the code-quality code quality improvement label Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Pathlib everywhere #23

Use Pathlib everywhere #23

rusheb commented Feb 20, 2023 •

edited

Loading

valedan commented Mar 6, 2023

rusheb commented Mar 7, 2023

valedan commented Mar 7, 2023

rusheb commented Mar 7, 2023

valedan commented Mar 22, 2023

Use Pathlib everywhere #23

Use Pathlib everywhere #23

Comments

rusheb commented Feb 20, 2023 • edited Loading

valedan commented Mar 6, 2023

rusheb commented Mar 7, 2023

valedan commented Mar 7, 2023

rusheb commented Mar 7, 2023

valedan commented Mar 22, 2023

rusheb commented Feb 20, 2023 •

edited

Loading