Skip to content

Working on a large git repo #711

Open
@last-partizan

Description

@last-partizan

I tried this on relatively large old repo, and i'm getting this error (after waiting for a few minutes):

2024-10-10 10:30:07,184 Checking rwarning: exhaustive rename detection was skipped due to too many files.
warning: you may want to set your diff.renameLimit variable to at least 77823 and retry the command.
Exception in thread Thread-1 (_worker_function):
Traceback (most recent call last):
  File "/home/serg/.local/pipx/venvs/seagoat/lib/python3.12/site-packages/seagoat/queue/base_queue.py", line 78, in _worker_function
    task = self._task_queue.get(timeout=0.1)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.12/queue.py", line 179, in get
    raise Empty
_queue.Empty

During handling of the above exception, another exception occurred:
...
  File "/home/serg/.local/pipx/venvs/seagoat/lib/python3.12/site-packages/seagoat/repository.py", line 119, in top_files
    (self.get_file(filename), score)
     ^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/serg/.local/pipx/venvs/seagoat/lib/python3.12/site-packages/seagoat/repository.py", line 134, in get_file
    self.get_file_object_id(filename),
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/serg/.local/pipx/venvs/seagoat/lib/python3.12/site-packages/seagoat/repository.py", line 45, in get_file_object_id
    subprocess.check_output(

After raising renameLimit, it again fails on same line, but with IndexError. Upon investigation, it tries to access src/app/node_modules/abbrev/README.md, which should be ignored, because node_modules is in my .gitignore. But, not in root, so maybe it treats ignore patterns differenly ...

output is '' - empty string, and it fails to access second part after split().

I can make a patch for it, if you suggest proper way to do it. My first guess is to return None for such cases and filter them later...

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions