refactor: Do one prediction per input sequence, easier experimentation #27

carlosgjs · 2024-01-24T18:56:09Z

The prediction/evaluation interfaces used List[List[str]] as the prediction type to account for: a) batch inference, i.e. generate docs for multiple code snippets in one pass and b) generating multiple predictions per input (sampling). But we've now decided to do deterministic prediction (temp=0) so there's no need for the second level.

This PR changes to the prediction return type to List[str], which also makes the code simpler.

Two additional changes:

Replace the use of the max_length parameter in favor of max_new_tokens
Break-up the eval function into eval_promp for easier prompt experimentation
Updated generate.ipynb notebook accordingly

codecov-commenter · 2024-01-24T19:00:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (10294bc) 97.00% compared to head (6f2c5da) 97.17%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #27      +/-   ##
==========================================
+ Coverage   97.00%   97.17%   +0.16%     
==========================================
  Files           3        3              
  Lines         167      177      +10     
==========================================
+ Hits          162      172      +10     
  Misses          5        5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

refactor: Do one prediction per input sequence, easier experimentation

6f2c5da

carlosgjs requested a review from anujsinha3 January 24, 2024 18:56

anujsinha3 approved these changes Jan 24, 2024

View reviewed changes

carlosgjs merged commit 3c7e0a0 into main Jan 25, 2024

carlosgjs deleted the carlosg/onepred branch January 25, 2024 01:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: Do one prediction per input sequence, easier experimentation #27

refactor: Do one prediction per input sequence, easier experimentation #27

Uh oh!

carlosgjs commented Jan 24, 2024

Uh oh!

codecov-commenter commented Jan 24, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

refactor: Do one prediction per input sequence, easier experimentation #27

refactor: Do one prediction per input sequence, easier experimentation #27

Uh oh!

Conversation

carlosgjs commented Jan 24, 2024

Uh oh!

codecov-commenter commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Jan 24, 2024 •

edited

Loading