Improve with percomputing by AliQassab · Pull Request #5 · AliQassab/Module-Complexity

AliQassab · 2026-03-08T23:22:36Z

Learners, PR Template

Self checklist

I have titled my PR with Region | Cohort | FirstName LastName | Sprint | Assignment Title
My changes meet the requirements of the task
I have tested my changes
My changes follow the style guide

Changelist

Briefly explain your PR.

Questions

Ask any questions you have for your reviewer.

github-actions · 2026-03-08T23:22:45Z

Your PR couldn't be matched to an assignment in this module.

Please check its title is in the correct format, and that you only have one PR per assignment.

If this PR is not coursework, please add the NotCoursework label (and message on Slack in #cyf-curriculum or it will probably not be noticed).

If this PR needs reviewed, please add the 'Needs Review' label to this PR after you have resolved the issues listed above.

github-actions · 2026-03-08T23:22:49Z

Your PR couldn't be matched to an assignment in this module.

Please check its title is in the correct format, and that you only have one PR per assignment.

If this PR is not coursework, please add the NotCoursework label (and message on Slack in #cyf-curriculum or it will probably not be noticed).

If this PR needs reviewed, please add the 'Needs Review' label to this PR after you have resolved the issues listed above.

github-actions

Great work on both tasks, Alaa! You've successfully applied precomputing to improve the performance of both functions. Here's a summary of the review:

What you did

You tackled two separate functions and improved both:

count_letters: You precomputed two sets (lower_set and upper_set) before doing any lookups. This is a classic precomputing pattern — building a lookup structure once at O(n) cost, then querying it at O(1) per operation. The overall complexity drops from O(n²) to O(n). You also quietly fixed a latent bug: the old is_upper_case helper returned True for digits and spaces (since '1'.upper() == '1'), whereas char.isupper() correctly identifies only genuine uppercase letters.
common_prefix: You sort the strings first and then only compare adjacent pairs. This works because of a useful property of lexicographic sorting: if two strings share a prefix P, any string that falls between them in sorted order must also start with P — so the pair with the longest common prefix will always be adjacent. This reduces comparisons from O(n²) to O(n log n).

What you did well

The precomputing pattern in count_letters is clean and idiomatic — using set comprehensions is exactly the right tool here.
The sorting insight in common_prefix is clever and mathematically sound.
Clear, helpful comments explain the reasoning behind both approaches.
You kept the public function signatures compatible and didn't break any existing tests.

Minor things to consider (not blockers)

In count_letters, you make two passes over text (one for lowercase, one for uppercase). You could do it in one pass by building both sets together. Still O(n), just a small efficiency note.
In common_prefix, the loop for index in range(len(sorted_strings) - 1) is correct, but a slightly more Pythonic alternative is for a, b in zip(sorted_strings, sorted_strings[1:]), which avoids index arithmetic.

These are optional improvements — the code as written is correct and clear.

VERDICT: completed

github-actions · 2026-03-08T23:26:23Z

Sprint-2/improve_with_precomputing/count_letters/count_letters.py

    """
    count_letters returns the number of letters which only occur in upper case in the passed string.
    """
-    only_upper = set()


This is the key precomputing step. By building lower_set once here (O(n)), you turn every subsequent membership check into O(1) instead of O(n). The original code did letter.lower() not in s where s is a plain string — Python's in operator on strings is a substring search and costs O(n) per call, making the old function O(n²) overall. Your approach is a textbook example of precomputing a lookup structure.

So, what is a better way to write it?

AliQassab added 2 commits October 18, 2025 10:51

Refactor count letters file

06745a8

Refactor common prefix file

9887ef9

AliQassab added the needreview PR is pending AI review label Mar 8, 2026

github-actions bot reviewed Mar 8, 2026

View reviewed changes

github-actions bot added completed Code is correct and meets quality standards and removed needreview PR is pending AI review labels Mar 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve with percomputing#5

Improve with percomputing#5
AliQassab wants to merge 2 commits intomainfrom
improve-with-percomputing

AliQassab commented Mar 8, 2026

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot Mar 8, 2026

Uh oh!

AliQassab Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AliQassab commented Mar 8, 2026

Learners, PR Template

Changelist

Questions

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

github-actions bot commented Mar 8, 2026

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

What you did

What you did well

Minor things to consider (not blockers)

Uh oh!

github-actions bot Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

AliQassab Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant