Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: schedule html parser not parsing all courses #982

Merged

Conversation

limwa
Copy link
Member

@limwa limwa commented Sep 29, 2023

This PR fixes an issue with the schedule HTML parser where the parser would only parse the classes from the first course of each student.

Review checklist

  • Terms and conditions reflect the current change
  • Contains enough appropriate tests
  • If aimed at production, writes a new summary in whatsnew/whatsnew-pt-PT
  • Properly adds an entry in changelog.md with the change
  • If PR includes UI updates/additions, its description has screenshots
  • Behavior is as expected
  • Clean, well-structured code

@limwa limwa changed the base branch from develop to master September 29, 2023 21:07
@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

@bdmendes @LuisDuarte1 this one is ready as well, after I format the code 😎

@limwa limwa force-pushed the hotfix/schedule-html-parser-not-parsing-all-courses branch from 9dab75f to 1a93d2a Compare September 29, 2023 21:09
Copy link
Collaborator

@bdmendes bdmendes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not really seeing what is changed. Shouldn't this be equivalent to zipping?

@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

By the way, even with this PR, the parser is not parsing the HTML perfectly. However, this PR makes it parse stuff at least.

@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

I'm not really seeing what is changed. Shouldn't this be equivalent to zipping?

That's why I changed the code to make it a bit clearer.

With IterableZip: IterableZip([1, 2, 3], [4]) -> [[1, 4]]
Without IterableZip: [1, 2, 3].map((e) => [e, 4]) -> [[1, 4], [2, 4], [3, 4]]

@codecov
Copy link

codecov bot commented Sep 29, 2023

Codecov Report

Merging #982 (4699efb) into master (7558258) will decrease coverage by 0%.
Report is 1 commits behind head on master.
The diff coverage is 100%.

Additional details and impacted files
@@          Coverage Diff          @@
##           master   #982   +/-   ##
=====================================
- Coverage      23%    23%   -0%     
=====================================
  Files         143    143           
  Lines        4349   4348    -1     
=====================================
- Hits          964    959    -5     
- Misses       3385   3389    +4     

bdmendes
bdmendes previously approved these changes Sep 29, 2023
@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

The idea is that IterableZip stops when it reaches the end of one of the iterables. When a user has 5 courses, but the network router only returns 1 URL (my case), only the first course is parsed (because IterableZip reached the end of the URLs list)

Copy link
Member

@LuisDuarte1 LuisDuarte1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While I didn't think about the iterablezip bug at the time, I think that this won't work on a session with multiple faculties and different courses between them? The logic seems too hacky as you described in a comment

@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

While I didn't think about the iterablezip bug at the time, I think that this won't work on a session with multiple faculties and different courses between them? The logic seems too hacky as you described in a comment

Indeed it won't. For instance, if a student has 2 faculties, should I create one element with the first faculty and another with the second faculty? That way, I think it'll work. I'm busy right now but I can implement it in a bit!

@limwa
Copy link
Member Author

limwa commented Sep 29, 2023

While I didn't think about the iterablezip bug at the time, I think that this won't work on a session with multiple faculties and different courses between them? The logic seems too hacky as you described in a comment

Indeed it won't. For instance, if a student has 2 faculties, should I create one element with the first faculty and another with the second faculty? That way, I think it'll work. I'm busy right now but I can implement it in a bit!

@LuisDuarte1 I noticed that the solution I outlined wouldn't work. Mainly because I think that the method that parses the HTML only needs the baseUrl used for the request that fetched the schedule. Therefore, I decided to rework the method a bit.

I am not very happy with the way it is currently, but I can't think of another way to match the baseUrl with the request...

@limwa limwa requested a review from LuisDuarte1 September 29, 2023 22:57
Copy link
Collaborator

@bdmendes bdmendes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems correct to me now, as it's getting the classes from all courses in all facilities. This is very hard to test, though.
#797 will guarantee that the chosen faculties are correct.

Copy link
Member

@LuisDuarte1 LuisDuarte1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good work!

@LuisDuarte1 LuisDuarte1 merged commit ae31671 into master Sep 30, 2023
@LuisDuarte1 LuisDuarte1 deleted the hotfix/schedule-html-parser-not-parsing-all-courses branch September 30, 2023 15:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants