Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More flexible duplicate search #119

Open
rshadow opened this issue May 28, 2019 · 3 comments
Open

More flexible duplicate search #119

rshadow opened this issue May 28, 2019 · 3 comments

Comments

@rshadow
Copy link

rshadow commented May 28, 2019

There are links that do not match 100%, but are duplicates:

  • Links with a slash at the end. For example: http://example.com and http://example.com/
  • Http and https links. For example: http://example.com/ and https://example.com/
  • Links with empty hashtag: http://example.com/# and http://example.com/

These are the cases that I found during the deduplication of a collection of bookmarks.

@cadeyrn
Copy link
Owner

cadeyrn commented May 28, 2019

Hi,

thanks for the suggestion. HTTP:// and HTTP:// are not the same, there can be different content. But I will consider the other cases.

@seefood
Copy link

seefood commented Jun 4, 2019

I use another extension called Bookmark Dupes because I love their "expert mode" that lets me use regex to find even more dupes (I just passed 10k bookmarks, so that's very useful). Among the stuff I do with it:

  • remove extra strings after #
  • remove extra get parameters like youtube's &feature-youtu.be or other site's &utm_foo=bar and the like
  • strip www. from the hostname
    and lots more.

@cadeyrn cadeyrn added this to the Version 4.2.0 milestone Aug 13, 2023
@tDeContes
Copy link

No need to compare at the time to search for duplicates, if they are handled at the time to update redirections.

About HTTP/HTTPS, there is a problem with the "HTTPS only" mode when a site is available only with HTTP. (Do you need more details / examples ?)

About empty hashtag, is it possible to handle obsolete hashtags or not ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants