Audit? #402
Replies: 2 comments 1 reply
-
Unfortunately I cannot contribute with code, just some thoughts:
Some thoughts on automatic checking before adding:
|
Beta Was this translation helpful? Give feedback.
-
I think this is a good idea. For the porn list, we could automate testing using McAfee (if their rating system & use of a unofficial api are ok) and review manually the ones not detected. We also can test those in a VM, but that will take a long time & is subject to website downtimes, ip blocking, and other problems. We also could use VirusTotal, either via the web interface or the API. |
Beta Was this translation helpful? Give feedback.
-
I don't want to stifle discussion, but after some recent scalability problems, I don't see this happening anytime soon. I'm locking this discussion for the time being. Long term, this is something I think we should revisit. However, I don't believe the project is in a position to look at this currently.
Wanted to start a discussion about an idea I had regarding @WordsOfMe's comments on #350.
Quality is critical for a project like this. Part of that is always coming up with ideas and methods to work to increase quality of the lists. So, I have no idea if this is even a remotely good idea. But brainstorming new ideas to increase quality I think benefits the project.
So. Do we think doing a full audit of every domain on every list would be beneficial to the project? Basically verifying that every one is legitimate and should exist on the list.
I'm not quite sure how this would work. Maybe it can be assisted by automation? Possibly a system can create audit issues based on if domains have recently be transferred owners or changed IP addresses that the domain points to? Then as those issues get marked as closed the system continues to expand and create more issues to end up encapsulating every domain? Maybe there can even be tools to give additional information in those issues that would be beneficial to determine if the domain should exist on the blocklist or not (screenshots of domain content maybe?)?
This would be a massive undertaking. And I'm not sure we are prepared for a project like this currently. I think it would also require some more concrete guidelines about what domains should be included vs not.
Just thought it'd be worth discussing to see if this is something we should work towards.
Beta Was this translation helpful? Give feedback.
All reactions