Hazard idea : AI sourced Data

I am suggesting a new category of Data Hazard called "AI Sourced Data".  Suggested symbol : Ouroboros

These would be cases in which the data is scrapped over the internet or any other sources, which turns out to be AI-generated data. These scrapped data will then be used to train more AI models, thereby creating a negative feedback loop making worse and worse trained models. 

This can be intentional in some aspects - for example : "Nightshade" - AI Poisoning for protecting Copyrights. But in many cases ,this can be oversight on training or direct malicious intent of sabotaging. 

Also such models trained with 'AI sourced data', can further reinforce other data hazards such as existing bias, privacy issues, and more.
![Serpiente_alquimica](https://github.com/very-good-science/data-hazards/assets/11191375/b68a52cf-9109-4728-901e-111e5764c5f9)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hazard idea : AI sourced Data #209

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hazard idea : AI sourced Data #209

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions