Public datasets out there for Analysis(or any other use if you may).
Feel free to add more to this list!
- https://labs-repos.iit.demokritos.gr/skel/i-config/downloads/enron-spam/
- https://github.com/clips/pattern/tree/master/test/corpora
tasdik at Acer in ~/Dropbox/projects/datasets on master
$ tree
.
├── email
│ ├── csv
│ │ └── spam-apache.csv
│ └── plaintext
│ ├── corpus1.zip
│ ├── corpus2.zip
│ ├── corpus3.zip
│ ├── enron1.zip
│ ├── enron2.zip
│ ├── enron3.zip
│ ├── enron4.zip
│ ├── enron5.zip
│ └── enron6.zip
├── LICENSE
├── README.md
├── reviews
│ └── short_reviews
│ ├── negative.txt
│ ├── positive.txt
│ └── README.md
└── spelling
└── spelling-birkbeck.csv
6 directories, 16 files
$
MIT License, Tasdik Rahman (@tasdikrahman)
You can find a copy of the License at http://prodicus.mit-license.org/