This is a copyright violation detector running on Wikimedia Cloud Services.
It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.
- earwigbot >= 0.1
- flask >= 0.10.1
- flask-mako >= 0.3
- mako >= 0.7.2
- mwparserfromhell >= 0.3
- mwoauth == 0.3.8
- oursql >= 0.9.3.1
- requests >= 2.9.1
- SQLAlchemy >= 0.9.6
- apsw >= 3.26.0
- uglifyjs >= 3.12.6
- cssnano >= 4.1.10
- postcss-cli >= 8.3.1
-
If using Toolforge, you should clone the repository to
~/www/python/src
, or otherwise symlink it to that directory. A virtualenv should be created at~/www/python/venv
. -
Install all dependencies listed above.
-
Create an SQL database with the
cache
andcache_data
tables defined by earwigbot-plugins. -
Create an earwigbot instance in
.earwigbot
(runearwigbot .earwigbot
). In.earwigbot/config.yml
, fill out the connection info for the database by adding the following to thewiki
section:_copyviosSQL: host: <hostname of database server> db: <name of database>
If additional arguments are needed by
oursql.connect()
, like usernames or passwords, they should be added to the_copyviosSQL
section. -
Run
./build.py
to minify JS and CSS files. -
Start the web server (on Toolforge,
webservice uwsgi-python start
).