Crowdsourced image quality control for the Dark Energy Survey (DES). The code available here is for a web-app that asks users to identify flaws in survey images. The app loads and displays FITS image directly in the browser and works on desktop and mobile devices.
More details can be found in the associated paper (Astronomy & Computing, arXiv).
A live demo version of the exposure checker in action can be found here:
http://des-exp-checker.pmelchior.net
The license is MIT. Feel free to use and modify, but please cite our paper if you do.
The app requires a webserver (tested on apache and nginx) with PHP and SQLite support. Then:
-
Clone the repository and go into the new directory.
-
Copy
htaccess.txt
to.htaccess
and append whatever else may be needed for your installation. -
Execute
sqlite3 files.db < sql/files.sql sqlite3 users.db < sql/users.sql
This step creates two empty database files
files.db
andusers.db
, move them into a directory the webserver can access but that is hidden from direct access. Because of.htaccess
, files and directories that start with a.
are inaccessible when requested through the webserver, so you do:mkdir .db mv files.db users.db .db/
Once there, make sure that the webserver user can read from and write to those files.
-
Load information about the test images into the file database. As you can see from the schema in
sql/files.sql
, each image needs to have 4 items of information:expname TEXT, ccd INT, band TEXT, name TEXT
ccd
andband
should be obvious,expname
is the short but unique name of the exposure, andname
(as a bit of a misnomer) is actually the filename of the FITS file in question. No full paths are needed, you'll set them in$config['fitspath']
next. -
Copy
config.php.template
toconfig.php.inc
and edit as needed (see below). -
On a production environment: Remove
.git
,htaccess-minimum.txt
,config.php.template
, andREADME.md
.
This file contains almost all of the particular configuration for the server and the location of files. Its content is:
$config = array(
"adminemail" => T_STRING, // email address used for contact
"adminname" => T_STRING, // name used for admin emails
"domain" => T_STRING, // webserver domain name
"releases" => T_ARRAY, // list of available releases
"userdb" => T_STRING, // path to the user database
"filedb" => T_ARRAY, // release => path to file database
"fovpath" => T_ARRAY, // release => path to FoV images
"fitspath" => T_ARRAY // release => path to fits files
"images_per_fp" => T_INT, // size of focal plane, used for congrats
"release" => NULL, // empty, set later
"problem_code" => T_ARRAY // problem label => numeric code
);
The code has a mechanism to switch between data releases. For that define a list of releases, e.g.
"releases" => array("SVA1", "Y1A1")
Set the arrays for filedb
, fovpath
,fitspath
with one key-value pair for each release, e.g.
"filedb" => array("SVA1" => ".db/files.sva1.db",
"Y1A1" => ".db/files.y1a1.db")
Open the file release_selector.shtml
and modify as needed. The name of each release must be the content of HTML node of class="release-button"
like
<a class="release-button" href="#">SVA1</a>
Problem classes are defined in two different locations: As textual labels for frontend users and as numbers for the server (the reduce overhead and storage requirements in the file databases).
-
Open
problem_selector.shtml
and modify as needed. For a label to be working as intended, it needs to be the content of an HTML node ofclass="problem"
, e.g.<a class="problem" href="#">Column mask</a>
-
Open
config.php.inc
again and make sure each problem label has a numeric code in$config['problem_code']
, e.g.$config['problem_code'] = array( "OK" => 0, // DO NOT CHANGE! "Other..." => 999, // DO NOT CHANGE! "Awesome!" => 1000, // DO NOT CHANGE! "Column mask" => 1, ... )
In addition to the frontend statistics pages, user generated reports can be queried with an API. The path is
api.php?release=SVA1&problem=Column%20mask
The parameter release
needs to be from $config['releases']
and problem needs to be a key in $config['problem_codes']
. If necessary, both need to be urlencoded.
The API returns JSON of the following form:
[
{"qa_id": number,
"expname": string,
"ccd": number,
"band": string,
"problem": string,
"x": number,
"y": number,
"detail": null or string,
"false_positive": true or false,
"release": string
}
]
The list has one of dictionary per reported problem that matches the request. qa_id
is a unique identifier of the report in the given release
, x/y
are the CCD coordinates of the center of the problem marker, detail
(only set for labels "Other…" and "Awesome!") is a user-generated text to describe the report.