You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 2, 2023. It is now read-only.
Allow to download and store the data, either in a folder or in a MongoDB, about a list of pages.
3
+
This toolkit allow to manipulate and download information about Wikipedia's pages and users through the apis:
4
+
- https://[domain].wikipedia.org/w/api.php
5
+
-http://stats.grok.se/
6
+
The main classes for handling informations are WikiPage and WikiUser.
4
7
5
-
Can be used through command line using the class net.wekeypedia.main.Main.
8
+
This toolkit allows multithreading for downloading the data using the classes DownloadThread and JobManager.
6
9
7
-
Require the mongoDB driver for java to work.
10
+
This toolkit also allow to store the data either in a folder or in a MongoDB (see the classes FolderManager and MongoManager).
11
+
12
+
An example of how to use this toolkit can be found with the class Main.
13
+
14
+
## require
15
+
16
+
Require the mongoDB driver in the classpath for java to work (http://docs.mongodb.org/ecosystem/drivers/java/).
17
+
18
+
# Ant file
19
+
20
+
## documentation
21
+
22
+
To generate the documentation launch.
23
+
24
+
## compilation and running
25
+
26
+
Before compiling, you need to edit the path section at the beginning of the build with the location of the mongoDB java driver. Then you can compile using to 'ant init' and try the toolkit using the class main with 'ant Main'.
27
+
28
+
To generate a runnable jar, you need to edit the manifest section at the end of the build with the location of the mongoDB java driver. Then, launch 'ant jar'.
0 commit comments