RESIN: Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
-
Run
git clone https://github.com/RESIN-KAIROS/RESIN-pipeline-public
-
Run
cd RESIN-pipeline-public
-
Setup
${KAIROS_LIB}
dir:-
https://github.com/RESIN-KAIROS/RESIN-pipeline-public/blob/api/docker-compose.yaml#L8;
-
Uncompress data package (e.g, LDC2020E33) and put everything under
${KAIROS_LIB}/resin/resin/input/task1
;e.g.,
${KAIROS_LIB}/resin/resin/input/task1/{data,docs,tools}
-
Put schemas library under
${KAIROS_LIB}/resin/resin/schemas
-
-
Download and uncompress external data http://159.89.180.81/demo/resources/edl_data.tar.gz;
Set up dirs:
-
Set up device number for each GPU-based component, e.g.,
https://github.com/RESIN-KAIROS/RESIN-pipeline-public/blob/api/docker-compose.yaml#L44
-
Start APIs using:
docker-compose up
-
Send the following POST message to the main API (https://github.com/RESIN-KAIROS/entrypoint) to start processing:
curl -X POST --header "Content-Type: application/json" -d '{"id": "3fa85f64-5717-4562-b3fc-2c963f66afa6", "runId": "my_run_id", "sender": "string", "time": "2020-11-25T03:34:48.008Z", "content": {"data": "Example source document content here."}, "contentUri": "s3://kairos-experiment-data/performera/"}' http://0.0.0.0:10100/kairos/entrypoint
- Output will be in this dir:
${KAIROS_LIB}/resin/resin/persist/${my_run_id}
- Output will be in this dir:
-
Send the following GET message to check the status:
curl -X GET http://0.0.0.0:10100/kairos/status
Please view source code of each docker component in the docker-compose file.
@article{wen2021resin,
author = {Haoyang Wen and Ying Lin and Tuan M. Lai and Xiaoman Pan and Sha Li and Xudong Lin and Ben Zhou and Manling Li and Haoyu Wang and Hongming Zhang and Xiaodong Yu and Alexander Dong and Zhenhailong Wang and Yi R. Fung and Piyush Mishra and Qing Lyu and Dídac Surís and Brian Chen and Susan W. Brown and Martha Palmer and Chris Callison-Burch and Carl Vondrick and Jiawei Han and Dan Roth and Shih-Fu Chang and Heng Ji},
title = {RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System},
journal = {NAACL Demo Track},
year = {2021}
}