APISeedsGetter

The repository consists of a set of scripts used to extract data from APIs.

CienciaVitae API

The information we want to extract from CienciaVitae API:

The CienciaVitae ID e.g. 851A-7C31-0327
URLs from the each CienciaVitae ID

Basic steps to get the information that Arquivo.pt wants from CienciaVitae API:

Get all CienciaVitae IDs using the endpoint “persons all” (https://api.cienciavitae.pt/v1.1/searches/persons/all)
For each ID from 1, we need to get all the publications (https://api.cienciavitae.pt/v1.1/curriculum/CIENCIAVITAE_ID?lang=User%20defined)
Extract the information

Parameters

-u or --username     --> Username from CienciaVitae API
-p or --password     --> Password from CienciaVitae API

Run

python process_api_cienciaviate.py -u USERNAME - p PASSWORD

RCAAP API

The information we want to extract from RCAAP API:

Links to the PDFs
All the citations from each PDF

Basic steps to get the information that Arquivo.pt wants from CienciaVitae API:

Get the publications (e.g. https://www.rcaap.pt/api/v2/search/results/publications)
For each publication we need to get the entity (e.g. https://www.rcaap.pt/api/v2/entity/a230c2cf-c84f-46e6-9ebc-cc722e9ef0fa)
For each entity we need to get the URL to the PDF
Extract the information

Parameters

-p or --path          --> Localization of the processed files
-d or --destination   --> Destination of the output files

Setup

Run in a different shell

java -mx1000m -jar tika-server-1.20.jar --port=9998

Run

python rcaap_api.py

Scholar Tecnico API

There's a README inside the ScholarTecnico_API folder, use that.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CienciaVitaeAPI		CienciaVitaeAPI
RCAAP_API		RCAAP_API
ScholarTecnico_API		ScholarTecnico_API
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APISeedsGetter

CienciaVitae API

Parameters

Run

RCAAP API

Parameters

Setup

Run

Scholar Tecnico API

About

Releases

Packages

Contributors 2

Languages

License

arquivo/APISeedsGetter

Folders and files

Latest commit

History

Repository files navigation

APISeedsGetter

CienciaVitae API

Parameters

Run

RCAAP API

Parameters

Setup

Run

Scholar Tecnico API

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages