Skip to content

Concyclics/OpenReviewSpider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenReviewSpider

Archtecture

  • MetaData (Json)
  • DataBase (Sqlite)
  • Incremental Update
  • Query Co-Author by date (COI-assist)
flowchart TD
    RUN["OpenReviewSpider.run()"] --> GETCONF["get_all_conferences()"]
    GETCONF --> FORCONF{"for conf in conferences"}
    FORCONF -- "not in DB" --> WALK["walk_conference(conf)"]
    WALK --> GETSUB["get_submissions(conf)"]
    GETSUB --> FORSUB{"for submission in submissions"}
    FORSUB --> GETAUTH["get_authorids(submission)"]
    GETAUTH --> FORAUTH{"for author_id in authorids"}
    FORAUTH -- "not in DB" --> GETPROF["get_profile(author_id)"]
    GETPROF -- "found" --> ADDPROF["DB.add_profile(profile)"]
    GETPROF -- "not found" --> ADDEMPTY["DB.add_empty_profile(author_id)"]
    FORSUB --> ADDPAPER["DB.add_paper(submission, conf)"]
    WALK --> ADDCONF["DB.add_conference(conf, len(submissions))"]
    ADDPROF --> AUTHORS[("authors table")]
    ADDEMPTY --> AUTHORS
    ADDPAPER --> PAPERS[("papers table")]
    ADDPAPER --> EDGES[("author_paper_edges table")]
    ADDPAPER --> REVIEWS[("reviews table")]
    ADDCONF --> CONFS[("conferences table")]
    style Spider fill:#e0f7fa,stroke:#26c6da,stroke-width:2px
    style DB fill:#f1f8e9,stroke:#8bc34a,stroke-width:2px
Loading

How to Run

Requirements

python==3.12
openreview-py>=1.50
requests>=2.22.0
tqdm

Set Account

Edit src/main.py, set OpenReview Account and Password

import openreview
from openReviewSpider import OpenReviewSpider
    
if __name__ == '__main__':
    client = openreview.api.OpenReviewClient(
    baseurl='https://api2.openreview.net',
    username='your_username',  # Replace with your OpenReview username
    password='your_password'  # Replace with your OpenReview password
    )
    
    spider = OpenReviewSpider(client, '~/data/')
    spider.run()
    print("OpenReview Spider finished running.")

Run

cd src
python main.py

Data Analysis

see DataVisualization

About

OpenReviewSpider

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages