Skip to content

Mustabsar1/WRDSDownloadScripts

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WRDSDownloadScripts

This repo contains IPython jupyter notebook scripts to download specific big datasets from WRDS and calculate statistics at a more aggregated level. A naive query is not possible due to massive amounts of data (e.g. intraday bond transactions of 1GB per day) at a disaggregated level. The scripts breaks up the queries into manageable chunks and perform a loop.

Requirements

Datasets

  • TRACE Bond intraday transaction data: run TRACE.ipynb
    • script computes daily statistics from intraday bond transactions and create a dataset at the bond-day level
  • Calculate earnings Response Coefficients (ERC): run ERC script.ipynb
    • script computes the necessary variables to calculate ERC's: earnings announcement returns, summary of analyst earnings forecasts and actual forecasts from IBES

personal notes

  • Here is a general guide on how to connect to WRDS with python
  • WRDS server session: this approach is for when you want to get data and compute stuff on the WRDS server. It is possible to execute python scripts. But it is unclear to me how to execute SQL commands that include computations such as merges, as opposed to only selecting and downloading data.
    • ssh USERNAME@wrds-cloud.wharton.upenn.edu to login
    • qrsh to get into compute node

About

Scripts to download large datasets from WRDS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 74.6%
  • Python 25.4%