Skip to content
/ rfna Public

Web page scraping for eFloras, including the Flora of North America

License

Notifications You must be signed in to change notification settings

sckott/rfna

Repository files navigation

rfna

Build Status Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.

This set of functions scrapes web content, and allows searches of the content, on the eFloras website, including Flora of North America, etc.

The website: http://www.efloras.org

Installation

Install rfna

remotes::install_github("sckott/rfna")
library('rfna')

Usage

Get families

res <- get_families("fna")
res$names[1:10]
#>  [1] "Achatocarpaceae"  "Acoraceae"        "Agavaceae"        "Aizoaceae"       
#>  [5] "Alismataceae"     "Aloaceae"         "Amaranthaceae"    "Amblystegiaceae" 
#>  [9] "Andreaeaceae"     "Andreaeobryaceae"

Get genera

res <- get_genera(from='fna', family='Asteraceae')
res$names[1:10]
#>  [1] "Asteraceae"     "Acamptopappus"  "Acanthospermum" "Achillea"      
#>  [5] "Achyrachaena"   "Acmella"        "Acourtia"       "Acroptilon"    
#>  [9] "Adenocaulon"    "Adenophyllum"

Parse a page

pg1<-'http://www.efloras.org/browse.aspx?flora_id=1&start_taxon_id=10074&page=1'
head(parse_page(pg1))
#>             name     id
#> 1     Asteraceae  10074
#> 2  Acamptopappus 100070
#> 3 Acanthospermum 100132
#> 4       Achillea 100191
#> 5   Achyrachaena 100226
#> 6        Acmella 100279
#>                                                                 url
#> 1  http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=10074
#> 2 http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=100070
#> 3 http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=100132
#> 4 http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=100191
#> 5 http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=100226
#> 6 http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=100279

Get state (paleate or epaleate) of receptacle.

url <- 'http://www.efloras.org/florataxon.aspx?flora_id=1&taxon_id=102552'
receptacle(url)
#> [1] "Argyranthemum" "epaleate"      "epaleate"

Meta

About

Web page scraping for eFloras, including the Flora of North America

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors 4

  •  
  •  
  •  
  •