ARROW-6041: [Website] Blog post announcing R library availability on CRAN #4948

nealrichardson · 2019-07-25T19:27:12Z

No description provided.

nealrichardson · 2019-07-25T19:28:35Z

site/_posts/2019-08-01-r-cran-release.md

It would be cool to use a real/interesting example Parquet file here--anyone know of any? I found a few online but they're all multi-file partitioned things, which we don't have good support for in R yet.

I always like to use the New York Taxi trip dataset for Parquet file usage as a month of data has a decent size but loads very quickly, sadly there is no official source for a Parquet file for it.

nealrichardson · 2019-07-25T19:29:34Z

site/_posts/2019-08-01-r-cran-release.md

@wesm you may especially want to review this section for historical accuracy and current policy stance.

wesm · 2019-07-25T19:41:19Z

Will review. We'll have to be careful about what we call a "release" on this blog, since that has a very specific meaning in Apache-land. When in doubt, say "Available on CRAN" rather than "Released on CRAN"

wesm · 2019-07-26T16:27:45Z

site/_posts/2019-08-01-r-cran-release.md

Link to CRAN (for people who don't know what that is)?

wesm · 2019-07-26T16:29:44Z

site/_posts/2019-08-01-r-cran-release.md

The "list of PPAs" is a bit too specific. Say "See ... to find pre-compiled binary packages for some common Linux distributions such as Debian, Ubuntu, CentOS, and Fedora. Other Linux distributions must install the libraries from source."

wesm · 2019-07-26T16:30:11Z

site/_posts/2019-08-01-r-cran-release.md

Maybe say "Apache Parquet support" here

wesm · 2019-07-26T16:31:45Z

site/_posts/2019-08-01-r-cran-release.md

I think you need to qualify that this is "preliminary" read and write support that is in its early stages of development. Otherwise you're setting the wrong expectations. It would be accurate (and helpful) to state that the Python Arrow library has much richer support for Parquet files, including multi-file datasets, and we hope to achieve feature equivalency in the next 12 months.

wesm · 2019-07-26T16:33:31Z

site/_posts/2019-08-01-r-cran-release.md

It's accurate to say "includes a much faster implementation of the Feather file format"

when you say "initial products coming out of the Arrow project" -- it didn't actually. Perhaps say "was one of the initial applications of Apache Arrow for Python and R".

wesm · 2019-07-26T16:35:17Z

site/_posts/2019-08-01-r-cran-release.md

Maybe you want to say that we will look at adapting the "feather" package to be based on "arrow" (though this could upset some users).

wesm · 2019-07-26T16:38:46Z

site/_posts/2019-08-01-r-cran-release.md

I think if you say "Parquet supports various compression formats" it might bring up some canards with the R community. It's simpler to say that "Parquet is optimized to create small files and as a result can be more expensive to read locally, but it performs very well with remote storage like HDFS or Amazon S3. Feather is designed for fast local reads, particularly with solid state drives, and is not intended for use with remote storage systems. Feather files can be memory-mapped and read in Arrow format without any deserialization while Parquet files always must be decompressed and decoded."

wesm · 2019-07-26T16:39:38Z

site/_posts/2019-08-01-r-cran-release.md

This is the first time you reference "Spark" in the article -- you need to use "Apache Spark"

wesm · 2019-07-26T16:41:09Z

site/_posts/2019-08-01-r-cran-release.md

To avoid peanuts being hurled from the gallery, you may want to state here that the functions like read_csv_arrow are being developed to optimize for the memory layout of the Arrow columnar format, and are not intended as a replacement for "native" functions that return R data.frame, for example.

wesm · 2019-07-26T16:42:31Z

site/_posts/2019-08-01-r-cran-release.md

nit: change filename to remove "release"

nealrichardson commented Jul 25, 2019

View reviewed changes

wesm reviewed Jul 26, 2019

View reviewed changes

nealrichardson added 3 commits July 31, 2019 13:27

First draft of R package release announcement

06c06e2

Add self to contributors.yml; remove thoughtcrime from post title

ddb1857

Incorporate Wes's revisions

c5dd6fa

nealrichardson force-pushed the blog-cran-release branch from 055af21 to c5dd6fa Compare July 31, 2019 20:49

nealrichardson mentioned this pull request Aug 7, 2019

ARROW-6142: [R] Install instructions on linux could be clearer #5027

Closed

nealrichardson added 2 commits August 8, 2019 07:47

Merge upstream/master

b5d9e73

Add macOS R installation warning

fe98d6a

wesm changed the title ~~ARROW-6041: [Website] Blog post announcing R package release~~ ARROW-6041: [Website] Blog post announcing R library availability on CRAN Aug 8, 2019

wesm added 2 commits August 8, 2019 12:10

Update date, small language tweaks

3b06bb4

Add note about nokogiri requirements

7c8254b

wesm closed this in d63fe6f Aug 8, 2019

asfimport mentioned this pull request Aug 8, 2019

[Website] Blog post announcing R package release #22444

Closed

Uh oh!

ARROW-6041: [Website] Blog post announcing R library availability on CRAN #4948

ARROW-6041: [Website] Blog post announcing R library availability on CRAN #4948

Uh oh!

Conversation

nealrichardson commented Jul 25, 2019 • edited by wesm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wesm commented Jul 25, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wesm Jul 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nealrichardson commented Jul 25, 2019 •

edited by wesm

Loading

wesm Jul 26, 2019 •

edited

Loading