Skip to content

Added baseball.csv #117

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed

Added baseball.csv #117

wants to merge 1 commit into from

Conversation

chrisjordansquire
Copy link

This seems to work. Was that what you had in mind?

@wesm
Copy link
Member

wesm commented Aug 30, 2011

That's cool. I was thinking about including a bigger piece to write meaningful/interesting groupby examples. I haven't looked into the license for the baseball data, plyr is MIT license so I assume that data is just public domain but...

@chrisjordansquire
Copy link
Author

Didn't think about that.

It comes from http://www.baseball-databank.org/

They say their data is open for use by individual researchers, but other uses require their permission. On the other hand, I read the MIT license as saying copying any part of the software and its documentation is fine, and the data was released with the software. So it seems like the only real question is if the MIT license should be included with the baseball.csv file.

(The MIT license only says it needs repeating when 'substantial' parts of the software are copied.)

@chrisjordansquire
Copy link
Author

Another potential solution is that I'm currently writing a little function to generate arbitrarily sized biostatistics-like data sets. I'd been intending it as a learning tool to let me explore pandas functionality for datasets more like what I'm used to (e.g., lots of categorical data), but it could be adapted to creating large tables of data for this as well.

@wesm
Copy link
Member

wesm commented Sep 13, 2011

I included a little chunk (100 rows) in doc/data so we should be all set

@wesm wesm closed this Sep 13, 2011
dan-nadler pushed a commit to dan-nadler/pandas that referenced this pull request Sep 23, 2019
Support Python 3.5 in Travis and fix issues with pandas 0.18.0
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants