tablib vs dataset vs pandas: what areas do they cover, what is tablib's feature vision? #136

tony · 2014-01-17T21:55:19Z

@kennethreitz: Greetings, I saw https://github.com/kennethreitz/tablib/issues/124, I didn't want to burden with another issue, but I have a few questions about tablib to articulate:

Where does tablib stand against https://github.com/pudo/dataset?

It's not a duplicate effort, but I find the interoperability with databases to be a great value add and related.

Actually, from my pov, I feel a bit of redundancy, (I hope I convey this correctly) I have a need to import tabular into a table-friendly data into python object ( DataSet (tablib) and table (dataset). tablib can import from a wide variety of places, dataset maintains DB back-end.

I could possibly glue both together somehow. I'm not sure it would look elegant or / come a performance cost.

Is there a chance to potentially kill two birds with one stone and see if collaboration / cooperation is possible?
Where does tablib stand against https://github.com/pydata/pandas and its DataFrame's?

I think Pandas is a little too heavy to bring into an application I'd like to keep lite. It has a dependency of numpy. However, it covers a lot turf, I also see some interesting activity for Pandas DataFrame's and sql at ENH: sql support pandas-dev/pandas#4163.
This question is for @kennethreitz and everybody. I'm interested speed ups for python containers and data structures. Does cython or python C API stand any chance of offering a speed up if Dataset, Databook, and / or any parts of tablib were written in C? Is there anything in tablib that stands to benefit from C optimization?
Overall

What areas does tablib cover? What does it not? To be honest, I would really like a go-to for relational and tabular data without the overhead of pandas.

The text was updated successfully, but these errors were encountered:

kennethreitz · 2016-02-07T12:41:05Z

Dataset is a SQL library, Tablib is not.
Pandas is an incredibly powerful data science tool. Tablib is intended for small dataset creation and exporting into friendly formats.
No interest or need for speedups. This isn't a scientific library, and I want it to be as simple to install as possible. If you want things to run faster, I suggest you use PyPy as your runtime.

iurisilvio added the question label Sep 6, 2014

kennethreitz closed this as completed Feb 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tablib vs dataset vs pandas: what areas do they cover, what is tablib's feature vision? #136

tablib vs dataset vs pandas: what areas do they cover, what is tablib's feature vision? #136

tony commented Jan 17, 2014

kennethreitz commented Feb 7, 2016

tablib vs dataset vs pandas: what areas do they cover, what is tablib's feature vision? #136

tablib vs dataset vs pandas: what areas do they cover, what is tablib's feature vision? #136

Comments

tony commented Jan 17, 2014

kennethreitz commented Feb 7, 2016