You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's not a duplicate effort, but I find the interoperability with databases to be a great value add and related.
Actually, from my pov, I feel a bit of redundancy, (I hope I convey this correctly) I have a need to import tabular into a table-friendly data into python object ( DataSet (tablib) and table (dataset). tablib can import from a wide variety of places, dataset maintains DB back-end.
I could possibly glue both together somehow. I'm not sure it would look elegant or / come a performance cost.
Is there a chance to potentially kill two birds with one stone and see if collaboration / cooperation is possible?
I think Pandas is a little too heavy to bring into an application I'd like to keep lite. It has a dependency of numpy. However, it covers a lot turf, I also see some interesting activity for Pandas DataFrame's and sql at ENH: sql support pandas-dev/pandas#4163.
This question is for @kennethreitz and everybody. I'm interested speed ups for python containers and data structures. Does cython or python C API stand any chance of offering a speed up if Dataset, Databook, and / or any parts of tablib were written in C? Is there anything in tablib that stands to benefit from C optimization?
Overall
What areas does tablib cover? What does it not? To be honest, I would really like a go-to for relational and tabular data without the overhead of pandas.
The text was updated successfully, but these errors were encountered:
Pandas is an incredibly powerful data science tool. Tablib is intended for small dataset creation and exporting into friendly formats.
No interest or need for speedups. This isn't a scientific library, and I want it to be as simple to install as possible. If you want things to run faster, I suggest you use PyPy as your runtime.
@kennethreitz: Greetings, I saw https://github.com/kennethreitz/tablib/issues/124, I didn't want to burden with another issue, but I have a few questions about tablib to articulate:
Where does tablib stand against https://github.com/pudo/dataset?
It's not a duplicate effort, but I find the interoperability with databases to be a great value add and related.
Actually, from my pov, I feel a bit of redundancy, (I hope I convey this correctly) I have a need to import tabular into a table-friendly data into python object (
DataSet
(tablib) andtable
(dataset). tablib can import from a wide variety of places, dataset maintains DB back-end.I could possibly glue both together somehow. I'm not sure it would look elegant or / come a performance cost.
Is there a chance to potentially kill two birds with one stone and see if collaboration / cooperation is possible?
Where does tablib stand against https://github.com/pydata/pandas and its
DataFrame
's?I think Pandas is a little too heavy to bring into an application I'd like to keep lite. It has a dependency of
numpy
. However, it covers a lot turf, I also see some interesting activity for Pandas DataFrame's and sql at ENH: sql support pandas-dev/pandas#4163.This question is for @kennethreitz and everybody. I'm interested speed ups for python containers and data structures. Does cython or python C API stand any chance of offering a speed up if
Dataset
,Databook
, and / or any parts of tablib were written in C? Is there anything in tablib that stands to benefit from C optimization?Overall
What areas does tablib cover? What does it not? To be honest, I would really like a go-to for relational and tabular data without the overhead of pandas.
The text was updated successfully, but these errors were encountered: