-
-
Notifications
You must be signed in to change notification settings - Fork 19.3k
Closed
Milestone
Description
In [1]: DataFrame(data,columns=["a","a"])
...
pandas/pandas/core/internals.pyc in _stack_dict(dct, ref_items, dtype)
1344 stacked = np.empty(shape, dtype=dtype)
1345 for i, item in enumerate(items):
-> 1346 stacked[i] = _asarray_compat(dct[item])
1347
1348 # stacked = np.vstack([_asarray_compat(dct[k]) for k in items])
IndexError: index out of bounds
5e6db32 is a failing test for this.
it looks like _to_sdict threads down to a call to _convert_object_array which builds a dict
keyed on column names, so dupe columns get squashed and you end up with a mismatch
between the length of the columns arg to df.__init__ and the data.
_to_sdict is not used for ndarrays so this doesn't haoppen, I was able to reuse
_init_ndarray for the case of columns being a flat list and have things work as expected.
still, too much code touching this, better left to the core devs to decide how to handle this.
Metadata
Metadata
Assignees
Labels
No labels