Skip to content

BUG in read_csv skipping a row after a row with trailing spaces #8983

Closed
@selasley

Description

@selasley

as reported by @xdliao in #8752, skipping a row after a row with trailing spaces fails to create the expected dataframe
data = """A B C \nD E F \nH I J \n1 2 3 \n4 5 6 \n"""
pd.read_csv(StringIO(data), skiprows=2, delim_whitespace=True) and
pd.read_csv(StringIO(data), skiprows=[0,1], delim_whitespace=True) work as expected, but the dataframes returned by
pd.read_csv(StringIO(data), skiprows=[0,2], delim_whitespace=True) and
pd.read_csv(StringIO(data), skiprows=[1,2], delim_whitespace=True) are incorrect

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions