Skip to content

Please make the parser more robust #724

@cskyan

Description

@cskyan

I encountered some ParseError exceptions when I tried to parse the ntriples files. Some of them are quite easy to be fixed during the runtime such as empty lines, codec issues, etc. I hope that the parser could pre-process the files and deal with these problems or ignore the invalid records. At least, we need to know which lines in our data file have problems. Because we cannot make sure that the downloaded files strictly follow the standard format. If the package just raises the exception without correcting it, it will take more time to parse the whole file. Maybe the impact of neglected data could be accepted when we are processing a large data set.
In my case, I directly modify this line of code. I insert a continue code here to let the program proceed. Otherwise, I cannot get the remaining data when I encounter a ParseError. I know that it is not a good way to skip this exception but it is the fastest way to continue my project. Hope that this suggestion would be accepted.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions