You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I understand the origin of the issue, as you described. However, this won't add the wanted consistency, as some articles will be separated by two empty lines, and some other by three.
Of course, this would make the output parse able, so it can be a solution.
I am sure a simple test condition, or trimming, can render the output simpler to parse.
I understand the origin of the issue, as you described. However, this won't add the wanted consistency, as some articles will be separated by two empty lines, and some other by three.
That's what the .rstrip() is for, it will remove the final newline, if there is one, so that there will always be 2 in the final output.
It does works, however, beside being horribly slow; it has a major issue.
It should separate articles with two empty lines, which it does, sometimes.
\n\n
but NOT on all articles.
\n
This makes the text output impossible to parse on a large scale, since no consistency is taken, it can happen everywhere.
And I am sure peoples had spent a lot of time figuring it out, as I did, and have end-up to abandon their project completly.
:/
The text was updated successfully, but these errors were encountered: