You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's a bit tricky to reproduce, but I think a reliable way is to save an HTML doc to disk without any any BOM or encoding info in the file. Load the file into JRuby in 1.9 mode so it should have an encoding of "ASCII-8BIT". Now just do Nokogiri::HTML(File.open('junk_file').read) and you should get the exception. Note that the test file I'm using does have newlines in it. I'm not quite sure what character is throwing it off. If I change the encoding to UTF-8, Nokogiri is able to construct the document just fine.
The text was updated successfully, but these errors were encountered:
Hi,
I just upgraded to 1.5.3rc5 and started seeing:
It's a bit tricky to reproduce, but I think a reliable way is to save an HTML doc to disk without any any BOM or encoding info in the file. Load the file into JRuby in 1.9 mode so it should have an encoding of "ASCII-8BIT". Now just do
Nokogiri::HTML(File.open('junk_file').read)
and you should get the exception. Note that the test file I'm using does have newlines in it. I'm not quite sure what character is throwing it off. If I change the encoding to UTF-8, Nokogiri is able to construct the document just fine.The text was updated successfully, but these errors were encountered: