GzipSource beginnings. #510

swankjesse · 2014-02-03T06:25:29Z

Needs tests in a follow up. In particular for the new
methods on OkBuffer (peekByte, skip, indexOf) and for
all of the weird header cases on gzip data (headers
that include names, comments, extra fields, header
CRCs), plus failing cases for header CRC, CRC and
ISIZE.

codefromthecrypt · 2014-02-03T06:43:40Z

OkHttp seems to be a place where binary formats can hang out in new clothes

codefromthecrypt · 2014-02-03T06:50:04Z

okhttp-protocols/src/main/java/com/squareup/okhttp/internal/bytes/GzipSource.java

+  }
+
+  /** Fills the buffer with at least {@code byteCount} bytes. */
+  private void require(int byteCount, Deadline deadline) throws IOException {


nice. I think I've wanted this elsewhere.

yeah. One catch with making it general is the policy on over-reading. This class will read more data than it needs to; in some situations that could be bad. (Because there's no mechanism to push back bytes you didn't need.)

codefromthecrypt · 2014-02-03T06:53:33Z

only editorial comments beyond the ones you've made. I'm heads down until my talk tomorrow, else I'd finish this for us!

codefromthecrypt · 2014-02-07T01:31:15Z

@swankjesse squash the commit I added if you like. should be good to go

codefromthecrypt · 2014-02-07T13:55:50Z

one sec. forgot the header crc test.

codefromthecrypt · 2014-02-07T14:12:25Z

ok all tests are backfilled. One thing interesting is that it seems a lot of gzip implementations in the wild use an old version of gzip which thinks the HCRC flag is continuation. Very few implement either. For example, if you compress with the HCRC and read it in vanilla osx gzip, it thinks the thing is multipart!!

echo H4sIAgAAAAAAAB0m8yxRL1ZIVAj184xQKK4sLknNVVTwVMjOyy9XKMnILFYEAI2PrTcgAAAA|base64 --decode|gunzip -l -v -gzip: stdin is a a multi-part gzip file -- not supported

swankjesse · 2014-02-07T16:04:09Z

okhttp-protocols/src/test/java/com/squareup/okhttp/internal/bytes/GzipSourceTest.java

+  /**
+   * Allows us to customize a gzip impl someone else wrote so that we can test our implementation.
+   */
+  private static class GZIPOutputStream extends java.util.zip.GZIPOutputStream {


I think I'd prefer to just create some golden base64/hex gzip files than generate them on the fly. With this we're really hacking GzipOutputStream in a way that isn't too natural.

(You should use this mechanism to create those files)

swankjesse · 2014-02-07T16:06:26Z

LGTM

swankjesse · 2014-02-07T16:09:09Z

okhttp-protocols/src/test/java/com/squareup/okhttp/internal/bytes/GzipSourceTest.java

+      gunzip(gzipped);
+      fail();
+    } catch (IOException e) {
+      assertEquals("FHCRC: actual 0x00261d != expected 0x000000", e.getMessage());


Interesting. I was expecting the string format to print 8 hex chars plus 0x, but it included the 0x in the 8 character width.

codefromthecrypt · 2014-02-08T01:17:46Z

okie rewrote the test to use constants and squashed

codefromthecrypt · 2014-02-08T01:18:57Z

okhttp-protocols/src/main/java/com/squareup/okhttp/internal/bytes/OkBuffer.java

@@ -195,7 +241,7 @@ void write(byte[] data, int offset, int byteCount) {
      tail.limit += toCopy;
    }

-    this.byteCount += data.length;
+    this.byteCount += byteCount;


this is a whoops :)

GzipSource beginnings.

codefromthecrypt reviewed Feb 3, 2014
View reviewed changes

swankjesse mentioned this pull request Feb 4, 2014

Bug with OkHttp UnknownLengthHttpInputStream reading zlib streams #507

Closed

swankjesse reviewed Feb 7, 2014
View reviewed changes

GzipSource beginnings.

2c6f99d

codefromthecrypt reviewed Feb 8, 2014
View reviewed changes

swankjesse added a commit that referenced this pull request Feb 8, 2014

Merge pull request #510 from square/jwilson_0203_gzip_beginnings

1b25214

GzipSource beginnings.

swankjesse merged commit 1b25214 into master Feb 8, 2014

swankjesse deleted the jwilson_0203_gzip_beginnings branch February 14, 2014 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GzipSource beginnings. #510

GzipSource beginnings. #510

swankjesse commented Feb 3, 2014

codefromthecrypt commented Feb 3, 2014

codefromthecrypt Feb 3, 2014

swankjesse Feb 4, 2014

codefromthecrypt commented Feb 3, 2014

codefromthecrypt commented Feb 7, 2014

codefromthecrypt commented Feb 7, 2014

codefromthecrypt commented Feb 7, 2014

swankjesse Feb 7, 2014

swankjesse Feb 7, 2014

swankjesse commented Feb 7, 2014

swankjesse Feb 7, 2014

codefromthecrypt commented Feb 8, 2014

codefromthecrypt Feb 8, 2014

GzipSource beginnings. #510

GzipSource beginnings. #510

Conversation

swankjesse commented Feb 3, 2014

codefromthecrypt commented Feb 3, 2014

codefromthecrypt Feb 3, 2014

Choose a reason for hiding this comment

swankjesse Feb 4, 2014

Choose a reason for hiding this comment

codefromthecrypt commented Feb 3, 2014

codefromthecrypt commented Feb 7, 2014

codefromthecrypt commented Feb 7, 2014

codefromthecrypt commented Feb 7, 2014

swankjesse Feb 7, 2014

Choose a reason for hiding this comment

swankjesse Feb 7, 2014

Choose a reason for hiding this comment

swankjesse commented Feb 7, 2014

swankjesse Feb 7, 2014

Choose a reason for hiding this comment

codefromthecrypt commented Feb 8, 2014

codefromthecrypt Feb 8, 2014

Choose a reason for hiding this comment