Fix 2 issues #3

mikekap · 2025-11-02T00:12:12Z

Fixes #1
Fixes #2

Copilot

Pull Request Overview

This PR fixes two issues related to HTTP session handling and body size management in the pcap2har tool. The changes add support for HTTP/1.1 keep-alive connections and implement response body truncation for large payloads.

Key Changes:

Added HTTP/1.1 keep-alive connection tracking by introducing sequence counters to differentiate multiple requests on the same TCP stream
Implemented configurable maximum body size limit (10MB default) with truncation and warning messages for oversized responses
Added logic to skip response body checks for HTTP HEAD requests

Reviewed Changes

Copilot reviewed 6 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
pcap2har/main.py	Added HTTP/1.1 session sequencing, max body size parameter throughout the pipeline, HEAD request handling, and body truncation logic
tests/test_main.py	Added test for body truncation functionality
tests/test_http.py	Added test for HTTP keep-alive connections
tests/goldens/test_websocket.py-test_websocket_segmented	Updated connection ID format from "1-1" to "1-1-0"
tests/goldens/test_websocket.py-test_websocket_parse	Updated connection ID format from "1-0" to "1-0-0"
tests/goldens/test_http.py-test_chunked_gzip	Updated connection ID format from "1-0" to "1-0-0"

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-02T00:13:03Z

pcap2har/main.py

+                )
+                if conv_details[current_session_id].request.url:
+                    # This is a new request on the same connection, increment sequence
+                    http1_sequence_counters[tcp_stream] += 1


The session ID is created before checking if it's a new request, but the counter is only incremented after the check. This creates a race condition where current_session_id uses the old counter value but should use the new one. The counter should be incremented before creating current_session_id, or the session ID should be recreated after incrementing.

Suggested change

http1_sequence_counters[tcp_stream] += 1

http1_sequence_counters[tcp_stream] += 1

current_session_id = (

"1",

tcp_stream,

http1_sequence_counters[tcp_stream],

)

Copilot · 2025-11-02T00:13:03Z

pcap2har/main.py

+                current_session_id = (
+                    "1",
+                    tcp_stream,
+                    http1_sequence_counters[tcp_stream],
+                )
+                if conv_details[current_session_id].request.url:
+                    # This is a new request on the same connection, increment sequence


When a request line is detected and the counter is incremented (line 364), full_stream_id on line 366 uses the incremented counter, but current_session_id still holds the old value. This means the conditional logic accesses one session while the actual processing uses a different session, leading to incorrect session assignment.

Suggested change

current_session_id = (

"1",

tcp_stream,

http1_sequence_counters[tcp_stream],

)

if conv_details[current_session_id].request.url:

# This is a new request on the same connection, increment sequence

# This is a new request on the same connection, increment sequence if needed

if conv_details[("1", tcp_stream, http1_sequence_counters[tcp_stream])].request.url:

Copilot · 2025-11-02T00:13:04Z

pcap2har/main.py

+        if conv.request.method != "HEAD":
+            content_length = conv.response.headers.get("content-length")
+            if content_length and int(content_length[0]) > 0 and not conv.response.body:
+                log_fn(f"{conv!s}: Missing response body")


[nitpick] The indentation change moves the response body check inside the HEAD method guard, but the request body check above (lines 296-298) is not similarly guarded. Consider documenting why HEAD responses specifically need this special handling, or extract this logic into a helper function for clarity.

Copilot

Pull Request Overview

Copilot reviewed 6 out of 7 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-02T00:17:05Z

pcap2har/main.py

+
+            full_stream_id = ("1", tcp_stream, http1_sequence_counters[tcp_stream])


The variable current_session_id is assigned but never used. This appears to be dead code that should either be removed or the logic should use current_session_id instead of full_stream_id on line 368 when processing request packets.

Suggested change

full_stream_id = ("1", tcp_stream, http1_sequence_counters[tcp_stream])

full_stream_id = current_session_id

else:

full_stream_id = ("1", tcp_stream, http1_sequence_counters[tcp_stream])

mikekap added 3 commits November 1, 2025 16:54

Add body truncation as an option.

e5c7097

Add bad test

d34353f

Implement HTTP keep-alive properly

08af76e

mikekap requested a review from Copilot November 2, 2025 00:12

Copilot AI reviewed Nov 2, 2025

View reviewed changes

A bit better

0f8210e

mikekap requested a review from Copilot November 2, 2025 00:16

Copilot AI reviewed Nov 2, 2025

View reviewed changes

ugh i can't read

3f3eac6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix 2 issues #3

Fix 2 issues #3

Uh oh!

mikekap commented Nov 2, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 2, 2025

Uh oh!

Copilot AI Nov 2, 2025

Uh oh!

Copilot AI Nov 2, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		full_stream_id = ("1", tcp_stream, http1_sequence_counters[tcp_stream])

Fix 2 issues #3

Are you sure you want to change the base?

Fix 2 issues #3

Uh oh!

Conversation

mikekap commented Nov 2, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants