Fix Ollama client streaming issue with stream=True #439

devdattatalele · 2025-08-17T11:22:32Z

Summary

Fixes issue #299 where the Ollama client fails with 'generator' object has no attribute 'raw_response' error when using stream=True.

Correct Implementation

OllamaClient consistently returns GeneratorOutput for both streaming and non-streaming
raw_response contains the streaming generator (following established contract)
data remains None until final output is processed by Generator component
Maintains interface consistency across all model clients
Preserves polymorphism - all clients return the same type

Previous Approach (Fixed)

The initial implementation incorrectly returned raw generators directly, breaking:

Interface consistency across model clients
Generator component expectations
SOLID principles (Single Responsibility, Open/Closed)

Technical Changes

1. OllamaClient (`adalflow/components/model_client/ollama_client.py`)

# CORRECT: Consistent GeneratorOutput return
if isinstance(completion, GeneratorType):
    return GeneratorOutput(
        data=None,                    # Final output processed later
        raw_response=completion,      # Streaming iterator
        api_response=completion       # Original response
    )

2. Generator Core (`adalflow/core/generator.py`)

Removed incorrect type checking logic that was specific to OllamaClient
Restored original error handling that works for all model clients
Maintains single responsibility principle

3. Test Updates (`tests/test_ollama_client.py`)

Updated to follow proper streaming contract:

parsed.raw_response → streaming iterator
parsed.data → None (until consumed)
Maintains GeneratorOutput consistency

Test Results

# All tests passing
pytest tests/test_ollama_client.py -v
======================== 10 passed ========================

#299

Resolves issue SylphAI-Inc#299 where OllamaClient failed with 'generator' object has no attribute 'raw_response' error when using stream=True. Changes: - Modified OllamaClient.parse_chat_completion to return raw generators directly for streaming - Updated Generator error handling to prevent generator objects in raw_response field - Added proper type checking for both sync and async generators - Updated tests to reflect correct streaming behavior The fix ensures that streaming generators are handled correctly by the Generator component rather than being incorrectly wrapped in GeneratorOutput.raw_response.

adalflow/adalflow/components/model_client/ollama_client.py

liyin2015 · 2025-08-17T14:25:17Z

adalflow/adalflow/core/generator.py

                    output = self._post_call(completion)
                except Exception as e:
                    log.error(f"Error processing the output: {e}")
+                    # Check if completion is a generator to avoid placing generator object in raw_response


we almost never change the generator output. Raw_response is for streaming, data is for final parsed result

liyin2015

check anthropic client on how to handle the streaming. Eventually the standard is to convert to openai's responses api standard.

liyin2015

try this code,

from adalflow.components.model_client.ollama_client import OllamaClient
from adalflow.core import Generator

stream_generator = Generator(
    model_client=OllamaClient(host="http://localhost:11434"),
    model_kwargs={
        "model": "gpt-oss:20b",
        "stream": True,  # Enable streaming
    }
)

async def test_ollama_streaming():
    # async call with streaming
    output = await stream_generator.acall(prompt_kwargs={"input_str": "Why is the sky blue?"})

    async for chunk in output.raw_response:
        print(chunk["message"]["content"], end='', flush=True)


if __name__ == "__main__":
    import asyncio
    asyncio.run(test_ollama_streaming())

It works for streaming.

Previous implementation broke interface consistency and created architectural problems. Corrected approach: - OllamaClient consistently returns GeneratorOutput for all cases - raw_response contains the streaming generator (following Anthropic client pattern) - data remains None until final output is processed - Removed incorrect type checking from Generator core component - Maintains polymorphism across all model clients This follows the established contract: - raw_response = streaming chunks/iterator - data = finalized complete output (processed later) Fixes maintain full compatibility with Generator component and preserve all existing functionality (processors, tracking, caching). All tests pass and integration with Generator component verified.

devdattatalele · 2025-08-17T19:58:45Z

@liyin2015 I've updated the implementation based on your feedback. You mentioned that raw_response should stay for streaming and to check the Anthropic client for reference. I tested it with your code example it works perfectly!

liyin2015

check the comments

liyin2015 · 2025-08-18T20:10:05Z

adalflow/adalflow/core/generator.py

-                except Exception as e:
-                    log.error(f"Error processing the output processors: {e}")
-                    output.error = str(e)
+        # Check if this is a streaming response (generator/iterator)


this pr does not do much. and we cant force the data to None either. It is supposed to be the final complete output data, which should be handled in ollama_client, where u have to collect all stream and save the complete one in this field. you can see example in https://github.com/SylphAI-Inc/AdalFlow/blob/main/adalflow/adalflow/components/model_client/anthropic_client.py

keep the generator not changed at all

liyin2015 reviewed Aug 17, 2025

View reviewed changes

adalflow/adalflow/components/model_client/ollama_client.py Show resolved Hide resolved

liyin2015 reviewed Aug 17, 2025

View reviewed changes

liyin2015 requested changes Aug 17, 2025

View reviewed changes

liyin2015 reviewed Aug 17, 2025

View reviewed changes

devdattatalele requested a review from liyin2015 August 17, 2025 19:05

liyin2015 requested changes Aug 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Ollama client streaming issue with stream=True #439

Fix Ollama client streaming issue with stream=True #439

Uh oh!

devdattatalele commented Aug 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

liyin2015 Aug 17, 2025

Uh oh!

liyin2015 left a comment

Uh oh!

liyin2015 left a comment

Uh oh!

devdattatalele commented Aug 17, 2025

Uh oh!

liyin2015 left a comment

Uh oh!

liyin2015 Aug 18, 2025

Uh oh!

liyin2015 Aug 18, 2025

Uh oh!

Uh oh!

Fix Ollama client streaming issue with stream=True #439

Are you sure you want to change the base?

Fix Ollama client streaming issue with stream=True #439

Uh oh!

Conversation

devdattatalele commented Aug 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Correct Implementation

Previous Approach (Fixed)

Technical Changes

1. OllamaClient (adalflow/components/model_client/ollama_client.py)

2. Generator Core (adalflow/core/generator.py)

3. Test Updates (tests/test_ollama_client.py)

Test Results

Uh oh!

Uh oh!

liyin2015 Aug 17, 2025

Choose a reason for hiding this comment

Uh oh!

liyin2015 left a comment

Choose a reason for hiding this comment

Uh oh!

liyin2015 left a comment

Choose a reason for hiding this comment

Uh oh!

devdattatalele commented Aug 17, 2025

Uh oh!

liyin2015 left a comment

Choose a reason for hiding this comment

Uh oh!

liyin2015 Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

liyin2015 Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devdattatalele commented Aug 17, 2025 •

edited

Loading

1. OllamaClient (`adalflow/components/model_client/ollama_client.py`)

2. Generator Core (`adalflow/core/generator.py`)

3. Test Updates (`tests/test_ollama_client.py`)