- 
                Notifications
    You must be signed in to change notification settings 
- Fork 1.2k
Open
Labels
Description
-  Use llama_decodeinstead of deprecatedllama_evalinLlamaclass
-  Implement batched inference support for generateandcreate_completionmethods inLlamaclass
- Add support for streaming / infinite completion
giangluu352001, harry-pham-wise, JackKCWong, bb-worm, ChristianWeyer and 45 moresengiv, ArtyomZemlyak, hamishc, bioshazard, gerred and 16 moreesmeetu, robertritz, zhengzhanpeng, hamishc, ngupta10 and 12 more