Skip to content

Conversation

Bojun-Feng
Copy link
Contributor

@Bojun-Feng Bojun-Feng commented Dec 18, 2023

Fix #775

  • Update Model Family JSON
  • Update README
  • Update Docs

I have tested the GGUF model locally with Llama.cpp but did not test the PyTorch ones due to the lack of CUDA support.

I played around with OpenHermes 2.5 on my laptop and generally believe it to be the best 7B local model we have so far. Here are some outputs from the Q2_K quantization (I'm sure other quantizations will perform even better) with 0 temperature for deterministic output, if anyone is interested:

screenshots Screenshot 2023-12-17 at 6 29 38 PM Screenshot 2023-12-17 at 6 36 45 PM Screenshot 2023-12-17 at 8 15 46 PM Screenshot 2023-12-17 at 10 23 32 PM

@XprobeBot XprobeBot added this to the v0.7.3 milestone Dec 18, 2023
@aresnow1 aresnow1 merged commit 5a77ff9 into xorbitsai:main Dec 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

FEAT: support openhermes 2.5
3 participants