-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release/2.6.0 #1184
Merged
Merged
Release/2.6.0 #1184
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…curity-issues chore: update certifi, idna, zipp versions and add extras in poetry.lock
… bias detection logic
…response structure
…ove, and false questions - idea implemented
…ate CHAT_MODEL_CLASSES structure
…and improve output schema integration
… 'ollama' hub with JSON schema method
…mentation-due-to-openai fix(bug): update model handling in OpenAI and AzureOpenAI configurations
…ntation for Ollama API
…-deepseek Feature/add integration to deepseek
…-tests-for-robust-model-evaluation Feature/implement med halt tests for robust model evaluation
…tion-supports-the-ollama-provider feat: add support for generating templates using Ollama provider
…ting-bug-fixes-in-260-rc-version fixes: resolving the bugs 2_6_0rc versions
…ting-bug-fixes-in-260-rc-version fix: better handling of extra model params in Harness
Prikshit7766
approved these changes
Mar 10, 2025
chore: update version to 2.6.0
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
📢 Highlights
We are excited to introduce the latest langtest release, bringing you a suite of improvements designed to streamline model evaluation and enhance overall performance:
🛠 De-biasing Data Augmentation:
We’ve integrated de-biasing techniques into our data augmentation process, ensuring more equitable and representative model assessments.
🔄 Evaluation with Structured Outputs:
LangTest now supports structured output APIs for both OpenAI and Ollama, offering greater flexibility and precision when processing model responses.
🏥 Confidence Testing with Med Halt Tests:
Introducing med halt tests for confidence evaluation, enabling more robust insights into your LLMs’ reliability under diverse conditions.
📖 Expanded Task Support for JSL LLM Models:
QA and Summarization tasks are now fully supported for JSL LLM models, enhancing their capabilities for real-world applications.
🔒Security Enhancements:
Critical vulnerabilities and security issues have been addressed, reinforcing the LangTest overall stability and safety.
🐛 Resolved Bugs:
We’ve fixed issues with templatic augmentation to ensure consistent, accurate, and reliable outputs across your workflows.