Release/2.6.0 #1184

chakravarthik27 · 2025-03-09T14:21:57Z

📢 Highlights

We are excited to introduce the latest langtest release, bringing you a suite of improvements designed to streamline model evaluation and enhance overall performance:

🛠 De-biasing Data Augmentation:
We’ve integrated de-biasing techniques into our data augmentation process, ensuring more equitable and representative model assessments.
🔄 Evaluation with Structured Outputs:
LangTest now supports structured output APIs for both OpenAI and Ollama, offering greater flexibility and precision when processing model responses.
🏥 Confidence Testing with Med Halt Tests:
Introducing med halt tests for confidence evaluation, enabling more robust insights into your LLMs’ reliability under diverse conditions.
📖 Expanded Task Support for JSL LLM Models:
QA and Summarization tasks are now fully supported for JSL LLM models, enhancing their capabilities for real-world applications.
🔒Security Enhancements:
Critical vulnerabilities and security issues have been addressed, reinforcing the LangTest overall stability and safety.
🐛 Resolved Bugs:
We’ve fixed issues with templatic augmentation to ensure consistent, accurate, and reliable outputs across your workflows.

…curity-issues chore: update certifi, idna, zipp versions and add extras in poetry.lock

… bias detection logic

… models

…method

…scriptions

… class

… prompts

…response structure

…ove, and false questions - idea implemented

…t column names

…o structure

…upport

…essage handling

…ate CHAT_MODEL_CLASSES structure

…and improve output schema integration

… 'ollama' hub with JSON schema method

…ask instructions

…thods

…ndling

…mentation-due-to-openai fix(bug): update model handling in OpenAI and AzureOpenAI configurations

… model name

…ntation for Ollama API

…-deepseek Feature/add integration to deepseek

…-tests-for-robust-model-evaluation Feature/implement med halt tests for robust model evaluation

…tion-supports-the-ollama-provider feat: add support for generating templates using Ollama provider

…meter merging

…tion

…patibility.

…ting-bug-fixes-in-260-rc-version fixes: resolving the bugs 2_6_0rc versions

…nce level

…ting-bug-fixes-in-260-rc-version fix: better handling of extra model params in Harness

…h new notebooks

chore: update version to 2.6.0

chakravarthik27 added 30 commits January 9, 2025 20:07

chore: update certifi, idna, zipp versions and add extras in poetry.lock

77945d2

Merge pull request #1162 from JohnSnowLabs/fix/vulnerabilities-and-se…

c4c831f

…curity-issues chore: update certifi, idna, zipp versions and add extras in poetry.lock

feat: add debiasing functionality with initial approach

c79bb3e

refactor: improve code formatting and readability in debias.py

11fa891

feat: enhance debiasing functionality with improved data handling and…

ad05cc1

… bias detection logic

feat: enhance DebiasTextProcessing with support for OpenAI and Ollama…

7628f87

… models

feat: add Ollama package support in poetry.lock and pyproject.toml

0a67de4

refactor: remove commented-out OpenAI client code in interaction_llm …

02ab09d

…method

feat: add ollama-sdk support in poetry.lock

dfb4440

refactor: rename bias detection classes and update field titles to de…

cd32ec8

…scriptions

fix: linting issues

9de2947

refactor: improve formatting of system prompt in DebiasTextProcessing…

a0294a9

… class

feat: enhance bias detection response structure and improve debiasing…

13b2615

… prompts

feat: add standard bias evaluation prompt and improve bias detection …

46b3714

…response structure

feat: add new robustness classes for false confidence, none of the ab…

d93e3e6

…ove, and false questions - idea implemented

fix: correct typos in bias evaluation prompt and update output datase…

47d5bf7

…t column names

fix: rename "original text" to "biased_text" in debias_info DataFrame

c9dbc5e

feat: add risk level to bias detection response and update debias_inf…

9a50917

…o structure

fix: rename "row" to "row_id" in debias_info DataFrame

259ae69

feat: enhance model handling with additional info and output schema s…

7bd46d4

…upport

feat: add output schema support to model initialization and improve m…

a011ba0

…essage handling

feat: enhance QASample result validation to support Custom Output Schema

6b76e75

feat: enhance model handler to support dynamic module imports and upd…

d82daca

…ate CHAT_MODEL_CLASSES structure

feat: improve error handling for module imports in PretrainedModelForQA

e4f8d7e

feat: refactor model handling to use unified MODEL_CLASSES structure …

dcec433

…and improve output schema integration

feat: extend output schema support in PretrainedModelForQA to include…

8ebaf6a

… 'ollama' hub with JSON schema method

feat: enhance bias evaluation prompt with structured categories and t…

51dc1bd

…ask instructions

feat: add FCT class for clinical tests with transformation and run me…

b313395

…thods

NOTA test is implemented in clincial category.

f94da73

refactor: update FCT and NOTA transform methods to improve options ha…

add3d53

…ndling

chakravarthik27 added 24 commits February 21, 2025 20:16

fix: enhance model type handling in QA and TextGen processing

0cac1c5

refactor: update model handling in OpenAI and AzureOpenAI configurations

54bc5b9

Merge pull request #1178 from JohnSnowLabs/fix/error-in-templatic-aug…

77129e4

…mentation-due-to-openai fix(bug): update model handling in OpenAI and AzureOpenAI configurations

feat: add support for generating templates using Ollama provider

82a8b6e

fix: improve error handling in template generation and update default…

c6c3604

… model name

fix: enhance error messaging in template generation and update docume…

bdba91e

…ntation for Ollama API

Merge pull request #1176 from JohnSnowLabs/feature/add-integration-to…

36472dd

…-deepseek Feature/add integration to deepseek

Merge pull request #1170 from JohnSnowLabs/feature/implement-med-halt…

9365ace

…-tests-for-robust-model-evaluation Feature/implement med halt tests for robust model evaluation

Merge pull request #1180 from JohnSnowLabs/feature/templatic-augmenta…

4be9c0e

…tion-supports-the-ollama-provider feat: add support for generating templates using Ollama provider

fix: handle potential None value in additional_info during model para…

6d77f1e

…meter merging

fix: return None for unsupported model types in text generation check

e8a036d

fix: correctly assign model_type and annotator in QA model initializa…

e1bfb01

…tion

Default model_type for OpenAI and Azure-OpenAI to ensure backward com…

690d270

…patibility.

fix: update conditional check for model_type in PretrainedModelForQA

105150e

Merge pull request #1182 from JohnSnowLabs/fix/issues-found-while-tes…

2fb3969

…ting-bug-fixes-in-260-rc-version fixes: resolving the bugs 2_6_0rc versions

fix: improve handling of additional model parameters in Harness class

9109db2

fix: add handling for additional model information in Harness class

be442eb

Notebook: evaluation with structured outputs

d39e8bf

feat: add enhance_text method for debiasing text based on bias tolera…

2b8dfb0

…nce level

fix: format enhance_text method for improved readability

5843350

fix: update langchain-openai to 0.3.7 and update the fqt and nota tests.

4aeaf7c

Notebook: Added for Med Halt Tests

8c369bd

Notebook: JSL Medical LLM QA and Sum

6f3ea4c

Merge pull request #1183 from JohnSnowLabs/fix/issues-found-while-tes…

817b917

…ting-bug-fixes-in-260-rc-version fix: better handling of extra model params in Harness

chakravarthik27 self-assigned this Mar 9, 2025

chakravarthik27 requested a review from Prikshit7766 March 10, 2025 03:51

chore: update version to 2.6.0 and enhance tutorial documentation wit…

41e9dc7

…h new notebooks

Prikshit7766 approved these changes Mar 10, 2025

View reviewed changes

Merge pull request #1185 from JohnSnowLabs/chore/final_website_updates

05e51d6

chore: update version to 2.6.0

chakravarthik27 merged commit cbfdc33 into main Mar 10, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release/2.6.0 #1184

Release/2.6.0 #1184

chakravarthik27 commented Mar 9, 2025 •

edited

Loading

Release/2.6.0 #1184

Release/2.6.0 #1184

Conversation

chakravarthik27 commented Mar 9, 2025 • edited Loading

📢 Highlights

chakravarthik27 commented Mar 9, 2025 •

edited

Loading