Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
app		app
data		data
htcondor		htcondor
results		results
src/selective_context		src/selective_context
.gitignore		.gitignore
context_manager.py		context_manager.py
main.py		main.py
pyproject.toml		pyproject.toml
qa_manager.py		qa_manager.py
readme.md		readme.md
requirements.txt		requirements.txt
selective_context.py		selective_context.py
test.py		test.py
utils.py		utils.py

Repository files navigation

Selective Context for LLMs

Selective Context compresses your prompt and context to allows LLMs (such as ChatGPT) to process 2x more content. It is especially useful in dealing with long documents and maintaining long conversations without compromising their performance on various tasks!

This repository contains the code and data for the paper: Unlocking Context Constraints of LLMs: Enhancing Context Efficiency of LLMs with Self-Information-Based Content Filtering.

Updates!!

Try our demo on Huggingface Space.

Key Features

Efficient Context Management: Selective Context maximizes the utility of fixed context length in LLMs, allowing them to process long documents and extended conversations more efficiently.
Informativeness Evaluation: Our method employs a base language model to compute self-information for lexical units (sentences, phrases, or tokens) in a context and use it to evaluate their informativeness.
Extensive Evaluation: We provide extensive evaluations of Selective Context on three data sources (arxiv papers, BBC news articles, and conversation transcripts) and four different NLP tasks (summarization, question answering, original context reconstruction, and conversation).

Getting Started

To get started, follow these steps:

Install selective-context via Pypi:
```
pip install selective-context
```

Import SelectiveContext:

from selective_context import SelectiveContext

Compress your prompt and context:

sc = SelectiveContext(model_type='gpt2', lang='en')
context, reduced_content = sc(text)

You can also adjust the reduce ratio:

context, reduced_content = sc(text, reduce_ratio = 0.5)

If you prefer to try with web interface, try our streamlit app:
```
streamlit run app/app.py
```
Or directly visit our Space on Hugging Face Hub.

Code Structure

selective_context.py: A demo for performing context reduction using Selective Context.
context_manager.py: The main module for managing context and implementing the Selective Context algorithm.
main.py: The main script for running experiments and evaluating the effectiveness of Selective Context.
qa_manager.py: A helper module for managing question answering tasks during the experiments.

Experiments

To reproduce the experiments from the paper, run the following command:

python main.py

This will run the experiments on arxiv papers, BBC news articles, and conversation transcripts with four different NLP tasks: summarization, question answering, original context reconstruction, and conversation.

Dataset in the paper

The dataset used in the paper can be found at:

Arxiv: HF Hub
BBC News: HF Hub
ShareGPT.com: HF Hub

The datasets are created by ourselves so if you need citation just use the citation of this tool.

Citation

If you find this repository helpful or use our method in your research, please consider citing our paper:

@misc{li2023unlocking,
      title={Unlocking Context Constraints of LLMs: Enhancing Context Efficiency of LLMs with Self-Information-Based Content Filtering}, 
      author={Yucheng Li},
      year={2023},
      eprint={2304.12102},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective Context for LLMs

Updates!!

Key Features

Getting Started

Code Structure

Experiments

Dataset in the paper

Citation

License

About

Releases 2

Packages

Contributors 2

Languages

liyucheng09/Selective_Context

Folders and files

Latest commit

History

Repository files navigation

Selective Context for LLMs

Updates!!

Key Features

Getting Started

Code Structure

Experiments

Dataset in the paper

Citation

License

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages