LLM-RAGs

RAGS Text Generation API

This project implements an advanced text generation API using GPT-2 from Huggingface's transformers library. The model is served via a Flask API, allowing users to generate text based on a provided prompt.

Features

Utilizes GPT-2 for text generation
Supports advanced text generation parameters like:
- temperature
- top_k (Top-K sampling)
- top_p (Nucleus sampling)
- repetition_penalty (to avoid repetition)

Prerequisites

Ensure you have Python 3.7 or higher installed. Install the required dependencies before running the application.

Installation

Clone the repository and navigate to the project directory:

git clone https://github.com/Arshi81099/LLM-RAGs.git cd rags-text-generation-api
Install the required dependencies:

pip install transformers torch flask
Running the Application

To run the Flask API: python app.py

This will start the Flask server on http://127.0.0.1:5000.

API Usage

POST /generate - Generates text based on a given prompt.

Request

Endpoint: /generate Method: POST Content-Type: application/json Body Parameters:

prompt (string, required): The initial text prompt to generate from.
max_length (int, optional): Maximum length of the generated text. Default is 50.
temperature (float, optional): Sampling temperature. Default is 0.9.
top_k (int, optional): Number of top K tokens to consider. Default is 50.
top_p (float, optional): Nucleus sampling threshold. Default is 0.85.
repetition_penalty (float, optional): Penalty for repeating sequences. Default is 1.2.

Example Request

curl -X POST http://127.0.0.1:5000/generate
-H "Content-Type: application/json"
-d '{ "prompt": "Once upon a time in a faraway land", "max_length": 100, "temperature": 0.9, "top_k": 50, "top_p": 0.85, "repetition_penalty": 1.2 }'

Example Response

{ "generated_text": "Once upon a time in a faraway land, the world was filled with people who were not of any kind. They had no idea what they wanted to do or how much money it would cost them for their services and that there might be other ways out."\n"I don't know if you're right," said Mr Taylor as he looked at his wife's face from behind her glasses. "But I'm sure we'll find some way back home where our children will have more freedom than ever" }

Configuration Options

You can modify the behavior of text generation by adjusting the following parameters:

max_length: Controls how long the generated text will be.
temperature: Higher values (like 0.9) produce more diverse results, while lower values (like 0.7) make the output more deterministic.
top_k: Restricts the token selection to the top k most probable next tokens.
top_p: Uses nucleus sampling, where the cumulative probability of the selected tokens is p.
repetition_penalty: A value greater than 1.0 penalizes repetitive tokens in the output.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
app.py		app.py
rags.py		rags.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-RAGs

RAGS Text Generation API

Features

Prerequisites

Installation

API Usage

Request

Example Request

Example Response

Configuration Options

About

Access has been restricted

Releases

Packages

Languages

Arshi81099/LLM-RAGs

Folders and files

Latest commit

History

Repository files navigation

LLM-RAGs

RAGS Text Generation API

Features

Prerequisites

Installation

API Usage

Request

Example Request

Example Response

Configuration Options

About

Resources

Access has been restricted

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages