Langchain Services

1. Setup

1.1. Donwload data

My implementation of the Langchain Services is based on the AIO Project Langchain Services

This repository uses data from the Ciriculum Vitae (CV) dataset. You can download the full dataset and other necessary files using the following commands:

data_source/generative_ai/download.sh

Note that the dataset contains over 3000 CVs, which could be resulted in incorrect parsing results. Our repository also provides a subset of the dataset with about 50 CVs for testing purposes.

1.2. Run service in local

Python version: 3.11.9

python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
# Start the server
uvicorn src.app:app --host "0.0.0.0" --port 5000 --reload

This will ask for the Google API key, which you can get from the Google Cloud Console. After providing the key, the server will start and you can access it at http://localhost:5000/docs.

1.3 Run service in docker

docker compose up -d

Turn off service

docker compose -f down

2. Architecture

The service ingests CV PDFs from local files or Google Drive links, extracts information using an LLM and stores embeddings in a FAISS vector store. The workflow is:

ingestion -> extraction -> storage -> search.

Modules

src/rag/file_loader.py – download/load and split documents.
src/rag/cv_extractor.py – prompt chain for CV parsing.
src/rag/vectorstore.py – persistent FAISS store with metadata support.
src/app.py – FastAPI server exposing upload and search endpoints.

API Usage

Upload a CV from a local file:

curl -X POST -F "file=@resume.pdf" http://localhost:5000/upload_cv

Search for candidates:

curl -X POST -H "Content-Type: application/json" \
     -d '{"query": "python developer"}' \
     http://localhost:5000/search_candidates

3. Deployment

Langserve

After the service is running, you can deploy it using Langserve in the following url:

https://localhost:5000/langserve/chat/playground
https://localhost:5000/langserve/generative_ai/playground

Streamlit

You can also deploy the service using Streamlit with the following command:

streamlit run src/streamlit.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
chat_histories		chat_histories
data_source/generative_ai		data_source/generative_ai
docker		docker
docs		docs
src		src
vectorstore		vectorstore
.dockerignore		.dockerignore
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Langchain Services

1. Setup

1.1. Donwload data

1.2. Run service in local

1.3 Run service in docker

2. Architecture

Modules

API Usage

3. Deployment

Langserve

Streamlit

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

chisphung/CV_Extractor_Langchain

Folders and files

Latest commit

History

Repository files navigation

Langchain Services

1. Setup

1.1. Donwload data

1.2. Run service in local

1.3 Run service in docker

2. Architecture

Modules

API Usage

3. Deployment

Langserve

Streamlit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages