Plex Knowledge Graph

Plex KG builds a knowledge graph from Plex media data, linking movies, genres, and people through semantic relationships. It enables querying and simple recommendations.

Tools

PlexAPI: collect Plex data.
rdflib: extract/map Plex data to turtle only. Doesn't handle large graphs well because it runs in memory.
pyshacl: validate graphs.
Fuseki: handle RDF storage, query and reasoning.
FastAPI: easily access user facing functions.

Data Flow

Get data from Plex.
Transform Plex data to a graph in the Turtle format.
Create media relationship graph.
Validate graphs against expected shapes in: ./rdf/shapes/.
Upload graphs fuseki.

Note

The docker compose automagically creates a dataset called 'plex' and will be used as the dataset throughout the project. You can view the graphs in fuseki or download them. If you'd like to see an example graph, you can check out ./rdf/example.ttl. That file has example nodes for the media graph and relationship graph. Additionally, ./rdf/ontology.ttl adds to the existing schema.org ontology.

Setup

Developed on Python 3.12.

Generate X-Plex-Client-Identifier for the python client. You can generate it in many different ways but I prefer to do it on the CLI using uuidgen.
Get your authToken by sending a request to https://plex.tv/api/v2/users/signin with the following items in your request body:
- X-Plex-Client-Identifier: from step 1.
- login: email
- password: password
Add X-Plex-Client-Identifier and authToken to your .env.
You might need to change the permission for the directories inside ./fuseki-data/ to 100:100 (fuseki UID and GID).

chown -R 100:100 ./fuseki-data/*

Run

docker compose up

URLs:

http://localhost:3030/ - Fuseki
http://localhost:8000/ - FastAPI

Run queries using fuseki API:

curl POST \
    --data-urlencode "query@{path-to-rq-file}" \
    http://localhost:3030/plex/query | jq

If it's saying unauthorized, use curl's -u parameter: curl -u user:pw ...
If you get an error saying that the URL doesn't support POST requests, ensure that the dataset name is correct.

Note

If you'd like to see the queries being run in the project, you can find them in ./rdf/queries/.

Limitations

To reduce the project's complexity, the media is limited to a single Plex section and a single user. Note: Movies and TV Shows can be considered Plex sections. The project was developed and tested using only movies so the other sections might not even work.

If you would like to see the available Plex sections, you can use the FastAPI endpoint /library, "Get Plex Libraries". I know... confusing names but that's Plex naming scheme. The section ID is called "key" in the dataset.

Another major limitation is that I did not implement any error handling... 😅 I just didn't feel like, lol. You can check docker logs to help clear any blockers you might have.

Oh, and no OWL...

Resources

Note

Check out prereqs.md for some key points on Semantic Web.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
rdf		rdf
src		src
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
prereqs.md		prereqs.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plex Knowledge Graph

Tools

Data Flow

Setup

Run

Limitations

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Plex Knowledge Graph

Tools

Data Flow

Setup

Run

Limitations

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages