This repository visualizes the embedding space for the llama3.2 model.
Mathematically,
Let
To visualize these high-dimensional embeddings, we use dimensionality reduction techniques to project from
Common techniques for reducing dimensionality while preserving structure:
- PCA (Principal Component Analysis): Linear projection that maximizes variance
- t-SNE: Non-linear method that preserves local neighborhood structure
- UMAP: Balances local and global structure preservation
- Install and run ollama
- Pull the llama3.2 model:
ollama pull llama3.2 - Generate embeddings using the API:
curl http://localhost:11434/api/embed -d '{
"model": "llama3.2",
"input": "Your prompt here"
}'The response will contain a 3072-dimensional vector that can then be reduced to 3D for visualization.
llama3.2 has
