DarwinLM: Evolutionary Structured Pruning of Large Language Models

This repository contains the unofficial implementation of DarwinLM: Evolutionary Structured Pruning of Large Language Models.

Project Structure

darwinlm/
├── configs/                  # Configuration files
│   ├── models/              # Model-specific configs
│   │   ├── llama2_7b.yaml
│   │   ├── qwen_14b.yaml
│   │   └── mistral_7b.yaml
│   └── base.yaml            # Base configuration
├── darwinlm/                # Main package
│   ├── algorithms/          # Core algorithms
│   │   ├── pruning.py      # Second-order pruning
│   │   ├── evolution.py    # Evolutionary search
│   │   └── training.py     # Training-aware selection
│   ├── models/             # Model handling
│   │   └── adapter.py      # Model adapter
│   ├── data/               # Data processing
│   │   └── manager.py      # Data manager
│   └── utils/              # Utilities
│       ├── config.py       # Config management
│       ├── logging.py      # Logging utilities
│       ├── checkpointing.py
│       └── metrics.py      # Evaluation metrics
├── tests/                  # Test suite
├── examples/               # Example scripts
├── setup.py               # Package setup
└── README.md              # Documentation

Installation

# Clone repository
git clone https://github.com/llmsresearch/darwinlm.git
cd darwinlm

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Linux/Mac
# or
.\venv\Scripts\activate  # Windows

# Install dependencies
pip install -r requirements.txt

# Install package in development mode
pip install -e .

Usage

Configure your model:

# configs/models/your_model.yaml
model:
  name: "your-model"
  pretrained_path: "path/to/model"
  architecture: "llama"  # or other supported architectures

Run compression:

# Compress Llama2 model
python examples/compress_llama.py

# Compress Qwen model
python examples/compress_qwen.py

# Compress with custom config
python examples/compress_llama.py model=llama2_7b pruning.sparsity_target=0.6

The compressed model will be saved in the specified output directory.

Configuration

The configuration system uses Hydra and is organized as follows:

configs/base.yaml: Base configuration with default values
configs/models/: Model-specific configurations
- llama2_7b.yaml: Llama 2 7B configuration
- qwen_14b.yaml: Qwen 14B configuration
- mistral_7b.yaml: Mistral 7B configuration

You can override any config value through command line:

python examples/compress_llama.py \
    model=llama2_7b \
    pruning.sparsity_target=0.6 \
    training.batch_size=64

Contributing

We welcome contributions! Please check our Contributing Guidelines for details.

Some areas we'd love help with:

Adding support for new model architectures
Implementing alternative pruning strategies
Improving evolution operators
Enhancing documentation and examples

Citation

@article{tang2025darwinlm,
  title={DarwinLM: Evolutionary Structured Pruning of Large Language Models},
  author={Tang, Shengkun and Sieberling, Oliver and Kurtic, Eldar and Shen, Zhiqiang and Alistarh, Dan},
  journal={arXiv preprint arXiv:2502.07780},
  year={2025}
}

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Project Structure

Installation

Usage

Configuration

Contributing

Citation

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
configs		configs
darwinlm		darwinlm
examples		examples
tests		tests
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

License

llmsresearch/darwinlm

Folders and files

Latest commit

History

Repository files navigation

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Project Structure

Installation

Usage

Configuration

Contributing

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages