File tree
1,979 files changed
+185459
-157813
lines changed- .buildkite- nightly-benchmarks/scripts
- scripts- hardware_ci
- tpu
 
 
- .github- ISSUE_TEMPLATE
- workflows
 
- benchmarks- auto_tune
- cutlass_benchmarks
- disagg_benchmarks
- kernels- deepgemm
 
- multi_turn
 
- cmake- external_projects
 
- csrc- attention- mla- cutlass_sm100_mla- device
- kernel
 
 
 
- core
- cpu
- cutlass_extensions
- moe- marlin_moe_wna16
 
- quantization- cutlass_w4a8
- fp4
- fused_kernels
- gptq_marlin
- machete
- w8a8- cutlass- c3x
- moe
 
- fp8- amd
- nvidia
 
- int8
 
 
- rocm
 
- docker
- docs- api- vllm
 
- assets- deployment
- design/cuda_graphs
 
- community
- configuration
- contributing- model
 
- deployment- frameworks
- integrations
 
- design
- features- quantization
 
- getting_started- installation- cpu
- gpu
 
 
- mkdocs/hooks
- models- extensions
 
- serving
- training
- usage
 
- examples- offline_inference- basic
- kv_load_failure_recovery
- logits_processor
- pooling
 
- online_serving- dashboards- grafana
- perses
 
- disaggregated_serving_p2p_nccl_xpyd
- disaggregated_serving
- elastic_ep
- openai_embedding_long_text
- pooling
- structured_outputs
 
- others
 
- requirements
- tests- async_engine
- basic_correctness
- benchmarks
- compile- piecewise
 
- config
- core- block- e2e
 
 
- cuda
- detokenizer
- distributed
- engine
- entrypoints- llm
- offline_mode
- openai- correctness
- tool_parsers
 
- pooling- correctness
- llm
- openai
 
 
- evals- gpt_oss
- gsm8k- configs
 
 
- fastsafetensors_loader
- kernels- attention
- core
- mamba
- moe- modular_kernel_tools
 
- quantization
 
- kv_transfer
- lora
- metrics
- mistral_tool_use
- model_executor- model_loader- fastsafetensors_loader
- runai_model_streamer
- tensorizer_loader
 
 
- models- language- generation_ppl_test
- generation
- pooling_mteb_test
- pooling
 
- multimodal- generation- vlm_utils
 
- pooling
- processing
 
- quantization
 
- mq_llm_engine
- multimodal
- plugins_tests
- plugins- lora_resolvers
- prithvi_io_processor_plugin/prithvi_io_processor
- vllm_add_dummy_model- vllm_add_dummy_model
 
- vllm_add_dummy_platform- vllm_add_dummy_platform
 
 
- quantization
- reasoning
- runai_model_streamer_test
- samplers
- speculative_decoding/speculators
- standalone_tests
- tokenization
- tool_use- mistral
 
- tools
- tpu- lora
 
- tracing
- transformers_utils
- utils_
- v1- attention
- core
- cudagraph
- distributed
- e2e
- engine
- entrypoints- llm
- openai- responses
 
 
- executor
- generation
- kv_connector- nixl_integration
- unit
 
- kv_offload
- logits_processors
- metrics
- sample
- shutdown
- spec_decode
- structured_output
- tpu- worker
 
- tracing
- worker
 
- vllm_test_utils- vllm_test_utils
 
- weight_loading
- worker
 
- tools- ep_kernels
- pre_commit
- profiler- nsys_profile_tools
 
 
- vllm- adapter_commons
- assets
- attention- backends- mla
 
- layers
- ops
- utils
 
- benchmarks- lib
 
- compilation
- config
- core- block
 
- device_allocator
- distributed- device_communicators
- eplb
- kv_transfer- kv_connector- v1- p2p
 
 
- kv_lookup_buffer
- kv_pipe
 
 
- engine- multiprocessing
- output_processor
 
- entrypoints- cli- benchmark
 
- openai- tool_parsers
 
 
- executor
- inputs
- logging_utils
- lora- layers
- ops- ipex_ops
- torch_ops
- triton_ops
- xla_ops
 
- punica_wrapper
 
- model_executor- layers- fla/ops
- fused_moe- configs
 
- mamba- ops
 
- quantization- compressed_tensors- schemes
- transform- schemes
 
 
- kernels- mixed_precision
- scaled_mm
 
- quark- schemes
 
- utils
 
- rotary_embedding
 
- model_loader
- models
- warmup
 
- multimodal
- platforms
- plugins- io_processors
- lora_resolvers
 
- profiler
- ray
- reasoning
- transformers_utils- chat_templates
- configs- speculators
 
- processors
- tokenizers
 
- triton_utils
- usage
- utils
- v1- attention/backends- mla
 
- core- sched
 
- engine
- executor
- kv_offload- backends
- worker
 
- metrics
- pool
- sample- logits_processor
- ops
- tpu
 
- spec_decode
- structured_output
- worker
 
- worker
 
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,979 files changed
+185459
-157813
lines changedLines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 368 | 368 |  | |
| 369 | 369 |  | |
| 370 | 370 |  | |
| 371 |  | - | |
|  | 371 | + | |
| 372 | 372 |  | |
| 373 | 373 |  | |
| 374 | 374 |  | |
|  | |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 181 | 181 |  | |
| 182 | 182 |  | |
| 183 | 183 |  | |
| 184 |  | - | |
| 185 |  | - | |
|  | 184 | + | |
| 186 | 185 |  | |
| 187 |  | - | |
| 188 | 186 |  | |
| 189 | 187 |  | |
| 190 | 188 |  | |
| 191 | 189 |  | |
| 192 |  | - | |
| 193 |  | - | |
|  | 190 | + | |
| 194 | 191 |  | |
| 195 |  | - | |
| 196 | 192 |  | |
| 197 | 193 |  | |
| 198 | 194 |  | |
|  | |||
Lines changed: 1 addition & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 365 | 365 |  | |
| 366 | 366 |  | |
| 367 | 367 |  | |
| 368 |  | - | |
| 369 |  | - | |
|  | 368 | + | |
| 370 | 369 |  | |
| 371 | 370 |  | |
| 372 | 371 |  | |
|  | |||
| 455 | 454 |  | |
| 456 | 455 |  | |
| 457 | 456 |  | |
| 458 |  | - | |
| 459 |  | - | |
| 460 |  | - | |
| 461 |  | - | |
| 462 |  | - | |
| 463 | 457 |  | |
| 464 | 458 |  | |
| 465 | 459 |  | |
|  | |||
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 76 | 76 |  | |
| 77 | 77 |  | |
| 78 | 78 |  | |
| 79 |  | - | |
|  | 79 | + | |
| 80 | 80 |  | |
| 81 | 81 |  | |
| 82 | 82 |  | |
|  | |||
| 150 | 150 |  | |
| 151 | 151 |  | |
| 152 | 152 |  | |
| 153 |  | - | |
| 154 |  | - | |
| 155 |  | - | |
| 156 |  | - | |
| 157 |  | - | |
|  | 153 | + | |
|  | 154 | + | |
|  | 155 | + | |
|  | 156 | + | |
|  | 157 | + | |
|  | 158 | + | |
|  | 159 | + | |
|  | 160 | + | |
|  | 161 | + | |
|  | 162 | + | |
| 158 | 163 |  | |
| 159 | 164 |  | |
| 160 | 165 |  | |
|  | |||
| 163 | 168 |  | |
| 164 | 169 |  | |
| 165 | 170 |  | |
|  | 171 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 8 | 8 |  | |
| 9 | 9 |  | |
| 10 | 10 |  | |
| 11 |  | - | |
|  | 11 | + | |
| 12 | 12 |  | |
| 13 | 13 |  | |
| 14 | 14 |  | |
| 15 | 15 |  | |
| 16 | 16 |  | |
|  | 17 | + | |
|  | 18 | + | |
|  | 19 | + | |
|  | 20 | + | |
|  | 21 | + | |
|  | 22 | + | |
|  | 23 | + | |
|  | 24 | + | |
|  | 25 | + | |
|  | 26 | + | |
|  | 27 | + | |
|  | 28 | + | |
|  | 29 | + | |
|  | 30 | + | |
|  | 31 | + | |
|  | 32 | + | |
|  | 33 | + | |
|  | 34 | + | |
|  | 35 | + | |
| 17 | 36 |  | |
| 18 | 37 |  | |
| 19 | 38 |  | |
| 20 | 39 |  | |
| 21 | 40 |  | |
| 22 | 41 |  | |
| 23 |  | - | |
|  | 42 | + | |
|  | 43 | + | |
| 24 | 44 |  | |
|  | 45 | + | |
| 25 | 46 |  | |
| 26 | 47 |  | |
| 27 | 48 |  | |
|  | |||
| 43 | 64 |  | |
| 44 | 65 |  | |
| 45 | 66 |  | |
| 46 |  | - | |
|  | 67 | + | |
|  | 68 | + | |
|  | 69 | + | |
| 47 | 70 |  | |
| 48 | 71 |  | |
| 49 | 72 |  | |
|  | |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 86 | 86 |  | |
| 87 | 87 |  | |
| 88 | 88 |  | |
| 89 |  | - | |
| 90 |  | - | |
| 91 |  | - | |
| 92 |  | - | |
| 93 | 89 |  | |
| 94 | 90 |  | |
| 95 | 91 |  | |
|  | |||
| 167 | 163 |  | |
| 168 | 164 |  | |
| 169 | 165 |  | |
| 170 |  | - | |
| 171 |  | - | |
| 172 |  | - | |
| 173 |  | - | |
| 174 |  | - | |
| 175 |  | - | |
| 176 | 166 |  | |
| 177 | 167 |  | |
| 178 | 168 |  | |
|  | |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
|  | |||
| 58 | 58 |  | |
| 59 | 59 |  | |
| 60 | 60 |  | |
| 61 |  | - | |
| 62 |  | - | |
| 63 |  | - | |
| 64 |  | - | |
| 65 |  | - | |
|  | 61 | + | |
|  | 62 | + | |
| 66 | 63 |  | |
| 67 | 64 |  | |
| 68 | 65 |  | |
|  | |||

0 commit comments