forked from vllm-project/vllm
    
        
        - 
                Notifications
    You must be signed in to change notification settings 
- Fork 49
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      [Triton] [355_wip] add shape checking for aiter triton fp4 gemm
      
    
        
          #764
            opened Oct 24, 2025  by
            k50112113
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [ROCM] Llama4 VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support
      
    
        
          #763
            opened Oct 24, 2025  by
            tpopp
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      CI: Initial the model performance tests workflow
      
    
      
  
        
          #747
            opened Oct 21, 2025  by
            gyohuangxin
            
        
        
            
    •
    
      Draft
    
  
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [WIP] Support persistent MLA for ROCm MLA backend
      
    
      
  
        
          #739
            opened Oct 16, 2025  by
            ganyi1996ppo
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [FEAT] Add support for AITER bpreshuffle block scale gemm
      
    
      
  
        
          #717
            opened Sep 27, 2025  by
            tjtanaavllm
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [Perf] refactor attention backend for perf boost
      
    
      
  
        
          #713
            opened Sep 26, 2025  by
            ganyi1996ppo
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
      
    
      
  
        
          #705
            opened Sep 24, 2025  by
            xytpai
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [ROCm] Add allreduce dispatcher for ROCm device
      
    
        
          #704
            opened Sep 24, 2025  by
            zejunchen-zejun
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [ROCm] Add allreduce dispatcher for ROCm device
      
    
      
  
        
          #695
            opened Sep 18, 2025  by
            zejunchen-zejun
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)
      
    
      
  
        
          #694
            opened Sep 18, 2025  by
            xudonlyu
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator
      
    
        
          #669
            opened Sep 11, 2025  by
            xytpai
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      support ck-tile fused bias gemm for rocm unquantized gemm
      
    
        
          #668
            opened Sep 11, 2025  by
            eliotwang
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale
      
    
        
          #659
            opened Sep 8, 2025  by
            zhuyuhua-v
            
        
        
            
    
  
    Loading…
 
        
        
      
    Previous Next
  
  
  ProTip!
  Updated in the last three days: updated:>2025-10-21.