Compile bug: nvcc fatal : Unsupported gpu architecture 'compute_120'

### Git commit

commit 3f3769ba76061a511f02f2a48da2ad2d93fce511 (HEAD -> master, origin/master, origin/HEAD)

### Operating systems

Windows

### GGML backends

CUDA

### Problem description & steps to reproduce

After replacing 3090 with 5070 I see compilation error:
`nvcc fatal : Unsupported gpu architecture 'compute_120'`
I found this: https://github.com/ggml-org/whisper.cpp/issues/3030
Adding -DCMAKE_CUDA_ARCHITECTURES="86" solved my llama.cpp compilation problem.
Should this fix be merged also into llama.cpp?

### First Bad Commit

_No response_

### Compile command

```shell
cmake -DGGML_CUDA=ON -DLLAMA_CURL=OFF .. 
cmake --build . --config Release -j 30
```

### Relevant log output

```shell
nvcc fatal : Unsupported gpu architecture 'compute_120'
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile bug: nvcc fatal : Unsupported gpu architecture 'compute_120' #13271

Git commit

Operating systems

GGML backends

Problem description & steps to reproduce

First Bad Commit

Compile command

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Compile bug: nvcc fatal : Unsupported gpu architecture 'compute_120' #13271

Description

Git commit

Operating systems

GGML backends

Problem description & steps to reproduce

First Bad Commit

Compile command

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions