ramalama chat should wait for llama.cpp to initiialize

### Issue Description

llama-server /models api returns a 503 error while the model is loading. ramalama run handles this but ramalama chat currently does not. For interactive use this isn't a major issue but it would be good to fix this.

Worked around this for CI in https://github.com/containers/ramalama/pull/2342

### Steps to reproduce the issue

Run ramalama chat soon after ramalama serve --detach

### Describe the results you received

ramalama chat fails

### Describe the results you expected

ramalama chat succeeds

### ramalama info output

```yaml
N/A
```

### Upstream Latest Release

Yes

### Additional environment details

_No response_

### Additional information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ramalama chat should wait for llama.cpp to initiialize #2343

Issue Description

Steps to reproduce the issue

Describe the results you received

Describe the results you expected

ramalama info output

Upstream Latest Release

Additional environment details

Additional information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ramalama chat should wait for llama.cpp to initiialize #2343

Description

Issue Description

Steps to reproduce the issue

Describe the results you received

Describe the results you expected

ramalama info output

Upstream Latest Release

Additional environment details

Additional information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions