LLMBoost metrics
Show information regarding the LLM inference requests
| Metric name | Type | Description |
|---|---|---|
| num_active_requests | gauge | Number of ongoing chat/completion requests in the node |
| num_total_requests_total | counter | Total number of requests served by the LLMBoost engine |