Graceful shutdown isn't working like expected

### Is there an existing issue for this?

- [x] I have searched the existing issues

### Current behavior

This implementation is incorrect: https://github.com/nestjs/terminus/issues/2421 (to read first) & https://github.com/nestjs/terminus/pull/2422

1. on receiving SIGTERM signal, set `readiness` probe to fail with 503, to tell the orchestrator to stop sending requests
2. Wait X seconds to be sure traffic stops being forwarded to the app by Kubernetes (should match the interval of the readiness probe + few seconds, to be sure the orchestrator is aware the pod should stop receive traffic), 
4. proceed to close the webserver (process last requests if there are still some long ones running)
5. proceed to close database connections and others connections & shutdown the app

![Image](https://github.com/user-attachments/assets/e66c6b18-a9d9-4b67-914b-0f6389ef34bf)


### Minimum reproduction code

Load test your NestJS app running in a Kubernetes environment, and trigger a new deployment during this load test. You should notice a few failed requests.

Here is a simple example of load test you can run with [k6](https://k6.io/):

```bash
cat << 'EOF' | k6 run -
import http from 'k6/http';
import { sleep } from 'k6';

export const options = {
  scenarios: {
    constant_request_rate: {
      executor: 'constant-arrival-rate',
      rate: 5,                // 5 iterations per second
      timeUnit: '1s',         // 1 second
      duration: '2m',         // 2 minutes
      preAllocatedVUs: 5,     // Number of VUs to pre-allocate
      maxVUs: 10,            // Maximum number of VUs to allow if needed
    },
  },
};

export default function () {
  http.get('https://your-endpoint.com/livez');
  sleep(1);
}
EOF
```


### Steps to reproduce

_No response_

### Expected behavior

The expected graceful shutdown behaviour from a production-ready NestJs app should be:

1. on receiving SIGTERM signal, ~set `readiness` probe to fail with 503, to tell the orchestrator to stop sending requests~
2. **Wait X seconds** to be sure traffic stops being forwarded to the app by Kubernetes
3. **set `readiness` probe to fail with 503, to tell the orchestrator to stop sending requests**
4. proceed to close the webserver (process last requests if there are still some long ones running)
5. proceed to close database connections and others connections & shutdown the app

![Image](https://github.com/user-attachments/assets/d606fa30-7c60-48cc-b8e5-673c437f0ab4)

Therefore, if the loadbalancer is still sending a request before being aware the endpoint is removed, the requests won't we seen as failed with 502, but instead will still be processed and not lead to downtime during a rolling update.

### Package version

latest

### NestJS version

latest

### Node.js version

latest

### In which operating systems have you tested?

- [x] macOS
- [ ] Windows
- [ ] Linux

### Other

Resources that explains why the few seconds sleep is necessary:
https://learnk8s.io/graceful-shutdown

In the meantime, simply setting a sleep to 0s in Terminus, and adding a lifecycle preStop hook to sleep X sec is enough to fix the behaviour.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Graceful shutdown isn't working like expected #2569

Is there an existing issue for this?

Current behavior

Minimum reproduction code

Steps to reproduce

Expected behavior

Package version

NestJS version

Node.js version

In which operating systems have you tested?

Other

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Graceful shutdown isn't working like expected #2569

Description

Is there an existing issue for this?

Current behavior

Minimum reproduction code

Steps to reproduce

Expected behavior

Package version

NestJS version

Node.js version

In which operating systems have you tested?

Other

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions