Stream Interruption on Ant Media Server 2.14 due to sudden memory spike on Vultr Kubernetes Engine #7465

oleul-bjit · 2025-08-25T10:42:50Z

oleul-bjit
Aug 25, 2025

All RTMP streams (~268 cameras) stopped simultaneously on Ant Media Server 2.14 running in Kubernetes. Streams threw memory spiked, heap dump API failed, pod OOMKilled, and all streams resumed only after restart.

Environment

Operating system and version: Ubuntu 22.04 (Vultr Kubernetes Engine node image)
Java version: 17
Ant Media Server version: Enterprise Edition 2.14
Browser name and version: N/A (issue occurs server-side with RTMP ingest)

Steps to reproduce

Run Ant Media Server 2.14 on Vultr Kubernetes Engine (VKE).
Ingest ~268 RTMP camera streams continuously with enabling "addDateTimeToHlsFileName": true
Observe server behavior during prolonged load.

Expected behavior

RTMP streams should continue generate .ts files
Memory usage should remain stable under expected load.
Heap dump API should work when triggered under memory pressure to allow analysis.

Actual behavior

On 2025-08-18 between 22:11 JST and 22:21 JST, all RTMP streams stopped simultaneously.

Timeline (JST):

22:10:59 → Last successful stream recording observed.
22:11:06 → Stream stop notifications received.
22:11:08 → Multiple MongoInterruptedException from RTMPConnectionExecutor.
22:11:10 → Errors logged: Error executing call: Service: null Method: publish Num Params: 2 0: {streamId}?subscriberId=ru9JD0Dg&subscriberCode=758169 1: live exception message.
22:13:41 → Memory usage spiked to ~50%; heap dump API triggered but did not work.
22:16:01 → Continuous MongoInterruptedException and publish error logs until this time.
22:20:33 → Pod terminated due to OOMKilled.
22:21:21 → Pod restarted; all streams resumed automatically.

Why did all streams stop simultaneously and begin sending exception messages?

What does the following mean?

Error executing call: Service: null Method: publish Num Params: 2
0: {streamId}?subscriberId=ru9JD0Dg&subscriberCode=758169
1: live exception message

When and why does this exception occur?

Why did memory spike sharply during this exception period?

Why did GC + heap dump API fail under memory pressure? Is there any alternative safe way to collect a reliable heap dump in such cases?

Logs

Logs and heap dump are available here:
https://drive.google.com/drive/folders/1HOIOEYU0-yV9JjufHdmZq1TYwYdddN0x?usp=sharing

burak-58 · 2025-09-01T03:00:22Z

burak-58
Sep 1, 2025
Maintainer

Hi @oleul-bjit,
I have checked the logs you shared. Everything starts with Mongo access issues. Mongo queries took long and this causes everything you listed.
ı am converting this issue a discussion so that you can check the Mongo and your setup with one of my friend from support team.

0 replies

oleul-bjit · 2025-09-01T07:52:48Z

oleul-bjit
Sep 1, 2025
Author

Hello @burak-58,
Thank you for your investigation.
We did not encounter this issue with version 2.11.3. Is there anything specific about version 2.14 that affects MongoDB? Do we need to update any configurations?

2 replies

yashtandon113 Sep 3, 2025
Collaborator

Hi @oleul-bjit

Which version of MongoDB are you using?

Actually in v2.14, some optimizations are done for MongoDB to make it work faster for queries. You can check the 2.14 release logs.

https://github.com/ant-media/Ant-Media-Server/releases/tag/ams-v2.14.0

oleul-bjit Sep 4, 2025
Author

Hello @yashtandon113 ,
We are using MongoDB Community version: "7.0.5" with compatible version is "6.0.5"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ant Media

Stream Interruption on Ant Media Server 2.14 due to sudden memory spike on Vultr Kubernetes Engine #7465

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Ant Media

Stream Interruption on Ant Media Server 2.14 due to sudden memory spike on Vultr Kubernetes Engine #7465

Uh oh!

oleul-bjit Aug 25, 2025

Environment

Steps to reproduce

Expected behavior

Actual behavior

Logs

Replies: 2 comments · 2 replies

Uh oh!

burak-58 Sep 1, 2025 Maintainer

Uh oh!

oleul-bjit Sep 1, 2025 Author

Uh oh!

yashtandon113 Sep 3, 2025 Collaborator

Uh oh!

oleul-bjit Sep 4, 2025 Author

oleul-bjit
Aug 25, 2025

Replies: 2 comments 2 replies

burak-58
Sep 1, 2025
Maintainer

oleul-bjit
Sep 1, 2025
Author

yashtandon113 Sep 3, 2025
Collaborator

oleul-bjit Sep 4, 2025
Author