Memory leak on a Kafka Observation due to the metric "spring.kafka.listener.active"

**In what version(s) of Spring for Apache Kafka are you seeing this issue?**
3.3.0 and 3.3.1

**Describe the bug**

We have an application using Spring Boot v3.3.6 and Spring Kafka v3.3.0. And we have seen that the metric called `spring.kafka.listener.active` leads to an increasing number of activeTask instances of type `DefaultLongTaskTimer` from Micrometer, which are never garbage collected. 
We can see in the screenshot the memory and CPU usage of the process, which shows a classic memory leak trend :
![image](https://github.com/user-attachments/assets/a245a0dd-fecc-4ae8-a634-9086bab7c2de)

The issue doesn’t appear on another application we have that uses spring Kafka v3.2.2

The symptoms are similar to the one in this issue: [https://github.com/spring-projects/spring-security/issues/14030](https://github.com/spring-projects/spring-security/issues/14030 ), where the stop method is not call on the observation.
We can see by debugging the code in method `doInvokeRecordListener` in the class `KafkaMessageListenerContainer`, that the finally bloc containing the `observation.stop` is not called because the listener is an instance of `RecordMessaginMessageListenerAdapter`.
 

When we disable this property, system resource consumption seems to return to normal : `spring.cloud.stream.kafka.binder.enableObservation`

Further compounding the issue, prometheus scrapes regularly this metric, which uses even more CPU and leads to timeouts or broken pipes on the scraping endpoint (`/actuator/Prometheus`)

See the stack trace associated to the scrapping workload : [threaddump-1734718231951.zip](https://github.com/user-attachments/files/18214550/threaddump-1734718231951.zip)


**To Reproduce**

We have been able to create a minimal sample project to reproduce the issue. 
This is a simple Kafka producer / consumer using the latest Spring Boot v3.4.1 and Spring Kafka v3.3.1 versions.
We changed the rate of the producer to 1 millisecond (so around 1000 messages per second) to speed up the phenomenon.

We see millions of instance counts (and growing) for the active task after about 1h of running the test :
![image](https://github.com/user-attachments/assets/8da4d9b8-aaff-4051-952e-b7add770b566)


**Sample**

[sample](https://github.com/vivi2701/sample)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory leak on a Kafka Observation due to the metric "spring.kafka.listener.active" #3690

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Memory leak on a Kafka Observation due to the metric "spring.kafka.listener.active" #3690

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions