Transcription API File Extension Issue

**Bug description**
Spring AI overrides the file name with `audio.webm` when using the Transcription API of Open AI. [See code in here](https://github.com/spring-projects/spring-ai/blob/main/models/spring-ai-openai/src/main/java/org/springframework/ai/openai/api/OpenAiAudioApi.java#L163). Sending an `*.mp3` file with the file name as `*.webm` is not supported by the new Open AI transcription models `gpt-4o-transcribe` and `gpt-4o-mini-transcribe`. These models will fail with an unsupported error while `whisper-1` will allow it.

**Environment**
Java 21, Spring Boot 3.5.0, Spring AI 1.0 GA

**Steps to reproduce**
This problem can be tested with `curl` using the [Open AI Transcription API](https://platform.openai.com/docs/api-reference/audio/createTranscription). 

1. Grab a mp3 file with a speech
2. Rename the file as `audio.webm`
3. Send a request with `curl` providing the API key and making sure that the model selected is one of the `gpt-4o-transcribe` (not `whisper-1`).
4. Observe the returned error

Make sure to send a renamed mp3 file as `audio.webm`
```sh
curl https://api.openai.com/v1/audio/transcriptions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/audio.webm" \
  -F model="gpt-4o-transcribe"
```

**Expected behavior**
Spring AI should not be changing the file name and failing silently. It should send the correct file name (at least the correct file extension).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transcription API File Extension Issue #3557

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Transcription API File Extension Issue #3557

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions