-
-
Notifications
You must be signed in to change notification settings - Fork 304
Open
Labels
Description
OpenAI recently released their new image generation model via API: gpt-image-1.
It would be great to add support for it in the gem.
Notes
gpt-image-1uses a different endpoint:images/edits.- Currently,
RubyLLM#paintusesimages/generationsendpoint (code). - It's a bit odd that OpenAI introduced a separate endpoint instead of simply allowing an image input on
generations, but that's the current setup.
Proposal:
I suggest introducing a new method, maybe RubyLLM#edit, to support the images/edits endpoint separately without changing the existing paint method.
Or update RubyLLM#paint to support :
RubyLLM.paint 'transform me this image into ghibli style", with: { image: some_image }WDYT?
Lastly, I am wondering if this would be possible (instead of just #paint) :
chat = RubyLLM.chat
chat.draw "transform me into ghibli style", with: { image: "me.png" }
# Analyze images
chat.draw "can you turn the landscape into blue pastel colors?"