Skip to content

Support larger memory instance types for RepackModelStep #4098

@michael-lum

Description

@michael-lum

Describe the feature you'd like
Please either provide a larger default memory instance type or allow for configuration for the instance type of the RepackModelStep. This step is currently added automatically for the CreateModel step if custom code is provided but does not allow configuration of instance type.

How would this feature be used? Please describe.
For Large Language Models, model sizes can be larger than 30MB. This would allow providing custom code for model artifacts of large sizes with an automated repackaging step.

Describe alternatives you've considered
The only other alternative is to manually package the inference and loading code directly into the model zip or implementing the packaging logic ourself.

Additional context
Code Link

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions