Reduce size of runtime-adapter image (exclude Python/tensorflow to convert keras models)

The current image weight is very high (2.14Gb) which slows down the predictor's uptime.

Correct me if I'm wrong please, but the only reason the adapter needs to install tensorflow is to convert keras models to tensorflow models, which sounds weird to do it on runtime and not in advance, see 

https://github.com/kserve/modelmesh-runtime-adapter/blob/f9781d287d31ec40c7c3eb77d5ac12eb68622aaa/model-mesh-triton-adapter/server/utils.go#L63-L64

https://github.com/kserve/modelmesh-runtime-adapter/blob/f9781d287d31ec40c7c3eb77d5ac12eb68622aaa/Dockerfile#L145
https://github.com/kserve/modelmesh-runtime-adapter/blob/f9781d287d31ec40c7c3eb77d5ac12eb68622aaa/Dockerfile#L164
https://github.com/kserve/modelmesh-runtime-adapter/blob/f9781d287d31ec40c7c3eb77d5ac12eb68622aaa/Dockerfile#L172

If we remove this option, we can remove the tensorflow installation, and since python is needed only for that, removing the entire python installation.
This reduces the image size from 2.14 GB to 256Mb.

Can we just remove it? If not, can we have two images, the original one and a new slim one?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce size of runtime-adapter image (exclude Python/tensorflow to convert keras models) #59

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	func convertKerasToTF(kerasFile string, targetPath string, ctx context.Context, loggr logr.Logger) error {
	cmd := exec.Command("python", "/opt/scripts/tf_pb.py", kerasFile, targetPath)

Reduce size of runtime-adapter image (exclude Python/tensorflow to convert keras models) #59

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions