coremltools Error: ValueError: perm should have the same length as rank(x): 3 != 2

Charlie Fish · 2 years ago

coremltools Error: ValueError: perm should have the same length as rank(x): 3 != 2

jack@lemmy.nz · 2 years ago

This is a great minimal example! You’re on the right track regarding the input, it’s just that coreml expects the input shape to be fully defined, meaning it must be a 2D tensor of (batch_size, sequence_length).

If you change the conversion inputs line to be

inputs=[ct.TensorType(shape=(32, max_len), name="embedding_input", dtype=np.int32)],

instead of a 1-dimensional tensor you should be fine.

Also you may need to use mlpackage instead of mlmodel for the file extension.

Charlie Fish · 2 years ago

Interesting! I’ll try this tonight and see how it goes. Really appreciate your reply tho. I’ll let you know the outcome.

Charlie Fish · 2 years ago

This worked!!! However it now looks like I have to pass in 32 (batch size) comments in order to run a prediction in Core ML now? Kinda strange when I could pass in a single string to TensorFlow to run a prediction on.

Also it seems to be much slower than my Create ML model I was playing with. Went from 0.05 ms on average for the Create ML model to 0.47 ms on average for this TensorFlow model. Looks like this TensorFlow model also is running 100% on the CPU (not taking advantage of GPU or Neural Engine).

Obviously there are some major advantages to using TensorFlow (ie. I can run on a server environment, I can better control stopping training early based on that val_accuracy metric, etc). But Create ML seems to really win in other areas like being able to pass in a simple string (and not having to worry about tokenization), not having to pass in 32 strings in a single prediction, and the performance.

Maybe I should lower my batch_size? I’ve heard there are pros and cons to lowering & increasing batch_size. Haven’t played around with it too much yet.

Am I just missing something in this analysis?

I really appreciate your help and advice!