Google Cloud - Compute Engine VS Machine Learning

Google Cloud ML is a fully managed service whereas Google Compute Engine is not (the latter is IaaS).

Assuming that you just want to know some differences for the case when you have your own model, here you have some:

The most noticeable feature of Google CloudML is the deployment itself. You don't have to take care of things like setting up your cluster (that is, scaling), launching it, installing the packages and deploy your model for training. This is all done automatically, and you would have to do it yourself in Compute Engine although you would be unrestricted in what you can install.

Although all that deployment you can automatise more or less, there is no magic to it. In fact, you can see in the logs of CloudML for a training job that it is quite rudimentary in the sense that a cluster of instances is launched and thereafter TF is installed and your model is run with the options you set. This is due to TensorFlow being a framework decoupled from Google systems.
However, there is a substancial difference of CloudMl vs Compute Engine when it comes to prediction. And that is what you pay for mostly I would say with CloudML. You can have deployed model in CloudML for online and batch prediction out of the box pretty much. In Compute Engine, you would have to take care of all the quirks of TensorFlow Serving which are not that trivial (compared to training your model).
Another advantage of CloudML is hyper-parameter tuning. It is no more than just a somewhat smart brute-forcing tool to find out the best combination of parameters for your given model, and you could possibly automatise this in Compute Engine, but you would have to do that part of figuring out the optimisation algorithms to find the combination of parameters and values that would improve the objective function (usually maximise your accuracy or reduce your loss).
Finally, pricing is slightly different in either service. Until recently, pricing of CloudML was in pair with other competitors (you would pay for computing time in both training and prediction but also per prediction which you could compare with the computing time in Compute Engine). However, now you will only pay for that computing time (and it is even cheaper than before) which probably renders the idea of managing and scaling your own cluster (with TensorFlow) in Compute Engine useless in most scenarios.

Recommended topics

Hot tags