As a product VP at Google Cloud, Michael Gerstenhaber works totally on Vertex AI, the corporate’s unified platform for deploying enterprise AI. It offers him a high-level view of how firms are literally utilizing AI fashions, and what nonetheless must be finished to unleash the potential of agentic AI.
After I spoke with Gerstenhaber, I used to be significantly struck by one concept I hadn’t heard earlier than. As he put it, AI fashions are pushing in opposition to three frontiers without delay: uncooked intelligence, response time, and a 3rd high quality that has much less to do with uncooked functionality than with price — whether or not a mannequin may be deployed cheaply sufficient to run at large, unpredictable scale. It’s a brand new mind-set about mannequin capabilities, and a very worthwhile one for anybody making an attempt to push frontier fashions in a brand new route.
This interview has been edited for size and readability.
Why don’t you begin by strolling us by way of your expertise in AI up to now, and what you do at Google.
I’ve been in AI for about two years now. I used to be at Anthropic for a 12 months and a half, I’ve been at Google nearly half a 12 months now. I run Vertex AI, Google’s developer platform. Most of our prospects are engineers constructing their very own functions. They need entry to agentic patterns. They need entry to an agentic platform. They need entry to the inference of the neatest fashions on this planet. I present them that, however I don’t present the functions themselves. That’s for Shopify, Thomson Reuters, and our numerous prospects to offer in their very own domains.
What drew you to Google?
Google is I feel distinctive on this planet in that we’ve got every part from the interface to the infrastructure layer. We will construct knowledge facilities. We will purchase electrical energy and construct energy vegetation. We have now our personal chips. We have now our personal mannequin. We have now the inference layer that we management. We have now the agentic layer we management. We have now APIs for reminiscence, for interleaved code writing. We have now an agent engine on prime of that that ensures compliance and governance. After which we even have the chat interface with Gemini enterprise and Gemini chat for customers, proper? So a part of the rationale I got here right here is as a result of I noticed Google as uniquely vertically built-in, and that being a energy for us.
Techcrunch occasion
Boston, MA
|
June 9, 2026
It’s odd as a result of, even with all of the variations between firms, it seems like all three of the massive labs are actually shut in capabilities. Is it only a race for extra intelligence, or is it extra sophisticated than that?
I see three boundaries. Fashions like Gemini Professional are tuned for uncooked intelligence. Take into consideration writing code. You simply need one of the best code you will get, doesn’t matter if it takes 45 minutes, as a result of I’ve to take care of it, I’ve to place it in manufacturing. I simply need one of the best.
Then there’s this different boundary with latency. If I’m doing buyer assist and I have to know the way to apply a coverage, you want intelligence to use that coverage. Are you allowed to transact a return? Can I improve my seat on an airplane? However it doesn’t matter how proper you’re if it took 45 minutes to get the reply. So for these instances, you need probably the most clever product inside that latency price range, as a result of extra intelligence now not issues as soon as that particular person will get bored and hangs up the cellphone.
After which there’s this final bucket, the place any individual like Reddit or Meta needs to average the whole web. They’ve massive budgets, however they’ll’t take an enterprise threat on one thing in the event that they don’t know the way it scales. They don’t know what number of toxic posts there will likely be immediately or tomorrow. In order that they have to limit their price range to a mannequin on the highest intelligence they’ll afford, however in a scalable approach to an infinite variety of topics. And for that, price turns into very, essential.
One of many issues I’ve been puzzling about is why agentic programs are taking so lengthy to catch on. It feels just like the fashions are there and I’ve seen unimaginable demos, however we’re not seeing the type of main modifications I might have anticipated a 12 months in the past. What do you assume is holding it again?
This know-how is mainly two years previous, and there’s nonetheless a whole lot of lacking infrastructure. We don’t have patterns for auditing what the brokers are doing. We don’t have patterns for authorization of knowledge to an agent. There are these patterns which are going to require work to place into manufacturing. And manufacturing is at all times a trailing indicator of what the know-how is able to. So two years isn’t lengthy sufficient to see what the intelligence helps in manufacturing, and that’s the place persons are struggling.
I feel it’s moved uniquely shortly in software program engineering as a result of it matches properly within the software program growth lifecycle. We have now a dev setting wherein it’s secure to interrupt issues, after which we promote from the dev setting to the take a look at setting. The method of writing code at Google requires two folks to audit that code and each affirm that it’s adequate to place Google’s model behind and provides to our prospects. So we’ve got a whole lot of these human-in-the-loop processes that make the implementation exceptionally low-risk. However we have to produce these patterns elsewhere and for different professions.

