Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?

Matthew Berman
12 Apr 202434:20

Summary

TLDRGoogle Cloud's 2024 keynote introduced Vertex AI, an enterprise AI platform with a model garden offering over 130 models, including Gemini 1.5 Pro with a 1 million token context window. The platform enhances capabilities for processing vast information and audio, enabling innovative applications. Google also launched Vertex AI Agent Builder for creating customer service agents and showcased its integration with Google Workspace for employee productivity. Additionally, Google Vids, an AI-powered video creation app, and advancements in code assistance with Gemini's large context window were highlighted, demonstrating Google's commitment to AI-enhanced workplace tools.

Takeaways

  • 🚀 Google has launched an agent platform at Google Cloud Next 2024, focusing on AI advancements.
  • 🌟 The Vertex AI Model Garden offers access to over 130 models, including Gemini, Claude from Anthropic, and popular open models like LLaMA and Gemma.
  • 🔍 Google's Vertex AI allows users to choose the best AI model for their specific use case, budget, and performance needs, with the ability to switch between models seamlessly.
  • 📢 Gemini 1.5 Pro is now in public preview, offering a massive 1 million token context window, which is particularly impressive for processing large amounts of information.
  • 🎥 The platform can handle multimodal reasoning, such as analyzing text and video to identify specific items or information, as demonstrated in a shopping assistance demo.
  • 🔧 Google introduced Code Gemma, a fine-tuned, lightweight open model designed for coding, leveraging the same technology used to create Gemini.
  • 🛠️ Vertex AI Agent Builder is a tool that allows users to create powerful customer agents with human-like conversations and natural language instructions to control conversation flow.
  • 🔗 The agent platform can integrate enterprise data from operational databases and applications, enhancing its capabilities to perform tasks and provide customer service.
  • 👥 Google showcased customer agents in various industries, such as automotive, travel, and retail, emphasizing the platform's versatility in improving customer experiences.
  • 📝 The script highlights the potential of AI in the workplace, with agents capable of performing tasks, understanding complex data, and assisting with processes like benefits enrollment.
  • 🎬 Google announced 'Google Vids', an AI-powered video creation app for work, which uses Gemini to assist with video writing, production, and editing based on user prompts and context.

Q & A

  • What is the main focus of the Google Cloud Next 2024 keynote speech discussed in the script?

    -The main focus is the announcement of Google's new agent platform and the features of Vertex AI, including its model garden with over 130 models and the public preview of Gemini 1.5 Pro.

  • What are some of the models available in Google's Vertex AI model garden?

    -Some of the models include Gemini, Claude from Anthropic, Llama, Gemma, and MRR, as well as open models like Stable Diffusion and others.

  • What is the significance of the 1 million token context window in Gemini 1.5 Pro?

    -The 1 million token context window allows for processing vast amounts of information in a single stream, enabling tasks such as analyzing an hour-long video or over 30,000 lines of code in a single context window.

  • How does Google's Vertex AI platform differ from other AI platforms in terms of model offerings?

    -Vertex AI is unique in that it offers a combination of first-party, third-party, and open-source models, and provides capabilities to fine-tune, augment, manage, and monitor these models.

  • What is the role of the Vertex AI Agent Builder in creating customer agents?

    -The Vertex AI Agent Builder allows users to create powerful customer agents through three key steps: creating humanlike conversations with various inputs, controlling conversation flow with natural language instructions, and improving response quality with search and extensions to complete tasks for customers.

  • How does Google's new product, Google Vids, integrate with other Google Workspace apps?

    -Google Vids integrates with other Google Workspace apps by allowing users to use existing documents for context, suggesting a narrative outline, and creating fully animated scenes with stock media and music based on the input prompt.

  • What is the potential use case for Gemini 1.5 Pro's large context window in coding?

    -The large context window can be used to understand and reason through entire codebases, making it easier for developers to make complex changes, such as updating services or adding new features, in a fraction of the time it would normally take.

  • How does the script describe the capabilities of the Vertex AI platform for enterprise use cases?

    -The script describes the Vertex AI platform as capable of accessing and processing vast amounts of information, supporting cross-modality analysis, and integrating with enterprise data from operational databases and applications, making it suitable for complex enterprise use cases.

  • What is the role of the 'Gemini Code Assist' feature in the script?

    -Gemini Code Assist is a feature that leverages the large context window to assist developers in making code changes efficiently. It can reason through entire codebases and provide clear recommendations for edits that align with company security and compliance requirements.

  • What is the speaker's opinion on the current state of the Vertex AI agent framework?

    -The speaker expresses a desire for more sophistication in the Vertex AI agent framework, comparing it to existing products like Open AI's custom GPTs and suggesting that Google could be more future-thinking in its capabilities.

  • How does the script highlight the integration of AI with customer service and sales?

    -The script highlights the integration of AI with customer service and sales by discussing the use of customer agents that can listen, understand needs, and recommend products and services across various channels, as well as the potential for AI to enhance self-service and improve answer quality.

  • What is the speaker's view on the potential of AI agents in the workplace?

    -The speaker views AI agents in the workplace as having tremendous potential, especially with the ability to perform tasks and accomplish things, essentially acting as AI employees. They are excited about the integration of these agents with tools like Google Docs and Google Chat.

  • What is the speaker's reaction to the announcement of Google's partnership with HubSpot?

    -The speaker finds it interesting that Google is mentioning HubSpot, especially considering the rumors about Google potentially acquiring HubSpot, and they highlight the cool feature of being able to feed HubSpot CRM data into the agent.

  • What is the speaker's perspective on the use of AI in coding assistance?

    -The speaker is excited about the prospect of AI in coding assistance, particularly with Gemini's massive context window, and they express a desire to try out and make a video about the feature.

Outlines

00:00

🚀 Google Cloud's Vertex AI Platform Launch

Google Cloud unveiled its Vertex AI platform during the 2024 keynote, featuring an extensive model garden with over 130 models, including proprietary and open-source options like Claude from Anthropic and popular models such as Llama and Gemma. The platform allows users to switch between models based on use case, budget, and performance needs. Gemini 1.5 Pro, with its 1 million token context window, was highlighted for its ability to process vast amounts of information, such as hour-long videos and large codebases, offering new possibilities for AI applications.

05:02

🛠️ Introduction of Vertex AI Agent Builder and Code Gemma

Google introduced the Vertex AI Agent Builder, a tool designed to create powerful customer agents through three key steps: utilizing Gemini Pro for conversational AI, controlling conversation flow with natural language instructions, and enhancing response quality with search capabilities and task extensions. Code Gemma, a fine-tuned model for coding, was also announced, showcasing Google Cloud's commitment to providing a comprehensive AI solution for various tasks, including coding assistance.

10:02

🤖 Exploring Customer and Enterprise Agents in Vertex AI

The script discusses the types of agents being built on Google Cloud using generative AI, focusing on customer service agents that operate across various channels. Mercedes-Benz's partnership with Google Cloud for personalized customer experiences in their vehicles was highlighted, along with other brands like ADT, Verizon, and Target, which are creating agents for sales and service. The potential for more sophisticated agents beyond customer service bots was noted, expressing a desire for more innovative applications of AI.

15:04

🛒 Customer Agent Demonstration and Vertex AI Agent Builder Overview

A demonstration of a customer agent leveraging Gemini and Vector search to assist with shopping inquiries was provided. The agent's ability to analyze video and text for search queries was showcased. The Vertex AI Agent Builder was criticized for its lack of sophistication compared to platforms like Autogen or Crew AI, and the presenter expressed confusion over the tool's interface and capabilities, particularly regarding the integration and coding aspects.

20:05

🏢 Agents in the Workplace and Google Workspace Integration

The script shifts focus to agents in the workplace, emphasizing their ability to perform tasks and integrate with company and web data. Google's Workspace integration was highlighted, with the introduction of an employee agent that can handle tasks like benefits enrollment. The agent's multimodal capabilities and integration with tools like Google Chat and Google Drive were demonstrated, showcasing the potential for streamlined workflow and productivity.

25:07

📹 Google Vids: The New Video Creation App for Work

Google announced Vids, a new addition to the Google Workspace suite, designed to simplify video creation for work with AI assistance. Vids leverages Gemini to help users create videos by providing narrative outlines, customizable styles, and automated scene generation with stock media and music. The app's ability to integrate with existing documents and provide a streamlined video production process was emphasized.

30:09

🔧 Gemini Code Assist: AI-Powered Coding Assistance

The script concludes with a demonstration of Gemini Code Assist, an AI tool that helps developers make codebase changes efficiently. The tool's ability to understand and reason through entire codebases, provide clear recommendations, and ensure compliance with security requirements was showcased. The tool's integration with development environments like Visual Studio Code was also highlighted, demonstrating its potential to significantly reduce development time and effort.

Mindmap

Keywords

💡Google Cloud Next 2024

Google Cloud Next is an annual conference hosted by Google focused on its cloud computing services and products. In the context of the video, it serves as the event where Google announces new features and developments for its AI platform, Vertex AI. The script mentions 'Google Cloud Next 2024 keynote speech' as the source of the announcements being discussed.

💡Vertex AI

Vertex AI is Google's enterprise AI platform that enables businesses to build, deploy, and scale AI and machine learning models. The video's theme revolves around the updates and new features of Vertex AI, such as the model garden and the agent builder, which are discussed extensively throughout the script.

💡Model Garden

The Model Garden is a feature within Vertex AI that provides access to a variety of AI models, including both open-source and closed-source options. It is highlighted in the script as a key aspect of Vertex AI, allowing users to choose from over 130 models for different AI tasks.

💡Gemini 1.5 Pro

Gemini 1.5 Pro is a model within Vertex AI's Model Garden that offers a large context window, supporting up to 1 million tokens. The script emphasizes its capabilities, such as processing vast amounts of information and its potential use cases, like analyzing long videos or large codebases.

💡Code Gemma

Code Gemma is an open-source model designed for coding tasks, built using the same technology as Gemini. The script mentions it as a new addition to Vertex AI, indicating Google's commitment to providing tools for developers within their AI platform.

💡AI Agents

AI Agents in the context of the video refer to automated systems that can perform tasks, understand needs, and provide recommendations. The script discusses the potential of AI agents in customer service and enterprise applications, showcasing their integration into various platforms and services.

💡Customer Service Agents

Customer service agents, as discussed in the script, are AI-powered systems designed to interact with customers, understand their needs, and provide assistance across various channels. Examples from the script include Mercedes-Benz using AI for personalized customer experiences and other companies using AI for sales and service.

💡Vertex AI Agent Builder

The Vertex AI Agent Builder is a tool within Vertex AI that allows users to create customer agents with natural language processing capabilities. The script provides a demonstration of its functionality, showing how it can be used to build conversational AI for various applications.

💡Multimodal Reasoning

Multimodal reasoning refers to the ability of an AI model to understand and process information from multiple types of data, such as text, audio, and video. The script highlights this capability in the context of Gemini's advanced features, allowing it to analyze and generate responses based on various inputs.

💡Google Vids

Google Vids is a new app introduced in the script as part of Google's workspace suite. It is an AI-powered video creation tool that uses Gemini to assist in writing, production, and editing of videos for work. The script showcases its ability to generate video content based on prompts and existing documents.

💡Gemini Code Assist

Gemini Code Assist is a feature that leverages the large context window of Gemini 1.5 Pro to assist developers with coding tasks. The script demonstrates its ability to understand and recommend code changes based on business requirements and design mockups, showcasing its potential to streamline development processes.

Highlights

Google Cloud announces the launch of an agent platform with Vertex AI at Google Cloud Next 2024.

Vertex AI's Model Garden offers access to over 130 models, including Gemini, Claude, and popular open models like llama, Gemma, and mrr.

Google's Model Garden is organized by modality and task, allowing users to easily find the right model for their needs.

Gemini 1.5 Pro is taken into public preview, boasting a 1 million token context window for processing vast amounts of information.

Google is working on even larger context windows of up to 10 million tokens, expanding potential use cases significantly.

Gemini 1.5 Pro's large context window enables processing of long videos, extensive audio, and large codebases.

Google announces Code Gemma, a fine-tuned lightweight open model designed for coding, leveraging the same technology as Gemini.

Vertex AI is the only AI platform to provide a single platform for model tooling and infrastructure.

Google Cloud's customer agents are designed to work seamlessly across all channels, including web, mobile, point of sale, and call centers.

Mercedes-Benz is partnering with Google Cloud to enhance the digital experience in their cars, using AI for personalized and intuitive experiences.

Google introduces Vertex AI Agent Builder, allowing users to create powerful customer agents in three simple steps.

Vertex AI Agent Builder uses Gemini Pro for human-like conversations and natural language instructions to control conversation flow.

Google's employee agents can perform tasks, understand multimodal inputs, and connect to enterprise data for workplace assistance.

Google Workspace is integrating AI to enhance productivity, with the introduction of Google Vids, an AI-powered video creation app for work.

Gemini Code Assist is showcased, demonstrating the ability to understand and suggest code changes based on business requirements and design mockups.

Google's AI advancements aim to make developers more productive, allowing complex tasks to be accomplished in minutes instead of weeks.

The integration of AI across Google's products, such as Workspace, Docs, and now Vids, demonstrates a cohesive approach to enhancing work productivity.

Transcripts

play00:00

all right so Google finally launched an

play00:02

agent platform and we're going to take a

play00:05

look at the announcement right now so

play00:07

this is from Google Cloud next 2024

play00:09

keynote speech I did a super cut of it

play00:11

but now I want to talk more specifically

play00:13

about it and we're going to watch it

play00:14

together and in a video that I have

play00:16

planned I'm going to show you how to use

play00:18

the vertex AI agent Builder yourself and

play00:20

I've been playing around with it and

play00:22

it's pretty cool all right so let's

play00:24

start with the keynote now let's dive

play00:26

into vertex AI our fast growing

play00:29

Enterprise AI platform in our vertex AI

play00:32

model Garden you can access over 130

play00:36

models including the latest versions of

play00:39

Gemini par models like Claude from

play00:42

anthropic and popular open models

play00:44

including llama Gemma and mrr all right

play00:47

so first uh the model Garden which seems

play00:50

pretty cool they have a bunch of

play00:52

different models that you can use both

play00:53

open source and closed source and in

play00:55

fact let me just show it to you all

play00:57

right so here it is here's the model

play00:58

garden and we can see it has Gemini

play01:01

imagine Gemma chirp here's Gemini 1.5

play01:04

Pro so it is really cool that they have

play01:06

all these models in the same place

play01:08

here's stable diffusion Laura they have

play01:10

it filtered by modality so language

play01:12

Vision tabular document they also have

play01:14

it filtered by task so generation

play01:16

classification Etc so if we click into

play01:18

Gemini 1.5 Pro we can see all the

play01:21

information about it use cases

play01:23

documentation this feels like hugging

play01:26

face and it's interesting because

play01:28

hugging face actually showed up at the

play01:30

Google Cloud next keynote but I guess

play01:32

they don't see this as competitive and

play01:34

you can open in vertex AI studio and so

play01:37

that's where you can start playing

play01:38

around with it and here is all the

play01:40

models that they have as they said so

play01:42

here's llama 2 Claude 3 stable diffusion

play01:46

mixol 8 x 7B wizard coder I mean they

play01:49

really have a ton of the top models so

play01:52

really really cool all right let's keep

play01:54

watching you choose the best model for

play01:56

your use case budget and performance

play01:58

needs and switch between models as you

play02:01

need to get today we're taking Gemini

play02:05

1.5 Pro into public preview all right so

play02:09

this is pretty cool Gemini 1.5 Pro in

play02:11

public preview I've had access to it for

play02:13

a while I've been playing around with it

play02:15

having a million token context window is

play02:17

absolutely insane being able to drop an

play02:19

hourlong video into a prompt and it

play02:21

answer questions about that video is

play02:24

kind of mind-blowing there's an example

play02:26

in there where you can load up a movie

play02:27

and ask it a question about what was on

play02:30

some like note that somebody took out of

play02:31

their pocket in a scene that maybe

play02:33

lasted just a couple dozen frames really

play02:36

really impressive stuff all right let's

play02:38

keep watching Gemini offers the world's

play02:41

largest context window would support for

play02:44

up to 1 million tokens with Gemini 1.5

play02:48

Pro customers can now process vast

play02:52

amounts of information in a single

play02:54

stream all right I want to pause for a

play02:56

second they're talking about 1 million

play02:58

tokens but it has already leaked that

play03:01

they have 10 million token context

play03:03

Windows internally that they're working

play03:04

on these massive context windows are

play03:06

going to open up brand new use cases and

play03:09

I'm super excited to see how well they

play03:11

work including 1 hour video 11 hours of

play03:15

audio code basis will well over 30,000

play03:20

lines of code I mean that is a monster

play03:23

use case being able to have 30,000 lines

play03:25

of code in a single context window is

play03:28

really really impressive now of course

play03:30

most mature code bases are well over

play03:32

30,000 lines of code so there's still

play03:34

going to be a need for mapping out code

play03:37

bases using rag Solutions like pine cone

play03:40

so we're still very far away from being

play03:42

able to put an entire codebase in a

play03:44

single prompt over

play03:47

700,000 words we're enhancing Gemini 1.5

play03:51

Pro with the ability to process audio

play03:54

enabling cross modality analysis for

play03:58

instance you can use it to search

play04:00

in audio and video content for example

play04:03

find a timestamp in a baseball game

play04:06

video where a commentator says it's out

play04:09

of here we've seen some amazing examples

play04:13

of what people can do with this large

play04:14

context window Sunda mentioned a few and

play04:17

others include a university Professor is

play04:21

using it to extract data from a 3,000

play04:25

page document with texts data tables and

play04:28

charts in just a a single shot yeah

play04:31

that's probably one of the coolest use

play04:32

cases just being able to load up huge

play04:34

PDFs huge documents and being able to

play04:37

summarize them easily extract

play04:39

information from them accurately I'm

play04:41

really excited about a million token

play04:43

context window and he also mentioned

play04:46

audio which is really cool I can load up

play04:48

an hour and a half long podcast and ask

play04:51

questions about it and it'll give me

play04:53

answers based on the context of that

play04:55

podcast so very very cool okay so I'm

play04:57

just going to skip ahead a little bit

play04:58

let's keep watching we're also

play05:00

announcing the availability of code

play05:01

Gemma a fine-tuned lightweight open

play05:05

model designed for coding from the same

play05:08

technology used to create Gemini all

play05:10

right so I've used Gemma and frankly it

play05:13

was very unimpressive but I know they

play05:15

just released a new version of Gemma so

play05:17

I definitely have it on my list to test

play05:19

out and look I am appreciative of any

play05:21

company that is releasing open source

play05:23

model so thank you to Google for

play05:25

releasing Gemma and now maybe I need to

play05:27

test code Gemma because now they have a

play05:29

finetuned version of Gemma specific for

play05:32

code let's keep watching with these

play05:33

additions Google Cloud continues to be

play05:37

the only cloud provider to offer widely

play05:39

used first party third parties and open-

play05:43

Source

play05:44

models vertex AI can be used to tune

play05:47

augment manage and monitor these Models

play05:50

All right so yeah I mean Google's really

play05:52

getting in the game now I'm impressed

play05:53

with all of these announcements their

play05:55

model builder allows you to fine-tune

play05:57

allows you to do a whole bunch of stuff

play05:58

with the models but what I really want

play06:00

to know about and what I really want to

play06:02

talk about today is their agent

play06:03

framework so I'm going to skip ahead and

play06:05

we're going to take a look at that Vex

play06:07

AI is the only AI platform to provide a

play06:10

single platform for model tooling and

play06:14

infrastructure now let's look at the

play06:16

types of Agents customers are building

play06:19

on Google Cloud using generative AI all

play06:22

right so now they're going to be talking

play06:24

about customer agents and when I hear

play06:26

agent I think about autogen I think

play06:29

about crew aai I think about agents that

play06:32

are coded given tools given

play06:34

personalities given backgrounds that can

play06:36

work together to accomplish and automate

play06:38

tasks I think when Google is talking

play06:40

about agents they're mostly talking

play06:43

about customer service agents this feels

play06:45

very similar to open ai's Assistance or

play06:47

their custom gpts product it doesn't

play06:50

feel like a fully featured agent

play06:52

framework to me at least not yet but

play06:55

let's take a look and see what they say

play06:57

and I'm also going to show you a little

play06:59

bit of the interface itself first

play07:02

customer agents you know similar to

play07:04

great sales and service people customer

play07:07

agents are able to listen

play07:10

carefully understand your needs

play07:12

recommend the right products and

play07:14

services they work seamlessly across all

play07:17

your channels the web your mobile app

play07:20

your point of sale and your call center

play07:23

and they can be integrated into product

play07:27

experiences with voice and video video

play07:31

mercedesbenz is working with us on

play07:33

customer agents to help people in their

play07:36

amazing cars let's hear from their CEO

play07:39

Ola Kines at Mercedes-Benz we want to

play07:43

offer our customers an exceptional

play07:45

digital experience that's why we're

play07:48

equipping our cars with high-end

play07:51

computers each car should only get

play07:53

better over time just like a good wine

play07:56

and with the power of Google cloud and

play07:58

AI we will make the user experience even

play08:01

more personalized our partnership across

play08:04

Google helps us build more intuitive and

play08:07

customized experiences last year we

play08:10

announced our partnership with Google

play08:12

Maps and today more than 3 million

play08:15

customers are using Google places in

play08:18

their Mercedes cars and we are applying

play08:20

Google Cloud AI across a number of other

play08:23

use cases ranging from a smart sales

play08:26

assistant improving customer service in

play08:28

our call centers

play08:30

and optimizing our marketing the sales

play08:32

assistant for example helps customers to

play08:35

seamlessly interact with Mercedes when

play08:38

booking a test drive or navigating

play08:40

through mercedes's offerings to find

play08:42

their next favorite vehicle and now

play08:45

we're exploring further opportunities to

play08:48

work with Google Cloud AI such as Next

play08:51

Level navigation features in addition

play08:54

we're partnering on one of the most

play08:56

exciting technology Topics in our

play08:58

industry automated driving this

play09:01

beautiful car right here is equipped

play09:03

with a level three system for

play09:05

conditionally automated driving we were

play09:07

the first manufacturer to get it

play09:09

certified in Germany California and

play09:12

Nevada for our next Generation internal

play09:14

development and test platform we will

play09:17

use Google Cloud as the backbone helping

play09:20

us to become even more efficient and

play09:23

flexible in our product development and

play09:25

Google Cloud's expert knowledge in

play09:28

processing massive amounts of data and

play09:30

scaling AI workloads will ensure that

play09:33

our cars get even more intelligent and

play09:36

AI driven partnering with the very best

play09:39

in their respective Fields is an

play09:41

important part of our software strategy

play09:44

and Google is the perfect example of

play09:47

that with Google Cloud Mercedes-Benz is

play09:50

building new ways to deliver the most

play09:53

intelligent vehicles to our customers

play09:56

and to create personalized intuitive

play09:58

experience

play09:59

we're really excited about working

play10:02

together thank you for having me okay

play10:05

this is the biggest missed opportunity

play10:07

I've ever seen why isn't there an agent

play10:09

built into to the infotainment system in

play10:12

the Mercedes that seems like the most

play10:14

obvious use case when you're driving you

play10:16

can't use your hands to text or type or

play10:18

search or do anything you could simply

play10:21

be talking to an agent to accomplish all

play10:23

of these different things for you I

play10:25

don't know why they wouldn't have done

play10:27

that I'm very surprised to see that they

play10:29

just skipped over that super obvious and

play10:32

super valuable use case we're inspired

play10:34

by the agents that customers are

play10:35

creating using a gen generative AI

play10:39

platform and all right so a lot of good

play10:41

brands on here ADT Verizon Target

play10:44

discover Best Buy Etc and they're all

play10:47

building agents but I think they're all

play10:49

basically just customer service Bots

play10:51

which is pretty disappointing that's the

play10:54

most easy obvious simple use case and I

play10:57

really think it speaks to how safe

play11:00

Google is playing it or maybe they're

play11:02

just thinking about it at the Enterprise

play11:03

level but there's really some

play11:05

cuttingedge stuff they could be doing

play11:08

which I wish they were our models

play11:10

InterContinental Hotels group will

play11:12

launch a travel planning capability to

play11:15

help each of you their guests plan their

play11:18

next vacation ADT is building an agent

play11:21

to help customers select and set up home

play11:25

security systems Verizon gives agents

play11:27

better recommendations so these all seem

play11:30

like customer facing Bots whether it's

play11:31

customer service or sales and that's

play11:34

fine that there's definitely a lot of

play11:36

money in those use cases but that's not

play11:38

as exciting to me magalo one of Brazil's

play11:41

largest retailers has put generative AI

play11:45

right at the heart of its customer

play11:47

service ing built a chatbot to enhance

play11:51

self-service and improve answer quality

play11:55

and Target uses AI on the Target app and

play11:58

website and by the way I just want to

play12:00

point out Google has had a product that

play12:03

does all of this for a very long time my

play12:06

previous company used it it was called

play12:08

dialogue flow and it still is a product

play12:11

within the Google cloud services Suite

play12:13

but it was very brittle it was very hard

play12:15

to set up so I understand why they're

play12:17

kind of relaunching these capabilities

play12:19

but still I'm a little disappointed that

play12:21

they're not more future thinking in

play12:23

their capabilities Minnesota's

play12:25

Department of Public Safety helps

play12:28

non-english speakers get licenses and

play12:30

other services with real-time

play12:33

translation Best Buy is building an

play12:36

assistant that will help troubleshoot

play12:39

product issues reschedule or combine

play12:42

order deliveries or manage

play12:45

software discover Financial Services is

play12:49

using search and synthesis across

play12:52

detailed policies and procedures during

play12:55

customer service calls and oranges fr

play12:59

French language agent is grounded in

play13:02

support knowledge transforming their

play13:04

help and contact site and their customer

play13:07

experience Oppo and OnePlus leaders in

play13:11

smart devices are incorporating our

play13:14

Gemini models and Google Cloud AI into

play13:17

their phone to deliver Innovative

play13:19

customer experiences including news

play13:22

audio recording summaries AI toolbox and

play13:25

much much more you know the opportunity

play13:28

for customer customer agents is

play13:30

tremendous to help each of you build

play13:32

customer agents faster we're introducing

play13:35

vertex AI agent Builder you can now

play13:39

create customer agents that are

play13:42

amazingly powerful in just three key

play13:45

steps all right so this is really what

play13:48

the agent Builder is it is not to the

play13:51

level of sophistication of an autogen or

play13:53

a crew AI it's really just a product

play13:56

that seems very similar to custom gpts

play13:59

from open AI first you can use Gemini

play14:02

Pro to create free flowing humanlike

play14:07

conversations with text voice images and

play14:11

video as inputs and personalize them

play14:15

with custom voice

play14:17

models second you can use natural

play14:20

language instructions to control the

play14:23

conversation flow and guide it on

play14:26

specific topics you don't want it to

play14:28

discuss such such as current events in

play14:30

the same way that you train your human

play14:33

agents you can also control when it

play14:35

hands over to a human agent with

play14:39

transcription and summarization of its

play14:41

conversation history to make these

play14:44

transitions extremely smooth third you

play14:47

can improve response quality with

play14:49

vector-based and keyword-based search to

play14:52

connect your internal information and

play14:55

the entire web you can also use

play14:58

extensions to complete tasks for

play15:01

customers like updating contact

play15:03

information booking a flight ordering

play15:05

food and many more and you can integrate

play15:09

Enterprise data from operational

play15:12

databases like

play15:13

allb Predictive Analytics with big quy

play15:16

and SAS applications like service now

play15:20

let's take a look at an example of a

play15:22

customer agent in action please welcome

play15:26

developer Advocate Amanda Lewis

play15:30

thank you

play15:32

Thomas so last night I was watching a

play15:35

video of this band and I love the

play15:38

keyboard player shirt so I was thinking

play15:41

I'd really like to be wearing that shirt

play15:43

tomorrow night but can I find it in my

play15:45

size and in time to be rocking it at the

play15:49

concert here in

play15:51

Vegas let's head over to my favorite

play15:53

store oh this is uh so scripted and

play15:56

Polished it's a little bit cringy they

play15:59

just launched a customer agent and it

play16:01

leverages Gemini and Vector search to

play16:04

deliver a seamless shopping experience

play16:07

all right I I can't get over it I I just

play16:09

I don't want these types of products

play16:11

personally I know they're valuable but

play16:13

they're out there these have already

play16:15

existed for a while and they're talking

play16:17

about it like it's so Cutting Edge

play16:19

customer shopping assistance customer

play16:21

support agents sales agents it's not

play16:23

interesting to me so let me play the

play16:25

rest of this demo and then I'm actually

play16:27

going to show you vertex really quickly

play16:28

and and you're going to understand why

play16:30

I'm a little bit disappointed with

play16:31

Google's announcements today what can we

play16:33

help you find well I'd like that shirt

play16:37

but I guess I have a few other

play16:38

specifications as well so find me

play16:42

a checkered shirt like the keyboard

play16:49

player is

play16:52

wearing I'd like to see

play16:56

prices where to buy it

play16:59

and how

play17:00

soon can I be wearing it going to

play17:04

include the

play17:07

video now the customer all right that's

play17:10

cool I'll give him credit for that being

play17:12

able to just drop a video and say tell

play17:14

me where I can buy the shirt that that

play17:16

person's wearing that is really really

play17:19

cool although again it's just for the

play17:21

shopping use case I would have liked to

play17:23

see something a little bit more future

play17:25

thinking agent is using Gemini's

play17:28

multimodal reasoning to analyze the text

play17:30

and video to identify exactly what I'm

play17:33

looking for then Gemini turns it into a

play17:36

searchable format how cool is this it

play17:39

found the checkered shirt I'm looking

play17:41

for right and some other great options

play17:44

in no time and that's because these

play17:47

results harness Google's trusted search

play17:49

Technologies which ensures customers

play17:52

like me get the right results in record

play17:54

time the suggested products are grounded

play17:57

in Syle Fashion's inventory and

play17:59

historical performance data to make sure

play18:01

customers leave happy and with that

play18:03

purchase in hand okay so I'm going to

play18:05

pause there let me show you vertex aai

play18:07

agent Builder now all right so this is

play18:10

their agent Builder I just want to show

play18:11

it to you quickly I'm going to make a

play18:12

full video all about it but I I want to

play18:14

show it to you because it's really

play18:15

telling about how Google is thinking

play18:18

about agents and it's not how I think

play18:20

about agents so over here we can create

play18:22

a new agent I've already created one

play18:24

weather agent we'll click into it and

play18:27

you give it a name you give it goal and

play18:29

then you can give it instructions one

play18:31

thing that I really do like about it is

play18:34

that the instructions can be very simple

play18:36

and you simply can just list them like

play18:39

this ask the user for their location and

play18:41

then use and then anytime you have a

play18:43

dollar symbol right there you can easily

play18:45

insert agents or tools that interface is

play18:49

very very nice so I simply say ask the

play18:51

user for their location use tool weather

play18:53

and the tool weather is one that I've

play18:55

already created let me show you over

play18:56

here we have our tools okay so I created

play18:58

this weather tool I have it as type

play19:01

function I have no description but you

play19:03

don't need one and then you simply have

play19:05

the input parameter schema and the

play19:07

output parameter schema here's where I'm

play19:09

really confused where's the actual code

play19:11

go I don't see a place to put code

play19:13

anywhere you can put input parameters

play19:15

and output parameters but how do you

play19:17

actually say Okay I want to hit this

play19:18

third party API and this is actually one

play19:20

of the samples that they give and I just

play19:22

don't understand it if you do let me

play19:24

know in the comments but basically where

play19:26

do I actually put it so let's see how it

play19:28

responds okay so I have the weather

play19:31

agent selected right here let's test it

play19:32

out what's the weather in Los Angeles it

play19:36

formatted it properly we have the tool

play19:38

input Fahrenheit Los Angeles California

play19:40

and then the output temperature zero

play19:42

where does it actually get the

play19:43

temperature from submit function output

play19:45

I'm sorry I can't provide weather

play19:46

information this is literally the

play19:48

example that they provide in the

play19:50

dashboard it's very confusing it's

play19:52

definitely not how I think about agents

play19:54

but they're making progress and so I

play19:56

appreciate their efforts so far one

play19:58

thing that I do want to show you that's

play19:59

really cool is you can easily have all

play20:01

of these Integrations by the way here's

play20:03

dialog flow messenger which is that

play20:05

product that I just told you about which

play20:06

is kind of their previous iteration of

play20:08

their agent framework but you can

play20:10

integrate twillo Discord all of these

play20:13

really easily which is super nice but

play20:16

these are basically just tools and so

play20:18

yeah that is the entire vertex AI agent

play20:20

Builder it is essentially just custom

play20:23

gpts by open AI so we'll go create you

play20:26

can list tools and agents and it has a

play20:28

code interpreter you can also add other

play20:30

tools here but again I don't really

play20:32

understand how tools work and so I think

play20:35

this is the code you basically have to

play20:37

format it in this yaml or Json format

play20:40

rather than kind of just pasting in

play20:42

python or whatever language you're most

play20:44

familiar with which is okay it's not

play20:46

great the thing I like about it is it

play20:47

does have built-in authentication which

play20:49

is nice and makes it really easy and you

play20:52

can also have TLS certificates right

play20:54

there but definitely not straightforward

play20:56

to use and I would prefer simp simply

play20:58

just defining a method here and allowing

play21:00

the agents to call that function

play21:03

whenever they need it all right so now I

play21:06

think they're starting to get into

play21:07

something more interesting which is

play21:09

agents in the workplace meaning agents

play21:12

that can actually perform tasks and

play21:14

accomplish things essentially kind of AI

play21:17

employee so let's take a look first you

play21:20

create a custom model in the ways that

play21:22

we've shown before from there you

play21:25

connect them to all your company and web

play21:28

data

play21:29

this can also be done with translation

play21:31

so that your company information is

play21:33

available regardless of language

play21:35

similarly we support multimodal inputs

play21:38

including videos call Audio images in

play21:42

addition to text now you will want to

play21:44

ground that in Enterprise truth using

play21:48

databases like alloy DB big query and

play21:51

data from Enterprise apps like sap and

play21:54

announcing today

play21:57

HubSpot let's take a

play21:59

interesting that they're mentioning

play22:00

HubSpot because it is rumored that

play22:02

Google is going to acquire HubSpot

play22:04

although it is just a rumor right now

play22:06

and that's pretty cool that you can

play22:08

actually feed in all of your HubSpot CRM

play22:10

data into the agent so let's keep

play22:12

watching at an example of an employee

play22:15

agent in action please welcome developer

play22:18

Advocate Gabe

play22:19

Vice thanks

play22:22

Lea hi folks so I know you all want to

play22:26

hear about awesome AI stuff that's

play22:27

coming but I need to talk to you for a

play22:29

minute about my annual benefits

play22:31

enrollment see I forgot I have to finish

play22:33

signing up by today and as you can see I

play22:36

might be a little bit busy so if you

play22:38

don't mind let's go ahead and look at

play22:39

this open enrollment email together okay

play22:41

yep I've got a deadline I knew that

play22:43

thank you I've got FSA stuff I've got an

play22:46

online portal from my company okay

play22:48

there's a lot here uh H they included

play22:50

video let's see if this makes my life

play22:52

easier ah okay so it's almost an hour

play22:54

long yeah I'm not going to have time to

play22:56

review all of this stuff let's see how

play22:58

this employee agent that we've developed

play23:00

using Google workspace Gemini models and

play23:02

vertex AI might be able to help me as

play23:05

you can see it's integrated directly

play23:06

into my Google Chat so I don't have to

play23:08

context switch while I'm figuring all

play23:09

the stuff out first things first let's

play23:12

have it summarize the email and the

play23:14

video that it sent me all right that's

play23:16

awesome I have been wanting to build an

play23:17

automation using AI that can read an

play23:20

email look at all the context from that

play23:23

thread and then all of the context of

play23:25

all of my emails to try to write a draft

play23:28

that I can simply either edit or send

play23:31

and that is kind of my dream cuz I get a

play23:33

ton of emails I wish I had that and I

play23:35

think that's where they're headed with

play23:36

this product summarize the body and

play23:39

attached video from my recent email with

play23:44

subject open

play23:46

enrollment

play23:47

closing so behind the scenes the agent

play23:50

is referencing that email body and its

play23:52

attachments as context in the prompt

play23:54

using retrieval augmented generation

play23:57

that is awesome awesome okay that is

play23:59

very very cool that way its response is

play24:01

limited to the content that matters to

play24:03

me the Gemini model's multimodal

play24:05

capabilities allows the agent to

play24:08

understand and reason across text audio

play24:11

and video from a single prompt I mean

play24:13

this is a way quicker read okay good and

play24:16

I can immediately see that the medical

play24:17

plants have been completely revamped

play24:19

this year let's go ahead and jump into

play24:21

the benefits portal to see more now I've

play24:24

already done my dental and my vision but

play24:26

I procrastinate I mean save

play24:29

the most important plan for last my

play24:31

medical plan let's see how this option

play24:34

Stacks against my existing coverage

play24:36

compare these coverage all right that's

play24:39

really cool that you can basically just

play24:41

invoke a Google drive folder or a Google

play24:44

Drive agent I think and then ask it

play24:47

additional information I'm very

play24:48

impressed with that by the way I didn't

play24:50

see anywhere in the vertex AI agent

play24:53

Builder where I could accomplish

play24:54

something like this I think this is all

play24:56

just built in by Google behind the

play24:59

scenes into their products this isn't

play25:01

something that you'll be able to build

play25:02

but we'll see options to the PDF doc I

play25:07

have on the Platinum

play25:10

plan the Gemini model's long context

play25:12

window paired with vertex extensions

play25:15

enables the agent to cross reference

play25:17

large amounts of data from a variety of

play25:19

sources including unstructured data like

play25:22

PDFs leveraging Gemini's Advanced

play25:24

reasoning capabilities the agent is able

play25:27

to understand the complex details my

play25:29

current plan and compare it with the new

play25:31

options for 2025 and since the

play25:33

Enterprise grounding features links me

play25:35

to the exact data that Gemini used to

play25:38

draw its conclusions which you can see

play25:40

linked here I can confidently trust its

play25:42

recommendation that the gold plan is

play25:44

best for me and done so now let's get a

play25:49

summary of my coverage let's say my

play25:51

house is multilingual so I'd like to

play25:53

have it in Japanese also please generate

play25:57

a summary of 2025 benefits in a Google

play26:01

doc in both English and

play26:05

Japanese although my source material is

play26:07

in English the Gemini model support for

play26:09

over 40 languages enables it to

play26:12

understand and respond in Japanese and

play26:15

here we go all right this is cool again

play26:18

but again this is all stuff that's built

play26:21

into the Google workspace product so

play26:23

very cool I'll definitely be using all

play26:25

of this but I wish they kind of added a

play26:27

lot of functionality into the agent

play26:30

Builder that I could use now that I've

play26:32

officially completed enrollment my

play26:34

daughter's going to need braces this

play26:35

year I'm going to skip over this I get

play26:37

the demo fine the agent knows that I'm

play26:39

at Google Cloud next because it's

play26:41

integrated with yeah so essentially now

play26:44

you have a personal agent to do

play26:46

everything for kind of your work Gmail

play26:48

Google Docs calendar very cool I'll

play26:51

definitely be using it so the next thing

play26:52

that they're going to talk about is a

play26:53

new product in their Google Suite or

play26:55

their Google Docs Suite of products so

play26:58

they have docs spreadsheets they have

play27:00

presentations and now they're going to

play27:01

add video which is really cool let's

play27:03

take a look we believe that everyone can

play27:06

be a great Creator and a great

play27:09

Storyteller but the formats and tools

play27:11

for storytelling at work haven't really

play27:13

changed that

play27:15

much how many times have you heard

play27:17

should we start with a dock or a

play27:19

deck well we can do a lot

play27:23

better I'm absolutely thrilled to

play27:26

announce our newest workpace app Google

play27:36

vids sitting alongside Google Docs

play27:39

sheets slides Google vids is an AI

play27:42

powered video creation app for work with

play27:46

Gemini in bids you have a video writing

play27:48

production and editing assistant

play27:51

allinone let me show you how simple it

play27:54

is to get started with

play27:56

bids now after week with all of you here

play27:59

at next I'm going to want to share a

play28:01

recap video to share all the excitement

play28:04

with my

play28:05

organization when I open up vids Gemini

play28:08

helps me get started I simply type in a

play28:11

prompt using an existing document for

play28:15

context all right that's really cool

play28:17

that you can pass in context that easily

play28:19

so I'm very impressed that everything

play28:21

that they're releasing with their kind

play28:23

of workspace agents seems to be very

play28:26

integrated with itself which is to be

play28:28

expected but it is very cool now based

play28:30

on that prompt Gemini suggests a

play28:33

narrative outline for the story that I

play28:36

could easily customize and

play28:38

edit I choose an expressive style and

play28:42

vids Works its magic so wow just like

play28:45

that I get the first draft with

play28:46

beautifully designed fully animated

play28:48

scenes complete with relevant stock

play28:51

media and music and even a generated

play28:54

script yeah all right that's very cool I

play28:56

wonder where it's pulling the ated stock

play28:58

media so it's not actually creating

play29:01

video AI video but it is kind of pulling

play29:04

together different b-roll and different

play29:06

title sections uh and it's kind of

play29:08

putting the whole thing together so

play29:10

pretty impressive all right so this is

play29:12

something I'm really excited about uh

play29:14

actual agents being able to code with

play29:16

you and I'm hopeful this is going to be

play29:18

really cool because of Gemini's massive

play29:20

context window so let's watch this video

play29:23

so let's take a look at what's coming

play29:25

for code assist with Gemini 1.5 Pro

play29:28

leveraging a 1 million token context

play29:31

window I'm a new developer with symbol

play29:34

Outfitters and today we show recommended

play29:36

products to customers only after they've

play29:39

made an initial

play29:41

selection these suggestions are powered

play29:43

by our custombuilt recommendation

play29:45

service based on previous

play29:47

purchases but now the marketing

play29:50

department has asked me to move this

play29:52

feature to our homepage so that

play29:55

customers can see products that they

play29:56

might be interested in as as soon as

play29:58

they get to our

play29:59

site our design department has created a

play30:02

mockup of what they would want this

play30:04

experience to look like in figma and for

play30:07

the developers out there you know that

play30:09

this means we're going to need to add

play30:10

padding in the homepage modify some

play30:13

views make sure that the configs are

play30:15

changed for our

play30:16

microservices and typically it would

play30:19

take me a week or two to even just get

play30:20

familiarized with our company's code

play30:22

base which has over a 100,000 lines of

play30:25

code across 11 services

play30:29

but now with Gemini Cod assist as a new

play30:32

engineer on the team I can be more

play30:35

productive than ever and can accomplish

play30:37

all of this work in just a matter of

play30:40

minutes this is because Gemini's code

play30:43

Transformations with full codebase

play30:45

awareness allows us to easily reason

play30:48

through our entire

play30:50

codebase and in comparison other models

play30:53

out there can't okay so this looks like

play30:55

VSS code which is kind of interesting

play30:58

given this is Google but I guess this is

play31:00

built into V code this is some kind of

play31:02

extension I'm not sure let's keep

play31:04

watching handle anything beyond 12 to

play31:06

15,000 lines of code and even then they

play31:09

struggle to get it

play31:11

right Gemini inside of code assist is so

play31:15

intelligent that we can just give it our

play31:17

business requirements including the

play31:19

visual design so let's ask here I am

play31:24

prompting Gemini to add a for you

play31:26

recommendation section on the homepage

play31:28

all right and again very very cool that

play31:30

you can just drop a Google Drive link

play31:32

right into Gemini and it will grab that

play31:35

context so I'm impressed by their

play31:37

ability to just essentially drop any

play31:40

source of information at any time into

play31:42

Gemini along with an image of the future

play31:45

state to show the improved design almost

play31:47

immediately Gemini code assist starts by

play31:50

reasoning about the code changes that it

play31:52

needs to make and has insights an

play31:54

experience teammate would have for

play31:57

example because we asked Gemini Cod

play31:59

assist to change the recommendation

play32:02

service it was able to find the

play32:04

recommendation function and extract out

play32:07

the exact details needed to make the

play32:09

call to the recommendation

play32:11

service it highlights the files needing

play32:13

to be changed and reveals the reasoning

play32:16

behind its recommendations using our own

play32:18

codebase for

play32:20

context Gemini Cod assist doesn't just

play32:23

suggest code edits it provides clear

play32:26

recommendations and make sure that all

play32:28

of these recommendations are aligned

play32:30

with symbol Outfitter security and

play32:33

compliance

play32:34

requirements in code assist we've also

play32:37

added an option to apply the edit which

play32:41

keeps me as the developer and the driver

play32:44

seat so let's take a look at the source

play32:47

code changes that Gemini code assist has

play32:49

made in our code

play32:51

base it looks like we have multiple

play32:53

edits across two files handlers. Go

play32:58

and also

play32:59

home.html Gemini cist even applied these

play33:02

changes to the full

play33:04

repository and to put this in context no

play33:08

pun intended it would have taken me over

play33:11

70 hours nonstop to even just read

play33:14

through all of these files all right I

play33:17

think that's kind of a little bit of BS

play33:19

marketing talk because you don't

play33:21

necessarily have to read through every

play33:23

single file every single line of code to

play33:26

actually make modifications the code

play33:28

base but fine I understand what she's

play33:30

saying just like I would with any code

play33:32

change my next step is to check the

play33:34

workout by testing out the modified app

play33:37

locally so let's try it and there we go

play33:42

the for you recommendation section is

play33:44

exactly what our marketing team was

play33:46

asking for all right so very cool and

play33:49

this is a very simple marketing page

play33:51

that they're updating so it's kind of a

play33:52

simple use case but I'm excited to try

play33:54

it out anything with AI encoding you

play33:56

know I'm all about I'll definitely make

play33:58

a video about that as well so I think

play34:01

I'm going to call this video right here

play34:02

Google announced some really cool stuff

play34:04

I wish the agent Builder would have been

play34:07

more sophisticated but overall all of

play34:09

the functionality that they're adding

play34:11

into the Google workspace product is

play34:14

very welcome so if you liked this video

play34:16

please consider giving a like And

play34:17

subscribe and I'll see you in the next

play34:19

one