Accelerate the value of generative AI with three secret ingredients

Google Cloud
9 Apr 202429:05

Summary

TLDREl video ofrece una visión detallada sobre cómo las organizaciones pueden aprovechar el poder de los datos y la inteligencia artificial (IA) para transformar sus negocios. Se discuten tres componentes clave para el éxito en la implementación de soluciones generativas de IA: patrones, perfiles de usuario y controles. Los patrones se refieren a componentes comunes que promueven la reutilización y la escala, los perfiles de usuario se centran en los resultados de negocio y cómo los individuos obtienen valor de la tecnología, y los controles son esenciales para garantizar la seguridad y la confianza en la solución. Además, se exploran técnicas para la selección de modelos de IA, la ingeniería de prompts y la gestión de sesiones para mejorar la calidad de los resultados. La presentación incluye una demostración de cómo PWC ha desarrollado una plataforma para facilitar la experimentación rápida y la personalización basada en perfiles de usuario, con un enfoque en el control y la gobernanza para garantizar la seguridad y el rendimiento de las soluciones de IA a gran escala.

Takeaways

  • 📈 **Estrategias de patrones**: Los patrones son componentes comunes que impulsan la reutilización y la escala, disminuyendo costos y evitando la duplicación de esfuerzos en la organización.
  • 🧑 **Personas clave**: El enfoque en los resultados de negocio y en los individuos que utilizan la tecnología es fundamental para integrar y obtener el máximo valor de las soluciones generativas de IA.
  • 🛡 **Controles de seguridad**: La seguridad y la conformidad son cruciales para garantizar la confianza en la implementación de soluciones de IA, requiriendo un control sistemático para auditorías y garantizar la calidad de los resultados.
  • 🔍 **Búsqueda de información**: La capacidad de generar respuestas a partir de datos estructurados y no estructurados es una de las aplicaciones destacadas de la IA generativa.
  • 🤖 **Evolución de bots**: Los bots de servicio al cliente y la capacidad de responder rápidamente a consultas son ejemplos de cómo la IA generativa puede mejorar la productividad y la eficiencia.
  • 🌐 **Transformación de idiomas**: La traducción y el procesamiento de lenguaje natural son áreas en las que la IA generativa está teniendo un impacto significativo, mejorando la productividad del desarrollador y la comunicación.
  • 📚 **Creación de contenido**: La generación de contenido de marketing, documentos, código e imágenes es una aplicación en crecimiento donde la IA demuestra su versatilidad.
  • 🔢 **Modelos y tamaños**: La elección del modelo de IA correcto depende del caso de uso y del tamaño de los datos, lo que afecta el rendimiento y la precisión de la solución.
  • 📈 **Escala y producción**: El desafío principal para las empresas es la capacidad para escalar y poner en producción casos de uso exitosos de IA generativa a lo largo de múltiples funciones empresariales.
  • 👥 **Diversidad de roles**: La colaboración entre múltiples roles dentro de una organización es esencial para la implementación exitosa y escalable de soluciones de IA generativa.
  • 📝 **Gestión de cambios**: La educación y el acompañamiento durante la implementación de nuevas tecnologías son clave para su adopción y éxito a gran escala.

Q & A

  • ¿Quién es Sora Mohanti y qué papel desempeña en PWC?

    -Sora Mohanti es socio en PWC y se enfoca en la práctica de datos y AI de la empresa, pasando la mayor parte de su tiempo ayudando a organizaciones a desbloquear el poder de los datos y la inteligencia artificial para transformar sus negocios.

  • ¿Qué áreas de la empresa están experimentando una transformación con la ayuda de la IA?

    -La transformación está ocurriendo en múltiples funciones empresariales, donde las soluciones de IA están siendo implementadas con éxito y luego desafiantes a escalar y producirse para aprovechar su valor y medir el retorno de inversión (ROI).

  • ¿Cuáles son los tres ingredientes clave mencionados para soluciones generativas de IA?

    -Los tres ingredientes clave son patrones, personas y controles. Los patrones son componentes comunes que impulsan la reutilización y la escala; las personas se enfocan en los resultados de negocio y en qué obtienen de la solución; y los controles aseguran la seguridad y la confianza en la implementación de la tecnología.

  • ¿Cómo son diferentes los patrones de manejo de datos no estructurados y estructurados?

    -Los patrones de manejo de datos no estructurados y estructurados difieren en cómo se interactúa con ellos. Mientras que los datos estructurados se convierten en comandos SQL para interactuar con bases de datos, los datos no estructurados requieren un enfoque diferente para funcionar eficazmente.

  • ¿Qué es un modelo LLM y cómo se relaciona con la experiencia del usuario?

    -Un modelo LLM (Large Language Model) es un modelo de lenguaje grande que se utiliza en la generación de texto y comprensión. La experiencia del usuario está enfocada en cómo se les proporciona contexto y cómo se les permite interactuar con el modelo para lograr los resultados deseados.

  • ¿Por qué es importante la selección del modelo LLM adecuado para una tarea específica?

    -La selección adecuada del modelo LLM es crucial porque determina la eficacia de la solución generativa de IA en un caso de uso específico. Un modelo no adecuado puede generar respuestas vagas o no precisas, lo que afecta negativamente la calidad y confianza en los resultados.

  • ¿Qué es un 'prompting template' y cómo ayuda en la formación de usuarios?

    -Un 'prompting template' es una herramienta que guía a los usuarios a entender para quién están formulando un prompt, qué información buscan, quién será el receptor de la información y qué restricciones deben considerar en su prompt. Esto ayuda a los usuarios a ser más efectivos en la obtención de la información requerida y a ser más precisos en sus solicitudes a los modelos de IA.

  • ¿Cómo se aborda la gobernanza y el control de riesgos al implementar soluciones de IA generativa a gran escala?

    -La gobernanza y el control de riesgos se abordan estableciendo un mecanismo de gobernanza que incluya roles y responsabilidades definidos, políticas de uso estrictas, capacitación y un equipo responsable de regular todas las implementaciones dentro de la organización. Esto ayuda a equilibrar los riesgos y el ROI al mismo tiempo que se promueve la innovación.

  • ¿Qué es la 'chunking' y por qué es importante en la calidad de los resultados de los modelos de IA?

    -La 'chunking' se refiere a cómo se divide la información para su procesamiento por los modelos de IA. Es importante porque afecta directamente la calidad de los resultados, ya que la forma en que se 'chunkea' los datos puede influir en la precisión y la coherencia de las respuestas generadas por el modelo.

  • ¿Cómo se implementan los controles de seguridad y calidad en la plataforma demostrada por Brad Foster?

    -Los controles de seguridad y calidad se implementan a través de opciones configurables por el usuario, como el tamaño de chunking, la región de ejecución, los límites de tokens, el umbral de seguridad y la selección del modelo de IA. Esto permite a los usuarios experimentar y configurar iterativamente sin la necesidad de programación constante y asegura que se mantengan estándares de seguridad y calidad en los procesos.

  • ¿Cómo se puede mejorar la calidad de los resultados en las soluciones de IA generativa?

    -La calidad de los resultados se puede mejorar a través del modelado de versiones, pruebas de regresión, el control de la aleatoriedad en la selección de respuestas, la gestión de sesiones de chat y la actualización y control de versiones de datos. Estos aspectos son cruciales para mantener la precisión y confianza en las soluciones de IA a gran escala.

Outlines

00:00

📈 Introducción a la transformación de negocios con datos y AI

El primer párrafo presenta a Sora Mohanti y Brad Foster, socios de PwC, quienes se especializan en la práctica de datos y la inteligencia artificial. Explican que su trabajo gira en torno a ayudar a organizaciones a desbloquear el poder de los datos y la IA para transformar sus negocios. Comienzan a discutir observaciones en industrias en las que trabajan y presentan los tres ingredientes clave para soluciones generativas de IA: patrones, perfiles de usuario y controles.

05:00

🧩 Importancia de los patrones, perfiles y controles en la IA generativa

Este párrafo profundiza en los tres componentes clave para soluciones de IA generativa: patrones, perfiles de usuario y controles. Los patrones son componentes comunes que promueven la reutilización y la escala. Los perfiles de usuario se enfocan en los resultados de negocio y en qué obtienen los individuos que los utilizan. Los controles son fundamentales para garantizar la seguridad y la confianza en la implementación de estas soluciones. Se discuten ejemplos de patrones, como la diferencia entre datos estructurados y no estructurados, y cómo se aplican en diferentes contextos.

10:01

🔗 Desafíos y soluciones en la implementación de IA generativa

El tercer párrafo aborda los desafíos que enfrentan los clientes de PwC en la implementación de casos de uso de IA generativa a escala. Se destaca la importancia de comprender los patrones a nivel empresarial y la reutilización de componentes. Se menciona la formación de equipos centralizados para la implementación y la gestión de cambios. Además, se discute la importancia de la selección del modelo de IA, el contexto, el procesamiento de lenguaje y la ingeniería de consultas para garantizar la calidad y la seguridad de los resultados.

15:02

🛡️ Controles y gobernanza en la implementación de soluciones de IA

Este párrafo se enfoca en los controles y la gobernanza necesarios para la implementación segura y eficaz de soluciones de IA generativa. Se destaca la importancia de rastrear las entradas y resultados, el control de la calidad, la gestión de versiones de datos y modelos, y la necesidad de políticas de contenido y acceso basado en roles. Se recomienda establecer un mecanismo de gobernanza para equilibrar los riesgos y la rentabilidad de las inversiones en IA.

20:04

🚀 Demostración de la plataforma de IA generativa de PwC

El quinto párrafo ofrece una demostración de la plataforma de IA generativa desarrollada por PwC. Se muestra cómo la plataforma permite la interacción con diferentes tipos de datos y casos de uso, y cómo se pueden configurar controles y opciones de usuario sin requerir conocimientos de programación. Se destaca la importancia de la personalización basada en perfiles de usuario y la capacidad de la plataforma para adaptarse a diferentes necesidades y roles dentro de una organización.

25:05

📚 Conclusión y formas de involucrarse con PwC y Google

El sexto y último párrafo concluye la presentación, ofreciendo información sobre cómo los asistentes pueden seguir involucrándose con PwC y Google en eventos futuros. Se proporciona un código QR para acceder a más detalles sobre breakfasts, sesiones y eventos de networking. Además, se agradece la participación y se cierra la presentación con un mensaje musical.

Mindmap

Keywords

💡Data y AI

Datos y AI son conceptos fundamentales en el video, referidos como la clave para transformar los negocios. Se discute cómo las organizaciones pueden desbloquear el poder de los datos y la inteligencia artificial para mejorar sus operaciones. Un ejemplo en el guión es cuando Sora Mohanti menciona trabajar con organizaciones para ayudarles a transformar sus negocios mediante la utilización de datos y AI.

💡Generative AI

La Inteligencia Artificial Generativa (AI) es un tema central del video, abordado por Brad Foster como una evolución en el campo de la IA. Se destaca cómo esta tecnología está siendo utilizada con Google y sus capacidades para generar contenido, respuestas y soluciones personalizadas. En el script, se menciona que Brad ha estado involucrado en el viaje de la IA desde el inicio, y ahora se enfoca en la generativa AI y sus aplicaciones con clientes de Google.

💡Modelos de LLM

Los modelos de LLM (Large Language Models) son una parte crucial de la generative AI y son mencionados en el video como herramientas para manejar y procesar información. Son importantes para la selección del modelo correcto según el caso de uso y para garantizar la calidad de los resultados. En el guión, se destaca la importancia de elegir el modelo adecuado para cada tarea y cómo Google ofrece una variedad de modelos para elegir.

💡Seguridad y Controles

La seguridad y los controles son aspectos claves al implementar soluciones de generative AI, como se discute en el video. Son esenciales para garantizar que los procesos sean seguros y confiables, y para cumplir con los estándares de cumplimiento y calidad. En el guión, se abordan temas como el monitoreo de entradas y resultados, la trazabilidad y la aplicación de políticas de contenido.

💡Personas y Experiencia del Usuario

Las 'personas' son representaciones de los diferentes roles y usuarios finales que interactúan con la generative AI. La experiencia del usuario (UX) es fundamental para el éxito de la implementación de la tecnología. En el video, se insiste en el diseño de la solución alrededor de las necesidades y objetivos de las personas, asegurando que la tecnología sea adoptada y utilizada efectivamente en la organización.

💡Patrones y Reutilización

Los patrones son componentes comunes que promueven la reutilización y la escala en las soluciones de generative AI, como se menciona en el video. Ayudan a reducir costos y a evitar la duplicación de esfuerzos. En el guión, se habla de diferentes tipos de patrones, desde la summarización de información hasta la creación de contenido, y cómo estos son fundamentales para la eficiencia y el éxito a gran escala.

💡Desarrollo y Escalabilidad

El desarrollo y la escalabilidad son retos clave para las organizaciones que buscan implementar soluciones de generative AI. El video aborda cómo los patrones y las personas pueden ayudar a superar estos desafíos. Se destaca la importancia de la producción y la medición del ROI (Retorno de Inversión) al escalar los casos de uso y la implementación de la tecnología.

💡Gobierno y Políticas

El gobierno y las políticas son mencionados en el video como componentes esenciales para la gestión responsable de la implementación de la generative AI. Incluyen la asignación de roles y responsabilidades, la capacitación y la aplicación de políticas de uso estrictas. Estas medidas ayudan a equilibrar el riesgo y el rendimiento, y son cruciales para la implementación a gran escala.

💡Procesos de RAG

RAG (Rapid Application Generation) se refiere a los procesos de generación rápida de aplicaciones, que son importantes para la integración de la generative AI en los flujos de trabajo de la organización. En el video, se discute la importancia de adaptar estos procesos a las necesidades de cada 'persona' y caso de uso, asegurando que los resultados sean precisos y consistentes.

💡Orquestación y Experiencia del Usuario

La orquestación se refiere a la integración y coordinación de las soluciones de generative AI en los procesos reales de la organización. La experiencia del usuario es crucial para la aceptación y el éxito de la tecnología. En el video, se destaca la importancia de entender cómo los usuarios interactuarán con la tecnología y de diseñar la solución para que se integre sutilmente en sus operaciones diarias.

💡Cambio de Gestión

El cambio de gestión es un aspecto importante al implementar nuevas tecnologías como la generative AI. El video destaca la necesidad de educación y apoyo para los usuarios durante el proceso de implementación. Esto incluye la capacitación en la utilización de la tecnología y la adaptación a los cambios en los procesos y flujos de trabajo existentes.

Highlights

Sora Mohanti, a partner with PwC's data and AI practice, discusses unlocking the power of data and AI for business transformation.

Brad Foster, also a partner at PwC, shares his experience with Google's AI journey and the evolution of generative AI.

Three key ingredients for generative AI solutions are identified: patterns, personas, and controls.

Patterns refer to common components that drive reuse, scale, and cost reduction in organizations.

Personas focus on the business outcomes and the experience of individuals using AI within organizations.

Controls are essential for ensuring safety, security, and the systematic implementation of AI solutions.

Different patterns for AI solutions include summarization, deep retrieval, Q&A, transformation, augmentation, and new creation.

The importance of model selection based on the use case and the right model for the job is emphasized.

Generative AI deployment requires a combination of different business roles working together for successful scaling.

Creating centralized COEs (Centers of Excellence) or deployment teams aids in detecting patterns and managing change.

The session discusses the challenges of scaling use cases and productionizing AI across enterprise functions.

LLM (Large Language Model) selection and context grounding are crucial for accurate and consistent AI responses.

Prompt engineering is key to controlling the creativity and ensuring the right outcome from AI models.

Session history and prompt context influence AI response generation, affecting the quality and consistency of results.

Orchestration and user experience are vital when integrating AI into real processes with actual users.

Change management and education are important for proper AI adoption and usage across an organization.

Governance mechanisms are recommended for managing risks and ROI when deploying generative AI solutions at scale.

The importance of regulatory risk, data privacy, and security in AI deployments is highlighted.

PwC has developed an application to allow rapid experimentation with AI based on personas and use cases, with built-in controls and metrics.

The demo showcases how controls and configurations can be systematically implemented for different AI interactions.

Transcripts

play00:00

[Music]

play00:10

welcome everybody uh my name is Sora

play00:12

mohanti I'm a partner with pwc's uh data

play00:15

nii practice and I spend most of my time

play00:18

working with organizations helping them

play00:21

unlock the power of data and AI to

play00:24

transform their businesses um we're

play00:27

going to talk to you a little bit here

play00:29

about

play00:30

uh some of the things that we're

play00:31

observing in the industries that we deal

play00:33

with h over to you

play00:35

Brad um thanks s yeah and thanks

play00:38

everybody for spending a few minutes

play00:39

today and your busy schedules uh hanging

play00:41

out with us and listen listening to this

play00:43

presentation um Brad Foster I'm a

play00:45

partner at at PWC uh focus a lot on the

play00:48

generative AI stuff that we're doing

play00:49

with Google I've been involved with

play00:51

Google uh since really they started they

play00:54

had three beta products years ago app

play00:56

engine and cloud storage and big query

play00:58

and so been a part of AI Journey for

play01:00

quite some time and it's fun to see the

play01:02

next uh Evolution with with generative

play01:04

Ai and what we're able to do uh with

play01:06

Google customers and and and solutions

play01:08

out there so we're going to spend a

play01:09

little bit of time today talking about

play01:12

some things we've learned I think over

play01:14

the past year uh that we might uh we

play01:16

think are maybe useful for for you guys

play01:18

to hear about uh we want to spend some

play01:20

time at the end as well for Q&A so we'll

play01:22

try to leave some leave some time for

play01:23

for that and pass the microphones around

play01:26

so like the topic of this is really

play01:29

three Secret ingredients and of course

play01:30

there's more that goes into this but

play01:32

just to talk about three you know

play01:33

important things uh around these

play01:36

generative I Sol Solutions is patterns

play01:39

personas and controls and The quick

play01:42

summary is patterns are really those

play01:44

things that are the common components uh

play01:48

that drive reuse scale can drive your

play01:51

cost down reusability in the

play01:52

organization prevent people from doing

play01:55

things you know five different times uh

play01:58

personas is really the focus on the

play02:00

business outcome so the the individuals

play02:01

who are using these things in the

play02:03

organizations and what are they getting

play02:04

out of it how do you Stitch it all

play02:06

together for for ultimate value uh and

play02:08

one of the nuances of of how you build

play02:10

these things to do that and finally

play02:12

controls right none of this is going uh

play02:14

into production or going to be used

play02:16

unless everybody feels safe and secure

play02:18

about it and so not just about the

play02:20

philosophy of those controls but how do

play02:22

you start to take those controls into

play02:24

systematic implementation so as you're

play02:26

using it you can audit it you have

play02:28

assurances and it's it's not just humans

play02:30

trying to follow a paper process um but

play02:32

these are these applications have this

play02:34

infused uh these controls infused so for

play02:36

example design patterns those are things

play02:38

like Which models are using for which

play02:40

which Solutions um the model tuning and

play02:44

and chunking how do you do that

play02:45

differently and how do you save best

play02:47

practices uh data types and ingestions

play02:49

for your rag uh processes Etc uh

play02:53

personas what what do you do to get that

play02:54

right right everything starts with the

play02:55

user experience you I know we like to

play02:57

talk about llma models but what's the

play02:59

user experience that the end of the day

play03:00

how do you give that person context

play03:02

what's the agent's job what are they how

play03:04

are they trying to perform how do you

play03:05

get the prompt engineering the rag

play03:07

processes correct for that Persona and

play03:09

then controls you obviously things like

play03:12

security compliance um there's managing

play03:16

capacity finops aspects of this uh as

play03:18

well as the accuracy and the quality of

play03:20

the output so these are the things we're

play03:22

going to talk about a bit today um so

play03:25

jumping into the patterns area a lot of

play03:28

you are probably familiar with this type

play03:29

of uh designation of patterns so this

play03:33

isn't an end all be all but just six

play03:34

ways we look at patterns um being at

play03:38

summarization so of information I want a

play03:40

quick summary of that could be

play03:42

structured data tell me this year's

play03:43

sales could be unstructured data tell me

play03:46

the point of this policy uh could be you

play03:48

know deep retrieval deep retrieval and

play03:50

Q&A are similar right deep retrieval how

play03:52

you getting you know deeper into that

play03:54

semantic nature of data and what it

play03:56

means in surfacing that Q&A think of all

play03:58

of the the evolution of Bots the chat

play04:00

Bots customer service Bots how are you

play04:02

you quickly surfacing those those

play04:04

questions you have

play04:06

transformation uh types of things a

play04:07

simple one is obviously language which

play04:10

now program you hear a lot about the

play04:12

developer uh productivity and and what

play04:14

we're able to do with you know

play04:15

translating uh whether it's code or

play04:17

whether it's actual spoken language of

play04:20

augmentation uh for things like

play04:23

autocomplete or test data generation

play04:25

could be another uh great example of

play04:27

that and finally net New Creation so you

play04:29

hear things like like marketing content

play04:30

creation and you know imagin and all of

play04:32

the the the images and and ways it can

play04:35

be used for for those types of use cases

play04:37

but when you look at at a high level

play04:39

it's really too high level when you get

play04:40

into the details of these

play04:42

implementations uh these patterns start

play04:44

to come into kind of level two and level

play04:47

three types of patterns and I'm not

play04:48

going to drain this you know ey chart

play04:51

but I'll give you a couple examples and

play04:52

one simple one to start with is the

play04:54

difference between unstructured and

play04:55

structured data so when you're you know

play04:58

interacting with structure data

play05:00

converting that English base prompt into

play05:03

a SQL command that has to interact with

play05:05

the structured database is a different

play05:07

process than asking uh that same prompt

play05:09

to an unstructured data set and behind

play05:11

the scenes what you have to do to make

play05:12

that work effectively is different so

play05:14

those are two different patterns the Q&A

play05:16

seems the same up front uh but the the

play05:19

back end and how you do that is

play05:20

different uh another is in like net New

play05:22

Creation what you're trying to create

play05:24

are you trying to create documents code

play05:27

images so those again it's not created

play05:30

the same what you're trying to do uh you

play05:32

know I'm doing images so imagin is what

play05:34

I might go with versus I'm just trying

play05:36

to create a new document or you know an

play05:37

email to send to somebody it's different

play05:39

of how it works in the back end and

play05:40

another thing that we didn't put on this

play05:42

slide that I think is important is uh

play05:44

also the types and sizes of really the

play05:46

sizes of models I think you the price

play05:48

performance game is is important as well

play05:52

so you'll learn for these types of

play05:53

questions and use cases everything isn't

play05:56

going to be solved by just one huge

play05:57

model doing it all you're going to find

play05:59

that you know everything you know uh is

play06:01

has a price and so that whether it's

play06:03

slower or whether you know the the

play06:05

answers are too vague because it it

play06:06

wasn't a specific of a model or you're

play06:08

trying to do it on a mobile mobile

play06:09

device so it needs to be be condensed

play06:12

that's another thing that's that comes

play06:13

into play as as we're all implementing

play06:14

these things so I'll turn over to sarb

play06:16

now to talk about as we see these

play06:18

patterns emerge and actual the

play06:20

implementations of these things how that

play06:21

starts to scale yeah so one of the

play06:24

challenges that we are observing that

play06:27

most of the most of our clients are

play06:29

facing right now is not at the use case

play06:32

level um I think there's a lot of use

play06:34

cases that are very successfully getting

play06:36

implemented across Enterprise functions

play06:39

with the organizations that we work with

play06:41

or we consult with or even uh you know

play06:43

talk to on a regular basis where they

play06:46

have a challenge is to scale those use

play06:49

cases productionize them deploy them

play06:52

across multiple functions to harness the

play06:56

value and then measure that Roi that

play06:59

that's where most of these patterns come

play07:03

into the picture because these patterns

play07:05

help you understand reusability of of

play07:09

components like vector databases

play07:11

connectors plugins that you may have

play07:13

built any other uh you know data sets

play07:17

that you may have modernized both

play07:18

structured or unstructured the

play07:20

embeddings and so as an organization uh

play07:24

you know we often tell our clients to

play07:26

understand those patterns across the

play07:29

Enterprise and then figure out how you

play07:31

can look at reusability all the way up

play07:34

to getting the cic CDs done the

play07:36

deployments and then even productionize

play07:38

it uh at scale um I think that's been

play07:42

that's been the the biggest challenge

play07:44

that most companies face around scaling

play07:46

and then patterns actually help you

play07:48

overcome some of those challenges well

play07:51

so that's a quick uh summary of some

play07:54

patterns we're going to going to move

play07:55

into the personas a bit now and talk

play07:57

about how to view that and in the

play08:00

importance of of this gen gen Journey so

play08:03

uh the other thing that we've noticed uh

play08:05

is to ensure that generative AI is

play08:08

actually being deployed at scale and

play08:11

people are harnessing the value you

play08:13

require a multiple of personas to be

play08:16

able to use this in a seamless way but

play08:18

also contribute to the build and the

play08:20

deployment of these uh Solutions at

play08:22

scale so what you'll notice here is the

play08:24

roles of several different uh

play08:27

individuals within the businesses that

play08:29

need to come together to deploy it and

play08:33

what we've observed is there are

play08:34

organizations that are being very

play08:36

successful by creating Coes or factory

play08:40

models or um you know deployment uh

play08:43

teams that are centralized and are doing

play08:46

it in a centralized way across multiple

play08:48

different parts of the business because

play08:50

then they can detect the patterns their

play08:52

personas are working together and then

play08:53

that also helps with the change

play08:55

management because the usage and the

play08:58

user stories for each of these personas

play09:01

may be slightly different so when you're

play09:04

building this at scale when you're

play09:05

centralizing it at scale you are

play09:07

actually ensuring that your user

play09:10

experiences are getting completely tied

play09:12

into those personas enabling the users

play09:15

to adopt it much faster across multiple

play09:17

aspects of of an

play09:20

organization so this next slide talks a

play09:23

little bit about some specifics when

play09:26

you're going to do an implementation it

play09:28

all seems easy easy right when you I

play09:30

just want to upload one document and ask

play09:32

that one question that that seems to

play09:33

work every time right because you took a

play09:35

simple scenario that's not scaled that

play09:37

doesn't have you know an enterprise

play09:39

process that's been been run for decades

play09:41

wrapped around it so uh the first is llm

play09:44

model selection so what what is the use

play09:46

case what are they trying to achieve and

play09:48

then how do you choose the right model

play09:50

for for the job and that of you know

play09:51

obviously the model garden and Google's

play09:53

own models like there there's just going

play09:54

to be a continued evolution of that that

play09:56

we all want to keep an eye and take

play09:57

advantage of the next is the context so

play10:01

how do you ground that Persona in what

play10:03

their job is and make it specific so

play10:06

that it's not trying to do everything

play10:08

within you know those functions within a

play10:11

CFO function there's many different

play10:13

parts of a cfo's organization all that

play10:15

that play different roles those need to

play10:17

be embedded into the context of of that

play10:19

agent so to speak the grounding and rag

play10:22

processes um again very important I

play10:24

think this is a hot you know probably

play10:25

one of the hottest topics happening

play10:27

right now is how do you ground these

play10:28

models in your organization's data so

play10:31

that those results are coming back you

play10:33

know accurate and consistent and there's

play10:35

a process around getting that right um

play10:39

it can be things as well if you have two

play10:41

versions of the policy which one's

play10:43

correct if you upload both on accident

play10:46

your people could get confusing answers

play10:48

and it's a you know it's a it's a data

play10:50

input problem and so getting that right

play10:52

making sure you're working with those um

play10:54

personas to do that is is key and

play10:56

there's things like chunk sizing and

play10:58

overlaps so that plays a role in the

play11:00

quality of the output that you're

play11:02

getting for some of these models and

play11:04

again back to the patterns they're going

play11:05

to start to be emerge best practices for

play11:07

for these use cases prompt engineering

play11:10

we're all you used to you know prompting

play11:12

by now to hopefully to some extent and

play11:15

you know the way you ask questions is

play11:17

key uh to this and do you want to

play11:19

control those prompts we've seen some

play11:21

clients who are saying some of this I

play11:23

want the drop- down menu like here's

play11:25

what I've learned are the best ways to

play11:27

prompt the specific business process and

play11:29

I don't want let people get too creative

play11:31

with that because I I've learned what

play11:33

generates the best result response

play11:35

confidence another one do you want let

play11:37

it to be creative um and if it's not

play11:39

quite sure it's a marketing you know uh

play11:42

campaign let it be creative versus hey I

play11:44

want this to come back this is a legal

play11:46

situation or a compliance situation

play11:48

every time it should come back with the

play11:49

same exact answer based on this input of

play11:51

of data that that I provided it session

play11:54

history and prompt context uh for those

play11:57

of you use this the bit I don't know if

play11:59

you've experienced the situation where

play12:01

you're asking it 20 questions and then

play12:04

you kind of change directions a bit and

play12:06

it comes out with a response and then

play12:08

you start a new new session and you ask

play12:10

ask that same question and it generates

play12:11

a totally different response you're like

play12:13

wait a minute I ask it the same question

play12:14

it's off the same pile of of data why

play12:17

did it apparently come back with two

play12:19

different responses and usually the

play12:21

answer is well the first time it had the

play12:23

context you were building up a context

play12:25

from 20 questions and 20 answers and

play12:27

just like a human it thinks you're

play12:28

talking about directionally now and so

play12:30

it uses that as part of the response

play12:32

generation versus if you start a new

play12:33

session and that's the very first

play12:34

question you ask it you know will come

play12:36

back with you know a consistent response

play12:38

every time so those are just interesting

play12:40

things and then orchestration and user

play12:41

experience uh as you put this into a

play12:43

real process with real users are are

play12:46

using this um don't forget about you

play12:49

know their daily life sitting down with

play12:51

them how are they going to you know

play12:52

enter this copy and paste answers you

play12:54

know where how are they getting the

play12:55

output into the next stage of the the

play12:56

process they're executing so you know

play12:58

don't ignore

play12:59

uh that part of it and at the end of all

play13:01

of this too is the change management

play13:02

aspect you know this is new for

play13:04

everybody um and so how to use it

play13:06

properly uh education you know the

play13:09

handholding as these things are

play13:11

implemented are key so these are some

play13:13

things uh that we're seeing as kind of

play13:16

key to consider as you're you're getting

play13:18

into these

play13:19

implementations the the final section

play13:21

here to go over a bit around controls so

play13:23

how are we making sure we're doing this

play13:24

safely and and securely and and and

play13:26

ensuring quality results so sorry talk a

play13:29

little a little bit about what we're

play13:29

seeing here yeah uh actually before I

play13:31

jump into that one of the things that

play13:33

when you were talking about uh prompt

play13:35

engineering one of the things that I

play13:36

realized has been a very successful

play13:38

method that organizations have used is

play13:41

what we call a prompting template which

play13:44

kind of allows individuals to understand

play13:47

who they're prompting for what

play13:48

information are they looking for who's

play13:50

going to be the receiver of the

play13:51

information what are some of the

play13:53

constraints that you want to consider in

play13:55

your prompt so what that does is that

play13:57

creates a a an environment for users to

play14:01

be able to get trained in in prompting

play14:03

for the right information and that

play14:05

actually helps uh and and that's

play14:08

interesting because now when we get into

play14:10

controls one of the things that we often

play14:13

get asked is um how are how are uh

play14:17

organizations governing uh the the

play14:20

deployment of generative AI Solutions at

play14:22

scale and I think this is where some of

play14:26

these tough questions get asked around

play14:29

regulatory risk data privacy risk who's

play14:31

using it uh the solution how do we know

play14:34

that the prompts are secure how do we

play14:35

know that the data is secure or how do

play14:37

we know that you know people who

play14:39

shouldn't have access to certain

play14:40

embeddings or vector databases don't

play14:42

have access or how do we ensure that we

play14:45

using the most effective llm how do we

play14:48

know that we are picking the right

play14:50

architecture um and how do we know that

play14:52

we are doing it at the right cost with

play14:54

the right level of uh return on

play14:56

investment I think all are great

play14:57

questions uh and which is why we we and

play15:01

we've done this to ourselves we've you

play15:03

know highly recommend putting a

play15:05

governance mechanism in process when

play15:07

you're deploying generative AI solutions

play15:09

to manage the risk and the ROI because

play15:12

both need to be balanced and when these

play15:15

deplo when these deployments are

play15:16

happening at scale it's very important

play15:18

to understand how the risks are getting

play15:21

mitigated and what kind of Roi are you

play15:24

expecting versus what do you get it's

play15:26

okay to do it in the use case uh

play15:28

environment it's not that difficult you

play15:30

have you have you know you have

play15:31

contained yourselves into maybe one or

play15:34

two pods that are doing it but when you

play15:35

scale it out these things become uh

play15:38

difficult to handle and we've had a lot

play15:40

of client situations where clients have

play15:41

come back saying oh there's there's been

play15:43

an issue with us uh you know from a

play15:45

security standpoint or from a data

play15:47

standpoint uh even quality of the data

play15:50

sometimes uh is questionable so which is

play15:52

why going back to that earlier

play15:54

conversation around personas uh the CDO

play15:56

or the data organization needs to be uh

play15:59

working hand inand with the deployment

play16:01

team to figure out to ensure that the

play16:03

data is actually right and those

play16:04

patterns are right

play16:07

um the next slide yeah um some of the

play16:10

core Dimensions that we and I spoke

play16:12

about this a little bit uh in terms of

play16:14

what you need to do to ensure that your

play16:16

uh governance processes are right I

play16:19

think obviously we spoke about you know

play16:20

ensuring there's roles and

play16:21

responsibilities there's training on

play16:23

usage there are very strict usage

play16:25

policies uh people uh that people

play16:28

understand that people have been trained

play16:30

on um there is a governance team that's

play16:32

responsible for governing all of the

play16:34

deployments within the organization now

play16:36

some of them some of this may actually

play16:37

sound cumbersome and and you may think

play16:40

that it's actually stifling Innovation

play16:41

but it doesn't I think it just takes a

play16:43

little bit of time like a fly wheel when

play16:45

you push it at the beginning you need

play16:47

momentum for it to get to to for it to

play16:49

run and what we've noticed is that

play16:51

initial push is a little bit hard but

play16:52

then once you get it running like a

play16:54

machine um it actually runs really

play16:56

really well and some of the best

play16:58

organizations in the world are are doing

play17:00

a great job at governing um uh you know

play17:03

the the risk versus Roi

play17:05

components over to you Brad yeah so a

play17:08

couple examples and we're going to do a

play17:10

bit of a demo here of how this stuff

play17:12

comes to life in actuality but uh some

play17:17

examples of controls uh with compliance

play17:20

so prompt input uh results you know and

play17:24

uh the prompts inputs themselves in the

play17:26

results like are you tracking all of

play17:28

that so do you know what your

play17:30

organization is typing in do you know

play17:31

the results that are coming out and how

play17:33

do you want to make sure you're

play17:35

monitoring uh that on a continuous basis

play17:38

uh citations so I think you guys

play17:40

probably understand what citations are

play17:41

but when I get a response back how do I

play17:44

know where it came from oh there were 10

play17:46

documents I could have pulled from which

play17:48

document did the answer come from which

play17:50

paragraph in that document did it did it

play17:52

come from so that again you have that

play17:53

audit Trail and the Assurance of that

play17:55

the result is actually accurate you have

play17:58

content policies uh applied to data

play18:00

curation and and those results so I

play18:02

think data as always will play a key

play18:05

role so is the data that you're using

play18:09

the approved data sets is it been

play18:10

curated uh Etc so in in what which

play18:13

situations do you want to make sure

play18:15

you're you're forcing and locking that

play18:16

down in other situations let people you

play18:18

know be creative and interact uh with

play18:20

data on the data side you know the the

play18:23

role-based uh access to Solutions and

play18:25

features so I don't think this is going

play18:27

to be a world where every body has

play18:29

access to every feature and function in

play18:31

this landscape and it's just a

play18:33

playground you're going to give people

play18:34

uh some people more capabilities and

play18:37

ability to experiment and others are

play18:38

going to be part of a a finished process

play18:41

that they're executing that was uh that

play18:43

was built by an organization uh you have

play18:45

the the data for rag itself I talked

play18:48

about like how are you indexing that

play18:50

controlling that itself um and then

play18:52

really I think another thing that's

play18:54

going to come into the picture is the

play18:55

data versioning so over time as your

play18:59

data changes how do you go back and say

play19:02

well now I have a new policy I need to

play19:04

go reindex that and make sure that's the

play19:06

the do you know the document that's

play19:08

being used and how do I actually know

play19:10

and have a audit history that oh policy

play19:13

number one you know stop being used in

play19:15

this year in this date and this time and

play19:17

this is when policy number two is

play19:18

indexed and that's where answers are

play19:20

based so that as again you executing

play19:23

these processes at scale for especially

play19:24

regulated Industries you have you know

play19:26

that data versioning uh aspect of things

play19:29

and then quality um so model versioning

play19:32

and regression testing so we're going to

play19:34

live in a world and right the side by

play19:36

side or Google already has it like I

play19:37

want to test these two models um and

play19:39

maybe I went from Palm to Gemini and I

play19:42

want to see the results or you know

play19:44

Gemini W to 1.5 or a model Garden option

play19:47

in a in a Google version so there you

play19:49

know I think Google's been a you know

play19:50

done a great job embracing that concept

play19:53

and just in letting people explore uh

play19:55

that in a great rapid rapid way so

play19:58

that's going to be something something

play19:59

that continues uh chat I mentioned the

play20:01

chat session refresh so that can impact

play20:04

quality results again it's it's a bit of

play20:06

a user uh experience or user air

play20:08

situation versus there's something wonky

play20:11

happening on the on the back end so you

play20:12

need to be cognizant of that quality

play20:14

Randomness selection or lockdown do you

play20:16

want people to dial that temperature and

play20:18

the randomness up up up or down for the

play20:21

quality control and then chunking

play20:23

options uh as well so how are you

play20:25

chunking this data that's becoming more

play20:26

and more important as people scale uh

play20:29

and how you're chunking strategies and

play20:30

and how you look at that so it kind of

play20:33

wraps up some of the discussion parts of

play20:35

this uh did want to go into a short demo

play20:38

before we open in the session up to to

play20:40

Q&A and so what we built as we were kind

play20:43

of on our journey doing this uh at PWC

play20:47

was we learned that we don't want to

play20:50

build the same thing over and over for

play20:52

each use case we don't need a new Q&A

play20:55

front to back like let's use the same

play20:57

basic uh Q&A feature and function and

play21:00

then the data that's going in and the

play21:02

Persona that's using it is the only

play21:03

thing that's changing I don't need to

play21:04

you know build something ground up so we

play21:06

we we said let's try to work smarter not

play21:09

harder and put something in place that

play21:11

allows rapid experimentation has this

play21:13

kind of persona basis um utilizes these

play21:16

patterns over and over so we get scale

play21:19

and then has some core controls that we

play21:20

think people are going to need to start

play21:21

putting into into these platforms so

play21:24

that's what I'm going to show you a

play21:24

little bit of a demo on here it'll be

play21:26

kind of a quick run through so let me go

play21:29

ahead and you guys switch over to the

play21:31

the demo screen now so this is um the

play21:35

homepage uh of this

play21:37

application uh first thing to point out

play21:40

at the bottom you have the Chicklets of

play21:42

the experiences that you can engage with

play21:43

so you see chat explore web explore

play21:45

business insights and Gemini and so

play21:47

immediately we we decided what you're

play21:50

trying to do dictates again a different

play21:52

kind of backend and data interaction

play21:54

whether you're doing interactions with

play21:56

unstructured data in this chat EXP

play21:58

whether you want to index internet or or

play22:01

websites and that content whether you're

play22:03

looking at structured data for business

play22:04

insights so big query tables CSV EXL

play22:07

files uh or whether you want uh Gemini

play22:10

to do like image videos audio so we can

play22:13

an immediate Point here for people to

play22:15

interact with um the use case based you

play22:19

know this application based on the use

play22:20

case that they're looking at and then

play22:22

another thing that that we knew was

play22:25

important is as people come into this

play22:27

application that there's role-based

play22:29

security we know kind of what Persona

play22:31

that you are what you should you should

play22:33

have access to and what you shouldn't

play22:34

have access to and so as you look at at

play22:36

some things that that exist are right

play22:39

there's there's user

play22:41

Administration uh which is just normal

play22:43

role-based access privileges that you're

play22:44

all used to and applications we all

play22:46

interact with and then well as well as

play22:48

like personas so if I want to look at

play22:50

various personas for HR and develop

play22:52

those and this is where again you put

play22:54

the context in what is this agent trying

play22:56

to do uh I want to script that I want to

play22:58

control that or do I want to let a user

play23:01

Define that so that you know some of the

play23:03

things that we do just uh in the

play23:05

beginning as well as just some things

play23:06

around dashboards uh and metrics that we

play23:10

we look at in terms of the usage of the

play23:12

application how are people using it

play23:14

what's the interaction are they they

play23:15

liking the results so some kind of core

play23:18

uh things that we built in from an

play23:20

overall perspective and then also what

play23:22

we started to find uh was that we should

play23:25

give people more flexibility to manip at

play23:28

as opposed to having to go in the code

play23:30

all of the time and choose your chunk

play23:32

size or the the region you want to run

play23:34

this or the temperature start to have

play23:36

settings here that somebody could could

play23:37

dictate where do I want to run this

play23:39

which region Google region do I want to

play23:40

run it in what's the temperature output

play23:43

uh token limits and those obviously keep

play23:45

going up and up uh safety threshold so I

play23:47

want block you know if it doesn't know

play23:49

the answer do I want to block it um some

play23:53

few most um the model type so I want

play23:56

chat bison text bison Gemini so let

play23:58

people start choosing the the the model

play24:00

type I want to choose the model type do

play24:02

some interactions come choose a

play24:03

different model type uh do some

play24:05

different interactions this gets into

play24:06

the side by side and then again this

play24:08

gets into like chunking sizes overlap so

play24:10

let people quickly come in configure

play24:12

something iterate without having to

play24:14

necessarily be programmers all of the

play24:16

time let business people more and more

play24:18

try to to again rapidly experiment with

play24:21

this technology um going back to then

play24:25

you know really as you look at them

play24:27

jumping into the chat uh Explorer

play24:29

function here and I'll just use this

play24:31

will be an example of walking through an

play24:33

interaction with a a set of data so here

play24:36

I would start a new session for the user

play24:39

and this is going to maintain session

play24:40

IDs so same thing the history so if I

play24:42

want to go back to a previous session

play24:44

look at those results uh if I want to

play24:47

name these sessions so we're we're doing

play24:48

some processes for some companies and

play24:50

these are called you know control one

play24:52

session this date and it's going to you

play24:54

know control two so you have testers and

play24:57

and other things happen you know this

play24:58

where they really want to document this

play25:00

well uh you can then you know download

play25:03

results once you get them but this is

play25:05

here in your AR personas where you

play25:07

choose the Persona and again this would

play25:09

be dictated on the role you are so I see

play25:11

everything as I'm admin but when you log

play25:13

into this application you would only see

play25:14

the personas you have access to to be uh

play25:18

and so in this case I'm able to choose

play25:19

an HR policy it it pulls in the context

play25:21

and you see some other things here of

play25:23

whether you want to allow responses from

play25:25

you know llm itself show videos images

play25:27

randomness again all those things we

play25:29

could control based on who's logging in

play25:32

I can hide all those things say I don't

play25:33

want them to have access to do that

play25:35

actually I don't want them to even be

play25:36

able to upload a document I just want

play25:38

them to interact you'll see here these

play25:40

are indexes so these are precured data

play25:43

sets that somebody's already you know

play25:45

you know done the grounding ragged a

play25:47

process and now this is accessible um

play25:49

for that user that you this situation a

play25:51

user could upload a file so this is

play25:53

where you would actually go in choose

play25:55

the files from my local from Google

play25:56

Drive uh the doc types and this is where

play25:59

I can uh give it an index name it goes

play26:02

off does the upload executes and then

play26:04

that index is now available here in my

play26:07

my index here so in this situation I

play26:09

I'll choose a uh an index

play26:13

here and uh this was an HR situation so

play26:15

let's say I have somebody that's looking

play26:17

at HR policies and just wants to know

play26:19

hey what's the gift limit for for

play26:21

clients and so it's going to go look at

play26:23

that um index and say in this situation

play26:27

you know 100 $ per person and there's

play26:29

some

play26:30

exceptions uh with the program Etc a

play26:34

couple things here so first is the

play26:35

citation I talked about well where did

play26:37

you get this answer from like oh well it

play26:39

was from this document gifts

play26:40

entertainment policy June

play26:42

2023 uh so I know where that's coming

play26:44

from and kind of the paragraphs that

play26:46

that had got on that as well as I can

play26:48

give it some thumbs up thumbs down on

play26:50

the responses like did I like this

play26:51

response did I not like this response

play26:53

and we track all that on the back end

play26:55

again because if you're seeing people

play26:57

put a lot of thumbs down

play26:59

that could be an opportunity for

play27:00

training like they're writing bad

play27:01

prompts it could mean the data you know

play27:04

needs to be improved uh or other things

play27:06

right the context situation we talked

play27:08

about the process they're going through

play27:09

you know to do that U might need be

play27:11

improved so uh those are things that we

play27:14

look at there and again they can

play27:15

download all of these results here um

play27:18

and those similar things are available

play27:19

as you look across uh this for for these

play27:23

areas so if you were to go into the

play27:24

business Insight space you'll see that

play27:25

as opposed to necessarily documents you

play27:27

have things like files and tables that

play27:30

are that are accessible here so if I

play27:32

were to click on this you know I could

play27:33

see that maybe there's some big creery

play27:35

data sets and other things that we'll

play27:37

we'll load in here in in free form um

play27:40

and you know again same all of the same

play27:42

personas that exist and in those

play27:43

situations you're going to um you you'll

play27:46

see here like show charts I'm not going

play27:48

to Doo this but uh this is where you

play27:50

would do I want to see sales by quarter

play27:54

for the past year and again this is

play27:55

going to call the structured data

play27:57

backend um for that so we just wanted to

play27:59

spend a few minutes showing you guys

play28:01

some of like how do these controls start

play28:02

to come to life how do you start to

play28:04

configure these systematically so that

play28:06

again not everybody's not building this

play28:08

from from ground up every single time um

play28:10

and that the organization can kind of

play28:13

have some guard rails for for what

play28:15

you're letting people do or or not do um

play28:18

so anyway I think that's really what uh

play28:20

we had in terms of the demo uh let me

play28:22

switch back over to the final couple

play28:25

slides I think um

play28:32

all right so um just a wrap up session a

play28:34

couple more ways to engage with uh PWC

play28:37

folks if anybody's interested here's a

play28:39

QR code you can get information on uh on

play28:42

any of those events for for tomorrow

play28:44

breakfast and a couple sessions and a a

play28:46

happy hour tomorrow uh for those who are

play28:48

interested uh as well as um you know I

play28:52

know Google has some some other things

play28:55

uh that you can get engaged with and you

play28:57

can scan that QR code for that

play29:00

[Music]

play29:04

information

Rate This

5.0 / 5 (0 votes)

Связанные теги
Datos y IATransformación DigitalPwCGoogleGenerative AIEstrategias de NegocioSeguridad de DatosInteligencia ArtificialModelos de LLMGestión de CambiosControl de Calidad
Вам нужно краткое изложение на английском?