开启open-webui:一键集成ollama,打造革命性GPT体验

zhouhui
2 Mar 202409:39

Summary

TLDRThe video introduces the open Web UI project, a multimodal AI web client software with capabilities for document handling, local RAG integration, web browsing, voice input, AI painting, and model conversations. It demonstrates the installation process of the ollama Web version and open Web UI, including the setup of Docker and Pinokio for a seamless experience. The software's functionality is showcased through interactive examples, including chat, document analysis, and multimodal model responses to images and voice.

Takeaways

  • 💻 The script introduces 'Ollama Web UI', a project that has evolved from 'Ollama-Web UI' and is now called 'Open Web UI'.
  • 🔧 Ollama Web UI supports multimodality, including document handling, local RAG integration, web browsing, voice input, AI painting, and multiple model dialogues.
  • 📥 It also offers OpenAI API integration, external Ollama server connection, and backend reverse proxy support, making it a versatile chat client software.
  • 🛠️ The script provides a step-by-step guide on how to download and install Ollama, including dealing with antivirus software that might flag it as a virus.
  • 📈 The Ollama software has been updated to version 0.1.28, and the installation process involves navigating to a specific directory and running commands in CMD.
  • 🔗 The script mentions the need for downloading models, such as the 'gemma:7b' model, which can be initiated if not already present on the user's system.
  • 🌐 For Open Web UI, the installation is recommended to be done via Docker, with separate frontend and backend components implemented in Node.js and Python, respectively.
  • 📦 The script suggests using a software called 'Pinokio' to install Open Web UI if Docker installation is problematic.
  • 🖼️ Open Web UI supports uploading and analyzing documents, as demonstrated by the script with a project manager checklist and the generation of a CSV file.
  • 🎨 The software also supports multimodal capabilities, allowing users to analyze images and receive descriptions of their content.
  • 🔊 Additionally, Ollama Web UI has text-to-speech functionality, as showcased by the script with a sample voice output of an image description.

Q & A

  • What is the new name of the project previously known as 'ollama-Web UI'?

    -The project has been renamed to 'open Web UI'.

  • What are some of the key features of the open Web UI project?

    -Key features include multimodal support, document handling, local RAG integration, web browsing capabilities, voice input support, AI painting, multiple model dialogues, OpenAI API integration, external Ollama server connection, and backend reverse proxy support.

  • What is the current version of the Ollama software mentioned in the script?

    -The current version of the Ollama software mentioned is V0.1.28.

  • Why might the Ollama software installation be flagged as a virus by some antivirus programs?

    -The installation might be flagged as a virus due to its nature or certain components that are not recognized by the antivirus software, prompting the user to temporarily disable antivirus protection during installation.

  • What is the approximate size of the Ollama software installation?

    -The approximate size of the Ollama software installation is around 385 MB.

  • How can one start the Ollama software after installation?

    -After installation, one can start the Ollama software by launching the desktop CMD command and entering the appropriate command to initiate the software.

  • What is the purpose of the 'Pinokio' software mentioned in the script?

    -Pinokio is used to install the open Web UI project, as it provides an alternative method to install software when the user encounters issues with Docker Desktop.

  • What are the prerequisites for installing the open Web UI project using Pinokio?

    -The prerequisites include the installation of conda, git, zip, Python, and Visual Studio (VS), which are automatically installed by Pinokio before proceeding with the open Web UI installation.

  • How does the open Web UI project handle document analysis and output?

    -The open Web UI project can analyze documents and generate outputs in various formats, such as CSV, based on user prompts and the capabilities of the integrated models.

  • What is the multimodal capability of the Ollama software, and how is it demonstrated in the script?

    -The multimodal capability of the Ollama software allows it to process and understand different types of data, such as images. In the script, it is demonstrated by the software's ability to analyze a photo and describe its content.

  • What additional features does the open Web UI project support besides document analysis and multimodal capabilities?

    -In addition to document analysis and multimodal capabilities, the open Web UI project also supports features like text-to-speech (TTS) and potentially other AI-driven functionalities.

Outlines

00:00

😀 Introduction to Open Web UI and Ollama Web Version

The video script introduces the Ollama Web version, a multimodal client software that integrates various features such as document support, local RAG integration, web browsing, voice input, AI painting, and multiple model dialogues. It also supports OpenAI API integration, external Ollama server connections, and backend reverse proxy support. The project, formerly known as ollama-Web UI, has been renamed to open Web UI and is available in version V0.1.28. The script guides viewers through the installation process, which includes downloading the software, disabling antivirus software to avoid false positives, and navigating to the installation directory. It also mentions the need to download a model, such as the gemma:7b model, for the software to function properly.

05:01

😲 Exploring Open Web UI's Features and Installation Process

The second paragraph delves into the features of the Open Web UI project, highlighting its capabilities for chat, attachments, and multimodal support. It explains the process of using the project with a focus on the installation of the Pinokio software, which is used to install the open Web UI project due to issues with Docker Desktop on the presenter's computer. The script details the steps to download and install necessary software prerequisites like conda, git, zip, Python, and Visual Studio before proceeding with the installation of the open Web UI project. It also demonstrates the functionality of the web UI, including the ability to upload documents and receive responses in CSV format, as well as the integration of multimodal models for image recognition and text-to-speech capabilities. The video concludes by emphasizing the robustness of the open Web UI project compared to other similar tools like Chatbox.

Mindmap

Keywords

💡Ollama Web Version

The Ollama Web Version refers to the web-based iteration of the Ollama project, which is an AI-driven platform with enhanced capabilities. It is the central focus of the video, showcasing its evolution from 'ollama-Web UI' to 'open Web UI'. This term is integral to the video's theme as it represents the software being discussed, which includes a variety of features such as multimodal support, document handling, and AI functionalities.

💡Multimodal

In the context of the video, 'multimodal' describes the ability of the Ollama Web Version to process and interact with different types of data inputs and outputs, such as text, images, and voice. This feature is crucial as it expands the platform's capabilities beyond traditional text-based interactions, allowing for a richer user experience.

💡RAG (Retrieval-Augmented Generation)

RAG is a machine learning model that combines retrieval and generation to enhance AI's ability to provide relevant and contextual responses. In the video, RAG is mentioned as a feature that is supported and possibly enhanced within the Ollama Web Version, indicating the platform's advanced capabilities in handling complex AI tasks.

💡Web Browsing Function

The term 'web browsing function' refers to the capability of the Ollama Web Version to access and interact with web content. This is an important feature as it allows the AI to retrieve and utilize information from the internet, thus providing more comprehensive and up-to-date responses.

💡Voice Input Support

Voice Input Support is a feature that allows users to interact with the Ollama Web Version using spoken language. This feature is highlighted in the script as part of the platform's multimodal capabilities, demonstrating the AI's ability to process and respond to verbal commands.

💡AI Painting

AI Painting is a creative feature of the Ollama Web Version that enables the AI to generate visual art. The script mentions this feature as part of the platform's advanced capabilities, showcasing the AI's versatility in artistic creation.

💡Model Dialogue

Model Dialogue refers to the ability of the AI to engage in conversations using different models, which can be tailored to specific tasks or styles of communication. The video script mentions this feature to emphasize the flexibility and adaptability of the Ollama Web Version.

💡OpenAI API Integration

OpenAI API Integration indicates that the Ollama Web Version can connect and utilize the application programming interfaces provided by OpenAI, a leading company in AI research. This integration is significant as it allows the platform to leverage advanced AI models and capabilities from OpenAI.

💡External Ollama Server Connection

The ability to connect to an 'External Ollama Server' suggests that the platform can interact with other instances of Ollama or related services outside the local environment. This is an important aspect of the video's narrative, as it highlights the platform's connectivity and scalability.

💡Backend Reverse Proxy Support

Backend Reverse Proxy Support refers to the platform's capability to use a reverse proxy in its backend infrastructure. This is a technical feature that can enhance security and performance, and it is mentioned in the script to illustrate the robustness of the Ollama Web Version's architecture.

💡Pinokio

Pinokio is a software mentioned in the script used for installing the open Web UI project. It is an example of a tool that simplifies the installation process of complex software projects, such as setting up the necessary environment for the Ollama Web Version.

💡TTS (Text-to-Speech)

TTS, or Text-to-Speech, is a technology that converts written text into spoken language. In the context of the video, TTS is demonstrated as a feature of the Ollama Web Version, showcasing the platform's ability to not only process text but also to generate spoken responses.

Highlights

Ollama Web version supports multimodality, including document handling, local RAG integration, and web browsing capabilities.

The Web version of Ollama offers voice input support, AI painting, and the ability to converse with multiple models.

Integration with OpenAI API and external Ollama servers is possible, along with backend reverse proxy support.

The project, previously known as Ollama-Web UI, has been renamed to Open Web UI, featuring a chat and attachment functionality.

Ollama has been updated to version V0.1.28, with installation instructions provided for Windows users.

Users may need to disable antivirus software temporarily during the installation of Ollama to avoid false positives.

The installation directory for Ollama is typically under C:/documents/APP date, with a size of approximately 385MB.

Ollama requires the downloading of models, such as the gemma model, which can be initiated through specific commands.

The open Web UI project is hosted on GitHub, with installation instructions available for Docker.

The open Web UI is composed of a frontend implemented in Node.js and a backend in Python.

Pinokio software is demonstrated for installing the open Web UI project due to issues with Docker Desktop on the presenter's computer.

The open Web UI integrates with mainstream projects like Lobo CHAT and provides a comprehensive setup guide.

After installation, users can access the Web UI through a provided website link.

The Web UI supports uploading attachments and documents, demonstrated with a project manager checklist.

The system can analyze documents and output information in CSV file format based on user prompts.

Multimodal capabilities of the Web UI are showcased with the ability to analyze and describe the content of images.

The Web UI also supports text-to-speech functionality, as demonstrated with a description of an image.

The open Web UI project is praised for its robust features, including RAG support, document handling, and multimodality, offering more capabilities than previously discussed Chatbox.

Transcripts

play00:00

ollama web版本支持多模态

play00:03

支持文档 、本地 RAG 集成、网页浏览功能

play00:06

语音输入支持、AI绘画、多个模型对话

play00:11

OpenAI API 集成

play00:12

外部 Ollama 服务器连接

play00:14

后端反向代理支持

play00:16

基多功能一体的

play00:17

chatgptweb客户端软件来了

play00:20

今天给大家介绍一个项目

play00:21

是ollama的Web版

play00:23

它的早期项目叫ollama-Web UI

play00:26

现在项目更名为open Web UI

play00:29

它主要实现哪些功能呢

play00:30

可以实现聊天

play00:32

实现附件

play00:33

包括我们现在的多模态的附件

play00:36

就是聊天和附附件功能

play00:38

然后也支持RAG的一个增强

play00:40

那么下面给大家介绍下

play00:42

这个项目怎么去用的

play00:43

那我们首先回到上期视频当中

play00:46

我们用到了一个 ollama

play00:47

这个ollama的这个安装地址

play00:49

在这个位置

play00:50

目前他已经升到V0.1.28版本

play00:52

我们找到一个Windows版

play00:54

点击这个下载

play00:56

因为我提前已经下载完成了

play00:58

下载完成之后

play00:59

这里面有一个这个地方

play01:00

我们检查双击

play01:01

因为现在目前的话

play01:03

这个软件安装的时候

play01:04

可能会认为是一个病毒

play01:06

我们首先把杀毒软件给关闭掉

play01:10

然后把360的也去掉

play01:12

退出然后这时候我们进行点击安装

play01:16

点下一步

play01:17

它默认会装在c盘的document APP date

play01:21

这个目录下面去

play01:22

然后我们找一下安装目录

play01:27

程序安装是在这个目录下面

play01:29

然后大概是有100多兆

play01:33

哦 385M

play01:35

然后接下来

play01:35

我们就可以启动桌面板的CMD命令

play01:40

然后我们测一下

play01:45

好这个版本是0.1.28木

play01:47

已经安装好了

play01:49

接下来我们就启动这个

play01:51

ollama

play01:51

我们还是以上期的谷歌的gemma为例

play01:54

我们下载这个模型

play01:57

我们复制这个命令然后

play02:01

啊这时候我已经那个启起来了

play02:04

因为之前我在电脑上

play02:06

已经装过gemma:7b模型

play02:08

所以它直接可以启起来

play02:10

默认的情况下他没有模型的话

play02:12

可以有一个下载过程

play02:13

这个大家就稍等一下

play02:14

这个就是我们的模型的地址

play02:17

跟那个我们之前的docker文件差不多

play02:22

然后它这里面有一堆文件

play02:24

看就是这个

play02:25

大家看到了

play02:26

这就是相当于镜像

play02:28

而且模型镜像做成一个docker模式

play02:31

刚才ollama已经给大家安装过了

play02:33

接下来

play02:33

我们看一下这个open webui的安装

play02:36

open UI的项目地址是在github这个位置

play02:38

然后

play02:39

然后它的安装

play02:40

官方安装是由docker

play02:42

来方式来进行安装的

play02:44

因为它分前端和后端

play02:45

前端是nodejs

play02:47

后端是Python实现的

play02:48

然后因为我本地电脑上docker Desktop

play02:51

这个有点问题

play02:52

所以我这边就没有安装成功

play02:54

接下来我们给大家使用一个叫

play02:56

Pinokio来给大家进行安装这个东西

play03:00

给大家演示一下Pinokio这个软件

play03:02

安装这个项目

play03:03

在github的这个这个项目的这个地址

play03:06

然后我们点开这个官网

play03:08

查官网网站打开之后

play03:11

这里面有相关的说明

play03:14

比如说这是一个油管的一个视频

play03:16

然后下面呢是集成了

play03:18

就是像open Web UI Lobo CHAT就是这些主流的

play03:23

这这个github的一个项目

play03:25

我们打开这个dowload

play03:30

然后这里面会有Windows Mac

play03:33

嗯然后Linux我们选Windows

play03:36

然后点击下载

play03:37

然后按照这个路步骤

play03:39

就是安装就可以了

play03:41

我这里面已经提前安装好了

play03:43

这就不给大家演示

play03:44

安装完成之后我们桌面上会有

play03:46

打开软件之后

play03:47

这里面有一个访问这个网站

play03:51

然后这样的话

play03:52

他一个就我们看到刚才网站上

play03:56

Web,Web UI

play03:59

lobe chat这里面都有相关的这些

play04:01

我们只要在这里面安装好就行

play04:03

包括ollama

play04:05

对吧我们接下来我们下载

play04:08

呃就先选这个吧

play04:10

选中它之后

play04:11

然后这里面是先下载dowload

play04:14

这一块的话

play04:14

首先它会把目前的conda,git

play04:17

z i p

play04:18

还有py还有vs等

play04:20

这些软件先提前先安装

play04:22

安装完之后

play04:23

然后再来我们接下来看一下这个

play04:25

点开这个

play04:26

好了我们看窗口这边已经弹出来

play04:28

就是第一个已安装完了

play04:30

conda安装完

play04:31

第二个是装 git

play04:34

然后稍等片刻

play04:35

我们看到这个已经安装vs了

play04:38

然后这个弹出一个vs studio的

play04:42

应该安装这个界面

play04:43

这个稍等它一遍

play04:44

下载安装

play04:46

安装完成之后

play04:47

这里面有一个安装complete click OK

play04:50

我们点确定

play04:52

好这时候就已经完成了

play04:57

我们打开Pinokio

play04:58

然后这里面进入一个

play05:00

Pinokio的这些项目的这个地址

play05:03

这个各种网站的地址

play05:04

然后我们打开这个Web UI

play05:08

然后这里面我们点击download下载

play05:11

这时候它会保存save as 这个Web UI GT

play05:14

然后下载下来这个

play05:16

稍等片刻

play05:19

好这个是已经下载完成了

play05:22

然后它会进入到一个叫open Web UI的GT

play05:25

这个项目

play05:26

然后我们点开这个项目子模块

play05:29

这里面有一个install

play05:30

我们点开它

play05:34

然后接下来就是对那个Pinokio

play05:37

然后在github里面

play05:39

下Web open UI的这个项目

play05:41

然后只剩一个NPU的编译打包

play05:44

我们稍等片刻

play05:50

啊这一步就是安装完成

play05:52

运行完成之后

play05:53

我们点击start

play05:56

然后这时候他们需要提示我装

play05:59

安装啦ollama

play06:00

这时候我们这边已经启起来了

play06:02

我们点开这个

play06:07

这个ollama

play06:07

这个页面就已经打开了

play06:09

我们注册一下

play06:11

然后填入名字

play06:24

我们确保这边的ollama模型启动

play06:27

然后输入127.0.0.1:8000

play06:30

这时候启起来

play06:32

然后我们下拉选项

play06:33

这里面有一个那个刚才的gemma

play06:37

7b模型

play06:38

然后我们可以聊天

play06:50

好这个他已经写出来了

play06:52

然后接下来我们再可以测一下这个

play06:55

webui

play06:58

它可以支持那种附件和文档上传

play07:01

我们测一下

play07:02

我们试一下这个项目经理清单

play07:06

然后给它一个提示词比如说

play07:10

请分析一下项目

play07:11

进行清单

play07:12

并将这个文件格式输出

play07:13

CSV文件格式

play07:15

把生成出来的CSV文件

play07:16

格式与代码行输出

play07:18

我们看下效果

play07:23

好这时候他就按照我们刚才的要求

play07:25

把这边这个文档解析出来了

play07:29

然后生成姓名年龄所有代

play07:31

然后是否有PMP

play07:32

然后这是他的代码

play07:34

这个代码我个人觉得他生成的

play07:37

可能有点随机性

play07:38

但是至少他能读懂

play07:40

可能跟我们后端的那个gemma

play07:41

模型是有关系的

play07:44

这个webui还支持多模态

play07:46

我们接下来在这官方上下载这个模型

play07:49

这个是多模态的模型

play07:51

我们点开它

play07:54

然后找一个简单的1.6的这个模型吧

play07:57

我们把这个装一下

play08:02

然后我们开CMD窗口

play08:03

这个我们下一下模型

play08:04

稍等一下

play08:06

好了这个模型已经下载完成了

play08:09

我们看一下这

play08:15

play08:17

这时候就有一个ollama

play08:19

llava的多模态模型

play08:21

我们测一下

play08:21

我们选择一张照片

play08:35

我们问请这张照片的内容是什么

play08:37

让他回答一下

play08:42

好他已经回答出来

play08:43

这张照片是一个关于长发的脸部表情

play08:45

一个男姓

play08:46

正是对面一种视觉的圆

play08:48

这个就是这个Olama

play08:51

它还支持一些其他的一些功能

play08:54

比如说我们现在看到的声音

play08:56

然后它的图像

play08:57

图像可以直接AUTOMATIC1111

play09:00

因为这个时间关键就不给大家演示了

play09:02

然后声音这块我们给大家听一下

play09:06

这面有个TTS

play09:07

这张照片的内容

play09:09

play09:09

一张关于长发和脸部表情的男性画像

play09:13

他正视镜头

play09:15

给人以一种啊

play09:16

总之这个这个Web就是open Web UI

play09:21

这个项目功能还是非常强大的

play09:23

它支持一些RAG

play09:25

然后文档

play09:26

然后多模态等功能

play09:28

比之前我们说的那个Chatbox

play09:29

可能功能更强大一些啊

play09:32

本期的分享就到这里

play09:33

全部结束了

play09:35

谢谢大家的观看

Rate This

5.0 / 5 (0 votes)

Related Tags
AI ChatMultimodalWeb UIOllama WebRAG IntegrationDocument SupportVoice InputAI ArtModel DialogueOpenAI APIDocker Deployment