JeVoisBase  1.22
JeVois Smart Embedded Machine Vision Toolkit Base Modules
Share this page:
Loading...
Searching...
No Matches
PyLLM.PyLLM Class Reference

Interact with a large-language model (LLM) or vision-language model (VLM) in a chat box. More...

Public Member Functions

 __init__ (self)
 Constructor.
 
 init (self)
 JeVois optional extra init once the instance is fully constructed.
 
 uninit (self)
 JeVois optional extra un-init before destruction.
 
 setModel (self, name)
 
 runmodel (self)
 Run the LLM model asynchronously.
 
 processGUI (self, inframe, helper)
 Process function with GUI output on JeVois-Pro.
 

Public Attributes

 chatbox
 
 messages
 
 client
 
 statusmsg
 
 pc
 
 modelname
 
 setModel
 
 task
 
 generator
 
 currmsg
 

Detailed Description

Interact with a large-language model (LLM) or vision-language model (VLM) in a chat box.

This module uses the ollama framework from https://ollama.com to run a large language model (LLM) or vision language model (VLM) right inside the JeVois-Pro camera. The default model is tinydolphin, an experimental LLM (no vision) model with 1.1 Billion parameters, obtained from training the TinyLlama model on the popular Dolphin dataset by Eric Hartford.

For now, the model runs fairly slowly and on CPU (multithreaded).

Try asking questions, like "how can I make a ham and cheese sandwich?", or "why is the sky blue?", or "when does summer start?", or "how does asyncio work in Python?"

Also pre-loaded on microSD is moondream2 with 1.7 Billion parameters, a VLM that can both answer text queries, and also describe images captured by JeVois-Pro, and answer queries about them. However, this model is very slow as just sending one image to it as an input is like sending it 729 tokens... So, consider it an experimental feature for now. Hopefully smaller models will be available soon.

With moondream, you can use the special keyword /videoframe/ to pass the current frame from the live video to the model. You can also add more text to the query, for example:

user: /videoframe/ how many people? moondream: there are five people in the image.

If you only input /videoframe/ then the following query text is automatically added: "Describe this image:"

This module uses the ollama python library from https://github.com/ollama/ollama-python

More models

Other models can run as well. The main question is how slowly, and will we run out or RAM or out of space on our microSD card? Have a look at https://ollama.com for supported models. You need a working internet connection to be able to download and install new models. Installing new models may involve lengthy downloads and possible issues with the microSD getting full. Hence, we recommend that you restart JeVois-Pro to ubuntu command-line mode (see under System tab of the GUI), then login as root/jevois, then:

df -h / # check available disk space ollama list # shows instaled models ollama rm somemodel # delete some installed model if running low on disk space ollama run newmodel # download and run a new model, e.g., tinyllama (<2B parameters recommended); if you like it, exit ollama (CTRL-D), and run jevoispro.sh to try it out in the JeVois-Pro GUI.

Disclaimer

LLM research is still in early stages, despite the recent hype. Remember that these models may return statements that may be inaccurate, biased, possibly offensive, factually wrong, or complete garbage. At then end of the day, always remember that: it's just next-token prediction. You are not interacting with a sentient, intelligent being.

Author
Laurent Itti
Display Name:
PyLLM
Videomapping:
JVUI 0 0 30.0 YUYV 1920 1080 30.0 JeVois PyLLM
Email:
itti@usc.edu
Address:
University of Southern California, HNB-07A, 3641 Watt Way, Los Angeles, CA 90089-2520, USA
Main URL:
http://jevois.org
Support URL:
http://jevois.org/doc
Other URL:
http://iLab.usc.edu
License:
GPL v3
Distribution:
Unrestricted
Restrictions:
None

Definition at line 76 of file PyLLM.py.

Constructor & Destructor Documentation

◆ __init__()

PyLLM.PyLLM.__init__ (   self)

Constructor.

Definition at line 79 of file PyLLM.py.

Member Function Documentation

◆ init()

PyLLM.PyLLM.init (   self)

JeVois optional extra init once the instance is fully constructed.

Definition at line 87 of file PyLLM.py.

◆ processGUI()

PyLLM.PyLLM.processGUI (   self,
  inframe,
  helper 
)

◆ runmodel()

PyLLM.PyLLM.runmodel (   self)

Run the LLM model asynchronously.

Definition at line 122 of file PyLLM.py.

References PyLLM.PyLLM.chatbox, PyLLM.PyLLM.currmsg, and PyLLM.PyLLM.generator.

Referenced by PyLLM.PyLLM.processGUI().

◆ setModel()

PyLLM.PyLLM.setModel (   self,
  name 
)

Definition at line 112 of file PyLLM.py.

References PyLLM.PyLLM.task.

◆ uninit()

PyLLM.PyLLM.uninit (   self)

JeVois optional extra un-init before destruction.

Definition at line 106 of file PyLLM.py.

Member Data Documentation

◆ chatbox

PyLLM.PyLLM.chatbox

Definition at line 80 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI(), and PyLLM.PyLLM.runmodel().

◆ client

PyLLM.PyLLM.client

Definition at line 82 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI().

◆ currmsg

PyLLM.PyLLM.currmsg

Definition at line 130 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI(), and PyLLM.PyLLM.runmodel().

◆ generator

PyLLM.PyLLM.generator

Definition at line 116 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI(), and PyLLM.PyLLM.runmodel().

◆ messages

PyLLM.PyLLM.messages

Definition at line 81 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI().

◆ modelname

PyLLM.PyLLM.modelname

◆ pc

PyLLM.PyLLM.pc

Definition at line 89 of file PyLLM.py.

◆ setModel

PyLLM.PyLLM.setModel

Definition at line 102 of file PyLLM.py.

◆ statusmsg

PyLLM.PyLLM.statusmsg

Definition at line 83 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI().

◆ task

PyLLM.PyLLM.task

Definition at line 115 of file PyLLM.py.

Referenced by PyLLM.PyLLM.processGUI(), and PyLLM.PyLLM.setModel().


The documentation for this class was generated from the following file: