- SliceOfAI
- Posts
- Moondream 2B and 0.5B Vision model ๐
Moondream 2B and 0.5B Vision model ๐
An open source lightweight vision model for your local AI stack

I have been tinkering around with the new Moondream 2B model (small/ tiny vision model) that you can run on your local system.
Limited memory requirements, runs on CPU no GPU required. ๐ค
Common capabilities are to be able to:
Querying images
Captioning
Object detection
Get co-ordinates of items on the image.
Use cases can be:
Good for OCR, doing a POC on this!
Gaze detection
Extracting Structured data in JSON, XML and Markdown.
Good part is it so light weight you can run it on a Raspberry Pi, used for some IOT use cases also.
Checkout the release note here: