OpenMV 韌體 v5.0.0 · 基於 MicroPython v1.28 · 文件建置於 2026年6月19日

機器視覺，
化繁為簡。

即時人臉偵測、AprilTag 追蹤、QR 掃描與 YOLO，全部在裝置上以純 MicroPython 執行，無需主機電腦，無需雲端。

快速入門指南 API 參考文件找到您的開發板

最新動態閱讀 OpenMV 韌體 v5.0.0 更新日誌

開啟 IDE

下載並安裝適用於 Windows、macOS 或 Linux 的 OpenMV IDE，然後啟動 IDE。

連接相機

透過 USB 將 OpenMV Cam 插入電腦。準備就緒時，藍色心跳 LED 會閃爍。

執行您的第一個腳本

點擊 IDE 中的插頭圖示連接按鈕，然後按下綠色播放箭頭執行您的第一個腳本。

Hello world

範例

import csi
import time
import ml
from ml.postprocessing.ultralytics import YoloV8

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.VGA)
csi0.snapshot(time=2000)  # let AWB/AGC stabilize

# Built-in single-class person detector model.
model = ml.Model("/rom/yolov8n_192.tflite",
                 postprocess=YoloV8(threshold=0.4))
clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    # predict returns a list per class of ((x, y, w, h), score) tuples.
    for class_dets in model.predict([img]):
        for rect, score in class_dets:
            img.draw_rectangle(rect, color=(0, 255, 0))
    print(clock.fps(), "fps")

即時人物追蹤

內建 YOLOv8 模型為單類別人物偵測器——以 int8 量化並預載於 ROM 中。

從 /rom/yolov8n_192.tflite 載入——無需 SD 卡或下載。

在搭載 NPU 的開發板上即時運行——OpenMV N6 與 AE3。

攜帶您自己在 Roboflow 上訓練的 YOLOv8 模型，以相同方式載入。

import csi
import math
import time

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.QVGA)
csi0.snapshot(time=2000)  # let AWB/AGC stabilize
csi0.auto_gain(False)
csi0.auto_whitebal(False)

clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    for tag in img.find_apriltags():
        img.draw_detection(tag, color1=(255, 0, 0), color2=(0, 255, 0))
        deg = math.degrees(tag.rotation)
        print("ID %d  rotation %.1f deg" % (tag.id, deg))
    print(clock.fps(), "fps")

定位並辨識 AprilTags

AprilTags 是二維基準標記——對運動模糊和部分遮擋具有強健性，並可提供完整的 3D 姿態資訊。

內建偵測器——無需模型檔案或訓練。

回傳 ID 及完整 6-DoF 姿態——x/y/z 平移與 x/y/z 旋轉。

適用於機器人校準、AR 標記與室內定位。

import csi
import time
import ml
from ml.postprocessing.mediapipe import BlazeFace

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.VGA)
csi0.window((400, 400))  # square window for best results
csi0.snapshot(time=2000)  # let AWB/AGC stabilize

model = ml.Model("/rom/blazeface_front_128.tflite",
                 postprocess=BlazeFace(threshold=0.4))
clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    for rect, score, keypoints in model.predict([img]):
        img.draw_rectangle(rect, color=(0, 0, 255))
        ml.utils.draw_keypoints(img, keypoints, color=(255, 0, 0))
    print(clock.fps(), "fps")

使用 BlazeFace 偵測人臉

Google 的 BlazeFace 是一款輕量級 TensorFlow Lite 人臉偵測器，每張臉回傳邊界框及六個關鍵點。

從 /rom/blazeface_front_128.tflite 載入——已預先量化，無需下載。

每張臉六個關鍵點：眼睛、鼻子、嘴巴與耳朵。

無隱私疑慮——影像幀永遠不會離開相機。

import csi
import time

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.QVGA)
csi0.snapshot(time=2000)  # let AWB/AGC stabilize
csi0.auto_gain(False)

clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    for code in img.find_qrcodes():
        img.draw_rectangle(code.rect, color=(255, 0, 0))
        print(code.payload)
    print(clock.fps(), "fps")

從即時影像掃描 QR 碼

內建 QR 解碼器可處理傾斜、扭曲及部分遮擋的碼。

每個結果還提供版本、ECC 等級與角點座標。

數字、字母數字、二進位與漢字資料模式。

以 Python 字串回傳解碼後的內容——可直接使用。

import csi
import time

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.QVGA)
csi0.snapshot(time=2000)  # let AWB/AGC stabilize
csi0.auto_gain(False)
csi0.auto_whitebal(False)

# LAB thresholds: (L_min, L_max, A_min, A_max, B_min, B_max)
thresholds = [
    (30, 100, 15, 127, 15, 127),   # red
    (30, 100, -64, -8, -32, 32),   # green
]

clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    for blob in img.find_blobs(thresholds, pixels_threshold=200):
        img.draw_rectangle(blob.rect, color=(255, 0, 0))
        img.draw_cross((blob.cx, blob.cy))
    print(clock.fps(), "fps")

尋找顏色區塊

find_blobs 回傳符合一個或多個 LAB 閾值的連通像素區域。

針對您的光源調整閾值——請先停用自動增益與自動白平衡。

在單次呼叫中傳入多個閾值以進行多色追蹤。

pixels_threshold 篩除微小偵測結果；merge=True 合併重疊的色塊。

import csi
import time

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.GRAYSCALE)
csi0.framesize(csi.VGA)
csi0.window((640, 80))  # narrow strip for fast linear scanning
csi0.snapshot(time=2000)  # let AWB/AGC stabilize
csi0.auto_gain(False)
csi0.auto_whitebal(False)

clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    for code in img.find_barcodes():
        img.draw_rectangle(code.rect, color=(0, 255, 0))
        print(code.payload, "(quality %d)" % code.quality)
    print(clock.fps(), "fps")

讀取一維條碼

在畫面中任意位置尋找一維條碼並解碼其內容。

由 ZBar 函式庫驅動——支援辨識 EAN、UPC、Code 39/93/128、Codabar、ITF、ISBN 與 DataBar。

使用灰階視窗條帶以獲得最快的線性掃描速度。

每個結果包含格式、內容、旋轉角度、角點與邊界矩形。

import csi
import time
import ml
from ml.postprocessing.mediapipe import HandLandmarks

csi0 = csi.CSI()
csi0.reset()
csi0.pixformat(csi.RGB565)
csi0.framesize(csi.VGA)
csi0.window((400, 400))  # square window for the model
csi0.snapshot(time=2000)  # let AWB/AGC stabilize

# Connections between the 21 keypoints — palm + 5 fingers.
hand_lines = ((0, 1), (1, 2), (2, 3), (3, 4), (0, 5), (5, 6),
              (6, 7), (7, 8), (5, 9), (9, 10), (10, 11), (11, 12),
              (9, 13), (13, 14), (14, 15), (15, 16), (13, 17), (17, 18),
              (18, 19), (19, 20), (0, 17))

model = ml.Model("/rom/hand_landmarks_full_224.tflite",
                 postprocess=HandLandmarks(threshold=0.4))
clock = time.clock()

while True:
    clock.tick()
    img = csi0.snapshot()
    # predict returns a list per hand: index 0 = left, index 1 = right.
    for detections in model.predict([img]):
        for rect, score, keypoints in detections:
            ml.utils.draw_skeleton(img, keypoints, hand_lines,
                                   kp_color=(255, 0, 0),
                                   line_color=(0, 255, 0))
    print(clock.fps(), "fps")