Streams Gizmos

DeGirum Tools API Reference Guide. Reusable gizmos for video, inference, display, etc.

This API Reference is based on DeGirum Tools version 0.18.0.

Classes

VideoSourceGizmo

VideoSourceGizmo

Bases: Gizmo

OpenCV-based video source gizmo.

Captures frames from a video source (camera, video file, etc.) and outputs them as StreamData into the pipeline.

Functions

init(video_source=None, ...)

__init__(video_source=None, *, stop_composition_on_end=False)

Constructor.

Parameters:

Name

Type

Description

Default

video_source

int or str

A cv2.VideoCapture-compatible video source (device index as int, or file path/URL as str). Defaults to None.

None

stop_composition_on_end

bool

If True, stop the Composition when the video source is over. Defaults to False.

False

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the video tag).

run

run()

Run the video capture loop.

Continuously reads frames from the video source and sends each frame (with metadata) downstream until the source is exhausted or abort is signaled.

VideoDisplayGizmo

VideoDisplayGizmo

Bases: Gizmo

OpenCV-based video display gizmo.

Displays incoming frames in one or more OpenCV windows.

Functions

init(window_titles='Display', ...)

__init__(window_titles='Display', *, show_ai_overlay=False, show_fps=False, stream_depth=10, allow_drop=False, multiplex=False)

Constructor.

Parameters:

Name

Type

Description

Default

window_titles

str or List[str]

Title or list of titles for the display window(s). If a list is provided, multiple windows are opened (one per title). Defaults to "Display".

'Display'

show_ai_overlay

bool

If True, overlay AI inference results on the displayed frame (when available). Defaults to False.

False

show_fps

bool

If True, show the FPS on the display window(s). Defaults to False.

False

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames if the input queue is full. Defaults to False.

False

multiplex

bool

If True, use a single input stream and display frames in a round-robin across multiple windows; if False, each window corresponds to its own input stream. Defaults to False.

False

Raises:

Type

Description

Exception

If multiplex is True while allow_drop is also True (unsupported configuration).

run

run()

Run the video display loop.

Fetches frames from the input stream(s) and shows them in the window(s) (with optional overlays and FPS display) until all inputs are exhausted or aborted.

VideoSaverGizmo

VideoSaverGizmo

Bases: Gizmo

Video saving gizmo.

Writes incoming frames to an output video file.

Functions

init(filename, ...)

__init__(filename, *, show_ai_overlay=False, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

filename

str

Path to the output video file.

required

show_ai_overlay

bool

If True, overlay AI inference results on frames before saving (when available). Defaults to False.

False

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames if the input queue is full. Defaults to False.

False

run

run()

Run the video saving loop.

Reads frames from the input stream and writes them to the output file until the stream is exhausted or aborted.

VideoStreamerGizmo

VideoStreamerGizmo

Bases: Gizmo

Video streaming gizmo.

Streams incoming frames to RTSP stream using ffmpeg.MediaServer must be running to accept the stream.

Functions

init(rtsp_url, ...)

__init__(rtsp_url, *, fps=30, show_ai_overlay=False, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

rtsp_url

str

RTSP URL to stream to (e.g., 'rtsp://user:password@hostname:port/stream'). Typically you use MediaServer class to start media server and then use its RTSP URL like rtsp://localhost:8554/mystream

required

fps

float

Frames per second for the stream. Defaults to 30.

30

show_ai_overlay

bool

If True, overlay AI inference results on frames before saving (when available). Defaults to False.

False

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames if the input queue is full. Defaults to False.

False

run

run()

Run the video saving loop.

Reads frames from the input stream and writes them to the output file until the stream is exhausted or aborted.

ResizingGizmo

ResizingGizmo

Bases: Gizmo

OpenCV-based image resizing/padding gizmo.

Resizes incoming images to a specified width and height, using the chosen padding or cropping method.

Functions

init(w, ...)

__init__(w, h, pad_method='letterbox', resize_method='bilinear', stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

w

int

Target width for output images.

required

h

int

Target height for output images.

required

pad_method

str

Padding method to use ("stretch", "letterbox", "crop-first", "crop-last"). Defaults to "letterbox".

'letterbox'

resize_method

str

Resampling method to use ("nearest", "bilinear", "area", "bicubic", "lanczos"). Defaults to "bilinear".

'bilinear'

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames if the input queue is full. Defaults to False.

False

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the resize tag).

run

run()

Run the resizing loop.

Resizes each input image according to the configured width, height, padding, and resizing method, then sends the result with updated metadata downstream.

AiGizmoBase

AiGizmoBase

Bases: Gizmo

Base class for AI model inference gizmos.

Handles loading the model and iterating over input data for inference in a background thread.

Functions

init(model, ...)

__init__(model, *, stream_depth=10, allow_drop=False, inp_cnt=1, **kwargs)

Constructor.

Parameters:

Name

Type

Description

Default

model

Model or str

A DeGirum model object or model name string to load. If a string is provided, the model will be loaded via degirum.load_model() using the given kwargs.

required

stream_depth

int

Depth of the input stream queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames on input overflow. Defaults to False.

False

inp_cnt

int

Number of input streams (for models requiring multiple inputs). Defaults to 1.

1

**kwargs

any

Additional parameters to pass to degirum.load_model() when loading the model (if model is given as a name).

{}

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the inference tag).

on_result(result)

on_result(result)

abstractmethod

Handle a single inference result (to be implemented by subclasses).

Parameters:

Name

Type

Description

Default

result

InferenceResults

The inference result object from the model.

required

run

run()

Run the model inference loop.

Internally feeds data from the input stream(s) into the model and yields results, invoking on_result for each inference result.

AiSimpleGizmo

AiSimpleGizmo

Bases: AiGizmoBase

AI inference gizmo with no custom result processing.

Passes through input frames and attaches the raw inference results to each frame's metadata.

Functions

on_result(result)

on_result(result)

Append the inference result to the input frame's metadata and send it downstream.

Parameters:

Name

Type

Description

Default

result

InferenceResults

The inference result for the current frame.

required

AiObjectDetectionCroppingGizmo

AiObjectDetectionCroppingGizmo

Bases: Gizmo

Gizmo that crops detected objects from frames of an object detection model.

Each input frame with object detection results yields one or more cropped images as output.

Output

Image: The cropped portion of the original image corresponding to a detected object.
Meta-info: A dictionary containing:
- original_result: Reference to the original detection result (InferenceResults) for the frame.
- cropped_result: The detection result entry for this specific crop.
- cropped_index: The index of this object in the original results list.
- is_last_crop: True if this crop is the last one for the frame.

Note

cropped_index and is_last_crop are only present if at least one object is detected in the frame.

The validate_bbox() method can be overridden in subclasses to filter out undesirable detections.

Functions

init(labels, ...)

__init__(labels, *, send_original_on_no_objects=True, crop_extent=0.0, crop_extent_option=CropExtentOptions.ASPECT_RATIO_NO_ADJUSTMENT, crop_aspect_ratio=1.0, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

labels

List[str]

List of class labels to process. Only objects whose class is in this list will be cropped.

required

send_original_on_no_objects

bool

If True, when no objects are detected in a frame, the original frame is sent through. Defaults to True.

True

crop_extent

float

Extra padding around the bounding box, as a percentage of the bbox size. Defaults to 0.0.

0.0

crop_extent_option

CropExtentOptions

Method for applying the crop extent (e.g., aspect ratio adjustment). Defaults to CropExtentOptions.ASPECT_RATIO_NO_ADJUSTMENT.

ASPECT_RATIO_NO_ADJUSTMENT

crop_aspect_ratio

float

Desired aspect ratio (W/H) for the cropped images. Defaults to 1.0.

1.0

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames on overflow. Defaults to False.

False

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the crop tag).

run

run()

Run the object cropping loop.

For each input frame, finds all detected objects (matching the specified labels and passing validation) and sends out a cropped image for each. If no objects are detected and send_original_on_no_objects is True, the original frame is forwarded.

validate_bbox(result, ...)

validate_bbox(result, idx)

Decide whether a detected object should be cropped (can be overridden in subclasses).

Parameters:

Name

Type

Description

Default

result

InferenceResults

The detection result for the frame.

required

idx

int

The index of the object in result.results to validate.

required

Returns:

Name

Type

Description

bool

True if the object should be cropped; False if it should be skipped.

CropCombiningGizmo

CropCombiningGizmo

Bases: Gizmo

Gizmo to combine original frames with their after-crop results.

Expects N+1 inputs: one input stream of original frames (index 0), and N input streams of inference results from cropped images. This gizmo synchronizes and attaches the after-crop inference results back to each original frame's metadata.

Functions

init(crop_inputs_num=1, ...)

__init__(crop_inputs_num=1, *, stream_depth=10)

Constructor.

Parameters:

Name

Type

Description

Default

crop_inputs_num

int

Number of crop result input streams (excluding the original frame stream). Defaults to 1.

1

stream_depth

int

Depth for each crop input stream's queue. Defaults to 10.

10

run

run()

Run the crop combining loop.

Synchronizes original frames with their corresponding after-crop result streams, merges the inference results from crops back into the original frame's metadata, and sends the updated frame downstream.

AiResultCombiningGizmo

AiResultCombiningGizmo

Bases: Gizmo

Gizmo to combine inference results from multiple AI gizmos of the same type.

Functions

init(inp_cnt, ...)

__init__(inp_cnt, *, stream_depth=10)

Constructor.

Parameters:

Name

Type

Description

Default

inp_cnt

int

Number of input result streams to combine.

required

stream_depth

int

Depth of each input stream's queue. Defaults to 10.

10

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the inference tag).

run

run()

Run the result combining loop.

Collects inference results from all input streams, merges their results into a single combined result, and sends it downstream.

AiPreprocessGizmo

AiPreprocessGizmo

Bases: Gizmo

Preprocessing gizmo that applies a model's preprocessor to input images.

It generates preprocessed image data to be fed into the model.

Output

Data: Preprocessed image bytes ready for model input.
Meta-info: Dictionary including:
- image_input: The original input image.
- converter: A function to convert coordinates from model output back to the original image.
- image_result: The preprocessed image (present only if the model is configured to provide it).

Attributes:

Name

Type

Description

key_image_input

str

Metadata key for the original input image.

key_converter

str

Metadata key for the coordinate conversion function.

key_image_result

str

Metadata key for the preprocessed image.

Functions

init(model, ...)

__init__(model, *, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

model

Model

The model object (PySDK model) whose preprocessor will be used.

required

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames on overflow. Defaults to False.

False

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name and the preprocess tag).

run

run()

Run the preprocessing loop.

Applies the model's preprocessor to each input frame and sends the resulting data (and updated meta-info) downstream.

AiAnalyzerGizmo

AiAnalyzerGizmo

Bases: Gizmo

Gizmo to apply a chain of analyzers to an inference result, with optional filtering.

Each analyzer (e.g., EventDetector, EventNotifier) processes the inference result and may add events or notifications. If filters are provided, only results that contain at least one of the specified events/notifications are passed through.

Functions

init(analyzers, ...)

__init__(analyzers, *, filters=None, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

analyzers

List

List of analyzer objects to apply (e.g., EventDetector, EventNotifier instances).

required

filters

set

A set of event names or notification names to filter results. Only results that have at least one of these events or notifications will be forwarded (others are dropped). Defaults to None (no filtering).

None

stream_depth

int

Depth of the input frame queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames on overflow. Defaults to False.

False

get_tags

get_tags()

Get list of tags assigned to this gizmo.

Returns:

Type

Description

List[str]

List[str]: Tags for this gizmo (its name, the inference tag, and the analyzer tag).

run

run()

Run the analyzer processing loop.

For each input frame, clones its inference result and runs all analyzers on it (which may add events/notifications). If filters are specified, the result is dropped unless it contains at least one of the specified events or notifications. The possibly modified inference result is appended to the frame's metadata and sent downstream. After processing all frames, all analyzers are finalized.

SinkGizmo

SinkGizmo

Bases: Gizmo

Sink gizmo that receives results and accumulates them in an internal queue.

This gizmo does not send data further down the pipeline. Instead, it stores all incoming results so they can be retrieved (for example, by iterating over the gizmo's output in the main thread).

Functions

call

__call__()

Retrieve the internal queue for iteration.

Returns:

Name

Type

Description

Stream

The input Stream (queue) of this sink gizmo, which can be iterated to get collected results.

init(*, ...)

__init__(*, stream_depth=10, allow_drop=False)

Constructor.

Parameters:

Name

Type

Description

Default

stream_depth

int

Depth of the input queue. Defaults to 10.

10

allow_drop

bool

If True, allow dropping frames on overflow. Defaults to False.

False

run

run()

Run gizmo (no operation).

Immediately returns, as the sink simply collects incoming data without processing.

PreviousStreams Base NextCompound Models

Last updated 1 month ago

Was this helpful?

Classes

VideoSourceGizmo

Functions

__init__(video_source=None, ...)

get_tags

run

VideoDisplayGizmo

Functions

__init__(window_titles='Display', ...)

run

VideoSaverGizmo

Functions

__init__(filename, ...)

run

VideoStreamerGizmo

Functions

__init__(rtsp_url, ...)

run

ResizingGizmo

Functions

__init__(w, ...)

get_tags

run

AiGizmoBase

Functions

__init__(model, ...)

get_tags

on_result(result)

run

AiSimpleGizmo

Functions

on_result(result)

AiObjectDetectionCroppingGizmo

Functions

__init__(labels, ...)

get_tags

run

validate_bbox(result, ...)

CropCombiningGizmo

Functions

__init__(crop_inputs_num=1, ...)

run

AiResultCombiningGizmo

Functions

__init__(inp_cnt, ...)

get_tags

run

AiPreprocessGizmo

Functions

__init__(model, ...)

get_tags

run

AiAnalyzerGizmo

Functions

__init__(analyzers, ...)

get_tags

run

SinkGizmo

Functions

__call__

__init__(*, ...)

run

init(video_source=None, ...)

init(window_titles='Display', ...)

init(filename, ...)

init(rtsp_url, ...)

init(w, ...)

init(model, ...)

init(labels, ...)

init(crop_inputs_num=1, ...)

init(inp_cnt, ...)

init(model, ...)

init(analyzers, ...)

call

init(*, ...)