Products

Two NPU IP Editions for the
Full Edge AI Spectrum

Hand-tuned RTL delivered as Protected Code. Process-independent — from 55 nm to 2 nm class, customer chooses the foundry and node. Designed for SoC integration from day one.

CNN Edition
~0.5 TOPS INT8 ~3.3 mm² @ 28nm

Vision-first lightweight NPU.
For cost-sensitive edge devices where BOM and power matter.

  • Object detection & classification — YOLO-class including YOLOv3-tiny and YOLOv4-tiny
  • Face / scene recognition · gesture · keyword spotting
  • DDR-less option — SRAM + NOR Flash inference path
  • Ultra-low-power · BOM-friendly for emerging markets
  • 5-stage RISC-V control core
  • AXI Master · APB Slave · Synchronous FIFO interfaces
  • Clock gating · external power gating supported
  • Built-in Self-Test (BIST) — integration debug + production testing
Schedule Evaluation → View on Design-Reuse ↗
Generative AI Edition
~1 TOPS INT8 + FP16 ~6.1 mm² @ 28nm

Conversational AI on edge.
CNN + Transformer dual-engine for full speech / LLM / TTS pipeline.

  • LLM inference — Qwen 1.7B @ ~140 ms/token (MTP-ready ~47 ms/token, ~3× acceleration)
  • Whisper Small speech recognition
  • VITS text-to-speech synthesis
  • CNN + Transformer dual-engine architecture
  • Multi-model runtime switching — ASR → LLM → TTS on one chip
  • Custom AI instruction set · 5-stage RISC-V control core
  • AXI Master · APB Slave · Synchronous FIFO interfaces
  • Built-in Self-Test (BIST) — integration debug + production testing
Schedule Evaluation → View on Design-Reuse ↗

Scalable Architecture — 4 TOPS / 8 TOPS configurations available

Designed per project requirements. Talk to us about your roadmap.

Validated Workloads

Proven on Real Models,
Not Spec Sheets

Every workload below has been validated on our NPU IP with measured numbers. Reproducible during evaluation under NDA.

01 · INPUT

Whisper Small

Speech Recognition
Real-time ASR

On-device voice command, language understanding, dictation. Zero cloud dependency.

02 · REASONING

Qwen 1.7B

Light LLM
~140 ms / token

MTP-ready: ~47 ms/token (~3× acceleration). Conversational AI, intent understanding.

03 · OUTPUT

VITS TTS

Speech Synthesis
Natural voice

High-quality on-device text-to-speech for interactive voice agents.

04 · VISION

YOLOv4-tiny

Object Detection
~56 FPS (customer-pruned)

Object detection, security / intrusion detection, presence sensing.

Performance benchmark: YOLOv4-tiny at ~56 FPS after customer-side pruning — vs. ~10 FPS on industry comparable solutions at the same TOPS class.

Ready for the next step?

Schedule a technical evaluation or discuss licensing models — let's find the right fit for your project.