Products — HonChen Semiconductor

CNN Edition

~0.5 TOPS INT8 ~3.3 mm² @ 28nm

Vision-first lightweight NPU.
For cost-sensitive edge devices where BOM and power matter.

Object detection & classification — YOLO-class including YOLOv3-tiny and YOLOv4-tiny
Face / scene recognition · gesture · keyword spotting
DDR-less option — SRAM + NOR Flash inference path
Ultra-low-power · BOM-friendly for emerging markets
5-stage RISC-V control core
AXI Master · APB Slave · Synchronous FIFO interfaces
Clock gating · external power gating supported
Built-in Self-Test (BIST) — integration debug + production testing

Schedule Evaluation → View on Design-Reuse ↗

Generative AI Edition

~1 TOPS INT8 + FP16 ~6.1 mm² @ 28nm

Conversational AI on edge.
CNN + Transformer dual-engine for full speech / LLM / TTS pipeline.

LLM inference — Qwen 1.7B @ ~140 ms/token (MTP-ready ~47 ms/token, ~3× acceleration)
Whisper Small speech recognition
VITS text-to-speech synthesis
CNN + Transformer dual-engine architecture
Multi-model runtime switching — ASR → LLM → TTS on one chip
Custom AI instruction set · 5-stage RISC-V control core
AXI Master · APB Slave · Synchronous FIFO interfaces
Built-in Self-Test (BIST) — integration debug + production testing

Schedule Evaluation → View on Design-Reuse ↗

● Validated Workloads

Proven on Real Models,
Not Spec Sheets

Every workload below has been validated on our NPU IP with measured numbers. Reproducible during evaluation under NDA.

01 · INPUT

Whisper Small

Speech Recognition

Real-time ASR

On-device voice command, language understanding, dictation. Zero cloud dependency.

02 · REASONING

Qwen 1.7B

Light LLM

~140 ms / token

MTP-ready: ~47 ms/token (~3× acceleration). Conversational AI, intent understanding.

03 · OUTPUT

VITS TTS

Speech Synthesis

Natural voice

High-quality on-device text-to-speech for interactive voice agents.

04 · VISION

YOLOv4-tiny

Object Detection

~56 FPS (customer-pruned)

Object detection, security / intrusion detection, presence sensing.

Performance benchmark: YOLOv4-tiny at ~56 FPS after customer-side pruning — vs. ~10 FPS on industry comparable solutions at the same TOPS class.

Two NPU IP Editions for the
Full Edge AI Spectrum

Scalable Architecture — 4 TOPS / 8 TOPS configurations available

Proven on Real Models,
Not Spec Sheets

Whisper Small

Qwen 1.7B

VITS TTS

YOLOv4-tiny

Ready for the next step?

Two NPU IP Editions for theFull Edge AI Spectrum

Scalable Architecture — 4 TOPS / 8 TOPS configurations available

Proven on Real Models,Not Spec Sheets

Whisper Small

Qwen 1.7B

VITS TTS

YOLOv4-tiny

Ready for the next step?

Two NPU IP Editions for the
Full Edge AI Spectrum

Proven on Real Models,
Not Spec Sheets