CVT, a Computer Vision Toolkit.
-
Updated
Aug 24, 2022 - C
CVT, a Computer Vision Toolkit.
Winner solution of mobile AI (CVPRW 2021).
A header-only neural network library for microcontrollers, with partial bare-metal & native-os support.
FrostNet: Towards Quantization-Aware Network Architecture Search
Quantization Aware Training
ATtiny85 arduino example, running an RNN MNIST model via the (internal) 512-Byte EEPROM with ~95% accuracy
将端上模型部署过程中,常见的问题以及解决办法记录并汇总,希望能给其他人带来一点帮助。
Garuda: CVXIF coprocessor optimizing batch-1 attention microkernels with 7.5-9× lower p99 latency. RISC-V INT8 MAC accelerator for transformer inference.
VB.NET api wrapper for llm-inference chatllm.cpp
C# api wrapper for llm-inference chatllm.cpp
Corrects your grammar in 5 languages directly in your browser. Powered by an open-source AI model.
Generating tensorrt model using onnx
TinyML project. This system monitors your room or surrounding with an onboard microphone of Arduino nano BLE sense. Still Under Developement
A fork of convert_to_quant that adds QuIP quantization for INT‑8 models.
Python ML for training a custom on-device cry model (knowledge-distilled from YAMNet, INT8, deployed on ESP32-S3)
CPU face-embedding engine: 13 ms/face ArcFace INT8, 99.65% LFW 10-fold (beats FP32), 96 KB binary, 2.4x faster than ONNX Runtime. C99 + AVX-VNNI.
Silicon-proven INT8 systolic NPU (8×8 MAC array) taped out on SkyWater 130nm via LibreLane. Features a custom 32-bit ISA, UART–APB host interface, and fused streaming datapath. Validated on chest X-ray pneumonia detection. Silicon Sprint 2026 — AUC.
gemma-2-2b-it int8 cpu inference in one file of pure C#
Scripts and tools for optimizing deep learning models
Add a description, image, and links to the int8-quantization topic page so that developers can more easily learn about it.
To associate your repository with the int8-quantization topic, visit your repo's landing page and select "manage topics."