Pinboard (jm)
https://pinboard.in/u:jm/public/
recent bookmarks from jmggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++2022-12-14T10:01:28+00:00
https://github.com/ggerganov/whisper.cpp
jmHigh-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model:
Plain C/C++ implementation without dependencies;
Apple silicon first-class citizen - optimized via Arm Neon and Accelerate framework;
AVX intrinsics support for x86 architectures;
Mixed F16 / F32 precision;
Low memory usage (Flash Attention + Flash Forward);
Zero memory allocations at runtime;
Runs on the CPU;
C-style API
]]>ai asr c++ speech-recognition whisper openai speechhttps://pinboard.in/https://pinboard.in/u:jm/b:44935afe3e19/Snowboy Hotword Detection2020-01-16T15:06:38+00:00
https://snowboy.kitt.ai/
jmaudio iot hardware hotwords speech-recognition speech deviceshttps://pinboard.in/https://pinboard.in/u:jm/b:f89297738cbc/'DolphinAttack: Inaudible Voice Commands' [pdf]2018-01-25T13:49:35+00:00
https://arxiv.org/pdf/1708.09537.pdf
jm 20 kHz) to achieve inaudibility. By leveraging the nonlinearity of the microphone circuits, the modulated low frequency audio commands can be successfully demodulated, recovered, and more importantly interpreted by the speech recognition systems. We validate DolphinAttack on popular speech recognition systems, including Siri, Google Now, Samsung S Voice, Huawei HiVoice, Cortana and Alexa. By injecting a sequence of inaudible voice commands, we show a few proof-of-concept attacks, which include activating Siri to initiate a FaceTime call on iPhone, activating Google Now to switch the phone to the airplane mode, and even manipulating the navigation system in an Audi automobile. We propose hardware and software defense solutions. We validate that it is feasible to detect DolphinAttack by classifying the audios using supported vector machine (SVM), and suggest to re-design voice controllable systems to be resilient to inaudible voice command attacks.'
via Zeynep (https://twitter.com/zeynep/status/956520320504123392)]]>alexa siri attacks security exploits google-now speech-recognition speech audio acm papers cortanahttps://pinboard.in/https://pinboard.in/u:jm/b:8d9b9d7c9782/simon listens2010-10-21T08:46:00+00:00
http://www.simon-listens.org/index.php?id=122&L=1
jmspeech-recognition floss free-software kde speech recognition linux audio accessibilityhttps://pinboard.in/u:jm/b:fb9b49baf22b/