Skip to content
@Otosaku

Otosaku DSP

Privacy-first AI that works anywhere. We create on-device machine learning libraries optimized for real-time inference — no cloud, no latency, no compromise.

Pinned Loading

  1. OtosakuKWS-iOS OtosakuKWS-iOS Public

    Lightweight on-device keyword spotting engine for iOS using CoreML and real-time audio streaming.

    Swift 12 2

  2. OtosakuStreamingASR-iOS OtosakuStreamingASR-iOS Public

    OtosakuStreamingASR-iOS is a real-time speech recognition engine for iOS, built with Swift and Core ML. It uses a fast and lightweight streaming Conformer model optimized for on-device inference. D…

    Swift 11 4

  3. OtosakuTTS-iOS OtosakuTTS-iOS Public

    Swift library for offline text-to-speech synthesis on iOS/macOS. Generate natural speech directly on device using CoreML-optimized FastPitch and HiFiGAN models. No internet required, fully private.

    Swift 48 7

  4. NeMoConformerASR-iOS NeMoConformerASR-iOS Public

    On-device speech-to-text for iOS/macOS powered by NVIDIA NeMo Conformer CTC Small (13M params). Pure Swift + CoreML implementation with automatic audio padding, chunking for long audio, and real-ti…

    Swift 2

  5. NeMoSpeaker-iOS NeMoSpeaker-iOS Public

    Swift library for Speaker Embedding extraction and verification using NVIDIA NeMo TitaNet model converted to CoreML. Extract 192-dim speaker embeddings, verify speakers, and perform real-time speak…

    Swift 3

  6. NeMoVAD-iOS NeMoVAD-iOS Public

    Swift library for Voice Activity Detection (VAD) using NVIDIA NeMo MarbleNet model converted to CoreML. Detect speech segments in real-time on iOS/macOS with high accuracy and low latency.

    Swift 2

Repositories

Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…