FlashSR

توثق هذه الصفحة من Soniqo نموذج FlashSR كما هو منفذ في speech-swift / speech-core. روابط Hugging Face موجودة أدناه بعد ملاحظات الدمج.

الصفحة الداخلية أولا

بطاقات الصفحة الرئيسية وقوائم الوثائق تشير إلى هذه الصفحة أولا؛ وتبقى روابط النموذج والحزم داخلها.

لمحة سريعة

النموذج	FlashSR
الدور	Audio super-resolution for low-bandwidth or lossy audio
Backend	MLX int4 default; int8 available
الإخراج	48 kHz mono waveform, same length as input
اللغات	Audio-content agnostic
الرخصة	MIT
الحالة	Ready through speech upsample and the FlashSR Swift product
المصدر	FlashSR / AudioSR distillation
منتج Swift	`FlashSR`
CLI / runtime	`speech upsample`

المقتطف أدناه يطابق API أو الأمر الحالي في speech-swift.

# Upsample a low-bandwidth recording to 48 kHz mono.
.build/release/speech upsample noisy_lowres.wav \
  --variant int4 \
  -o clean_hr.wav

Download requests the single model.safetensors bundle and config explicitly; this is simple enough that byte-weighting is less critical than for sharded models.
Input is resampled to 48 kHz mono and processed in non-overlapping 5.12 second windows.
INT4 is a download-size optimization; runtime weights dequantize to FP, so memory footprint matches int8.
The model conforms to SpeechEnhancementModel, but its semantics are super-resolution rather than denoising.