Applied Deep Learning

Introduction

Applied Deep Learning

Home
Prerequisites
Previewing predictions
Image classification
Image classification
Object detection
Object detection
Instance segmentation
Instance segmentation
Image segmentation
Image segmentation
Speech-to-text
Speech-to-text

Introduction

Speech-to-text (STT; also ASR) transcribes audio files, usually short phrases, into text. These models can be used, e.g., for voice assistants that react to spoken commands.

Screenshot

What to do next: