Website Deepgram

🎯 Job Description: Freelance Speech Recognition Specialist, Deepgram

📈 Position: Freelance Speech Recognition Specialist

🏙️ Company: Deepgram

📍Location: Remote / Work-from-anywhere

🧾Contract: Part-time / Project-based Freelance

Hours: Flexible, depending on project scope

 

🛠 Key Responsibilities

  • Develop, tune, and improve Speech-to-Text (STT) models and pipelines using Deepgram’s APIs and tools.
  • Analyze audio data for quality, accents, dialects, background noise, and other real-world challenges; implement strategies to improve recognition accuracy.
  • Perform model training / fine-tuning on custom datasets to handle domain-specific vocabulary or specialized speech patterns.
  • Create and run evaluation benchmarks: measure metrics like word-error rate (WER), latency, confidence scores, etc.
  • Preprocess and manage audio: cleaning, segmenting, labeling, possibly diarization (identifying who speaks when).
  • Integrate and test with Deepgram’s streaming and batch transcription services; ensure low latency and high reliability.
  • Troubleshoot speech recognition issues: mis-transcriptions, noise interference, API behavior, etc.
  • Collaborate with data engineers, ML researchers, product teams, and possibly clients to understand use cases and requirements.
  • Document your work: model changes, evaluation results, best practices, instructions for deployment.

 

Required Skills & Qualifications

  • Strong understanding of Automatic Speech Recognition (ASR) / Speech-to-Text systems.
  • Experience with Deepgram’s STT APIs (or similar: Google Speech-to-Text, Whisper, etc.).
  • Good grasp of signal processing / handling audio data: formats, sampling, noise, etc.
  • Ability to preprocess data: cleaning audio, labeling, segmentation.
  • Familiarity with evaluation metrics (WER, latency, accuracy, confidence scores) and tools to measure them.
  • Programming skills: Python or relevant language; experience with ML frameworks or libraries.
  • Experience with streaming data (real-time transcription) &/or batch transcription pipelines.
  • Strong problem-solving ability, attention to detail.
  • Good communication skills; able to explain technical trade-offs and work with cross-functional stakeholders.

 

💡 Preferred / Nice-to-Have

  • Experience with specialized speech domains (e.g., medical, legal, technical, non-native speakers).
  • Experience with model fine-tuning or custom model adaptation.
  • Knowledge of diarization (multi speaker), speaker recognition.
  • Familiarity with cloud deployment or self-hosted environments.
  • Working knowledge of handling multilingual data.
  • Previous freelance or remote work experience, ability to manage timelines independently.

 

What Deepgram Offers / What to Expect

  • Remote / flexible work setup.
  • Access to Deepgram’s speech-AI tools, cutting-edge models.
  • Opportunity to work on real-world speech recognition challenges.
  • Collaborative team: you’ll interact with researchers, engineers, product, possibly customers.
  • Compensation based on scope and expertise.

 

📢 If you’re passionate about speech, language, and making machines understand us better, this role is for you. 💬Let’s build systems that not only hear — but truly understand voice. 

#SpeechRecognition#Deepgram #ASR #VoiceAI #MachineLearning #RemoteFreelance #AIJobs #VoiceTech #SpeechToText #DataScience

 

Upload your CV/resume or any other relevant file. Max. file size: 2 GB.