Pietro Passarelli

WSJ Transcription service

WSJ

Stack: node, React, Javascript, firebase, firestore, firebase hosting, firebase analytics, STT, Google Cloud Functions, custom authentication using Okta, ffmpeg, ffmpegwasm, google cloud tasks, Google Cloud STT, slate-transcript-editor,

An in house transcription service for WSJ reporters

An in house transcription service for WSJ reporters

Based on refactoring parts from BBC Digital Paper Edit - autoEdit3 open source project. Extracted only the part relevant for transcriptions.

  • Client side ffmpeg using WebAssembly ffmpegwasm
  • Serverless STT using google cloud tasks
  • Refactored to use in house design system (instead of bootstrap or material UI)
  • Uses open source slate transcript editor react component as a way to allow correcting the text and preserving time-codes.
  • etc...