Skip to content

David-Prelinger/hackharvard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

71 Commits
 
 
 
 
 
 
 
 

Repository files navigation

TeleSpeech (HackHarvard 2023 Best Overall Project 🏆)

A Chrome Extension that converts Telegram messages into custom AI-generated speech, mimicking the distinct voice of each sender.

128

Primera Hour, Kylie Bogar, Ronan Takizawa, David Prelinger

AI voices are stale and impersonal. Chrome extensions like "Free Text To Speech Online" use default voices to read text messages on the web out loud. While these default voices excel in cadence and clarity, they miss the nuance and emotion inherent in human speech. This emotional connection is important for a user, as it helps them feel engaged in online communication. Using personalized speech also helps users with special needs who rely on text-to-speech, as this feature assists them in identifying who is talking when vocalizing the messages.

Our project is an attempt to improve the "Free Text To Speech Online" chrome extension, which does text-to-speech with a default voice. We tried to improve the Chrome extension by allowing customizable voices and only having it read the content within a text message and not everything on a page.

Usage:

  1. Go to https://telespeech.vercel.app/
  2. Sign up for an account, and upload audio files of voices you want to use for the text-to-speech. For each audio file, set the name as the username displayed on the Telegram chat
  3. Download this repository as a zip, and upload the "ronanchromextension" directory to chrome://extensions/ (Make sure developer mode is On, and you upload it using "Load Unpacked"
  4. Open the Telegram web app, sign into your account on the chrome extension, and press play for any group chat. The extension will then play each person's text out loud in that person's voice 👍

Tutorial

hackharvard.2023.mp4

Tech Stack:

  • Chrome Extension (Vanilla JS)
  • Web App (NextJS, Bootstrap, Firebase)
  • API (Eleven Labs text-speech AI API)