e/speech.e (htmlized)

#######
#
# E-scripts on programs that do speech synthesis and speech-to-text.
#
# Note 1: use the eev command (defined in eev.el) and the
# ee alias (in my .zshrc) to execute parts of this file.
# Executing this file as a whole makes no sense.
# An introduction to eev can be found here:
#
#   (find-eev-quick-intro)
#   http://angg.twu.net/eev-intros/find-eev-quick-intro.html
#
# Note 2: be VERY careful and make sure you understand what
# you're doing.
#
# Note 3: If you use a shell other than zsh things like |&
# and the for loops may not work.
#
# Note 4: I always run as root.
#
# Note 5: some parts are too old and don't work anymore. Some
# never worked.
#
# Note 6: the definitions for the find-xxxfile commands are on my
# .emacs.
#
# Note 7: if you see a strange command check my .zshrc -- it may
# be defined there as a function or an alias.
#
# Note 8: the sections without dates are always older than the
# sections with dates.
#
# This file is at <http://angg.twu.net/e/speech.e>
#           or at <http://angg.twu.net/e/speech.e.html>.
#        See also <http://angg.twu.net/emacs.html>,
#                 <http://angg.twu.net/.emacs[.html]>,
#                 <http://angg.twu.net/.zshrc[.html]>,
#                 <http://angg.twu.net/escripts.html>,
#             and <http://angg.twu.net/>.
#
#######

# «.speech-dispatcher»		(to "speech-dispatcher")
# «.spd-say»			(to "spd-say")
# «.spd-say-ipa»		(to "spd-say-ipa")
# «.speechd-git»		(to "speechd-git")
# «.speechd-mailing-list»	(to "speechd-mailing-list")
# «.mbrola»			(to "mbrola")
# «.openai-whisper»		(to "openai-whisper")
# «.aeneas»			(to "aeneas")

# https://askubuntu.com/questions/501910/how-to-text-to-speech-output-using-command-line
# https://blends.debian.org/accessibility/tasks/speechsynthesis

#####
#
# speech-dispatcher in debian
# 2019dec17
#
#####

# «speech-dispatcher» (to ".speech-dispatcher")

# (find-zsh "installeddebs | sort | grep speech-dispatcher")
# (find-zsh "availabledebs | sort | grep speech-dispatcher")
# (find-status   "speech-dispatcher")
# (find-vldifile "speech-dispatcher.list")
# (find-udfile   "speech-dispatcher/")

* (eepitch-shell)
* (eepitch-kill)
* (eepitch-shell)
# (find-man "8 apt-cache")
apt-cache show speech-dispatcher
apt-cache search speech-dispatcher
apt-cache depends speech-dispatcher
apt-cache rdepends speech-dispatcher
apt-cache showpkg speech-dispatcher

# (find-node "(spd-say)Top")
# (find-node "(speech-dispatcher)Top")
# (find-node "(ssip)Top")

#####
#
# spd-say
# 2019dec17
#
#####

# «spd-say» (to ".spd-say")
# (find-man "1 spd-say")
# (find-man "1 spd-say" "-l, --language" "Set the language (iso code)")
# https://askubuntu.com/questions/842447/how-can-i-a-language-to-spd-say
# (find-sh "spd-say -L")
# https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes

* (eepitch-shell)
* (eepitch-kill)
* (eepitch-shell)
spd-say Hello
spd-say -l en Hello
spd-say -l pt viagem
spd-say -l fr fromage
spd-say "/zoʊˈtɛroʊ/"

#####
#
# spd-say-ipa
# 2019dec17
#
#####

# «spd-say-ipa» (to ".spd-say-ipa")
# https://mail.google.com/mail/ca/u/0/#sent/KtbxLrjCNKlqTSzjswftLsdrzpwGhXnVbq
# https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

Hi, beginner question here...
Is there a way to make spd-say pronounce text in IPA?
I mean, (some hack that would behave) like this:

  spd-say -l ipa "/zoʊˈtɛroʊ/"

Thanks in advance!
  Eduardo Ochs
  http;//angg.twu.net/
  http://angg.twu.net/emacsconf2019.html

#####
#
# speechd from git
# 2019dec17
#
#####

# «speechd-git» (to ".speechd-git")
# http://devel.freebsoft.org/speechd
# https://github.com/brailcom/speechd.git

* (eepitch-shell)
* (eepitch-kill)
* (eepitch-shell)
# rm -Rfv ~/usrc/speechd/
cd      ~/usrc/
git clone https://github.com/brailcom/speechd.git
cd      ~/usrc/speechd/

git branch --list -a
git for-each-ref
PAGER=cat git log --oneline --graph --all -20

# (find-fline "~/usrc/")
# (find-fline "~/usrc/speechd/")
# (find-gitk  "~/usrc/speechd/")

# (code-c-d "speechd" "~/usrc/speechd/")
# (find-speechdfile "")
# (find-speechdgrep "grep --color -niRH --null -e languages *")

# https://freebsoft.org/speechd-el
# https://freebsoft.org/speechd-in-detail

#####
#
# speechd-mailing-list
# 2019dec17
#
#####

# «speechd-mailing-list» (to ".speechd-mailing-list")
# https://lists.nongnu.org/mailman/listinfo/speechd-discuss
# https://lists.nongnu.org/archive/html/speechd-discuss/

#####
#
# MBROLA
# 2020apr03
#
#####

# «mbrola» (to ".mbrola")
# https://english.stackexchange.com/questions/264239/is-there-any-online-tool-to-read-pronounce-ipa-and-apa-written-words
# http://www.lfsag.unito.it/ipa/converter_en.html
# https://en.wikipedia.org/wiki/MBROLA
# https://github.com/numediart/MBROLA

#####
#
# openai-whisper
# 2023aug02
#
#####

# «openai-whisper»  (to ".openai-whisper")
# (find-angg "WHISPER/test1.py")

# https://emacsconf.org/captioning/
# (find-fline "~/LOGS/2023aug02.emacs")
# (find-fline "~/LOGS/2023aug02.emacs" "openai whisper")
# (find-fline "~/LOGS/2023aug02.emacs" "Mozilla Deepspeech")

# (find-pip3-links      "assemblyai")
# (find-importlib-links "assemblyai")
# (find-pypi-links      "assemblyai")
# (find-pip3spfile      "assemblyai/__init__.py")
# https://github.com/AssemblyAI/assemblyai-python-sdk
# (find-gitk  "~/usrc/assemblyai-python-sdk/")
# (code-c-d "aaipythonsdk" "~/usrc/assemblyai-python-sdk/")
# (find-aaipythonsdkfile "")
# (find-aaipythonsdkgrep "grep --color=auto -niRH --null -e language *")
# (find-aaipythonsdkgrep "grep --color=auto -nRH --null -e LanguageCode *")
# (find-aaipythonsdkgrep "grep --color=auto -nRH --null -e language_detection *")
# (find-aaipythonsdkfile "assemblyai/types.py" "def language_code(self) ->")
# (find-aaipythonsdkfile "assemblyai/types.py" "language_detection: Optional[bool]")
# (find-aaipythonsdkfile "setup.py" "Natural Language :: English")
# (find-aaipythonsdkfile "tests/unit/test_transcriber.py" "language_code=None")
# (find-aaipythonsdkfile "tests/unit/factories.py" "language_code = aai.LanguageCode.en")

# https://www.assemblyai.com/dashboard/activation
# https://www.assemblyai.com/docs/Guides/generating_subtitles_for_videos

* (eepitch-shell)
* (eepitch-kill)
* (eepitch-shell)
cd /tmp/
scp -v 2023mateus.mp3 $LINP/tmp/m.mp3

* (eepitch-python)
* (eepitch-kill)
* (eepitch-python)
import assemblyai as aai
aai.settings.api_key = "45a318aad48d46f7a2ec55978f6674b1"
transcriber = aai.Transcriber()

print(aai.LanguageCode.en)
print(aai.LanguageCode.pt)
print(aai.LanguageCode.foo)

print(transcriber)
print(transcriber.language_code())
# transcript = transcriber.transcribe("https://storage.googleapis.com/aai-web-samples/news.mp4")
transcript = transcriber.transcribe("/tmp/2023mateus.mp3")

print(transcript.text)
print(transcript.export_subtitles_srt())
print(transcript.export_subtitles_vtt())

* (eepitch-shell)
* (eepitch-kill)
* (eepitch-shell)
pip3 install assemblyai

https://github.com/openai/whisper

https://www.google.com/search?q=how+do+I+set+the+language+in+openai+whisper&oq=how+do+I+set+the+language+in+openai+whisper

# https://huggingface.co/freds0/distil-whisper-large-v3-ptbr
# (find-fline "~/LOGS/2025feb07.sachac")

#####
#
# aeneas
# 2024dec30
#
#####

# «aeneas»  (to ".aeneas")
# https://sachachua.com/blog/category/subed/
# (find-fline "~/2024dec29.emacs" "https://github.com/readbeyond/aeneas")

# (find-sh "espeak -s 130 -vfr 'maison voiture nappe'")

https://itinerarium.github.io/phoneme-synthesis/
https://github.com/itinerarium/phoneme-synthesis/

https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

https://packages.debian.org/sid/transcriber
http://trans.sourceforge.net/en/presentation.php
http://transag.sourceforge.net/

https://en.wikipedia.org/wiki/P%C3%A1draig
https://en.wikipedia.org/wiki/Help:IPA/Irish

https://lists.gnu.org/archive/html/emacs-tangents/2021-01/msg00076.html Devin Prater
https://wiki.hyperbola.info/doku.php?id=en:main:downloads&redirect=1#hypertalking_live_image

https://per.quest/
https://news.ycombinator.com/item?id=28170609 Show HN: Instantly listen to any URL (per.quest)

https://emacspeak.blogspot.com/2021/11/emacspeak-in-age-of-cloud-computing.html
http://emacspeak.sourceforge.net/raman/publications/bc-emacspeak/
https://emacspeak.blogspot.com/2022/04/announcing-emacspeak-560-agiledog.html
https://emacspeak.blogspot.com/2022/06/emacspeak-complete-audio-desktop-under.html

https://news.ycombinator.com/item?id=33278785 Buzz: Transcribe audio from your microphones in real-time using OpenAI's Whisper (github.com/chidiwilliams) - speech
https://github.com/chidiwilliams/buzz
https://github.com/openai/whisper#setup
https://openai.com/blog/whisper/

https://news.ycombinator.com/item?id=34211457 Ask HN: Are there any good open source text-to-speech tools?
https://news.ycombinator.com/item?id=34229502 Ask HN: Alternatives to Scratch for a blind child?
https://news.ycombinator.com/item?id=34361651 This Voice Doesn't Exist – Generative Voice AI (elevenlabs.io)
https://news.ycombinator.com/item?id=35365399 Universal Speech Model (research.google)
https://github.com/huggingface/distil-whisper
https://news.ycombinator.com/item?id=38093353 Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller (github.com/huggingface)
https://github.com/abus-aikorea/voice-pro
https://news.ycombinator.com/item?id=42261909 Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds (github.com/abus-aikorea)
https://adrianroselli.com/2025/03/dont-use-fake-bold-or-italic-in-social-media.html
https://news.ycombinator.com/item?id=43302835 Accessibility: Don't Use Fake Bold or Italic in Social Media (adrianroselli.com)

#  Local Variables:
#  coding:               utf-8-unix
#  End: