Who’s it for
For makers, founders, and productivity nerds who want to listen to their inbox instead of reading it.
No servers, no hosting — all done with n8n, a Telegram bot, and AI/ML API (LLM + TTS).
What it does / How it works
This workflow listens for new Gmail emails, extracts the sender, subject, date, and snippet, generates a 2–3 sentence natural summary using GPT-5 via AI/ML API, then converts the text into a lifelike voice message using Inworld TTS-1-Max.
The final audio file is downloaded and instantly delivered to your Telegram via a bot.
A separate auxiliary flow captures your Telegram chat_id once, stores it in a Data Table, and the main flow reuses it every time — no hardcoding required.
High-level flow:
- Gmail Trigger detects new incoming email
- Code node builds a clean
emailData block
- AI/ML API (GPT-5) generates a natural spoken summary
- AI/ML API (Inworld TTS-1-Max) converts summary → voice
- Audio file is downloaded
- Voice message is sent to the same Telegram chat
Requirements
Set up steps
- Create a Telegram bot via @BotFather and add Telegram API credentials in n8n
- Create an AI/ML API credential using your API key
- Add Gmail OAuth2 credentials for inbox access
- Import the workflow JSON
- Open nodes and attach the correct credentials (no hardcoded tokens inside nodes)
- Run the Telegram Trigger once and send any message to your bot — this saves your
chat_id
- Activate your workflow and receive your first voice-summary
How to customize
- Replace GPT-5 with another LLM from AI/ML API (same schema)
- Change prompt style: tone, structure, language, or verbosity
- Add language selection or multi-voice support
- Add filters (e.g., only important emails, only starred emails)
- Log all activity to Google Sheets or a database
- Add IF conditions: voice only during certain hours, weekday vs weekend rules
- Add rate-limits or user allowlists
- Change the voice model for different notification styles: corporate, friendly, narrator