Workflow overview
Why this workflow matters
Relevant for managed services and support workflows. Supports knowledge capture and document intelligence use cases.
Telegram Voice Notes to Markdown Journal with Groq Whisper and Gemini This workflow transforms Telegram voice notes and audio files into clean markdown journal entries using Groq Whisper, CloudConvert, and Google Gemini. It automatically downloads incoming Telegram audio messages, transcribes them with Groq Whisper, cleans the transcript into a readable reflective journal using Gemini, converts the result into a markdown file, and sends the journal back to Telegram. For better audio compatibility, the workflow also includes a CloudConvert fallback path. If Groq Whisper rejects the original audio file because of container or codec issues, the workflow converts the audio to MP3 with CloudConvert and retries transcription automatically. An optional Google Drive section can also archive all generated journals automatically. This workflow is useful for journaling, voice-based note-taking, personal reflection, idea capture, daily logging, and building a lightweight AI-assisted second brain. Common use cases Convert Telegram voice notes into readable journal entries Convert uploaded audio files into markdown journals Create markdown journals automatically using AI Capture thoughts, reflections, and ideas on the go Build a voice-based personal knowledge system Handle more audio formats by converting failed files to MP3 Archive AI-cleaned journals to Google Drive Learn how to combine Telegram, Groq Whisper, CloudConvert, Gemini, markdown generation, and Google Drive in AlekSystem How it works Starts from a Telegram Trigger Detects incoming voice notes or audio files Downloads the Telegram audio file Sends the original audio to Groq Whisper for transcription If Groq rejects the file, converts the audio to MP3 with CloudConvert Retries transcription with Groq Whisper using the converted MP3 file Formats the transcript with timestamps Sends the transcript to Google Gemini for cleanup and readability improvements Preserves the original tone and meaning while removing filler words and transcription artifacts Parses the AI response into structured JSON Converts the final journal into a markdown (.md) file Sends the markdown journal back to Telegram Optionally uploads the journal to Google Drive Setup steps Connect your Telegram credentials Open the Transcribe Audio with Groq Whisper node Replace the placeholder Authorization header with your Groq API key Install the verified CloudConvert community node if it is not already installed Connect your CloudConvert credentials for the MP3 fallback path Connect your Google Gemini credentials (Optional) Connect your Google Drive credentials Test the workflow manually with a Telegram voice note or audio file Activate the workflow Send a voice note or audio file to your Telegram bot CloudConvert Audio Compatibility Fallback The CloudConvert path is included to improve reliability with different audio uploads. Groq Whisper supports common audio formats, but some files may still fail because of unusual containers, codecs, MIME types, or metadata. When that happens, this workflow converts the failed audio file to MP3 with CloudConvert, then sends the MP3 back to Groq Whisper for transcription. This fallback is especially useful for some uploaded M4A, OGG, OPUS, or recording files that may not be accepted directly. If you only want to support files that Groq accepts directly, you can disconnect the CloudConvert fallback path. Optional Google Drive Backup The Google Drive section is fully optional. If connected: The workflow searches for a Personal Journal (AlekSystem) folder Creates the folder automatically if it does not exist Uploads generated markdown journals automatically If you do not want cloud backups, simply disconnect the Google Drive section. Notes This workflow uses: Groq Whisper (whisper-large-v3) for transcription CloudConvert for optional audio-to-MP3 fallback conversion Google Gemini for transcript cleanup and journal formatting Telegram Bot API for message delivery Markdown file generation for portable journaling The AI prompt is designed to: Preserve the speaker’s original meaning Keep the natural reflective monologue style Improve punctuation and readability Avoid summarization or hallucinated content The generated markdown journals can easily be imported into note-taking tools such as Obsidian, Logseq, Notion, or any markdown-based system. Need Help? Have questions or want to connect? Reach me on LinkedIn.
Best fit
Categories
Services
Use cases
Need another direction?