localhost:3000
Screenshot coming soon
06 / 11
🤖 Hackathon Submission

Meeting Insights Generator

Built for the 'From Words to Worlds — Multimodal GenAI' hackathon, this app handles the complete meeting intelligence pipeline. Upload a recording and AssemblyAI transcribes it to full text. The Facebook Bart Large CNN model on Hugging Face then summarizes the transcript. Key points from the summary drive Pexels image searches for visual highlights. Finally Deepgram converts the summary to an audio file, and everything is stored persistently on Cloudinary.

Key Features

  • Audio upload → full text transcription via AssemblyAI
  • AI summarization using Facebook Bart Large CNN (HuggingFace)
  • Visual highlights — Pexels images matched to key summary points
  • Text-to-speech audio output of summary via Deepgram
  • All generated content stored persistently on Cloudinary
  • Full multimodal pipeline: speech → text → summary → image → audio
  • React + Vite frontend with Node.js/Express backend

Tech Stack

ReactViteNode.jsExpress.jsMongoDBAssemblyAIHuggingFace Bart CNNPexels APIDeepgram APICloudinaryJWT

Award

Prize

Submission

Event

From Words to Worlds — Multimodal GenAI

Type

Hackathon / Bounty