Voice and Large Language Models (LLM)
This project focuses on processing audio inputs using advanced speech recognition and employing large language models to extract valuable insights.
It includes transcription, diarization, and analysis of recorded meetings and voice notes, generating actionable summaries and key information points.
-
AI and ML: OpenAI API, WhisperX/Faster Whisper, Hugging Face Models
-
Programming and Tools: Python, Langchain, Gradio