Audio Visual Meaning Extraction Pipeline
ICT & Open Learning/Innovation
Client company:CitricLabs
Uroš Čolović
Aleksa Bandić
Project description
In our project, we are tackling the challenge of extracting meaningful information from audio-visual content on platforms like YouTube. The goal is to create a pipeline that not only transcribes spoken words but also summarizes the content, allowing users to understand video material at a glance.
Context
This project is a stepping stone towards more sophisticated content analysis tools. Our current focus lies on developing an innovative pipeline for transcribing and summarizing YouTube content, which serves as a valuable asset for users needing quick insights from lengthy videos. The technology is designed to be versatile and adaptable, with potential future applications that could include sentiment analysis and other forms of media evaluation.
Results
The project has yielded a prototype that demonstrates the core functionality of video content analysis, including transcription and summarization. It has shown promising results in streamlining the consumption of video information. Our aim is to establish a foundation for future enhancements that can adapt to diverse content analysis needs.