AudioNavigator: Guide Audio Interattive per il Patrimonio

Description

AudioNavigator is an AI-based solution for navigation, consultation, and intelligent search within audio content.

It transforms audio files—often difficult to explore and rich in unannotated information—into materials that are immediately searchable, structured, and understandable.

Features

Automatic transcription of multilingual audio, with timestamps
Speaker recognition (speaker diarization)
Entity recognition (names, places, key terms)
Generation of an abstract of the content
Language detection within the audio
Full-text search and direct jump to the corresponding point in the audio
Metadata editing and export in standard formats (JSON, XML)

Architecture and Technologies

AudioNavigator acts as the consultation interface for the entire audio processing workflow. It relies on an AI system composed of:

AudioProcessor: an AI component dedicated to metadata extraction. It receives an audio file via API and generates a JSON containing the transcription and extracted metadata through speech-to-text models and LLMs
MetaAssistant: a platform that collects, stores, and organizes all metadata and makes it searchable.

As a result, the AudioNavigator interface allows users to listen to audio synchronized with the text, filter by entities, and quickly search for specific information.

Advantages

Automatic cataloging of nearly the entire audio archive
Multilingual audio transcription
Drastic reduction in consultation time
Scalability across thousands of hours of audio
Consistency of generated metadata
Much faster and more accurate search

Use Cases

Preparation of conference proceedings
Radio archives and oral history collections
Multimedia collections of museums
Educational portals and cultural platforms
Interactive digital exhibitions and installations
Digital storytelling projects