51,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 6-10 Tagen
payback
26 °P sammeln
  • Broschiertes Buch

Over the last decades, the development of digitization techniques and the increasing capacity of storage media together with decreasing storage costs has led to an explosion of digital content. Huge amounts of audio, images, and videos are generated daily and are mostly stored in an unstructured repository of multimedia information, much of which can be accessed through the Internet. To deal with these huge amounts of data, one of the main imperatives are tools and standards for data search and analysis. In particular, there is a need for efficient techniques that are able to extract semantic…mehr

Produktbeschreibung
Over the last decades, the development of digitization techniques and the increasing capacity of storage media together with decreasing storage costs has led to an explosion of digital content. Huge amounts of audio, images, and videos are generated daily and are mostly stored in an unstructured repository of multimedia information, much of which can be accessed through the Internet. To deal with these huge amounts of data, one of the main imperatives are tools and standards for data search and analysis. In particular, there is a need for efficient techniques that are able to extract semantic information directly from the content. The work presented in this book proposes methods to bridge the semantic gap in the computer vision and the musical audio mining domain. We propose features that are related with human perception and methods relying on machine learning techniques to relate these features with concepts. Next, we present a system to model, disclose, and manage different types of metadata. Furthermore, the interoperability with metadata standards is also tackled.
Autorenporträt
Gaëtan Martens obtained at Ghent University a master in computer science in 1999 and a Ph.D. in computer science engineering in 2011. His research focuses on signal processing in the musical audio mining and computer vision domain, machine learning techniques, and Semantic Web technologies. Since 2010, he works as a research engineer at TomTom.