Encoding of audio information for categorizing audio samples.

Postgraduate Thesis uoadl:2779359 333 Read counter

Unit:
Κατεύθυνση Ηλεκτρονικός Αυτοματισμός (Η/Α, με πρόσθετη εξειδίκευση στην Πληροφορική και στα πληροφοριακά συστήματα)
Library of the School of Science
Deposit date:
2018-08-03
Year:
2018
Author:
Lianos Vasileios
Supervisors info:
Διονύσης Ρεΐσης, Αναπληρωτής Καθηγητής, Τμήμα Φυσικής, Σχολή Φυσικών Επιστημών
Έκτορας Νισταζάκης, Αναπληρωτής Καθηγητής, Τμήμα Φυσικής, Σχολή Φυσικών Επιστημών
Δημήτριος Φραντζεσκάκης, Καθηγητής, Τμήμα Φυσικής, Σχολή Φυσικών Επιστημών
Original Title:
Encoding of audio information for categorizing audio samples.
Languages:
English
Translated title:
Encoding of audio information for categorizing audio samples.
Summary:
With the use of a powerful tool, implemented by Numenta, this thesis tries to implement a system, simulating the mammalian auditory system, that will obtain audio samples and try to extract a comparable representation of it. These representations could then be used in various ways and for many applications. One of them could be the embedment of these representations into an Euclidean space leading to their, corresponding audio samples, catecorization.
To do so, an algorithm simulating the mammalian cochlea and its components is designed. This thesis also makes use of a time to frequency transform specially designed for this purpose.
For the context of this thesis, the system designed is partially implemented and an experiment, acting as a proof of concept, is performed. The results show that the algorithm has an effective performance and leaves many promises to future endeavours.
Main subject category:
Science
Other subject categories:
Technology - Computer science
Keywords:
Neural network, Sound, Cochlea, Encoding, Categorization
Index:
No
Number of index pages:
0
Contains images:
Yes
Number of references:
30
Number of pages:
32
Thesis_Vasileios_Lianos_2014512.pdf (1 MB) Open in new window