Loading Events

« All Events

  • This event has passed.

Hamed Mohebbi Kalkhoran’s PhD Dissertation Defense

August 22, 2022 @ 3:00 pm - 4:00 pm

“Machine learning approaches for classification of myriad underwater acoustic events over continental-shelf scale regions with passive ocean acoustic waveguide remote sensing”

Abstract:

Underwater acoustic data contain a myriad of sound sources. Among underwater acoustic events, marine mammal vocalization classification is one of the most challenging problems due to their transient broadband calls, high variation in the calls of a specie, and high similarity between the calls of some species. Here, we developed machine learning approaches for classifying marine mammal vocalizations for real-time applications. We utilize acoustic data from a 160-element coherent hydrophone array and employ the passive ocean acoustic waveguide remote sensing technique to enable sensing and detections over instantaneous wide areas more than 100 km in diameter from the array. A variety of computational accelerating approaches, combining hardware and software, that make the methods desirable for real-time applications are also developed.

The humpback whale vocalizations can be divided into two classes, song and non-song calls. Here we use wavelet signal denoising and coherent array processing to enhance the signal-to-noise ratio. To build features vector for every time sequence of the beamformed signals, we employ Bag of Words approach to time-frequency features. Finally, we apply Support Vector Machine (SVM), Neural Networks, and Naive Bayes to classify the acoustic data and compare their performances. Best results are obtained using Mel Frequency Cepstrum Coefficient (MFCC) features and SVM which leads to 94% accuracy and 72.73% F1-score for humpback whale song versus non-song vocalization classification.

To classify a large variety of whale species vocalizations, we extracted time-frequency features from Power Spectrogram Density (PSD) of the beamformed signals. Then we used these features to train three classifiers, which are SVM, Neural Networks, and Random forest to classify six whale species: Fin, Sei, Blue, Minke, Humpback, and general Odontocetes. We also trained a set of Convolutional Neural Networks (CNN) to detect and classify each of these six whale vocalization categories directly using Per-Channel Energy Normalization (PCEN) spectrograms. Best results were obtained with Random forest classifier, which achieved 95% accuracy, and 85% F1 score. To detect transient sound sources, first we applied PCEN on the PSD of the beamformed signals. We applied thresholding on the PCEN data followed by morphological image opening to find potential sound sources and reduce noisy detections. Then we applied connected component analysis to obtain the final detected sounds for each bearing. To estimate the Direction of Arrival (DoA) of detected sounds, we applied non-maximum suppression (NMS), which is widely used in object detection applications in computer vision, on the detected sounds. We used mean power of each detected sound as the scores for NMS. To speed up the data processing, we investigated a variety of accelerating approaches, such as analyzing the effect of floating point precision, applying parallel processing, and implementing fast algorithms to run on GPU. During an experiment in the U.S. Northeast coast on board the US research vessel RV Endeavor in September 2021, we utilized the software and hardware advances developed here to record underwater acoustic data using Northeastern University in-house fabricated large aperture 160- element coherent hydrophone array with sampling frequency of 100 kHz per element.

Committee:

Prof. Purnima Ratilal (Advisor)

Prof. Themistoklis Sapsis

Prof. Devesh Tiwari

Details

Date:
August 22, 2022
Time:
3:00 pm - 4:00 pm
Website:
https://northeastern.zoom.us/j/96553787068?pwd=cGVjQk80d1dsSUZNS2VBampvQWsxZz09

Other

Department
Electrical and Computer Engineering
Topics
MS/PhD Thesis Defense
Audience
Faculty, Staff