sound-event-detection

Here are 60 public repositories matching this topic...

RetroCirce / HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

python music-information-retrieval audio-classification sound-event-detection transformer-models

Updated Sep 18, 2025
Python

sharathadavanne / seld-net

Star

Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network

tracking localization localisation sound-event-detection seld sound-event-localization sound-event-localization-detection

Updated Nov 21, 2022
Python

FireRedTeam / FireRedVAD

Star

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

vad voice-activity-detection aed sound-event-detection audio-event-classification audio-event-detection

Updated Apr 4, 2026
Python

soham97 / awesome-sound_event_detection

Star

Reading list for research topics in Sound AI

representation-learning audio-processing zero-shot-learning icassp sound-event-detection interspeech acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Aug 8, 2024

Audio-WestlakeU / ATST-SED

Star

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

semi-supervised-learning sed sound-event-detection fine-tuning self-supervised-learning atst

Updated Apr 22, 2026
Jupyter Notebook

thomeou / SALSA

Star

This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.

feature-extraction microphone-array sound-event-detection sound-event-localization first-order-ambisonics

Updated May 31, 2022
Python

adobe-research / openflam

Star

OpenFLAM: Framewise Language Audio Model

audio-processing sound-event-detection audio-text-modeling

Updated Jan 14, 2026
Python

giusenso / seld-tcn

Star

SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow

neural-network tensorflow keras convolutional-neural-networks audio-processing audio-recognition keras-tensorflow sound-event-detection direction-of-arrival seldnet seld-tcn

Updated Oct 1, 2020
Python

omine-me / LaughterSegmentation

Star

2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspeech 2024

speech laugh-detection sound-synthesis laughter sound-event-detection laughter-detection laughter-segmentaion

Updated Sep 1, 2024
Python

turpaultn / DCASE2019_task4

Star

Baseline of dcase 2019 task 4

baseline sound-event-detection dcase-challenge

Updated Sep 2, 2022
Python

dr-costas / dnd-sed

Star

Sound event detection with depthwise separable and dilated convolutions.

machine-learning deep-neural-networks deep-learning audio-signal-processing sound-event-detection depthwiseseparableconvolution machine-listening depthwise-separable-convolutions dilated-cnn dilated-convolution

Updated Mar 30, 2020
Python

koukyo1994 / kaggle-birdcall-6th-place

Star

Training code of Cornell Birdcall Identification Challenge 6th place solution

python pytorch kaggle sound-event-detection kaggle-solution audio-tagging birdsong-recognition

Updated Oct 12, 2020
Python

jim-schwoebel / sound_event_detection

Star

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

machine-learning acoustic-fingerprinting object-detection event-detection acoustics object-detection-pipelines audioset acoustic-model sound-event-detection acoustic-features object-detection-label common-voice common-voice-tool voice-computing object-detection-accuracy voicebook surveylex neurolex

Updated Feb 20, 2022
Python

MTG / DCASE-models

Star

Python library for rapid prototyping of environmental sound analysis systems

python deep-learning audio-classification sound-event-detection audio-tagging

Updated May 20, 2022
Jupyter Notebook

Jungjee / DcaseNet

Star

Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, and sound event detection. Implemented using PyTorch.

pytorch dcase sound-event-detection audio-tagging acoustic-scene-classification