본문 바로가기

Music & Audio Technology

My Research Experience and Projects [Performance] Data Sonification Ensemble (with Chris Chafe, Robyn Schulkowsky, Joey Baron)[Machine Learning / Audio DSP] Application of ML-Based Time Series Forecasting to Audio Dynamic Range Compression[Media Art Performance] Sounds Like Rain: An Immersive 3D Game-Like Audio-Visual Experience in First-Person Perspective with a Haunting Horror Ambiance[Media Art Performance] Irrational Rhythm - .. 더보기
[Computer Vision] Automatic Cheating Detection for Online Webcam Exam ▲ Full Demo Video code: https://github.com/soohyun123/Automatic-Cheating-Detection-for-Online-Webcam-Exam pseudocode: While loop takes video frames: Face Detection (OpenCV) if # of faces == 0: "ALERT!! No student" if # of faces > 1: "ALERT!! More than one student" if # of faces == 1: Hand Detection (Deep Learning, YOLO model) if # of hands == 0: "ALERT!! No hands" else: Draw yellow boxes for det.. 더보기
[Deep Learning] Drum sample generator using conditional GAN ▲ Kick ▲ Snare ▲ Hi-hat ▲ Tom ▲ Bass Drum ▲ Drum samples are generated by the conditional WGAN with gradient penalty (WGAN-GP), which directly generates one-dimensional time waveforms. iZotope's BreakTweaker drum samples, which are royalty-free, were used as a dataset for training. ▲ Continuous morphing of the generated audio sample according to the continuous change (linear interpolation) in th.. 더보기
[Speech Recognition] My seminar notes: from GMM-HMM to recent DNN techniques From June 2017 to February 2018, at Statistical Speech & Sound Computing Lab of prof. Hoi-rin Kim, I monthly did seminar presentations on Automatic Speech Recognition (ASR) from its classical techniques (e.g. GMM, HMM) to recent techniques based on DNN with mathematical derivations and proofs. The seminar covered their history, extended versions, strengths and drawbacks, and most importantly, th.. 더보기
[Spatial Audio] Spatial Separation of Audio Language using a Line Array Loudspeaker system * Detailed technical report on this project : [pdf link] Spatial Separation of Audio Language Using a Line Array Loudspeaker System.pdf drive.google.com ▲ Demo Video In this project, a line array loudspeaker system was built which can spatially separate audio languages. People on the right side in front of the line array heard lyrics in Korean and people on the left side heard lyrics in Japanese.. 더보기
Room Impulse Response Measurement with MATLAB code They all showed the same initial delay of the first peak of 186.6-7 ms. The distance between the speaker and microphone was 60.0cm, which is the distance that sound propagates within about 1.75ms. (for the typical speed of sound in air 343m/s) The error of 185ms might be due to the latency of the sound system of my laptop. This project was an assignment for the class by Prof. Jungwoo Choi. 더보기
[Media Art, Web Audio API] The Music of Gravitational Field link: https://soohyun123.github.io/ctp431/final/final.html code: https://github.com/soohyun123/soohyun123.github.io/blob/master/ctp431/final/sketch.js ▲ Demo video Beyond the rules of music according to human taste and intention, how will the notes sound when they are played while interacting with each other according to the laws of physics? There are two modes: Guitar and Oscillator mode, in wh.. 더보기
[Media Art, Web Audio API] Space Full of Music link: https://soohyun123.github.io/ctp431/hw2/hw2.html code: https://github.com/soohyun123/soohyun123.github.io/blob/master/ctp431/hw2/sketch.js ▲ Demo video There are three modes: Space, Daytime, and Night mode. 1) Click the Earth. 2) Click the Sun. 3) Click the Moon or the Space Ship. Now You Know How It Works. This project uses the time-varying amplitude and spectrum of music as mediums for v.. 더보기