Hello there all the DSP experts. As a novice with a problem, I have a humble request for guidance.
I am working with several 15-hour audio samples captured in an industrial environment of the following kind: Noisy background, constant whirring of machines, as if driving in a noisy vehicle, no major repetitive noise, some sudden sharp 'ping' (alarm signals from certain machines), and on top of that infrequent/sporadic human voice.
My main interest lies in hearing out the human communication. Currently, I am spending listening to the entire 15 hours!
I do not intend a perfect solution, just want to save as much time as possible, by trimming the portions where no human voice exists.
Is there any way I can achieve this through Signal Processing?
Thank you very much all.