尖叫检测

Scream detection

我正在从事一个需要检测某些语音模式的项目。例如"someone is screaming"：因为我不知道那个人是谁，所以child，男人，女人......各有各的声音......等等

所以，我正在寻找一种检测 "screaming" 的方法，例如，尽可能多地保存 "screaming" 的指纹，然后当我需要检查声音是否是"screaming" 语音，我可以为它创建一个指纹，然后搜索并查看我是否可以在我已有的 "screaming" 个指纹列表中找到相似之处。

我的方法是使用类似于以下项目的东西：

每个人都会给我一个特定声音的独特指纹，对吧？，我的问题是：我如何能够在 "screaming" 指纹列表中搜索相似性，是否有任何可能的方法来生成分数或每个指纹的 return % 相似性，以便我可以决定语音是否我' m 测试是按 % 还是有尖叫？

谢谢， J.B

My approach is to use something like the following projects:

这不是个好主意，尖叫通常是相当稳定的声音，而所有这些图书馆都在搜索声音中的不规则之处。他们不会检测到任何东西。最好改用简单的 DNN-LSTM 分类器。您可以使用 tensorflow 或任何其他 DNN 框架对其进行训练。您可以在此处找到算法的描述；

Deep Recurrent Neural Network-based Autoencoders for Acoustic Novelty Detection

或此处：

Deep Neural Networks for Automatic Detection of Screams and Shouted Speech In Subway Trains

How would I be able to search for a similarity on the list of "screaming" fingerprints, is there any possible way to generate score or return % of similarity to each fingerprint so I can decide if the voice i'm testing is by % or have a screaming?

在您的第一个库中，您可以使用 queryResult.BestMatch.Confidence 例如：

Confidence - returns a value between [0, 1]. A value below 0.15 is most probably a false positive. A value bigger than 0.15 is very likely to be an exact match. For good audio quality queries you can expect getting a confidence > 0.5.

尖叫检测

Scream detection

speech-recognition

similarity

audio-fingerprinting