EI / SCOPUS / CSCD 收录

中文核心期刊

样本量大小对双元音话者区分能力评判的影响

The effect of sample size on the evaluation of speaker discriminatory power of diphthong

  • 摘要: 线性判别分析是司法语音同一性鉴定中对元音共振峰参数的话者区分能力进行量化评估的常用方法。为探究多大的样本量(说话人数量)才能保证判别结果输出鲁棒性(稳定反映不同共振峰鉴定价值的相对大小)的问题, 以660位北京籍青年男性被试发汉语普通话双元音/ei/为对象, 提取前四条共振峰(F1—F4)的动态轨迹特征, 首先对共振峰的原始轨迹进行归一化十等分, 然后用多项式拟合进行数据降维, 最后以拟合系数为预测变量, 对不同样本量条件下不同共振峰参数的判别率进行了系统分析。实验结果表明, 不考虑F4时, 至少需要40人才能稳定反映F1—F3单条共振峰之间话者区分能力的相对大小; 当分析F1—F3的共振峰组合参数或者考虑F4时, 最小样本量会显著增加。

     

    Abstract: Linear discriminant analysis (LDA) is a commonly used method to evaluate the speaker discriminatory power of formant parameters in forensic voice comparison (FVC). To investigate the sample size (number of speakers) required to ensure the robustness of the classification rate (CR) results (reliably reflecting the relative value of different formants in FVC), in this study, the dynamic features of the first four formants (F1—F4) of the Chinese diphthong /ei/ of 660 young male speakers are extracted. The raw formant trajectories are normalized into ten equal parts, followed by polynomial fitting for dimensionality reduction. The fitting coefficients are used as predictor variables to conduct the LDA. The CR values of different formant parameters under various sample size conditions are systematically analyzed. The results show that when F4 is not considered, at least 40 speakers are needed to stably reflect the relative value of speaker discriminatory power among the individual formants F1—F3. When the combinations of F1—F3 are analyzed or F4 is considered, the minimum sample size will increase significantly.

     

/

返回文章
返回