1
Department of Electrical and Computer Engineering, Babol Noshirvani University of Technology, Tehran, Iran
2
Son Corporate Group, Tehran, Iran
10.22034/jsmta.2026.22764.1173
Abstract
Many anomaly detection algorithms require knowledge of the ratio of the two labels to operate. In real life, however, we may not have access to this value. As such, we often run anomaly detection packages with default values that may differ significantly from the actual value. Experiments on multiple datasets show that correctly determination of this ratio or at least obtaining a close estimate can makes a significant difference in the final performance of the anomaly detection algorithm. In this paper, we address the problem of estimating this ratio using both theoretical and heuristic techniques. In the theoretical method, we maximize the mutual information between features and labels to find the exact ratio. In the heuristic method, we sweep the [0,1] range in 0.01 steps to search for the ratio. On each iteration, we run the anomaly detection algorithm based on the ratio for that iteration and record the correlation coefficient between the features and the label generated by the algorithm. After the 100th iteration, we declare the ratio that provides the maximum correlation coefficient as our estimate of the label ratio. Our experiments on multiple datasets and several anomaly detection algorithms show that maximizing the correlation coefficient leads to the best results.
Kazemitabar, J. , Rahimi, S. and Rezaei-Ghadim, A. (2025). Blind label ratio estimation. Journal of Statistical Modelling: Theory and Applications, 6(1), 105-114. doi: 10.22034/jsmta.2026.22764.1173
MLA
Kazemitabar, J. , , Rahimi, S. , and Rezaei-Ghadim, A. . "Blind label ratio estimation", Journal of Statistical Modelling: Theory and Applications, 6, 1, 2025, 105-114. doi: 10.22034/jsmta.2026.22764.1173
HARVARD
Kazemitabar, J., Rahimi, S., Rezaei-Ghadim, A. (2025). 'Blind label ratio estimation', Journal of Statistical Modelling: Theory and Applications, 6(1), pp. 105-114. doi: 10.22034/jsmta.2026.22764.1173
CHICAGO
J. Kazemitabar , S. Rahimi and A. Rezaei-Ghadim, "Blind label ratio estimation," Journal of Statistical Modelling: Theory and Applications, 6 1 (2025): 105-114, doi: 10.22034/jsmta.2026.22764.1173
VANCOUVER
Kazemitabar, J., Rahimi, S., Rezaei-Ghadim, A. Blind label ratio estimation. Journal of Statistical Modelling: Theory and Applications, 2025; 6(1): 105-114. doi: 10.22034/jsmta.2026.22764.1173