dqt / quality

Per-detector F1 on synthetic warehouse shapes, NAB subset, and Yahoo Webscope S5 subset. results.csv ยท algorithm docs

Updated automatically on each PyPI release. Every row is averaged over all benchmark datasets.
detectors
64
best f1
1.000 (bocpd)
avg f1
0.875
avg precision
0.867
DetectorFamilyF1 vPrecisionRecallN
bocpdtimeseries1.0001.0001.0001
100%
callable_checkrule1.0001.0001.0001
100%
cardinality_in_rangerule1.0001.0001.0001
100%
column_pair_comparisonrule1.0001.0001.0001
100%
completenessrule1.0001.0001.0001
100%
composite_uniquenessrule1.0001.0001.0001
100%
cusumtimeseries1.0001.0001.0001
100%
date_formatrule1.0001.0001.0001
100%
date_part_missing_fractionrule1.0001.0001.0001
100%
double_mad_outlier_fractionoutlier1.0001.0001.0001
100%
ecodoutlier1.0001.0001.0001
100%
freshness_seconds_behindrule1.0001.0001.0001
100%
generalized_esdoutlier1.0001.0001.0001
100%
hbosoutlier1.0001.0001.0001
100%
iqr_fenceoutlier1.0001.0001.0001
100%
js_divergencedrift1.0001.0001.0001
100%
kl_divergencedrift1.0001.0001.0001
100%
ks_pvaluedrift1.0001.0001.0001
100%
lofoutlier1.0001.0001.0001
100%
mahalanobis_distanceoutlier1.0001.0001.0001
100%
max_in_rangerule1.0001.0001.0001
100%
median_in_rangerule1.0001.0001.0001
100%
min_in_rangerule1.0001.0001.0001
100%
mmddrift1.0001.0001.0001
100%
monotonicitytimeseries1.0001.0001.0001
100%
null_fractionrule1.0001.0001.0001
100%
numeric_meanrule1.0001.0001.0001
100%
outlier_fraction_driftdrift1.0001.0001.0001
100%
page_hinkleytimeseries1.0001.0001.0001
100%
quantile_in_rangerule1.0001.0001.0001
100%
referential_integrity_raterule1.0001.0001.0001
100%
regex_matchrule1.0001.0001.0001
100%
row_count_in_rangerule1.0001.0001.0001
100%
schema_changerule1.0001.0001.0001
100%
set_exclusionrule1.0001.0001.0001
100%
set_membershiprule1.0001.0001.0001
100%
sql_assertion_violationrule1.0001.0001.0001
100%
stddev_in_rangerule1.0001.0001.0001
100%
string_case_violationrule1.0001.0001.0001
100%
string_length_rangerule1.0001.0001.0001
100%
sum_in_rangerule1.0001.0001.0001
100%
uniquenessrule1.0001.0001.0001
100%
validityrule1.0001.0001.0001
100%
value_in_rangerule1.0001.0001.0001
100%
volumerule1.0001.0001.0001
100%
wasserstein_1drift1.0001.0001.0001
100%
auto_outlieroutlier0.9520.9091.0001
95%
benford_law_fitdistribution0.9520.9091.0001
95%
grubbsoutlier0.9520.9091.0001
95%
zscore_outlier_fractionoutlier0.9520.9091.0001
95%
adjusted_boxplot_fractionoutlier0.9090.8331.0001
91%
adwindrift0.9090.8331.0001
91%
psidrift0.7690.6251.0001
77%
isolation_forest_fractionoutlier0.7410.5881.0001
74%
holt_winterstimeseries0.6670.5001.0001
67%
mutual_informationdistribution0.6670.5001.0001
67%
one_class_svmoutlier0.6670.5001.0001
67%
stl_residual_zscoretimeseries0.6670.5001.0001
67%
matrix_profiletimeseries0.1821.0000.1001
18%
chi_square_driftdrift0.0000.0000.0001
0%
cramers_vdistribution0.0000.0000.0001
0%
mad_outlier_fractionoutlier0.0000.0000.0001
0%
prophet_anomalytimeseries0.0000.0000.0001
0%
remote_checkrule0.0000.0000.0001
0%
Benchmark: synthetic warehouse shapes (lognormal / normal / Poisson / Beta), NAB subset, Yahoo Webscope S5 subset. Source: examples/benchmarks/. License: MIT.