BioSonic Benchmarks — 98.9% F1 Accuracy, Independently Validated

BENCHMARK 01 · THIRD-PARTY TESTING

Independent validation — 98.7% F1 accuracy.

Wilder Sensing · Geoff Carss + Annabel Jeffries

01 · WILDER SENSING · THIRD-PARTY · F1 ACCURACY 02 · JAN DRACHMANN · FIELD · 5 DETECTORS · 11 SPECIES

98.7%

F1 ACCURACY

Geoff Carss & Annabel Jeffries

Wilder Sensing

"We tested a range of existing solutions using real acoustic data, and BioSonic clearly stood out, achieving 98.7% accuracy (F1 score) and giving us full confidence in the results we're bringing to the platform."

BioSonic's AI model uses Image recognition on Spectrogram. This leads to a step change in accuracy.

Grassland + Paludiculture dataset · 5,325 predictions

Metric comparison — Precision, Recall, F1 EXPAND ↓

higher is better · bars start at 50% baseline

PRECISION · BioSonic +22.6pt

BioSonic

98.3%

Kaleidoscope

75.7%

RECALL · BioSonic +7.9pt

BioSonic

99.6%

Kaleidoscope

91.7%

F1 SCORE · BioSonic +15.9pt

BioSonic

98.9%

Kaleidoscope

83.0%

BIOSONIC · Confusion Matrix · Accuracy 99.7%

	PREDICTED BAT	PREDICTED NO BAT
ACTUAL BAT	730 TRUE POSITIVE	3 FALSE NEGATIVE
ACTUAL NO BAT	13 FALSE POSITIVE	4,579 TRUE NEGATIVE

KALEIDOSCOPE PRO · Confusion Matrix · Accuracy 95.8%

	PREDICTED BAT	PREDICTED NO BAT
ACTUAL BAT	543 TRUE POSITIVE	49 FALSE NEGATIVE
ACTUAL NO BAT	174 FALSE POSITIVE	4,503 TRUE NEGATIVE

FALSE POSITIVES (Wrong "bat" calls)

Cost: investigator time wasted reviewing non-bat audio, inflated species counts

BIOSONIC

KALEIDOSCOPE

174

Kaleidoscope: 13.4× more

FALSE NEGATIVES (Missed bats)

Cost: protected species overlooked, incomplete EIA, compliance risk

BIOSONIC

KALEIDOSCOPE

Kaleidoscope: 16.3× more

Same data, very different outcomes. BioSonic delivers 98.9% F1 with only 16 errors across 5,325 predictions. Kaleidoscope Pro misses nearly 9% of true bats and produces 13× more false calls.

BENCHMARK 02 · SOUTHERN DENMARK

Manual vs BioSonic.

By Jan Drachmann · independent biologist · 5 detectors, 11 species · manual analysis vs BioSonic, same dataset

Detector ranking

Identical

D4 > D5 > D2 > D1 > D3 — same order whether counted by hand or by AI. Same ecological conclusion about which sites had the most activity.

Detections per detector

same order, slightly higher counts

23,527

25,514

2,370

3,505

1,156

1,688

391

461

118

JD — Manual BIOSONIC

11 → 100

NATTER'S BAT CALLS

Jan found 11. BioSonic found 100.

Going through tens of thousands of files by hand, it's easy to miss a few. On review, every one of BioSonic's 100 Natter's bat calls was validated as correct. That's the payoff of AI on large datasets — rare species don't slip through.

Species breakdown — Common & Rare species comparison EXPAND ↓

Common Species

BioSonic detects more Pipistrel (+203%) and Trold (+16%)

Manual BioSonic

Dværg

13,556

10,669

Trold

11,433

13,248

Pipistrel

2,257

6,834

Rare & Protected Species

Separate scale — these are the ones that matter for Annex II compliance

Manual BioSonic

Vand

112

221

Frynse

100

Dam

Brun

Syd

Weather integration

Jan also found interesting bat behaviours compared to wind speed and rain with BioSonic's automatic weather integration.

And the human cost

2TB bat data, comparing BioSonic vs Kaleidoscope.

According to Consultancy in Southern England

KALEIDOSCOPE WORKFLOW

490 hrs

12 weeks to client

BIOSONIC WORKFLOW

163 hrs

4 weeks to client

✓ 327 hours back · This means £21,255 saved at standard UK consultant rate

We don't ask you to take our word.
We show you the papers.

Independent validation — 98.7% F1 accuracy.

BioSonic's AI model uses Image recognition on Spectrogram. This leads to a step change in accuracy.

BIOSONIC · Confusion Matrix · Accuracy 99.7%

KALEIDOSCOPE PRO · Confusion Matrix · Accuracy 95.8%

FALSE POSITIVES (Wrong "bat" calls)

FALSE NEGATIVES (Missed bats)

Manual vs BioSonic.

Detector ranking

Detections per detector

Jan found 11. BioSonic found 100.

Common Species

Rare & Protected Species

Jan also found interesting bat behaviours compared to wind speed and rain with BioSonic's automatic weather integration.

2TB bat data, comparing BioSonic vs Kaleidoscope.

BioSonic on BatAbility Club with Neil Middleton.

A 30-minute demo.
Bring a WAV, leave with the report.

We don't ask you to take our word.We show you the papers.

Independent validation — 98.7% F1 accuracy.

BioSonic's AI model uses Image recognition on Spectrogram. This leads to a step change in accuracy.

BIOSONIC · Confusion Matrix · Accuracy 99.7%

KALEIDOSCOPE PRO · Confusion Matrix · Accuracy 95.8%

FALSE POSITIVES (Wrong "bat" calls)

FALSE NEGATIVES (Missed bats)

Manual vs BioSonic.

Detector ranking

Detections per detector

Jan found 11. BioSonic found 100.

Common Species

Rare & Protected Species

Jan also found interesting bat behaviours compared to wind speed and rain with BioSonic's automatic weather integration.

2TB bat data, comparing BioSonic vs Kaleidoscope.

BioSonic on BatAbility Club with Neil Middleton.

A 30-minute demo.Bring a WAV, leave with the report.

We don't ask you to take our word.
We show you the papers.

A 30-minute demo.
Bring a WAV, leave with the report.