Automatic bat species identification uses machine learning to classify echolocation calls from field recordings to species level, replacing hours of manual sonogram review with results delivered in minutes.

How automatic bat species identification works

Every bat species produces echolocation calls with characteristic frequency, duration, and shape parameters. These acoustic signatures are the basis for species identification from ultrasonic recordings. The automatic identification pipeline follows four stages:

  1. Recording. Bat detectors capture ultrasonic audio in the field, either as full-spectrum WAV files (which preserve the complete acoustic signal) or as zero-crossing files (which retain frequency information but discard amplitude). Full-spectrum recordings are essential for modern deep learning classifiers because they contain the spectral detail these models rely on.
  2. Spectrogram generation. The raw audio is converted into a spectrogram — a visual representation showing frequency on the vertical axis, time on the horizontal axis, and signal intensity as colour or brightness. Each bat call appears as a distinct shape: the steep FM sweep of a Myotis, the quasi-constant-frequency tail of a pipistrelle, or the alternating FM-CF pattern of a Greater Horseshoe bat.
  3. Feature extraction. Traditional classifiers measure specific call parameters from the spectrogram: peak frequency, minimum and maximum frequency, call duration, inter-pulse interval, and bandwidth. Deep learning classifiers skip this step by learning features directly from the spectrogram image, capturing patterns that hand-crafted parameter sets may miss.
  4. Classification. The extracted features or spectrogram images are fed to a trained model that assigns a species label to each call or call sequence. The model's output typically includes a confidence score, allowing analysts to focus manual review on low-confidence identifications.

The quality of each stage affects the final result, but classification accuracy depends most heavily on the model architecture and the training data it was built on.

Why accuracy matters

Automatic species identification is not an academic exercise. The results feed directly into regulatory decisions, environmental impact assessments, and operational curtailment schedules for wind farms. Misclassifications have tangible consequences.

Consider a wind farm environmental impact assessment in the UK. If a classifier fails to identify Barbastella barbastellus (Barbastelle) — a species listed in Annex II of the Habitats Directive — the EIA will understate the site's conservation significance, potentially leading to inadequate mitigation. Conversely, if the classifier generates excessive false positives for a rare species, the developer may face unnecessary and costly curtailment requirements.

In pre-construction surveys, the species list determines the mitigation strategy. In post-construction monitoring, species identification data drives curtailment algorithms: if high-risk species are active, turbines may need to be shut down during specific conditions. A classifier that confuses common pipistrelles (low conservation concern in most of Europe) with Nathusius' pipistrelles (a migratory species with higher collision risk) will produce curtailment recommendations that are either too aggressive or dangerously lax.

The difference between 83% F1 and 98.9% F1 is not a marginal improvement. At 83%, roughly one in six classifications is wrong. At 98.9%, fewer than one in ninety is wrong. For a dataset of 10,000 bat calls, that is the difference between 1,700 errors and 110.

BioSonic's approach to automatic identification

BioSonic's classifier is built on convolutional neural networks (CNNs) that analyse spectrogram images directly. Rather than reducing each call to a handful of measured parameters, the model processes the full two-dimensional spectral representation, learning to recognise species from the complete shape, harmonic structure, and temporal pattern of each call.

The model is trained on 2.5 million verified bat calls and 3.5 million noise files. This training dataset is, to our knowledge, the largest curated collection of expert-verified bat echolocation recordings used for classifier development. The noise dataset is equally important: by training on millions of non-bat sounds — wind, rain, insects, electromagnetic interference, mechanical vibration — the model learns what is not a bat, producing 13.4 times fewer false positives than competing classifiers.

Several design decisions contribute to BioSonic's 98.9% F1 performance:

Accuracy comparison

Published and independently verified accuracy metrics for the major bat classifiers currently in use:

These figures should be interpreted carefully. F1 scores are dataset-dependent, and a classifier may perform differently on recordings from different detectors, habitats, or geographic regions. The most informative comparison is one conducted on your own data, which is why BioSonic offers a free trial that lets you process your own recordings and evaluate the results directly. For a full feature-by-feature comparison, see the comparison pages.

Species coverage

BioSonic's classifier covers the European bat fauna, including species that are notoriously difficult to separate acoustically. The Myotis genus is a particular challenge in bat acoustics: species such as M. mystacinus (Whiskered Bat), M. brandtii (Brandt's Bat), and M. alcathoe (Alcathoe Bat) produce calls with heavily overlapping frequency and duration parameters. Parametric classifiers frequently fail on these species groups because the measured call parameters fall within shared ranges.

BioSonic's deep learning approach handles Myotis separation better than parametric methods because it identifies subtle spectral features — harmonic intensity ratios, fine-scale frequency modulation patterns, pulse shape asymmetries — that are not captured by standard call measurements. While no classifier can separate every Myotis call with certainty (some calls are genuinely ambiguous even to expert human analysts), the spectrogram-based approach significantly reduces the error rate for these cryptic species pairs.

Coverage extends to rarer species that other classifiers underperform on due to limited training data. BioSonic's 2.5 million call training dataset includes verified recordings of less common species, ensuring that the classifier does not simply default to the most probable species when encountering a call it has seen fewer times during training.

Beyond species: behaviour classification

Species identification answers "what is here?" but ecological assessment often requires knowing "what is it doing?" BioSonic classifies bat behaviour as well as species, distinguishing between three primary call types:

This behavioural data adds a dimension to survey results that species lists alone cannot provide. For wind farm assessments, evidence that bats are actively foraging at nacelle height — rather than simply commuting through the area — strengthens the ecological case for targeted curtailment.

How to evaluate a bat classifier

If you are selecting a bat identification tool for professional survey work, these are the metrics and questions that matter:

For detailed benchmark methodology and per-species results, see the BioSonic benchmarks page. For guidance on choosing between specific tools, see the comparison pages and the FAQ.