HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Journal articles

Swarm v3: towards tera-scale amplicon clustering

Abstract : Motivation: Previously we presented swarm, an open-source amplicon clustering program that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes. Results: When compared to previous swarm versions, swarm v3 has modernized C ++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and multithreading (more than 7 times faster with default parameters), and it has been extensively tested for its robustness and logic. Availability: Source code and binaries are available at https://github.com/torognes/swarm Supplementary information: Supplementary data are available at Bioinformatics online.
Document type :
Journal articles
Complete list of metadata

Contributor : Gestionnaire Hal-Su Connect in order to contact the contributor
Submitted on : Monday, July 12, 2021 - 1:07:03 PM
Last modification on : Friday, May 20, 2022 - 9:04:21 AM
Long-term archiving on: : Wednesday, October 13, 2021 - 6:53:12 PM


Publication funded by an institution


Distributed under a Creative Commons Attribution 4.0 International License



Frédéric Mahé, Lucas Czech, Alexandros Stamatakis, Christopher Quince, Colomban de Vargas, et al.. Swarm v3: towards tera-scale amplicon clustering. Bioinformatics, Oxford University Press (OUP), 2022, 38 (1), pp.267-269. ⟨10.1093/bioinformatics/btab493⟩. ⟨hal-03284105⟩



Record views


Files downloads