Skip to content
Snippets Groups Projects
Commit d9b562e8 authored by arq5x's avatar arq5x
Browse files

[DOC] add speed comparo to docs

parent 526119e7
No related branches found
No related tags found
No related merge requests found
docs/content/images/memory-comparo.png

46.3 KiB

docs/content/images/speed-comparo.png

45.5 KiB

......@@ -32,6 +32,44 @@ Table of contents
content/related-tools
=================
Performance
=================
As of version 2.18, ``bedtools`` is substantially more scalable thanks to improvements we have made in the algorithm used to process datasets that are pre-sorted
by chromosome and start position. As you can see in the plots below, the speed and memory consumption scale nicely
with sorted data as compared to the poor scaling for unsorted data. The current version of bedtools intersect is as fast as (or slightly faster) than the ``bedops`` package's ``bedmap`` which uses a similar algorithm for sorted data. The plots below represent counting the number of intersecting alignments from exome capture BAM files against CCDS exons.
The alignments have been converted to BED to facilitate comparisons to ``bedops``. We compare to the bedmap ``--ec`` option because similar error checking is enforced by ``bedtools``.
.. image:: content/images/speed-comparo.png
:width: 300pt
.. image:: content/images/memory-comparo.png
:width: 300pt
Commands used:
.. code-block:: bash
# bedtools unsorted
$ bedtools intersect \
-a ccds.exons.bed -b aln.bam.bed \
-c
# bedtools sorted
$ bedtools intersect \
-a ccds.exons.bed -b aln.bam.bed \
-c \
-sorted
# bedmap (no error checking)
$ bedmap --echo --count --bp-ovr 1 \
ccds.exons.bed aln.bam.bed
# bedmap (no error checking)
$ bedmap --ec --echo --count --bp-ovr 1 \
ccds.exons.bed aln.bam.bed
=================
Brief example
=================
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment