Analysis/Figure: protein clustering
Cluster all proteins, i.e. from all assemblies, together. Generate summary files and plots showing number of "shared" and unique proteins.
Can potentially replace pairwise assembly comparisons with cdhit
and diamond
.