Compare gene sequence libraries
Function in the statistical software R and example data for comparing gene sequence libraries using Cramer von Mises statistics; based on method developed by Singleton, D. R., M. A. Furlong, S. L. Rathburn, and W. B. Whitman. 2001. Quantitative comparisons of 16S rRNA gene sequence libraries from environmental samples. Appl. Env. Microbiol. 67:4374-4376.
To compare two sequence libraries (X and Y) using the R function genelib.comp, use syntax
genelib.comp(Total number of sequences in X and Y, Number of sequences in X, Number of randomizations, Aligned sequence data in fasta format)
Example: Compare 46 sequences in library A with 51 sequences in library B, data T0_AvB.fasta
genelib.comp(97,46,1000,"T0_AvB.fasta")
Output:
Delta-Cxy = 3.3403 : p-value = 0.001
Delta-Cyx = 1.0027 : p-value = 0.013
Plot: