Cd-hit
Citations
3,443 citations
3,147 citations
2,732 citations
2,454 citations
Cites methods from "Cd-hit"
...For PHASTER, we reduced the size of the bacterial sequence database by removing sequences with >70% sequence identity to any other sequence in the database, using CD-HIT (18)....
[...]
2,295 citations
Cites background from "Cd-hit"
...6 (Li and Godzik 2006; Fu et al. 2012) clustering software (with 99% similarity) to correct for potential advantage of more redundant assemblies and to retain only the longest predicted gene in a cluster....
[...]
References
17,301 citations
Additional excerpts
...221) from (Edgar, 2010)....
[...]
...1.221) from Edgar (2010)....
[...]
9,268 citations
"Cd-hit" refers background in this paper
...Different computation data buffers are allocated for different threads....
[...]
8,306 citations
"Cd-hit" refers background or methods in this paper
..., 2001) and was then extended to support clustering nucleotide sequences (Li and Godzik, 2006)....
[...]
...…computational time and noise interference in some analysis methods, etc. CD-HIT was originally developed to cluster protein sequences to create reference databases with reduced redundancy (Li et al., 2001) and was then extended to support clustering nucleotide sequences (Li and Godzik, 2006)....
[...]
6,970 citations
1,155 citations