sci-biology/cd-hit – Gentoo Packages

Version 4.8.1 is available upstream. Please consider updating!
It seems that version 4.8.1 is available upstream, while the latest version in the Gentoo tree is 4.6.6-r1.
You think this warning is false? Read more about it here.

Available Versions

Version	amd64	x86	alpha	arm	arm64	hppa	mips	ppc	ppc64	riscv	sparc
4.6.6-r1 : 0 EAPI 8	~amd64	~x86	?alpha	?arm	?arm64	?hppa	?mips	?ppc	?ppc64	?riscv	?sparc

Package Metadata

Upstream
Remote-Id https://code.google.com/archive/p/cdhit/
https://github.com/weizhongli/cdhit
Full description
CD-HIT is a very widely used program for clustering and comparing large sets of protein or nucleotide sequences. CD-HIT is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset. The CD-HIT package has CD-HIT, CD-HIT-2D, CD-HIT-EST, CD-HIT-EST-2D, CD-HIT-454, CD-HIT-PARA, PSI-CD-HIT and over a dozen scripts. CD-HIT (CD-HIT-EST) clusters similar proteins (DNAs) into clusters that meet a user-defined similarity threshold. CD-HIT-2D (CD-HIT-EST-2D) compares 2 datasets and identifies the sequences in db2 that are similar to db1 above a threshold. CD-HIT-454 is a program to identify natural and artificial duplicates from pyrosequencing reads. The usage of other programs and scripts can be found in CD-HIT user's guide.
USE flags
Global Use Flags
- openmp
License
GPL-2
Maintainer(s)
Gentoo Biology Project

External Resources

Git repository browser

Codeberg repository browser

Git log (short)

Changes Feed

Remote-Id	https://code.google.com/archive/p/cdhit/
	https://github.com/weizhongli/cdhit