GeneViewer is very simple program for visualising the locations of various genomic features (SNPs, exons etc.)along a sequence. The user enters start and stop coordinates of the region they’re interested in, followed by any number of files. Each file must contain 2 columns of data (region start & stop) with any number of rows
sgp2 is a program to predict genes by comparing anonymous genomic sequences from two different species. It combines tblastx, a sequence similarity search program, with geneid, an “ab initio” gene prediction program. In “assymetric” mode, genes are predicted in one sequence from one species (the target sequence), using a set of sequences (maybe only one) from the other species (the reference set). Essentially, geneid is used to predict all potential exons along the target sequence. Scores of exons are computed as log-likelihood ratios, function of the splice sites defining the exon, the coding bias in composition of the exon sequence as measured by a Markov Model of order five, and of the optimal alignment at the amino acid level between the target exon sequence and the counterpart homologous sequence in the reference set. From the set of predicted exons, the gene structure is assembled (eventually multiple genes in both strands) maximizing the sum of the scores of the assembled exons.
geneid is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, start and stop codons are predicted and scored along the sequence using Position Weight Arrays (PWAs). In the second step, exons are built from the sites. Exons are scored as the sum of the scores of the defining sites, plus the the log-likelihood ratio of a Markov Model for coding DNA. Finally, from the set of predicted exons, the gene structure is assembled, maximizing the sum of the scores of the assembled exons. geneid offers some type of support to integrate predictions from multiple source via external gff files and the redefinition of the general gene structure or model is also feasible. The accuracy of geneid compares favorably to that of other existing tools, but geneid is likely more efficient in terms of speed and memory usage.
MGEnrichment is a web application developed both to disseminate to the community our curated database of microglia-relevant gene lists, and to allow non-programming scientists to easily conduct statistical enrichment analysis on their gene expression data.
Varclus is a perl utility to identify clusters of amino acid or nucleotide changes in a sequence that are too tightly spaced to have occurred by chance alone.
CoRe is an R package implementing existing and novel methods for the identification of core-fitness genes (at two different level of stringency) from joint analyses of multiple CRISPR-Cas9 screens.
MaxTiC (Maximum Time Consistency) is a software which takes as input a species tree and a series of (possibly inconsistent) time constraints between its internal nodes, weighted by confidence scores.
Luhmann N, Lafond M, Thevenin A, Ouangraoua A, Wittler R, Chauve C. The SCJ Small Parsimony Problem for Weighted Gene Adjacencies.
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1364-1373. doi: 10.1109/TCBB.2017.2661761. Epub 2017 Jan 31. PMID: 28166504.
AnGST performs a phylogenetic reconciliation between a given species and gene tree, positing the best scoring set of HGT, DUP, and LOS events to explain all topological incongruities between the two trees. In order to protect against phylogenetic noise, we have designed AnGST to be capable of incorporating information from dozens of bootstrap trees simultaneously. In cases of variations among bootstrap subtrees, the subtree that best accords with the reference tree is adopted.