RenBio 0.7d – Identify Gene and Protein Name in Textual Document

RenBio 0.7d

:: DESCRIPTION

RenBio is a program to identify gene and protein names in a textual document based on machine learning techniques.RenBio searches for named entities in a document according to a decision tree. The attributes of the tree nodes may be regex matches, dictionary matches or signa words.

::DEVELOPER

Robert Bossy <Robert.Bossy@jouy.inra.fr>

:: SCREENSHOTS

N/A

:: REQUIREMENTS

  • Linux

:: DOWNLOAD

  RenBio

:: MORE INFORMATION

IsotopeCalculator 2 – Compute Isotopic Distributions for Large Proteins

IsotopeCalculator 2

:: DESCRIPTION

IsotopeCalculator is a  memory efficient algorithms for accurately calculating the isotopic fine structures of molecules. Treating individual isotopic species of a molecule as different mass states, we introduce the concept of transitions between mass states and represent all mass states of the molecule in a hierarchical structure. We show that there exists a simple relationship between two different mass states at two different levels of the hierarchical structure. This allows us to efficiently and accurately compute both the mass and the abundance of every mass state of a small to medium-sized molecule, whose gross structures include small number of fine structures. A truncated calculation of this algorithm can be applied to calculate a majority of isotopic species (99.99% of cumulative abundance) of a large molecule.

::DEVELOPER

Pengyu Hong

:: SCREENSHOTS

N/A

:: REQUIREMENTS

  • Windows/Linux/MacOsx

:: DOWNLOAD

 IsotopeCalculator

:: MORE INFORMATION

Citation

Li L., Karabacaka N. M., Cobba J. S., Wang Q., Hong P*, Agara J.N.*.
Memory Efficient Calculation of the Isotopic Mass States of a Molecule.
Rapid Commun Mass Spectrom. 2010 Sep;24(18):2689-96.

imCellPhen Alpha – Interactive Mining of Cellular Phenotypes

imCellPhen Alpha

:: DESCRIPTION

imCellPhen (Interactive mining of cellular phenotypes) is an innovative computing paradigm that uses intelligent human-computer interfaces to facilitate the application of the HCS technology in biomedical research. It’s a a new computing paradigm that combines unsupervised pattern mining techniques, P-VDE interfaces, and CBIR-RF techniques to boost the exploitation capacity of the HCS technology and facilitate its application to biomedical research.

::DEVELOPER

Pengyu Hong

:: SCREENSHOTS

:: REQUIREMENTS

:: DOWNLOAD

 imCellPhen

:: MORE INFORMATION

Citation

Hong, P. (2006).
Interactive Analysis of High-Content Cellular Images via Relevant Feedback.
2006 Workshop on Multiscale Biological Imaging, Data Mining and Informatics, Santa Barbara, CA, USA.

GeneNotes 1.0 – FireFox Extension of Collecting and Managing Biological Information

GeneNotes 1.0

:: DESCRIPTION

GeneNotes extension is a gene-oriented tool and is developed to help biologists collect and manage a variety of biological information as notes from the Internet. It greatly helps biologists during the decision making processes in large-scale functional genomics studies.

::DEVELOPER

Pengyu Hong

:: SCREENSHOTS

:: REQUIREMENTS

:: DOWNLOAD

 GeneNotes

:: MORE INFORMATION

Citation

BMC Bioinformatics. 2005 Feb 1;6:20.
GeneNotes–a novel information management software for biologists.
Hong P, Wong WH.

TFdiff 0.4 – Detection of Differential Gene Expression Factors

TFdiff 0.4

:: DESCRIPTION

TFdiff  identifies the context-dependent transcription factor binding sites (TFBSs) interactions that may yield an explanation why the expression of genes is modified in different directions given a particular condition.

:: DEVELOPER

Pieter De Bleser (VIB & Ghent University) & Bart Hooghe (VIB)

:: SCREENSHOTS

N/A

:: REQUIREMENTS

:: DOWNLOAD

 TFdiff

:: MORE INFORMATION

Citation:

Genome Biol. 2007;8(5):R83.
A distance difference matrix approach to identifying transcription factors that regulate differential gene expression.
De Bleser P, Hooghe B, Vlieghe D, van Roy F.

Repbase Submitter 1.1.115 – Format & Annotate Repbase Entries

Repbase Submitter 1.1.115

:: DESCRIPTION

RepbaseSubmitteris a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position.

:: DEVELOPER

Genetic Information Research Institute

:: SCREENSHOTS

:: REQUIREMENTS

  • Linux / Windows / MacOsX
  • Java

:: DOWNLOAD

  Repbase Submitter

:: MORE INFORMATION

Citation:

Kohany O, Gentles AJ, Hankus L, Jurka J
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.
BMC Bioinformatics, 2006 Oct 25;7:474

DASS-GUI 1.4 – Pattern Search in Non-sequential Data

DASS-GUI 1.4

:: DESCRIPTION

DASS-GUI is a stand-alone program written in C++ that calculates all significant closed sets* of a given dataset containing the host sets. Some of the used algorithms are taken from Hollunder et al. (2007). DASS-GUI also allows additional analyses of the identified closed sets:  filtering, handling of synonymous names, enrichment analyses, calculation of means and standard deviations of different numerical features, extraction of the underlying closed set hierarchy and corresponding export as GML file, as well as comparison (validation) with pre-defined sets.

::DEVELOPER

J.Hollunder and T.Wilhelm

:: SCREENSHOTS

:: REQUIREMENTS

:: DOWNLOAD

  DASS-GUI

:: MORE INFORMATION

Citation:

Hollunder, J., * Friedel, M., Kuiper, M., Wilhelm, T. (2010)
DASS-GUI: a user interface for identification and analysis of significant patterns in non-sequential data.
Bioinformatics 26(7), 987-9.

Validate GTF 1.0 – Check GTF file for Correctness

Validate GTF 1.0

:: DESCRIPTION

Validate GTF is a flexible Perl script that checks a GTF file for correctness. It can detect most common syntactic errors, such as including the stop codon within the CDS annotation. It can also detect semantic errors, such as annotated coding sequence that contains stop codons spanning splice sites.

::DEVELOPER

The Brent Lab

:: SCREENSHOTS

N/A

:: REQUIREMENTS

:: DOWNLOAD

  Validate GTF

:: MORE INFORMATION

parredHMMlib 1.0 – Library for Hardware Accelerated HMM Parallelizing Analysis

parredHMMlib 1.0

:: DESCRIPTION

parredHMMlib is a C++ library implementing the parredForward and parredViterbi algorithms for multi-core CPUs, parallelizing analysis of hidden Markov models with small state spaces.

::DEVELOPER

Andreas Sand.

:: SCREENSHOTS

N/A

:: REQUIREMENTS

:: DOWNLOAD

 parredHMMlib

:: MORE INFORMATION

Citation

Nielsen, J.; Sand, A.;
Algorithms for a Parallel Implementation of Hidden Markov Models with a Small State Space
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium