FaBox is a collection of simple and intuitive web services that enable biologists and medical researchers to quickly perform typical task with sequence data. The services makes it easy to extract, edit, and replace sequence headers and join or divide data sets based on header information. Other services include collapsing a set of sequences into haplotypes and automated formatting of input files for a number of population genetics programs, such as ARLEQUIN , TCS and MRBAYES . The toolbox is expected to grow on the basis of requests for particular services and converters in the future.
The Validate Fasta File utility is a Windows command-line application that will parse a Fasta file and return the number of proteins and number of residues in the file. Additionally, it will check the validity of the fasta file looking for common, known problems.
Faster2 is an extensible C++11 framework and program for efficient access and extraction of DNA sequences from FASTA and FASTQ files. It works with large file collections of raw as well as compressed data, and is based on the set of filters that can be organized into a pipeline. Faster2 performs input data indexing in order to accelerate all supported operations. It can be easily customized and extended with new filters, and its pipeline building sub-system can be incorporated into other tools. Faster2 is not a database system nor a data analytics tool. Its sole purpose is to simplify tedious operations that are part of everyday tasks performed routinely by bioinformaticians and computational biologists, and yet often require writing specialized text-processing scripts. +
FASTA programs find regions of local or global (new) similarity between Protein or DNA sequences, either by searching Protein or DNA databases, or by identifying local duplications within a sequence. Other programs provide information on the statistical significance of an alignment. Like BLAST, FASTA can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
MFCompress is a compression tool for FASTA and multi-FASTA files. In comparison to gzip and applied to multi-FASTA files, MFCompress can provide additional average compression gains of almost 50%, i.e.,it potentially doubles the available storage, although at the cost of some more computation time.
The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing.The main processing of such FASTA/FASTQ files is mapping (aka aligning) the sequences to reference genomes or other databases using specialized programs.However,It is sometimes more productive to preprocess the FASTA/FASTQ files before mapping the sequences to the genome – manipulating the sequences to produce better mapping results.The FASTX-Toolkit tools perform some of these preprocessing tasks.
GWFASTA (Genome Wise Sequence Similarity Search using FASTA) allows user to search their sequence against sequenced genomes and their product proteome. This integrate various tools which allows analysys of FASTA search