Applications in EMBOSS (Release 2.9.0)

ALIGNMENT CONSENSUS

Program nameDescription
consCreates a consensus from multiple alignments
megamergerMerge two large overlapping nucleic acid sequences
mergerMerge two overlapping nucleic acid sequences

ALIGNMENT DIFFERENCES

Program nameDescription
diffseqFind differences between nearly identical sequences

ALIGNMENT DOT PLOTS

Program nameDescription
dotmatcherDisplays a thresholded dotplot of two sequences
dotpathDisplays a non-overlapping wordmatch dotplot of two sequences
dottupDisplays a wordmatch dotplot of two sequences
polydotDisplays all-against-all dotplots of a set of sequences

ALIGNMENT GLOBAL

Program nameDescription
est2genomeAlign EST and genomic DNA sequences
needleNeedleman-Wunsch global alignment
stretcherFinds the best global alignment between two sequences

ALIGNMENT LOCAL

Program nameDescription
matcherFinds the best local alignments between two sequences
seqmatchallDoes an all-against-all comparison of a set of sequences
supermatcherFinds a match of a large sequence against one or more sequences
waterSmith-Waterman local alignment
wordmatchFinds all exact matches of a given size between 2 sequences

ALIGNMENT MULTIPLE

Program nameDescription
emmaMultiple alignment program - interface to ClustalW program
infoalignInformation on a multiple sequence alignment
plotconPlots the quality of conservation of a sequence alignment
prettyplotDisplays aligned sequences, with colouring and boxing
showalignDisplays a multiple sequence alignment
tranalignAlign nucleic coding regions given the aligned proteins

DISPLAY

Program nameDescription
abiviewReads ABI file and display the trace
cirdnaDraws circular maps of DNA constructs
lindnaDraws linear maps of DNA constructs
pepnetDisplays proteins as a helical net
pepwheelShows protein sequences as helices
prettyplotDisplays aligned sequences, with colouring and boxing
prettyseqOutput sequence with translated ranges
remapDisplay a sequence with restriction cut sites, translation etc
seealsoFinds programs sharing group names
showalignDisplays a multiple sequence alignment
showdbDisplays information on the currently available databases
showfeatShow features of a sequence
showseqDisplay a sequence with features, translation etc
sixpackDisplay a DNA sequence with 6-frame translation and ORFs
textsearchSearch sequence documentation text. SRS and Entrez are faster!

EDIT

Program nameDescription
biosedReplace or delete sequence sections
cutseqRemoves a specified section from a sequence
degapseqRemoves gap characters from sequences
descseqAlter the name or description of a sequence
entretReads and writes (returns) flatfile entries
extractfeatExtract features from a sequence
extractseqExtract regions from a sequence
listorWrites a list file of the logical OR of two sets of sequences
maskfeatMask off features of a sequence
maskseqMask off regions of a sequence
newseqType in a short new sequence
noreturnRemoves carriage return from ASCII files
notseqExcludes a set of sequences and writes out the remaining ones
nthseqWrites one sequence from a multiple set of sequences
pasteseqInsert one sequence into another
revseqReverse and complement a sequence
seqretReads and writes (returns) sequences
seqretsplitReads and writes (returns) sequences in individual files
skipseqReads and writes (returns) sequences, skipping the first few
splitterSplit a sequence into (overlapping) smaller sequences
trimestTrim poly-A tails off EST sequences
trimseqTrim ambiguous bits off the ends of sequences
unionReads sequence fragments and builds one sequence
vectorstripStrips out DNA between a pair of vector sequences
yankReads a sequence range, appends the full USA to a list file

ENZYME KINETICS

Program nameDescription
findkmFind Km and Vmax for an enzyme reaction by a Hanes/Woolf plot

FEATURE TABLES

Program nameDescription
coderetExtract CDS, mRNA and translations from feature tables
extractfeatExtract features from a sequence
maskfeatMask off features of a sequence
showfeatShow features of a sequence
twofeatFinds neighbouring pairs of features in sequences

INFORMATION

Program nameDescription
infoalignInformation on a multiple sequence alignment
infoseqDisplays some simple information about sequences
seealsoFinds programs sharing group names
showdbDisplays information on the currently available databases
textsearchSearch sequence documentation text. SRS and Entrez are faster!
tfmDisplays a program's help documentation manual
whichdbSearch all databases for an entry
wossnameFinds programs by keywords in their one-line documentation

NUCLEIC 2D STRUCTURE

Program nameDescription
einvertedFinds DNA inverted repeats

NUCLEIC CODON USAGE

Program nameDescription
caiCAI codon adaptation index
chipsCodon usage statistics
codcmpCodon usage table comparison
cuspCreate a codon usage table
sycoSynonymous codon usage Gribskov statistic plot

NUCLEIC COMPOSITION

Program nameDescription
bananaBending and curvature plot in B-DNA
btwistedCalculates the twisting in a B-DNA sequence
chaosCreate a chaos game representation plot for a sequence
compseqCounts the composition of dimer/trimer/etc words in a sequence
danCalculates DNA RNA/DNA melting temperature
freakResidue/base frequency table or plot
isochorePlots isochores in large DNA sequences
sirnaFinds siRNA duplexes in mRNA
wordcountCounts words of a specified size in a DNA sequence

NUCLEIC CPG ISLANDS

Program nameDescription
cpgplotPlot CpG rich areas
cpgreportReports all CpG rich regions
geeceeCalculates the fractional GC content of nucleic acid sequences
newcpgreportReport CpG rich areas
newcpgseekReports CpG rich regions

NUCLEIC GENE FINDING

Program nameDescription
getorfFinds and extracts open reading frames (ORFs)
marscanFinds MAR/SAR sites in nucleic sequences
plotorfPlot potential open reading frames
showorfPretty output of DNA translations
sixpackDisplay a DNA sequence with 6-frame translation and ORFs
sycoSynonymous codon usage Gribskov statistic plot
tcodeFickett TESTCODE statistic to identify protein-coding DNA
wobbleWobble base plot

NUCLEIC MOTIFS

Program nameDescription
dregregular expression search of a nucleotide sequence
fuzznucNucleic acid pattern search
fuzztranProtein pattern search after translation
marscanFinds MAR/SAR sites in nucleic sequences

NUCLEIC MUTATION

Program nameDescription
msbarMutate sequence beyond all recognition
shuffleseqShuffles a set of sequences maintaining composition

NUCLEIC PRIMERS

Program nameDescription
eprimer3Picks PCR primers and hybridization oligos
primersearchSearches DNA sequences for matches with primer pairs
stssearchSearches a DNA database for matches with a set of STS primers

NUCLEIC PROFILES

Program nameDescription
profitScan a sequence or database with a matrix or profile
prophecyCreates matrices/profiles from multiple alignments
prophetGapped alignment for profiles

NUCLEIC REPEATS

Program nameDescription
einvertedFinds DNA inverted repeats
equicktandemFinds tandem repeats
etandemLooks for tandem repeats in a nucleotide sequence
palindromeLooks for inverted repeats in a nucleotide sequence

NUCLEIC RESTRICTION

Program nameDescription
recoderRemove restriction sites but maintain the same translation
redataSearch REBASE for enzyme name, references, suppliers etc
remapDisplay a sequence with restriction cut sites, translation etc
restoverFinds restriction enzymes that produce a specific overhang
restrictFinds restriction enzyme cleavage sites
showseqDisplay a sequence with features, translation etc
silentSilent mutation restriction enzyme scan

NUCLEIC TRANSCRIPTION

Program nameDescription
tfscanScans DNA sequences for transcription factors

NUCLEIC TRANSLATION

Program nameDescription
backtranseqBack translate a protein sequence
coderetExtract CDS, mRNA and translations from feature tables
plotorfPlot potential open reading frames
prettyseqOutput sequence with translated ranges
remapDisplay a sequence with restriction cut sites, translation etc
showorfPretty output of DNA translations
showseqDisplay a sequence with features, translation etc
sixpackDisplay a DNA sequence with 6-frame translation and ORFs
transeqTranslate nucleic acid sequences

PHYLOGENY DISTANCE MATRIX

Program nameDescription
distmatCreates a distance matrix from multiple alignments

PROTEIN 2D STRUCTURE

Program nameDescription
garnierPredicts protein secondary structure
helixturnhelixReport nucleic acid binding motifs
hmomentHydrophobic moment calculation
pepcoilPredicts coiled coil regions
pepnetDisplays proteins as a helical net
pepwheelShows protein sequences as helices
tmapDisplays membrane spanning regions

PROTEIN COMPOSITION

Program nameDescription
backtranseqBack translate a protein sequence
chargeProtein charge plot
checktransReports STOP codons and ORF statistics of a protein
compseqCounts the composition of dimer/trimer/etc words in a sequence
emowseProtein identification by mass spectrometry
freakResidue/base frequency table or plot
iepCalculates the isoelectric point of a protein
mwcontamShows molwts that match across a set of files
mwfilterFilter noisy molwts from mass spec output
octanolDisplays protein hydropathy
pepinfoPlots simple amino acid properties in parallel
pepstatsProtein statistics
pepwindowDisplays protein hydropathy
pepwindowallDisplays protein hydropathy of a set of sequences

PROTEIN MOTIFS

Program nameDescription
antigenicFinds antigenic sites in proteins
digestProtein proteolytic enzyme or reagent cleavage digest
epestfindFinds PEST motifs as potential proteolytic cleavage sites
fuzzproProtein pattern search
fuzztranProtein pattern search after translation
helixturnhelixReport nucleic acid binding motifs
oddcompFinds protein sequence regions with a biased composition
patmatdbSearch a protein sequence with a motif
patmatmotifsSearch a PROSITE motif database with a protein sequence
pepcoilPredicts coiled coil regions
pregRegular expression search of a protein sequence
pscanScans proteins using PRINTS
sigcleaveReports protein signal cleavage sites

PROTEIN MUTATION

Program nameDescription
msbarMutate sequence beyond all recognition
shuffleseqShuffles a set of sequences maintaining composition

PROTEIN PROFILES

Program nameDescription
profitScan a sequence or database with a matrix or profile
prophecyCreates matrices/profiles from multiple alignments
prophetGapped alignment for profiles

UTILS DATABASE CREATION

Program nameDescription
aaindexextractExtract data from AAINDEX
cutgextractExtract data from CUTG
printsextractExtract data from PRINTS
prosextractBuilds the PROSITE motif database for patmatmotifs to search
rebaseextractExtract data from REBASE
tfextractExtract data from TRANSFAC

UTILS DATABASE INDEXING

Program nameDescription
dbiblastIndex a BLAST database
dbifastaIndex a fasta database
dbiflatIndex a flat file database
dbigcgIndex a GCG formatted database

UTILS MISC

Program nameDescription
embossdataFinds or fetches the data files read in by the EMBOSS programs
embossversionWrites the current EMBOSS version number