Penn Bioinformatics Core
HOME | SERVICES | TOOLS | WORKSHOPS | RESOURCES | PEOPLE  

General Web Resources

This is not meant to be a complete list of all available resources. Rather, it is a list of resources that we have found useful. If you know of a resource that you find useful that is not on the list, please email the bioinformatics core and we will add it to the list.

Databases

DNA Note: GenBank and EMBL mirror data but each has its own strengths in terms of interfaces and tools.
GenBank The central repository in the United States for sequence based data. Many databases and tools are available here.
EMBL European Molecular Biology Laboratory sequence repository.
Protein
Swiss-Prot Many tools and resources for protein analyses.
PIR Protein Information Resource.
PROSITE A database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family (if any) a new sequence belongs.
InterPro A useful resource for whole genome analysis and has already been used for the proteome analysis of a number of completely sequenced organisms including preliminary analyses of the mouse and human genomes.
PDB Repository for the processing and distribution of 3-D biological macromolecular structure data
Genomic
ENSEMBL Nice graphical displays of genomic assemblies (human, mouse and others)
UCSC Displays annotation on the assembled human and mouse genomes. Nice sequence based query tool and display.
The Human Genome Database International collaboration in support of the Human Genome Project
Human genome at NCBI GenBank view of the Human genome.
UCSC HGP Working Draft Public human and mouse genomic assemblies. Nice sequence based query tool with good visualization.
MGD at JAX Mouse genome database at Jackson Labs.
FlyBase Database for Drosophila genome and annotation
SGD Saccharomyces genome database
PlasmoDB Database for the Plasmodium genomic sequence. Developed here at Penn jointly by the laboratories of David Roos and Chris Stoeckert.
WormBase Genomic sequence and annotation for C. elegans.
Gene Indices  
AllGenes Integrates genomic and protein sequences for human and mouse around assembled transcripts. Powerful query interface. Developed here at Penn in CBIL.
TIGR Gene Indices Similar to AllGenes. Includes many different organisms.
Other Databases  
GeneCards Integrated database containing information from a variety of sources. Organized around official gene names.
Source Useful for obtaining information about sequences given accessions or image clone identifiers.

Tools

» top
EBI Toolbox Tools for database and homology searching, browsing and analysis including FASTA, CLUSTALW, MSA, PSA
BCM Search Launcher Tools for secondary structure prediction, multiple sequence alignment, gene finding, etc.
TIGR software Tools for splice sites detection, gene finding, genomic sequence alignment, sequence assembly, expression data visualization and analysis.
GenScan Provides access to the program Genscan for predicting the locations and exon-intron structures of genes in genomic sequences from a variety of organisms
MOTIF Tools to search protein and nucleic acid sequence Motifs
PatScan Searches protein or nucleotide (DNA, RNA, tRNA etc.) sequence archives for instances of a pattern which you input
SCOP Tools for structural classification of proteins

Pathways

KEGG Kyoto Encyclopedia of Genes and Genomes (KEGG) is an effort to computerize current knowledge of molecular and cellular biology in terms of the information pathways.
BioCarta Provides interactive graphic models of molecular and cellular pathways
MetaCyc Contains metabolic pathways from 150 different organisms

Microarray

Databases  
NCI ArrayDB A Gene Expression Database for the Molecular Pharmacology of Cancer
EBI ArrayExpress A public repository for well annotated microarray based gene expression data
ExpressDB Harvard's relational database containing yeast and E. coli RNA expression data
Stanford Microarray Database Raw and normalized data from microarray experiments with the corresponding image files. Data is released to the public at the researcher's discretion or upon publication.
ChipDB MIT's interactive database of expression data
Data Analysis Tools
EBI Expression Profiler A set of tools for clustering, analysis and visualization of gene expression and other genomic data
Sanger Center ExpressionBrowser A tool for visualizing clusters of expression data
J-Express A Java implementation of hierachical clustering, self organized maps, and principal component analysis, with several different viewing options and output formats.
Cluster/TreeView Cluster and TreeView are an integrated pair of programs for analyzing and visualizing the results of complex microarray experiments.
2HAPI A web-based microarray data analysis system to browse, visualize, and analyze data obtained from genome-wide gene expression experiments.

Statistics

StatSoft Electronic Statistics Textbook A searchable electronic textbook for general statistical concepts

Software

BioPerl Open source Perl tools for bioinformatics, genomics and life science research.
EMBOSS A package of free open source software for sequence analysis provided by the EMBL.