Each year Nucleic
Acids Research publishes a special issue describing a wide variety
of databases containing useful compilations of sequence and other information.
This year's update includes 548 databases. They was complied by Dr.
Michael Galperin. An abbreciated list of databases is shown below.
The entire list is available as a searchable, up-to-date, online resource
at:
http://nar.oupjournals.org/content/vol32/suppl_1.
| Database | URL | Description |
| Nucleotide Sequence | ||
| GenBank | www.ncbi.nlm.nih.gov | All publicly available nucleotide and protein sequences |
| EMBL Nucleotide Sequence Database | www.ebi.ac.uk/embl.html | All publicly available nucleotide and protein sequences |
| DNA Data Bank of Japan (DDBJ) | www.ddbj.nig.ac.jp | All publicly available nucleotide and protein sequences |
| DNA Sequences: Genes, Motifs and Regulatory Sites | ||
| TIGR Gene Indices | www.tigr.org/tdb/tgi.shtml | Organism-specific databases of EST and gene sequences |
| ExInt | intron.bic.nus.edu.sg/exint/exint.html | Exon-intron structure of eukaryotic genes |
| TRANSFAC | Transcription factors and binding sites | |
RDP |
rdp.cme.msu.edu | Ribosomal database project: rRNA sequences data |
|
Gene Expression |
||
PIR |
pir.georgetown.edu | A collection of protein sequence databases |
SWISS-PROT |
www.expasy.ch/sprot | Curated protein sequence databases |
| PROSITE | Biologically-significant protein patterns and profiles | |
| Pfam | Sequence alignments and profile hidden Markov models | |
Carbohydrate |
||
| CCSD | bssv01.lancs.ac.uk/gig/pages/gag/carbbank.htm | Complex carbohydrate structure databases (CarbBank) |
| Protein Structure | ||
| PDB | All available 3D structures of proteins and nucleic acids | |
| Genomics | ||
| GO | www.geneontology.org | Gene onthology consortium database |
| KEGG | www.genome.ad.jp/kegg | Databases of genes, proteins, and metabolic pathways |
| EcoCyc | ecocyc.org | E. coli K-12 genes, metabolic pathways, transporters, regulation |
| Ensembl | www.ensembl.org | Annotated information on eukaryotic genomes |