RRID | Searching in Community Resources

Newtomics

RRID:SCR_006073

http://newt-omics.mpi-bn.mpg.de/index.php

Newt-omics is a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centered database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ~50,000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13,810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. The newt Notopthalmus viridescens is the master of regeneration. This organism is known for more than 200 years for its exceptional regenerative capabilities. Newts can completely replace lost appendages like limb and tail, lens and retina and parts of the central nervous system. Moreover, after cardiac injury newts can rebuild the functional myocardium with no scar formation. To date only very limited information from public databases is available. Newt-Omics aims to provide a comprehensive platform of expressed genes during tissue regeneration, including extensive annotations, expression data and experimentally verified peptide sequences with yet no homology to other publicly available gene sequences. The goal is to obtain a detailed understanding of the molecular processes underlying tissue regeneration in the newt, that may lead to the development of approaches, efficiently stimulating regenerative pathways in mammalians. * Number of contigs: 26594 * Number of est in contigs: 48537 * Number of transcripts with verified peptide: 5291 * Number of peptides: 15169

Proper citation: Newtomics (RRID:SCR_006073) Copy

Source: SciCrunch Registry

NEMBASE

RRID:SCR_006070

http://www.nematodes.org/nembase4/

NEMBASE is a comprehensive Nematode Transcriptome Database including 63 nematode species, over 600,000 ESTs and over 250,000 proteins. Nematode parasites are of major importance in human health and agriculture, and free-living species deliver essential ecosystem services. The genomics revolution has resulted in the production of many datasets of expressed sequence tags (ESTs) from a phylogenetically wide range of nematode species, but these are not easily compared. NEMBASE4 presents a single portal into extensively functionally annotated, EST-derived transcriptomes from over 60 species of nematodes, including plant and animal parasites and free-living taxa. Using the PartiGene suite of tools, we have assembled the publicly available ESTs for each species into a high-quality set of putative transcripts. These transcripts have been translated to produce a protein sequence resource and each is annotated with functional information derived from comparison with well-studied nematode species such as Caenorhabditis elegans and other non-nematode resources. By cross-comparing the sequences within NEMBASE4, we have also generated a protein family assignment for each translation. The data are presented in an openly accessible, interactive database. An example of the utility of NEMBASE4 is that it can examine the uniqueness of the transcriptomes of major clades of parasitic nematodes, identifying lineage-restricted genes that may underpin particular parasitic phenotypes, possible viral pathogens of nematodes, and nematode-unique protein families that may be developed as drug targets.

Proper citation: NEMBASE (RRID:SCR_006070) Copy

Source: SciCrunch Registry

HFV Database

RRID:SCR_006017

http://hfv.lanl.gov/content/index

The Hemorrhagic Fever Viruses (HFV) sequence database collects and stores sequence data and provides a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database. The database uses an algorithm that aligns each sequence to a species-wide reference sequence. The NCBI RefSeq database is used for this; if a reference sequence is not available, a Blast search finds the best candidate. Using this method, sequences in each genus can be retrieved pre-aligned. Hemorrhagic fever viruses (HFVs) are a diverse set of over 80 viral species, found in 10 different genera comprising five different families: arena-, bunya-, flavi-, filo- and togaviridae. All these viruses are highly variable and evolve rapidly, making them elusive targets for the immune system and for vaccine and drug design. About 55,000 HFV sequences exist in the public domain today. A central website that provides annotated sequences and analysis tools will be helpful to HFV researchers worldwide.

Proper citation: HFV Database (RRID:SCR_006017) Copy

Source: SciCrunch Registry

DAM-Bio

RRID:SCR_006226

http://athina.biol.uoa.gr/DAM-Bio/

An integrated environment designed to support protein sequence and structure analysis on the web.

Proper citation: DAM-Bio (RRID:SCR_006226) Copy

Source: SciCrunch Registry

OMPdb

RRID:SCR_006221

http://aias.biol.uoa.gr/OMPdb/

A database of Beta-barrel outer membrane proteins from Gram-negative bacteria. The web interface of OMPdb offers the user the ability not only to view the available data, but also to submit advanced queries for text search within the database''s protein entries or run BLAST searches against the database. The most up-to-date version of the database (as well as all past versions) can be downloaded in various formats (flat text, XML format or raw FASTA sequences). For constructing OMPdb, multiple freely accessible resources were combined and a detailed literature search was performed. The classification of OMPdb''s protein entries into families is based mainly on structural and functional criteria. Information included in the database consists of sequence data, as well as annotation for structural characteristics (such as the transmembrane segments), literature references and links to other public databases, features that are unique worldwide. Along with the database, a collection of profile Hidden Markov Models that were shown to be characteristic for Beta-barrel outer membrane proteins was also compiled. This set, when used in combination with our previously developed algorithms (PRED-TMBB, MCMBB and ConBBPRED) will serve as a powerful tool in matters of discrimination and classification of novel Beta-barrel proteins and whole-genome analyses.

Proper citation: OMPdb (RRID:SCR_006221) Copy

Source: SciCrunch Registry

Dictyostelium discoideum genome database

RRID:SCR_006643

http://dictybase.org/

Model organism database for the social amoeba Dictyostelium discoideum that provides the biomedical research community with integrated, high quality data and tools for Dictyostelium discoideum and related species. dictyBase houses the complete genome sequence, ESTs, and the entire body of literature relevant to Dictyostelium. This information is curated to provide accurate gene models and functional annotations, with the goal of fully annotating the genome to provide a ''''reference genome'''' in the Amoebozoa clade. They highlight several new features in the present update: (i) new annotations; (ii) improved interface with web 2.0 functionality; (iii) the initial steps towards a genome portal for the Amoebozoa; (iv) ortholog display; and (v) the complete integration of the Dicty Stock Center with dictyBase. The Dicty Stock Center currently holds over 1500 strains targeting over 930 different genes. There are over 100 different distinct amoebozoan species. In addition, the collection contains nearly 600 plasmids and other materials such as antibodies and cDNA libraries. The strain collection includes: * strain catalog * natural isolates * MNNG chemical mutants * tester strains for parasexual genetics * auxotroph strains * null mutants * GFP-labeled strains for cell biology * plasmid catalog The Dicty Stock Center can accept Dictyostelium strains, plasmids, and other materials relevant for research using Dictyostelium such as antibodies and cDNA or genomic libraries.

Proper citation: Dictyostelium discoideum genome database (RRID:SCR_006643) Copy

Source: SciCrunch Registry

UNITE

RRID:SCR_006518

http://unite.ut.ee/index.php

A fungal rDNA internal transcribed spacer (ITS) sequence database (although additional genes and genetic markers are also welcome) to facilitate identification of environmental samples of fungal DNA. Additional important features include user annotation of INSD sequences to add metadata on, e.g., locality, habitat, soil, climate, and interacting taxa. The user can furthermore annotate INSD sequences with additional species identifications that will appear in the results of any analyses done. UNITE focuses on high-quality ITS sequences generated from fruiting bodies collected and identified by experts and deposited in public herbaria. In addition, it also holds all fungal ITS sequences in the International Nucleotide Sequence Databases (INSD: NCBI, EMBL, DDBJ). Both sets of sequences may be used in any analyses carried out. UNITE is accompanied by a project management system called PlutoF, where users can store field data, document the sequencing lab procedures, manage sequences, and make analyses. PlutoF intends to make it possible for taxonomists, ecologists, and biogeographers to use a common platform for data storage, handling, and analyses, with the intent of facilitating an integration of these disciplines. A user can have an unlimited number of projects but still make analyses across any project data available to him.

Proper citation: UNITE (RRID:SCR_006518) Copy

Source: SciCrunch Registry

GeneTack

RRID:SCR_011953

http://topaz.gatech.edu/GeneTack/cgi/print_page.cgi?fn=db_home.html&title=Frameshift%20Database

Tools for frameshift prediction and a frameshift database.

Proper citation: GeneTack (RRID:SCR_011953) Copy

Source: SciCrunch Registry

JiffyNet

RRID:SCR_011954

http://www.jiffynet.org/

Web based instant protein network modeler for newly sequenced species. Web server designed to instantly construct genome scale protein networks using protein sequence data. Provides network visualization, analysis pages and solution for instant network modeling of newly sequenced species.

Proper citation: JiffyNet (RRID:SCR_011954) Copy

Source: SciCrunch Registry

BWA

RRID:SCR_010910

http://bio-bwa.sourceforge.net/

Software for aligning sequencing reads against large reference genome. Consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. First for sequence reads up to 100bp, and other two for longer sequences ranged from 70bp to 1Mbp.

Proper citation: BWA (RRID:SCR_010910) Copy

Source: SciCrunch Registry

Phytozome

RRID:SCR_006507

http://www.phytozome.net/

A comparative platform for green plant genomics. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology / paralogy relationships as well as clade specific genes and gene expansions. As of release v9.1, Phytozome provides access to forty-one sequenced and annotated green plant genomes which have been clustered into gene families at 20 evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable.

Proper citation: Phytozome (RRID:SCR_006507) Copy

Source: SciCrunch Registry

Descriptions of Plant Viruses

RRID:SCR_006656

http://www.dpvweb.net/

DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.

Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy

Source: SciCrunch Registry

Genome Reference Consortium

RRID:SCR_006553

http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/

Consortium that puts sequences into a chromosome context and provides the best possible reference assembly for human, mouse, and zebrafish via FTP. Tools to facilitate the curation of genome assemblies based on the sequence overlaps of long, high quality sequences.

Proper citation: Genome Reference Consortium (RRID:SCR_006553) Copy

Source: SciCrunch Registry

Psort

RRID:SCR_007038

http://www.psort.org

Portal to the PSORT family of computer programs for the prediction of protein localization sites in cells, as well as other datasets and resources relevant to localization prediction. The standalone versions are available for download for larger analyses.

Proper citation: Psort (RRID:SCR_007038) Copy

Source: SciCrunch Registry

ProDom

RRID:SCR_006969

http://prodom.prabi.fr/

Comprehensive set of protein domain families automatically generated from UniProt Knowledge Database. Automated clustering of homologous domains generated from global comparison of all available protein sequences.

Proper citation: ProDom (RRID:SCR_006969) Copy

Source: SciCrunch Registry

Tetraodon Genome Browser

RRID:SCR_007079

http://www.genoscope.cns.fr/externe/tetraodon/

The initial objective of Genoscope was to compare the genomic sequences of this fish to that of humans to help in the annotation of human genes and to estimate their number. This strategy is based on the common genetic heritage of the vertebrates: from one species of vertebrate to another, even for those as far apart as a fish and a mammal, the same genes are present for the most part. In the case of the compact genome of Tetraodon, this common complement of genes is contained in a genome eight times smaller than that of humans. Although the length of the exons is similar in these two species, the size of the introns and the intergenic sequences is greatly reduced in this fish. Furthermore, these regions, in contrast to the exons, have diverged completely since the separation of the lineages leading to humans and Tetraodon. The Exofish method, developed at Genoscope, exploits this contrast such that the conserved regions which can be identified by comparing genomic sequences of the two species, correspond only to coding regions. Using preliminary sequencing results of the genome of Tetraodon in the year 2000, Genoscope evaluated the number of human genes at about 30,000, whereas much higher estimations were current. The progress of the annotation of the human genome has since supported the Genoscope hypothesis, with values as low as 22,000 genes and a consensus of around 25,000 genes. The sequencing of the Tetraodon genome at a depth of about 8X, carried out as a collaboration between Genoscope and the Whitehead Institute Center for Genome Research (now the Broad Institute), was finished in 2002, with the production of an assembly covering 90 of the euchromatic region of the genome of the fish. This has permitted the application of Exofish at a larger scale in comparisons with the genome of humans, but also with those of the two other vertebrates sequenced at the time (Takifugu, a fish closely related to Tetraodon, and the mouse). The conserved regions detected in this way have been integrated into the annotation procedure, along with other resources (cDNA sequences from Tetraodon and ab initio predictions). Of the 28,000 genes annotated, some families were examined in detail: selenoproteins, and Type 1 cytokines and their receptors. The comparison of the proteome of Tetraodon with those of mammals has revealed some interesting differences, such as a major diversification of some hormone systems and of the collagen molecules in the fish. A search for transposable elements in the genomic sequences of Tetraodon has also revealed a high diversity (75 types), which contrasts with their scarcity; the small size of the Tetraodon genome is due to the low abundance of these elements, of which some appear to still be active. Another factor in the compactness of the Tetraodon genome, which has been confirmed by annotation, is the reduction in intron size, which approaches a lower limit of 50-60 bp, and which preferentially affects certain genes. The availability of the sequences from the genomes of humans and mice on one hand, and Takifugu and Tetraodon on the other, provide new opportunities for the study of vertebrate evolution. We have shown that the level of neutral evolution is higher in fish than in mammals. The protein sequences of fish also diverge more quickly than those of mammals. A key mechanism in evolution is gene duplication, which we have studied by taking advantage of the anchoring of the majority of the sequences from the assembly on the chromosomes. The result of this study speaks strongly in favor of a whole genome duplication event, very early in the line of ray-finned fish (Actinopterygians). An even stronger evidence came from synteny studies between the genomes of humans and Tetraodon. Using a high-resolution synteny map, we have reconstituted the genome of the vertebrate which predates this duplication - that is, the last common ancestor to all bony vertebrates (most of the vertebrates apart from cartilaginous fish and agnaths like lamprey). This ancestral karyotype contains 12 chromosomes, and the 21 Tetraodon chromosomes derive from it by the whole genome duplication and a surprisingly small number of interchromosomal rearrangements. On the contrary, exchanges between chromosomes have been much more frequent in the lineage that leads to humans. Sponsors: The project was supported by the Consortium National de Recherche en Genomique and the National Human Genome Research Institute.

Proper citation: Tetraodon Genome Browser (RRID:SCR_007079) Copy

Source: SciCrunch Registry

RTPrimerDB- The Real-Time PCR and Probe Database

RRID:SCR_007106

http://medgen.ugent.be/rtprimerdb/

Database for primer and probe sequences used in real-time PCR assays employing popular chemistries (SYBR Green I, Taqman, Hybridization Probes, Molecular Beacon) to prevent time-consuming primer design and experimental optimization, and to introduce a certain level of uniformity and standardization among different laboratories. Researchers are encouraged to submit their validated primer and probe sequence, so that other users can benefit from their expertise. The database can be queried using the official gene name or symbol, Entrez or Ensembl Gene identifier, SNP identifier, or oligonucleotide sequence. Different options make it possible to restrict a query to a particular application (Gene Expression Quantification/Detection, DNA Copy Number Quantification/Detection, SNP Detection, Mutation Analysis, Fusion Gene Quantification/Detection, Chromatin immunoprecipitation (ChIP)), organism (Human, Mouse, Rat, and others) or detection chemistry.

Proper citation: RTPrimerDB- The Real-Time PCR and Probe Database (RRID:SCR_007106) Copy

Source: SciCrunch Registry

CD-HIT

RRID:SCR_007105

http://weizhong-lab.ucsd.edu/cd-hit/

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software program for clustering biological sequences with many applications in various fields such as making non-redundant databases, finding duplicates, identifying protein families, filtering sequence errors and improving sequence assembly etc. It is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset. The CD-HIT package has CD-HIT, CD-HIT-2D, CD-HIT-EST, CD-HIT-EST-2D, CD-HIT-454, CD-HIT-PARA, PSI-CD-HIT, CD-HIT-OTU and over a dozen scripts. * CD-HIT (CD-HIT-EST) clusters similar proteins (DNAs) into clusters that meet a user-defined similarity threshold. * CD-HIT-2D (CD-HIT-EST-2D) compares 2 datasets and identifies the sequences in db2 that are similar to db1 above a threshold. * CD-HIT-454 identifies natural and artificial duplicates from pyrosequencing reads. * CD-HIT-OTU cluster rRNA tags into OTUs The usage of other programs and scripts can be found in CD-HIT user''s guide. CD-HIT was originally developed by Dr. Weizhong Li at Dr. Adam Godzik''s Lab at the Burnham Institute (now Sanford-Burnham Medical Research Institute).

Proper citation: CD-HIT (RRID:SCR_007105) Copy

Source: SciCrunch Registry

Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB)

RRID:SCR_012820

http://www.rcsb.org/#Category-welcome

Collection of structural data of biological macromolecules. Database of information about 3D structures of large biological molecules, including proteins and nucleic acids. Users can perform queries on data and analyze and visualize results.

Proper citation: Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) (RRID:SCR_012820) Copy

Source: SciCrunch Registry

KEGG

RRID:SCR_012773

http://www.kegg.jp/

Integrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.

Proper citation: KEGG (RRID:SCR_012773) Copy

Source: SciCrunch Registry

Searching across hundreds of databases

Our searching services are busy right now. Please try again later

Log in

Leaving Community

About

Community Resources

More Resources

Literature

Log in

Tools Select Another Resource Report Type

Options

Current Facets and Filters

Facets

Recent searches

RRID:SCR_006073

RRID:SCR_006070

RRID:SCR_006017

RRID:SCR_006226

RRID:SCR_006221

RRID:SCR_006643

RRID:SCR_006518

RRID:SCR_011953

RRID:SCR_011954

RRID:SCR_010910

RRID:SCR_006507

RRID:SCR_006656

RRID:SCR_006553

RRID:SCR_007038

RRID:SCR_006969

RRID:SCR_007079

RRID:SCR_007106

RRID:SCR_007105

RRID:SCR_012820

RRID:SCR_012773

RRID Portal Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Sources

Categories

Subcategories

Further Questions

Category Graph

About

Recent News Entries

Contact Us

SciCrunch