FlyBase:Polypeptide Report

From FlyBase Wiki
Revision as of 16:31, 21 May 2015 by Bmatthew (talk | contribs)
Jump to navigation Jump to search

Last Updated: 21 May 2015

The Polypeptide Report provides information on individual annotated polypeptides. Annotated polypeptides are derived from Annotated Transcripts by calculating the open reading frame defined by the annotated translation start and stop sites. This generally represents the largest possible open reading frame, assuming this is consistent with conservation among the Drosophila species, but there are exceptions. In over 150 cases, a downstream ATG is annotated based on the PhyloCSF exon prediction algorithm and a small number of genes have been annotated with a non-canonical translation start. These exceptions are noted in comments attached to the relevant transcripts. Since an annotated polypeptide is created for every annotated coding transcript, multiple annotated polypeptides for a given gene may be identical in amino acid sequence. Annotated polypeptides may or may not correspond exactly to polypeptides described in the literature. For more information about curated polypeptides, see the Gene Report, subsection Polypeptide Data, field Reported protein sizes.

This is a field-by-field guide to the information provided in the Polypeptide Report.

General Information

Symbol The valid symbol that is used in FlyBase for the polypeptide.

The first part of the symbol (before the '\') is the standard prefix for the species (from the Species Abbreviations list). For species other than D.melanogaster, the species prefix is displayed wherever the polypeptide symbol is used throughout FlyBase. For D.melanogaster polypeptides, the species prefix is only displayed in the GENERAL INFORMATION section at the top of a Report.

Annotation Symbol The current symbol for the annotation that represents the polypeptide.
Associated gene The gene that encodes the polypeptide.

Clicking on the gene symbol will take you to the relevant Gene Report.

Species The organism that the polypeptide originates from, with the initial letter of the genus and the full species name listed.
FlyBase ID The Primary FlyBase identifier number of the polypeptide, used to uniquely identify the polypeptide in the database.

A polypeptide may also have any number of Secondary FlyBase identifier numbers, which are listed in the SECONDARY FLYBASE IDs section of the Polypeptide Report.

Length (aa) The length in amino acid residues of the polypeptide.
Theoretical pI The theoretical pI was calculated from the predicted amino acid sequence of the annotated protein using the BioPerl pICalculator module and the EMBOSS set of pK values for individual amino acids.
Predicted MW (kD) The Predicted MW was calculated from the predicted amino acid sequence of the annotated protein using the BioPerl SeqStats module.
Map GBrowse shapshot showing gene region plus 2kb on either side of the gene. Snapshot includes polypeptides of the gene of interest, plus polypeptides of neighboring genes. The polypeptide of interest is hilighted in pink.

Sequence

The amino acid sequence of the polypeptide.

Other Products of this Gene

Transcripts

A table of transcripts encoded by the same gene, which lists each transcript symbol, its Primary FlyBase identifier number and its length in nucleotides.

The table is subdivided into the transcript that encodes this polypeptide and transcripts that encode other polypeptides encoded by the same gene.

Clicking on a transcript symbol will take you to the relevant Transcript Report.

Other Polypeptides

A table of other polypeptides encoded by the same gene, which lists each polypeptide symbol, its Primary FlyBase identifier number and its length in amino acid residues.

Clicking on a polypeptide symbol will take you to the relevant Polypeptide Report.

External Crossreferences

A table of DDBJ/EMBL/Genbank sequence accession numbers corresponding to the polypeptide.

Clicking on the accession number will take you to the appropriate entry in the GenBank database.

Synonyms

A list of symbols that have been used in the literature, or by FlyBase, to describe the polypeptide.

References

A list of publications that discuss the polypeptide, subdivided into fields by type of publication. Publications which discuss the associated gene but not this particular polypeptide can be found on the Gene Report. Only those fields containing data are displayed in an individual Polypeptide Report.