5.2.9 Statistics

The PIC Statistics tab displays statistics about the sequence(s) being viewed. If only part of the sequence/alignment or assembly is selected then the statistics displayed will correspond to the highlighted part. The length and number of sequences currently selected is shown at the top of the Statistics tab.

Several of the metrics displayed in the Statistics tab can also be displayed as columns in the document table. These include sequence length, % pairwise identity, % identical sites, mean coverage, molecular weight and several protein statistics such as extinction coefficient and isoelectric point. The value in the document table will be for the entire document, not the currently selected region.

Molecular weight and protein statistics were added to the document table in Prime 2021, and will not appear in the table for documents created in an earlier version of Geneious unless that document is edited in Prime 2021. If you wish to add these statistics to the document table for documents created in earlier versions of Geneious, select the folder containing the document and go to Tools Preferences Appearance and Behavior and select Recalculate statistics now. Note that for performance reasons these statistics are only calculated on documents comprised of less than 10,000 bp or aa (this threshold applies to the total number of residues and gaps over all sequences in a document).

General statistics

Statistics for multiple sequences (lists, alignments, assemblies)

GC content

For documents that are created or modified in Geneious 8.1 or later, the GC content can also be viewed in the %GC column in the document table.

The %GC column shows the percentage of A, C, G, T, U, S, W nucleotides that are either G, C, or S. Ambiguous bases that contain a mixture of GC and non-GC bases (e.g. R, Y, M, K) are excluded from the calculation. This field is available on all nucleotide sequences, contigs, alignments, and sequence lists that were created or had their sequences last modified in Geneious 8.1 or later. For contigs and alignments, the consensus sequence and reference sequence (if any) are excluded from the calculation.

For sequences within an alignment, contig or list, the %GC column only shows the overall value for the alignment. To see a table of GC percentages for all individual sequences within an alignment or contig, the sequences need to be extracted to stand-alone sequences. Alternatively, individual values can be viewed in the statistics panel by clicking on the name of the sequence to select it.

Sequences in a list or alignment can be sorted by GC content by right clicking in the sequence viewer and choosing Sort %GC.