cleverGO - Tutorial

Usage

Manual submission

The simplest way to execute our tool is to provide a list of gene or protein identifiers and select organism / ID format from our list of supported databases. That's all it takes! You are then taken to a holding page until your calculation finishes. Depending on the server load, the calculation should be finished in under 30 minutes.

For user's convenience, we have implemented multiple id-types per organism.

Would you like to try it yourself? Try our example. Or just start fresh.

Custom universe submission

Same submission form as with the manual submission, only in this case it is possible to upload custom universe. This allows different baseline to be used (whole proteome is used as a default) to calculate more precise enrichment results. You can access the custom-universe submission here.

Automated submission

As a part of the multiCM analysis, The GO analyser can be invoked for each of the submitted sets without the need of data re-submission. We also provide ID-type auto-detection based on the FASTA file labels. To start with the automated analysis, please click on a coloured "dot" to activate the details menu:

As each cleverMachine model contains two datasets, analysis can be launched for each of the sets. After selecting a set, user is presented with the auto-detection screen to determine which organism/ID system to execute. The ID detection system relies on either presence of the ID in a simple form, or using standardised UniProt FASTA headers.

Output description

Enrichment table

Description of the column data

precision term precision calculated from the GO graph structure
(Herrmann C., Bérard S., Tichit L., SimCT: a generic tool to visualize ontology-based relationships for biological objects, Bioinformatics (2009) 25 (23): 3197-3198)
lvl depth of the term
go_term GO term identifier, with link to term description at AmiGO website
description Textual label for the term
e/d e signifies enrichment of the term, d signifies depletion compared to the population
%_set coverage on the provided set - how much of the set is annnotated with the GO?
%_pop coverage of the same term on our population
p_bonf p-value of the enrichment. To correct for multiple testing bias, we are applying Bonferroni correction. This is especially important for GO results as individual terms are often parts of the same sub-tree.

While the main structure of the enrichment table remains the same, we allow on-the-fly filtering and sorting of the output data.

For example, clicking on a column heading provides sorting by the selected column.

Term-depth and precision filtering

Using the sliders in the upper-right corner allows filtering of terms depending on their depth in the GO graph and the term precision. This function is useful narrowing-down the list to more specific terms is required. Often the most-specific terms provide the most relevant insight. The term precision is calculated according to :
Herrmann C., Bérard S., Tichit L., SimCT: a generic tool to visualize ontology-based relationships for biological objects, Bioinformatics (2009) 25 (23): 3197-3198.

Textual search

Free-text search field allows matching of the GO terms to the user-provided string. Only the entries for which the description is matching the search term remain on the list.

GO graph

The GO graph builds on top of the annotation information and provides further insight into relationships of the individual GO terms and links them together based on their semantic similarity. Read more on how we calculate the semantic similarity strengths in the documentation.

The visualisation is interactive and reacts to the following virtual forces to show the result:

Semantic similarity force - a spring-like force connecting similar elements
Central gravity - keeping elements within our view
Entropic repulsion - elements have natural tendency to spread out when possible
Bounding box - all elements are always visible.

View modifiers

Semantic similarity slider

GO elements are connected to other nodes similar to them. This connection can have varying strenght and while we already take the spring strength into account for the visualisation, changing the similarity strenght threshold allows further pruning of the lower-score connections.

minimal lvl slider

The level_depth provides high-pass filter for the elements of the graph - we are not filtering the connections but the actually the individual entries, same functionality as in the enrichment table.

precision slider

The precision provides high-pass filter for the elements of the graph based on the term precisions - as it is the case with level_depth, we are not filtering the connections but the actually the individual entries, same functionality as in the enrichment table.

p-value cutoff slider

Another filter for the elements of the graph is p-value qualification of the entries for the graph.

Output elements

Graph

The graph represents the GO terms as nodes with semantic similarity as connections between them. Both nodes and connections are filtered based on the settings provided above.

Word-cloud side panel

For each of the clusters, we provide basic visualisation of its contents. A cluster can be selected by clicking on any of its nodes, which activates the details side-panel. Cluster is simply defined as a set of noded that are connected to the selected node, either directly or through a proxy. A word-cloud is presented for each clusters, showing important words found in the GO term descriptions.

Cluster summary

Alongside with the wordcloud, we also provide a listing of all of the entries present in the cluster.
Importantly, we also cross-reference the cluster contents with the original submission and provide a set of entries from the original submissions that can be covered by the cluster's entries. This allows downloading of the contents of each of the clusters.

Browser compatibility

We ensure that our algorithm works on latest versions of Mozilla Firefox and Google Chrome. We are relying on the latest and greatest of web technologies and we do not require any extra plugin installation.

As a consequence of the above, any other browser not mentioned above is not supported and may not provide full experience.

Also, should you experience any problems when uploading files, please ensure that your internet connection works properly and there are no proxies/firewalls blocking the transfer in between you and our servers.