May 20, 2020

Public workspaceFind Proteins of Unknown Function (PUFs) using Plantannot - Protocol F

  • 1EMBRAPA
Icon indicating open access to content
QR code linking to this content
Protocol CitationMarcos Viana, Mauricio Mudadu, Adhemar Zerlotini 2020. Find Proteins of Unknown Function (PUFs) using Plantannot - Protocol F. protocols.io https://dx.doi.org/10.17504/protocols.io.bgdkjs4w
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
It is working
Created: May 14, 2020
Last Modified: May 20, 2020
Protocol Integer ID: 37004
Abstract
The Plantannot software provides several filters and a text search box that allows searching for molecules by its desired annotation features. These filters are needed to obtain PUFs and to try to relate them to abiotic stresses using RNA-seq expression data and co-expression networks. The Filters menu is separated in 8 fields, of those we are going to use only five: “Organism”, “Feature type”, “Orthology”, “Orthologs_coexpression” and “Analyses”. The “Feature Type” filter has three molecule types, from those the polypeptide box is the only that is going to be always checked and the others blank. By using the other 4 remaining filters, 6 protocols were created as examples of different ways to selecting PUFs. Protocol A: using lack of both homology and protein domain signatures. Protocol B: using lack of homology, presence of domain signatures - trying to select Domains of Unknown Function (DUF) from PFAM, and the text search “Unknown function”.Protocol C: using homology, lack of protein domain signatures and the text search “Unknown function”. Protocol D-F: same protocols of A-C but using ortholog groups to find homolog proteins with co-expression data related to abiotic stress.
Protocol F is intended to find PUFs from organisms that proteins are already public in the NCBI´s "nr" database and have no protein domain found by Interproscan. Proteins will be selected using the text search "Unknown function". Also, ortholog groups and co-expression networks will be used to relate proteins to abiotic stresses.
Entering application
Entering application
Enter the Plantannot Result's page, with empty filters and text box search: https://www.machado.cnptia.embrapa.br/plantannot/find/?q=
Or you can enter the https://www.machado.cnptia.embrapa.br/plantannot initial page and click on the magnifying glass with the text box empty as well.
Filtering
Filtering

Find PUFs from organisms that proteins are already public in the NCBI´s "nr" database and have no protein domain found by Interproscan. Proteins will be selected using the text search "Unknown function". Also, ortholog groups and co-expression networks will be used to relate proteins to abiotic stresses.

Visualize the "Filters" card on the left of the page from step1:




In the "Organisms" filter, select any organisms (expand the organism's list using the green arrow) or select all by leaving all boxes empty. We will use Oropetium tomaeum as example. Click "apply" to execute the filter:



Leave the "Coexpression" filter empty:



Leave the "Biomaterial" and "Treatment" filters empty:



Filters
Filters
Viewing results
Viewing results
Visualize the "Results" card on the center-right of the screen. There will be the resulting list of Oropetium's PUFs. 4 PUFs were filtered: