May 20, 2020

Public workspaceFind Proteins of Unknown Function (PUFs) using Plantannot - Protocol A

  • 1EMBRAPA
Icon indicating open access to content
QR code linking to this content
Protocol CitationMarcos Viana, Mauricio Mudadu, Adhemar Zerlotini 2020. Find Proteins of Unknown Function (PUFs) using Plantannot - Protocol A. protocols.io https://dx.doi.org/10.17504/protocols.io.bgcvjsw6
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
It is working
Created: May 13, 2020
Last Modified: May 20, 2020
Protocol Integer ID: 36981
Abstract
The Plantannot software provides several filters and a text search box that allows searching for molecules by its desired annotation features. These filters are needed to obtain PUFs and to try to relate them to abiotic stresses using RNA-seq expression data and co-expression networks. The Filters menu is separated in 8 fields, of those we are going to use only five: “Organism”, “Feature type”, “Orthology”, “Orthologs_coexpression” and “Analyses”. The “Feature Type” filter has three molecule types, from those the polypeptide box is the only that is going to be always checked and the others blank. By using the other 4 remaining filters, 6 protocols were created as examples of different ways to selecting PUFs. Protocol A: using lack of both homology and protein domain signatures. Protocol B: using lack of homology, presence of domain signatures - trying to select Domains of Unknown Function (DUF) from PFAM, and the text search “Unknown function”.Protocol C: using homology, lack of protein domain signatures and the text search “Unknown function”. Protocol D-F: same protocols of A-C but using ortholog groups to find homolog proteins with co-expression data related to abiotic stress.
Protocol A is intended to Find PUFs from organisms whose proteins are not yet in the NCBI´s nr database and have no protein domains found by Interproscan.
Entering application
Entering application
Enter the Plantannot Result's page, with empty filters and text box search: https://www.machado.cnptia.embrapa.br/plantannot/find/?q=
Or you can enter the https://www.machado.cnptia.embrapa.br/plantannot initial page and click on the magnifying glass with the text box empty as well.
Filtering
Filtering
Find PUFs from organisms whose proteins are not yet in the NCBI´s "nr" database and have no protein domains found by InterproScan.

Visualize the "Filters" card on the left of the page from step1:




In the "Organisms" filter, select any organisms (expand the organism's list using the green arrow) or select all by leaving all boxes empty. We will use Oropetium tomaeum as example. Click "apply" to execute the filter:



Leave the "Orthology" and "Coexpression" and "Orthologs_coexpression" filters empty:



Leave the "Biomaterial" and "Treatment" filters empty:



Filters
Filters
Viewing results
Viewing results
Visualize the "Results" card on the center-right of the screen, we will have the resulting list of Oropetium's PUFs, 2,541 PUFs were filtered:


By default we have 50 results displayed of the screen,but at the bottom of the screen this number can be changed or if you prefer you can borwser of the screens to see all the results.


In addition, at the top right of the results screen you can click on the highlighted icon in the image below and download all the results in a .tsv file