NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102884

Metagenome Family F102884

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102884
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 102 residues
Representative Sequence MSFELEQLREAAEKYAYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGTAFRRICTNWLRKNLTGLQRLQGRVLYSLAPEDFQHI
Number of Associated Samples 69
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 91.30 %
% of genes near scaffold ends (potentially truncated) 12.87 %
% of genes from short scaffolds (< 2000 bps) 10.89 %
Associated GOLD sequencing projects 63
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (77.228 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(35.644 % of family members)
Environment Ontology (ENVO) Unclassified
(62.376 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.366 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.24%    β-sheet: 0.00%    Coil/Unstructured: 54.76%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF13570PQQ_3 2.97
PF00583Acetyltransf_1 1.98
PF00118Cpn60_TCP1 1.98
PF13243SQHop_cyclase_C 1.98
PF08592Anthrone_oxy 1.98
PF05016ParE_toxin 0.99
PF03703bPH_2 0.99
PF05977MFS_3 0.99
PF08818DUF1801 0.99
PF13360PQQ_2 0.99
PF01909NTP_transf_2 0.99
PF04967HTH_10 0.99
PF00005ABC_tran 0.99
PF10127RlaP 0.99
PF02979NHase_alpha 0.99
PF13242Hydrolase_like 0.99
PF07282OrfB_Zn_ribbon 0.99
PF07690MFS_1 0.99
PF00717Peptidase_S24 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 1.98
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 0.99
COG3402Uncharacterized membrane protein YdbS, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 0.99
COG3428Uncharacterized membrane protein YdbT, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 0.99
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 0.99
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 0.99
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A77.23 %
All OrganismsrootAll Organisms22.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10005582All Organisms → cellular organisms → Bacteria6031Open in IMG/M
3300005167|Ga0066672_10025713All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3149Open in IMG/M
3300005554|Ga0066661_10043187All Organisms → cellular organisms → Archaea2553Open in IMG/M
3300005556|Ga0066707_10860023All Organisms → cellular organisms → Archaea558Open in IMG/M
3300005568|Ga0066703_10130538All Organisms → cellular organisms → Archaea1500Open in IMG/M
3300006797|Ga0066659_11375777All Organisms → cellular organisms → Archaea589Open in IMG/M
3300009038|Ga0099829_10167866All Organisms → Viruses → Predicted Viral1761Open in IMG/M
3300012199|Ga0137383_10002861All Organisms → cellular organisms → Archaea10701Open in IMG/M
3300012199|Ga0137383_10331570All Organisms → cellular organisms → Archaea1114Open in IMG/M
3300012206|Ga0137380_10017088All Organisms → cellular organisms → Archaea6635Open in IMG/M
3300012207|Ga0137381_10013878All Organisms → cellular organisms → Archaea6242Open in IMG/M
3300012349|Ga0137387_10128609All Organisms → cellular organisms → Archaea1791Open in IMG/M
3300012354|Ga0137366_10076405All Organisms → cellular organisms → Archaea2549Open in IMG/M
3300012357|Ga0137384_10342511All Organisms → cellular organisms → Archaea1239Open in IMG/M
3300012362|Ga0137361_10815493All Organisms → cellular organisms → Archaea849Open in IMG/M
3300012972|Ga0134077_10349543All Organisms → cellular organisms → Archaea630Open in IMG/M
3300015359|Ga0134085_10099550All Organisms → Viruses → Predicted Viral1206Open in IMG/M
3300017659|Ga0134083_10053586All Organisms → cellular organisms → Archaea1524Open in IMG/M
3300026295|Ga0209234_1005118All Organisms → cellular organisms → Archaea4971Open in IMG/M
3300026297|Ga0209237_1003230All Organisms → cellular organisms → Archaea9887Open in IMG/M
3300026328|Ga0209802_1050367All Organisms → cellular organisms → Archaea2065Open in IMG/M
3300026335|Ga0209804_1022401All Organisms → cellular organisms → Archaea3278Open in IMG/M
3300027862|Ga0209701_10040183All Organisms → cellular organisms → Archaea3023Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil35.64%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil34.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil14.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1014008723300002558Grasslands SoilMSFDLEQLREVAEKYAFHDASTHGVWPRLVWERRSDAPVARVDDAVEFLGKWKALRFKGGGRAFRRICSNWLRKNLTNLQRLQGRVLYSLAPED
JGI25385J37094_1020856613300002558Grasslands SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLERLQGRVLYSLAPD
JGI25382J43887_1000558233300002908Grasslands SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLERLQGRVLYSLAPDDFQHILDISTSFRDCGAPPTTYGKALHFFPP*
JGI25382J43887_1038742213300002908Grasslands SoilMSFDLEQLREVAEKYAFHDASTHGVWPRLVWERRSDAPVARVDDAVEFLGKWKALRFKGGGRAFRRICSNWLRKNLTNLQRLQGRVLYSLAPEDFQHILD
Ga0066672_1002571353300005167SoilMKNSILSFGLRQLREAADKYTYHAAGTHGVWPRLVWERRNQSPVARIDDAVEFLGKWKALRFKGGKTAFRRICAAWLRRNRTDLESLQGRVLYSLAPEDFQT
Ga0066683_1082602313300005172SoilMRLNWCSNSFTLDWTRLTFELEQLGEAARKYPYHDAGTHGVWPRLVWERRNESPVARLDDAVEFMGKWKALRFKGGGTAFRRICTTWLRKNLTDLQRLQGRVLYSLAPEDFQRI
Ga0066679_1028295123300005176SoilMSFELAQLREAAEKHTYQDASTHGVWPRLVWERRSRSSLARIDDAVEFLGKWKALRFKGGSTAFRRICTTWLRKNLTDLERLQ
Ga0066690_1002925013300005177SoilMSFELGQLREAAEKYAYHDASTHGVYPRLVWEGRSQSPVARLDDAVEFMGKWKALRFKGGKTAFRRISKTWLRKNRSDLESLQGRILYSLAPEDFQTVLKISSSFRDC
Ga0066682_1005284033300005450SoilLSFELAQLREAAEKYTYHDGGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRRICTNWLRKNLTDLQRLQGRVLYSLAPEDFQHILDLSSSFRDCGAPPTTI
Ga0066697_1009188443300005540SoilMSFELEQLRGLAEKYAYHDASTNGVYPRLIWERRNRSPVSRIDDAVEFLGKWKALRFKGGGSAFRRICTNWLRKNLTDLQHLRGRVLFSLAPEDFQRILDI
Ga0066697_1052628213300005540SoilLSFELAQLREAAEKYTYHDGGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRRICTNWLRKNLTDLQRLQGRVLY
Ga0066701_1018902823300005552SoilMSFDLEQLREVAEKYAFHDASTHGVWPRLVWERRSDAPVARVDDAVEFLGKWKALRFKGGGRAFRRICSNWLRKNLTNLQRLQGRVLYSLAPEDF
Ga0066701_1097385413300005552SoilMSFELEQLWEAAEKYAYHDAGTHGVWPRLVWERRSESPAARLDDAVEFLGKWKALRFKGGGLAFRRICTTWLRRNLSDLQRLQGRVLYGVASEDFQT
Ga0066661_1004318713300005554SoilMSFELAQLREAAEKHTYQDASTHGVWPRLVWERRSRSSLARIDDAVEFLGKWKALRFKGGSTAFRRICTTWLRKNLTDLERLQGKVL
Ga0066692_1035568713300005555SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLERLQGRVLYSLAPEDFQTVLKISSSFRDC
Ga0066692_1051454313300005555SoilMSFELGQLREAAEKYTYHDPGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRGICTNWLRKNLTGIQRLHGKVLYTLAPKDFQCILDISSSFRDCGAP
Ga0066707_1001946063300005556SoilMGFELEQLWEAVEKYAYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRRICTSWLRKNLGDLQRLQGGVLYSLAP
Ga0066707_1086002313300005556SoilMPNRFELKQLREAAEKYAYHDTGTHGVWPRLVWERRNEPPAARLDDAVEFLGKWKALRFKGGSTAFRRICTNWLRKNLTFLQRFQGRVLYSLAPEDFQTVLKISSSFQDCGAPPTTYGKALH*
Ga0066704_1089927613300005557SoilMPNRFELKQLKGAAEKYTYHDAGTHRIWPRLVWEERNKSPVARLDDAVEFLAKWKALRFKGGGVAFRRIYTTWLRKNLTELRR
Ga0066698_1023846413300005558SoilMSFELEQLREAAEKYAYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGTAFRRICTNWLRKNLTGLQRLQGRVLYSLAPEDFQHI
Ga0066698_1095406313300005558SoilMSFELEQLRGLAEKYAYHDASTNGVYPRLIWERRNRSPVSRIDDAVEFLGKWKALRFKGGGSAFRRICTNWLRKNLTDLQHLRGRVLFSLAPEDFQRILDITTSFRDCSAPPTTFGKA
Ga0066700_1005697313300005559SoilMSFELGQLREAAEKYAYHDASTHGVYPRLVWEGRSQSPVARLDDAVEFMGKWKALRFKGGKTAFRRISKTWLRKNRSDLESLQGRILYSLAPEDFQTVLKISSSFRDCGAP
Ga0066700_1052390613300005559SoilMSFELEQLQEAAEKYSYHDASTHGVYPRLVWEGRNQSSVARLDDAVEFMGKWKALRFKGGKTAFRRICKTWLRKNRSDLESLQGRILYSLAPEDFQTVLKISSSFRDCGAP
Ga0066703_1013053823300005568SoilMSFELGQLREAAEKYAYHDASTHGVYPRLVWEGRSQSPVARLDDAVEFMGKWKALRFKGGKTAFRRISKTWLRKNRSDLESLQGRILYSLAPEDFQTVLKISSSFRDCGAPPTTFGKALHFFLPET
Ga0066691_1002814243300005586SoilMSFDLEQLREVAEKYAFHDASTHGVWPRLVWERRSDAPVARVDDAVEFLGKWKALRFKGGGRAFRRICSNWLRKNLTNLQRLQGRVLYSL
Ga0066706_1028637013300005598SoilMSFELGQLREAAEKYTYHYAGTHGVYPRLVWEQRNKSPVARLDDATEFLGKWKALRFKGGKVAFSRICNTWLRNNLADLQRLQGKVLYSLAPEDFQHI
Ga0066706_1063426323300005598SoilMSFELGQLREAAETYTGHEAGTHGVWPRLVWKRRNESPVARFDDAVEFLGKWKALRFKGGKTAFRRICSTWLRKNLTDLQRLQGRVLYSLAPEDFQ
Ga0066656_1034860413300006034SoilMSFELEQLRGLAEKYAYHDASTNGVYPRLIWERRNRSPVSRIDDAVEFLGKWKALRFKGGGSAFRRICTNWLRKNLTDLQHLRGRVLFSLAPEDFQRILDITTSFRDCSAPPTTFGKALH
Ga0066656_1061709113300006034SoilMSFELEQLREAARKYLYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGTAFRRTCTNWLRRNLTLLQRLQGRALYSLTPEDFQHILDISSSFR
Ga0066659_1137577723300006797SoilMSFELEQLWEAAEKYAYHDAGTHGVWPRLVWERRSKSPVARLDDAIEFLGKWKALRFKGGGLAFRRTCTTWLRRNLSDLQRLQGRVLYDLAPDDFQTVLKISSSFRDCGAPPTTFGKALHFFLP
Ga0099793_1064421113300007258Vadose Zone SoilMPNRFELKQLREAAEKHAYHDAGTHGVHSRLVWERRNESPVARLDDAVKLLGKRKALRFKGGGRAFRRICTSWLRKNLTGIQRLQGRDLYSLAPENFQRILDISSSFR
Ga0066710_10050378113300009012Grasslands SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRNGSPVARLDDAVEFLGKWKALRFKGGGTAFRRICANWLRKNLIDLQRLQGRVLYSLAPEDFQDVLRISSSFRDCGAPPTTFGKALH
Ga0099829_1016786623300009038Vadose Zone SoilMSFELEQLRDAAQRYAYHDAGTHGVWPSLVWERRIESPVARLDDAVEFLGKWKALRFKGGGKAFRRICTTWLRKNLVDLQRLQGRVLYSLAPEDFQTVLKISSSFRDFGAPPTPFGKALHFFL
Ga0099829_1106535913300009038Vadose Zone SoilLEQLREAAEKYLYHNAGTHGVWARLVWERRSRSPVARLDDAVEFLGKWKALRFKGGGRAFRGICTNWLRKNLTDLPRLQGRVLYSLAPEDFQHILD
Ga0099830_1042960513300009088Vadose Zone SoilMSFELEQLREAAEKYTHHEAGTHGVYPRLVWERRSQVPVARLDDVVDFLGKWKALRFKGGKIAFRRICTTWLRKNLTDLQRLQGRNLYSLAPEDFQTVLKISSSFRDCGAPPTTF
Ga0099830_1074227513300009088Vadose Zone SoilMSNRFELKQLREAAEKHTYHDADTHGAWPRLAWERRDESPVARLDDAVEFLVKGKALRFKGGNIAFRKICTTWLRKNLTDLQRLQGR
Ga0099828_1012008013300009089Vadose Zone SoilMSFELEQLRDAAQRYAYHDAGTHGVWPRLVWDRRNQPPLSRLDDAVEFLGKWKALRFKGGKTAFRRICTTWLRKNLTDLKCLQGRVLNSLAPEDFQNILDISTS
Ga0134088_1053561013300010304Grasslands SoilMSFELAQLREAAQKYTYHDAGTHGVWTRLVWERRSESPVARLDDAVEFLGKWKALRFKGGGRAFRRICSTWLRRNLTDLQR
Ga0134063_1015028713300010335Grasslands SoilLEQLGEAARKYPYHDAGTHGVWPRLVWERRNESPVARLDDAVEFMGKWKALRFKGGGTAFRRICTTWLRKNLTDLQRLQGRVLYS
Ga0134071_1010828513300010336Grasslands SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLER
Ga0134062_1017764713300010337Grasslands SoilLEQLREAARKYPYHDAGTHGVCPRLVWERRSESPATRLDDAVEFLGKWKTLRFKGGGTAFRRICTTWLRKNLTDLQRLKGRVLYSLPLR
Ga0137391_1011880713300011270Vadose Zone SoilMYTKRTRMSFELDQLRQAAEKYQYHDADTHGVWPRLVWERRKESPLARLDDAVEFMGKWKALRFKGGKAAFRRFCNTWLKKNLIDLQHLQGRVLYSLAPEDFQT
Ga0137391_1104159113300011270Vadose Zone SoilMSFELEQLREAAEKYTYHDANTHGVWPRLVWDRRNEAPVARLEDAVEFLGKWKALRFKGGKTAFRRICAKWLRKNLTDLERLQGRVLYSLAPENFQHILDVSSSFRYCGAPPTTFGK
Ga0137388_1184764113300012189Vadose Zone SoilMSFELEQLREAAEKYTYHDAGTHGVWPRLVWDRRNQPPLSRLDDAVEFLGKWKALRFKGGKTAFRRICTTWLRTNLTHLQGLQGRVLCNLAPEDFQNILDISTSFRDCGAPPTTFG
Ga0137383_1000286113300012199Vadose Zone SoilMSFELEQLREAAEKYSHHDAGTHAVWPSLVWDRRSKSPVARLDDAVEFLGKWKALRFKGGGTAFRRICTNWLRKNLTDLHRLQGRVLYSLAPEDFQRVFRISSSLRDCGAPPTTFGKALHFFL
Ga0137383_1033157013300012199Vadose Zone SoilMSFELGQLREAAEKYAYHDASTHGIYPRLIWKRRNQSPVARLDDAVEFLGKWKALRFKGGKTAFRRICTKWLRQNLTDLQRLQGRVLYSLAPEDFQRVFRISSSLRDCGAPPTTFGKALHFFL
Ga0137380_1001708833300012206Vadose Zone SoilMRFELRQLREAAETYAGHDVGTRGVWPRLVWERRNESPIARLDDAVEFLGRWKALRFKGGKTAFRRICTSWLRKNLTSLQRLQGRVLYSLAPEDFLAHS*
Ga0137380_1019558613300012206Vadose Zone SoilLREAAQKYTYHDAGTHRVWPRLVRERRSRPPVARLDDAVEFLGKWKALSFKGGGTAFRRICTIWLRKNLTDLRRLQGRVLYSLAHEDFQ
Ga0137381_1001387843300012207Vadose Zone SoilMRFELRQLREAAETYTGHQAGTHGVWPRLLWERRNESPIARLDDAVEFLGRWKALRFKGGKTAFRRICTSWLRKNLTSLQRLQGRVLYSLAPEDFLAHS*
Ga0137381_1012070233300012207Vadose Zone SoilLIPVVSHIFVHERTRVSLELEQLREAAETHTGHEASTHAVWPRLVWERRNLPPVARLDDAVEFLAKWKALRFEGGKVAFRRICTTWLRKNLIDLQRLQVRVLYSLAP
Ga0137381_1096300513300012207Vadose Zone SoilMPNRFELKQLREAAEKHTYHDASTHGVYPRLIWERRNRSPVARIDDAVEFLGNWKVLRFKGGGSVFRRICTNWLRKNLTDLQRLQGRVLYSLAPEDFQHILDVTSSFRYCGAPPTTYGK
Ga0137381_1103205813300012207Vadose Zone SoilMANRFELKQLREAVEKYAYHDANTHGVYPRLVWEPRNLSPVARLDDAVEFLGKWKALRFKGGKTAFRRICTTWLRRNLTILERLRGRILYSLAPDDFQTVLKISSSFRDC
Ga0137381_1109003713300012207Vadose Zone SoilMDVHKPRLMSFELAQLREAAETYTYHDAGTHGVWPRLVWERRNESPIARLDDAVEFLGKWKALRFKGGKTAFRRVCATWLRKNLTDLERLQGRVLYSIAPEDFQTVLKISSSFQDCGS
Ga0137376_1029312033300012208Vadose Zone SoilMSFELKQLREAAEKYTYHDAGTDGVWPRLVWERRNGSPVARLDDAVEFLGKWKALRFKGGGTAFRRICANWLRKNLIDLQRLQGRVLYSLAPEDFQDVLRISS
Ga0137379_1068877623300012209Vadose Zone SoilMPNRFELTQLREAAEKHTYHDASTHGVYPRLIWERRNRSPVARIDDAVEFLGKWKALRFKGGGSVFRRICTNWLRKNLTDLQRLQGRVLYSLAPEDFQHILD
Ga0137378_1072801023300012210Vadose Zone SoilMSFELGQLREAAEKYAYHDASTHGIYPRLIWKRRNQSPVARLDDAVEFLGKWKALRFKGGKTAFRRICTKWLRQNLTDLQRLQGRVLYSLAPEDFQ
Ga0137378_1160048623300012210Vadose Zone SoilMPNRFELKQLREAAEKHTYHDASTHGVYPRLIWERRNRSPVARIDDAVEFLGKWKALRFKGGGSVFRRICTNWLRKNLTDLQRL
Ga0137377_1166605723300012211Vadose Zone SoilMDVHKPRLMSFELAQLREAAETYTYHDAGTHGVWPRLVWERRNESPIARLDDAVEFLGKWKALRFKGGKTAFRRVCATWLRKNLTDLERLQ
Ga0137370_1033674823300012285Vadose Zone SoilLSFELAQLREAAEKYTYHDGGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRRICTNWLRKNLTDLQRLQ
Ga0137387_1012860943300012349Vadose Zone SoilMRFELRQLREAAETYTVHQAGTHGVWPRLLWVRRNESPIARLDDAVEFLGRWKALRFKGGKTAFRRICTSWLRKNLTSLQRLQGRVLYSLAPEDFLAHS*
Ga0137387_1047531833300012349Vadose Zone SoilMSFELEQLREAAEKYTYHDAGTHGVWPRLVWERRNESPVARLYDAVEFLGKWKALRFKGGGTAFRRICTNWLRKNLTGLQRLQGRVLYSLAPEDFQHILEISSSFRDCGAPPTT
Ga0137387_1105708713300012349Vadose Zone SoilMPNRFELKQLREAAEKHTYHDASTHGVYPRLIWERRNRSPVARIDDAVEFLGNWKVLRFKGGGSVFRRICTNWLRKNLTDLQRLQGRVLYSLAPEDFQTVLK
Ga0137386_1021373143300012351Vadose Zone SoilMRFELRQLREAAETYTGHQAGTHGVWPRLLWERRNESPIARLDDAVEFLGRWKALRFKGGKTAFRRICTSWLRKNLTSLQRLQGRVLYSLAP
Ga0137366_1007640523300012354Vadose Zone SoilMSFELEQLREAAEKYTYHDAGTHGVWPRLVWERRNESPVARLDDAIEFLGKWKALRFKGGGAAFTRICKTWLRKNLSDLQRLQGRVLYSLA*
Ga0137384_1008799123300012357Vadose Zone SoilLEQLREAAEKYTYHDAGTHGVWPRLVWERRGESPVARLDDAVEFLGKWKALRFKGGGAAFRRICTNWLGKNLTDLQRLQGRVLHGLAPEDFQHIFDISSSFRDCGAPPTTY
Ga0137384_1034251133300012357Vadose Zone SoilMPNRFELEQLREAAEKYTHHDAGTHAIWPSLVWERRSRSPVTRLDDAVDFLGKWKALRFKGGGTALRRICTNWLRKNLAILERLQGMVLYSLAPEDFQHILDISSSFRDCGTPPTTFGK
Ga0137385_1059664613300012359Vadose Zone SoilLEQLREAARKYPHHDAGTHGVWPRLVWERRSESPVARLDDAIEFLGKWKALRFKGGKTAFRRTCTSWLRKNLADLQCLQGRVLYSLAPQDFQHILDISSSFRDCGAPPTTY
Ga0137361_1081549323300012362Vadose Zone SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLERLQGRVLYSLAPDDFQHILDISTSFRDCGAPTTTYGKALHLFHP*
Ga0137396_1025130913300012918Vadose Zone SoilMSFELEQLREAAKKYPYDDAGTHGIWPKLVWEQRNESPLARIDDAVKFLGKWKALRFKGGKTAFRRICTNWLRKNLTDIQRLQGNVLYSLAPEDFQHILDITSSL*
Ga0134077_1034954313300012972Grasslands SoilLEQLREAARKYPYHDAGTHGVWPRLVWERRNESPVARLDDAVEFMGKWKALRFKGGGTIFRRICTNWLRKNLTLLDRLQGRVLYSLANEDFQHILDISSSFRDCGAPPTTIGKALHFFLPET
Ga0134076_1014550733300012976Grasslands SoilMVVHRLARMSFELEQLREAAEKYAYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGTAFRRICTNWLRKNLTGLQRLQG
Ga0134085_1008369223300015359Grasslands SoilMIVHRLARMSFELEQLREAAEKYAYHDAGTHGVWPRLVWERRNESPVARLDDTIEFLGKWKALRFKGGKTAFRRICNSWLRKNLADLQRLQGRVLYSLAPEDFQRILD
Ga0134085_1009955013300015359Grasslands SoilLEQFREAARKYPYHDAGTHGFWPRLVWERRNESPVARIDDAVEFLGKWKTLRFKGGGTAFRRICTTWLRKNLTDLQRLKGRVLYSLPLRTFNVFLT*
Ga0134112_1012567913300017656Grasslands SoilMSFELKQLQEAAANYTYHDDGTHGVWPRLVWERRTESPAARLDDALEFLGKWNALRFKGGGRAFRRICTNWLRKNLTLLQRLRGRVIYSLAPED
Ga0134112_1043428313300017656Grasslands SoilMGFELEQLREAAEKYTYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKSKALRFKGGSTAFRRICTNWLRRNQTLLQRLQGRVLYSLAPEDFQHV
Ga0134112_1046369813300017656Grasslands SoilLSFELEQLRDAARRYPYHEAGTHGVWPRLVWERRNETPVARLDDAVEFLGKWKALRFKGGGTAFRRICTNWVRKNLADLQRLQGRVLYS
Ga0134083_1005358633300017659Grasslands SoilLTFELEQLREAARKYPYHDAGTHGVWPRLVWERRTESPAARLDDALEFLGKWNALRFKGGGRAFRRICTNWLRKNLTDLQRLQGRVLYRLAPEDFQHILELSSRFRDGGAPPTTIGKALHFF
Ga0209234_100511873300026295Grasslands SoilMSFELGQLREAVEKYVYHDASTHGVYPRLVWKRRNQSPVARLDDAVEFLGKWKALRFKGGGVAFRRICKTWLRKNLTILGPLQGRV
Ga0209235_102752823300026296Grasslands SoilMSNRFELQQLREAAEKYTHYDSGKHGVWPKLVWERRNQSPVARLDDSLEFLGKGKALRFKGGGTAFRRICTNWLRKNLTDLQRPCP
Ga0209235_104825233300026296Grasslands SoilMSFELKQLQEAAANYTYHDDGTHGVWPRLVWERRTESPAARLDDALEFLGKWNALRFKGGGRAFRRICTKWLRKNLADLERLQGRVLYSLVPEDFQHILEISTSFRD
Ga0209235_105812143300026296Grasslands SoilMSFELEQLREAAEKYTYHDAGTHGVWPRLVWERRSRSPVARIDDAVEFLGKWKALRFKGGKTAFRRICTNWLRKNLTDLQDLQGRALYSLAPEDFQDIFDISSSFRDCGA
Ga0209237_100323063300026297Grasslands SoilMSNRFELQQLREAAEKYTHYDSGKHGVWPKLVWERRNQSPVARLDDSLEFLGKGKALRFKGGGTAFRRICTNWLRKNLTDLQRLRGRVLYSLAPEDFQTVLKIRRIDD
Ga0209237_121004013300026297Grasslands SoilVSFELEQLREAAEKHTYHDAGTHGVWPRLVWERRNISPVARLDDAVEFLGKWKALRFKGGDRAFRRICTTWLRKNLADLERLQGRVLYSLAPEDF
Ga0209238_110614823300026301Grasslands SoilMSFELGQLREAAEKYAYHDASTHGVYSRLVWKRRNQSPVARLDDAVEFLGKWKALRFKGGKTAFRRICTRWLRKNLTNLQGLQGRVLNSLAPEDFQTVIEISSSFRDCGAPPTTFGKALH
Ga0209761_118178913300026313Grasslands SoilMSFELKQLQEAAANYTYHDDGTHGVWPRLVWERRTESPAARLDDALEFLGKWNALRFKGGGRAFRRICTKWLRKNLADLERLQGR
Ga0209761_124342423300026313Grasslands SoilMPNRFELKQLREAAEKYTYHDAGTHGVWPRLVWERRSESPVARLDDAVEFLGKWKALRFKGGGAAFRRICTTWLRKNLADLERLQG
Ga0209761_131195023300026313Grasslands SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRNISPVARLDDAVEFLGKWKALRFKGGGRAFRRICTTWLRKNLADLERLQGRVLYSLAPEDFQHILDIS
Ga0209152_1002973213300026325SoilMKFELRQLREAAETHTSHEDATRGVWPSLVWERRSLSPAARVDDAVEFLGKWKALRFKGGGAAFRRICSTWLRKNLADLQRLQGRVLYSLAPEDFRHILDITTSFR
Ga0209802_105036733300026328SoilMSFELKQLREAAEKYTYHDAGTHGVWPRLVWERRTKSPLARLDDAVEFLGKWKALRFKGGKTGFRTICTTWLRKNLTTLERLQGRVLYSLAPDDFQHILDISTSFRDCGAPPTTYGKALHFFPP
Ga0209158_118291413300026333SoilMSFELGQLREAAEKYTYHDPGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRGICTNWLRKNLTGIQRLHGKVLYTLAPKDFQCI
Ga0209377_110221713300026334SoilMSFELGQLREAAEKYTYHDPGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRGICTNWLRKNLTGIQRLHGKVLYTLAPKDFQCILDISSSFRDCGAPSPTYGK
Ga0209804_102240113300026335SoilMSFELGQLREAAEKYAYHDASTHGVYPRLVWEGRSQSPVARLDDAVEFMGKWKALRFKGGKTAFRRISKTWLRKNRSDLESLQGRILYSLAPEDFQ
Ga0209804_110772133300026335SoilMDVHKPRLMSFELAQLREAAETYAYHDAGTHGVWPSLVWERRNRSPAARLDDAVEFLGKWKALRFKGGGTAFRSICTNWLRNNLTDLERLQGRV
Ga0257177_108841123300026480SoilMSFELEQLRESAYKYTHHEAGTHGVYPRLVWERRGESPIARIDDAVEFLGKWKALRFKGGKIAFRRICKTWLRKNLADLQHLQGRVL
Ga0209808_116138323300026523SoilLEQLREAAEKYLHHDARTHGVWPRLVWERRNESPVARLDAAVEFLGKWKALRFKGGGAAFRRICKTWLRKNLTDLERLQGKVLYSLAPEDFQAILKIS
Ga0209378_114915023300026528SoilMGFELEQLWEAVEKYAYHDAGTHGVWPRLVWERRNESPVARLDDAVEFLGKWKALRFKGGGRAFRRICTSWLRKNLGDLQRLQGGVLYS
Ga0209160_126471023300026532SoilLSFEPEQLREAAETYVGHGAGTHGVWPRLVWERRSRSPVARLDDAVEFLGKWKALRFKGGGAAFRRICSIWLRKNLTDLQRLQGRVLYRLAPEDF
Ga0209056_1029273813300026538SoilMSFELGQLREAAETYTGHEAGTHGVWPRLVWKRRNESPVARFDDAVEFLGKWKALRFKGGKTAFRRICSTWLRKNLTDLQRLQGRVLYSLAPEDFQTVLNISSSFRD
Ga0209701_1004018373300027862Vadose Zone SoilMSLELEQLREAAEKYTYHDAGTHGVWPRLVWERRKKSPVARLDDAVEFLGKWKALRFKGGGRAFRRICNTWLRKNLTDLQRLQGRVLYSLAPED
Ga0307469_1089766013300031720Hardwood Forest SoilMLGVGELSMSFELEQLREAAEKYTYHGAGTHGIWPRLVWERRSRPPVGRLDDAIEFLGKWKALRFKGGKTAFRRICTKWLRKNLTDLERLQGRVLYSLAPDDFQTVLRTSSSFRDCG
Ga0307471_10231417723300032180Hardwood Forest SoilMSFELEQLREAAEKYTYHGAGTHGIWPRLVWERRSRPPVGRLDDAIEFLGKWKALRFKGGKTAFRRICTKWLRKNLTDLERLQGRVLYSLAPDDFQTVLR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.