NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F088482

Metagenome Family F088482

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088482
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 60 residues
Representative Sequence MQMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREKKKDAAKPKKVYTDDDLK
Number of Associated Samples 79
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 3.67 %
% of genes from short scaffolds (< 2000 bps) 1.83 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.330 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(60.550 % of family members)
Environment Ontology (ENVO) Unclassified
(57.798 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(61.468 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 48.94%    β-sheet: 0.00%    Coil/Unstructured: 51.06%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF02518HATPase_c 83.49
PF13426PAS_9 6.42
PF01478Peptidase_A24 4.59
PF08448PAS_4 3.67
PF00174Oxidored_molyb 0.92
PF00512HisKA 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.92
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.33 %
All OrganismsrootAll Organisms3.67 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300021178|Ga0210408_10126487All Organisms → cellular organisms → Bacteria2017Open in IMG/M
3300028047|Ga0209526_10111119All Organisms → cellular organisms → Bacteria1928Open in IMG/M
3300028536|Ga0137415_10000565All Organisms → cellular organisms → Bacteria → Acidobacteria35929Open in IMG/M
3300031754|Ga0307475_10152809All Organisms → cellular organisms → Bacteria1833Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil60.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.34%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.34%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.83%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.92%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10563496013300000364SoilMGMRVWMKNSIIVGALCMAGLPAPAQSQNGSQQSGSDPVADAARKAREDKKNAGKPKKVYTDDDVKPAAAA
JGI12053J15887_1039282513300001661Forest SoilMATKVWMKILIIVAVLCVAGLPAYAQSQSSWQQSGSDPVADAARKAREDKKNAAKPKKVYTDDDVKPA
JGI25615J43890_104863213300002910Grasslands SoilMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREKKKDAAKPKKVYTDDDLKGSVPAPEAAATSA
JGI25390J43892_1001146213300002911Grasslands SoilMRMRLRMKAFFFVPALCLAGLSAAAQQAGDPVADAARKAREAKKDKDTTKPKKVYTDDDFKKSVPE
JGI25617J43924_1004037923300002914Grasslands SoilMRVKFWMKVFLVVPAMGLVGLSAAARPQDSSQQQSGDAVADAARKAREAKKDAPKPKRGYYGR*
Ga0066677_1064287623300005171SoilMRMRLRMKAFFFVPALCLAGLSAAAQQAGDPVADAARKAREAKK
Ga0066388_10828979213300005332Tropical Forest SoilVAALALALPAFAQSQSSSQQSTGDAVADAARKAREAKKNAPKPK
Ga0066689_1084200523300005447SoilMRMRFWTKAWMTAAAVCLAGFSATARQQGSGQQQTGDPVADAARKARETKKDAPKPKKVYTD
Ga0066704_1061611013300005557SoilMRMRLRMKAFFFVPALCLAGLSAAAQQAGDPVADAARKAREAKKDKDTTKPKKVYTDDDFKKSV
Ga0066699_1032068123300005561SoilMRMRFWTKAWMTAAAVCLAGFSATARQQGSGQQQTGDPVADAARKARETKKDAPKPKKVYTDDDLKKSTPAPVA
Ga0066694_1022575723300005574SoilMRVRLWIKASMIVAIVAALCMAWLSAAARQQDSSQQRTGDPVADAARKAREKKKDAPKPKKIYTDDDVKK
Ga0066651_1057333413300006031SoilMRVRFWIKASMIVAALCMAGLSAAARQQGSSQQPTGDPVADAARKAREKKKDAPKPKKIYTD
Ga0075015_10009161113300006102WatershedsMRTAIWVKGFVLVPALCLAGISAGARPQDATQQTGDPVADAARKARESKKDVPKPKKVWTDD
Ga0079220_1133083913300006806Agricultural SoilMKLWTKVFVALSVACLAGFATAAQSQQTGDPVADAARKAREQKKDAPKPKKVYTDDDVKKSAPEPAA
Ga0075436_10015312813300006914Populus RhizosphereMRMRLRMKAFLFVPALWLAGLSAAAQQTGDPVADAARKAREAKKDTTKPKK
Ga0099793_1029106313300007258Vadose Zone SoilMRVRFWIKAFMIVAALCMAGLSAAARQQGSSQQQTGDPVADAARKAREKKKDAPKPK
Ga0099794_1005774913300007265Vadose Zone SoilMKVFVIAPALCLAGLSAAPQQQGSSQQQTSDPVADAARKAREMKKDAPKPKKVYTD
Ga0099794_1024178323300007265Vadose Zone SoilMKASVMVAALCLVGLTAAAQQQTGDPVADAARKARESKKDAPKPKKVYTDDDFKRSAPEPAAPATA
Ga0099829_1051173013300009038Vadose Zone SoilMCLAGLSAAAQQQGSAQQQTGDPVADAARKAREMKKDAPKPKKVYTDDDVKKSVPVPEAAATSAPV
Ga0099829_1067159723300009038Vadose Zone SoilMKVFVIVPALCLAGLAAHAQQQGSSQQQTGDPVADAARKARESKKDAPKPKKVYTDDDLK
Ga0099829_1105392613300009038Vadose Zone SoilMKAFVIVPALCLAGLAGAAQQQGSSQQQTGDPVADAARKAREAKKDAPKPKKV
Ga0099830_1018339113300009088Vadose Zone SoilMKLLVFVPGLFLAGLAAAAQPQGSSQQTGDPVADAARKAREAKK
Ga0099830_1029415523300009088Vadose Zone SoilMRVRFWMNVFVVVPTMCLVGLSAAAPPQGSSQQQTGDPVADAARKAREAKK
Ga0099830_1144127423300009088Vadose Zone SoilMKISIIVPVLCVAGLPAYAQSQSSWQQSGRDPVADAARKAREDKKNAAKPKKVYTDDDVKPATAAAAT
Ga0099828_1015085733300009089Vadose Zone SoilMRVKFWMKVFLVVPSMCLVGLSAAARPQDSSQQQTGDAVADAARKARESK
Ga0099828_1127287813300009089Vadose Zone SoilMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDAVADAARKARETKKDAPKPKKV
Ga0099827_1018275313300009090Vadose Zone SoilMIVAALCLAGLSAAARQQDSSQQQTGDPVADAARKAREKKKDAAKPKKIYTD
Ga0099792_1003819213300009143Vadose Zone SoilMKGCVIVPALCLVGLSAAAQQTGDPVADAARKAREAKKDTTKPKKVYTDDDVKKS
Ga0099792_1005908833300009143Vadose Zone SoilMRVKFWMKVFLVVPATGLVGLSAAARPQDSSQQQSGDAVADAARKAREAKKDAPKPKRVITDDDLKTSGPASAPVDATRAT
Ga0099792_1112853023300009143Vadose Zone SoilMRVKFWTKVFLIVPALCLAGLSAAAQQTGDPVADAA
Ga0099796_1041955423300010159Vadose Zone SoilMRVTLWTKILLFVPALFLARLSTAAQSQDASQQTGDPVADAARKARESKK
Ga0134084_1012804413300010322Grasslands SoilMIVAALCMAGLSAAARQQGSSQQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKSAP
Ga0137392_1015280413300011269Vadose Zone SoilMKAFVIVPALCLAGLSTAAQQQGSSQQQTGDPVADAARKAREMKKDAPKPKKVYTDDDVK
Ga0137391_1018054833300011270Vadose Zone SoilMKAFVIVPALCLAGLSTAAQQQGSSQQQTGDPVADAARKAREMKKDAPKPKKVYTDDDVKKSVPV
Ga0137391_1044219223300011270Vadose Zone SoilMRFWIKASLILPALCLAGLTAYGQQQGSSQQQTSDPVADAARKAREEKKNAQKPKKVYTDDDVRHNL
Ga0137391_1135075513300011270Vadose Zone SoilMKGFVIVPALCLVGLSATAQQTGDPVADAARKAREAKKDTT
Ga0137393_1146013623300011271Vadose Zone SoilMQVTFWMKAFVIVPALCVAGLSAAAQQQGSSQQQTGDAVADAARKARETKK
Ga0137388_1014824113300012189Vadose Zone SoilMRVRFWINVSMIVAALCVTGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKSAPGP
Ga0137388_1142887813300012189Vadose Zone SoilMFVAALCMAGLSTAARQQGSSQQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKSAPGP
Ga0137388_1151493623300012189Vadose Zone SoilMKGCVIVPALCLVGLSAAAQQTGDPVADAARKAREAKKDT
Ga0137363_1004709143300012202Vadose Zone SoilMKGFVIVGGICLAGLPATAQQQGSSQQQTGDPVADAARKAREAKKDAP
Ga0137363_1115006623300012202Vadose Zone SoilMRVKFWMKVFLVVPSMCLVGLSASARPQDSSQQQTGDAVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSDVAPASAPVNA
Ga0137399_1007491633300012203Vadose Zone SoilMKLLVFVPALFLAGLAAAAQPQGSSQQTGDPVADAARKA
Ga0137399_1124035423300012203Vadose Zone SoilMTAAAVCLASFSATARQQGSAQQQTGDPVADAARKARETKKDAPKPKKVYTDDDLKKSA
Ga0137362_1080083513300012205Vadose Zone SoilMRFWIKASLILPALCLAGLTAYAQQQGSSQQQTSDPVADAARKAREEKKNAQKPKK
Ga0137378_1016487233300012210Vadose Zone SoilMNGSVIVAGLCLAGLPATAQQQGSSQQQTGDPVADAARKAREAKK
Ga0137378_1186046823300012210Vadose Zone SoilLCLAGLTAYAQQQGSSQQQTSDPVADAARKAREEKKNAQKPKKVYTDDDVRHNLGGPAAP
Ga0137377_1074455513300012211Vadose Zone SoilMIVAALCMAWLSAAARQQGSSQQQTGDPVADAARKAREKKKDAPKPKKIY
Ga0137384_1122880223300012357Vadose Zone SoilMRFWIKASLILPALCLAGLTAYAQQQGSSQQQTSDPVADAARKAREEKKN
Ga0137360_1016133713300012361Vadose Zone SoilMKGLALVPVLCLVGFSAAARPQAASQQTGDPVADAARKARESKKDASKPKKVYTDDDFKKAAPEPAPA
Ga0137361_1024735023300012362Vadose Zone SoilMQVRFWINVSMIVAALCVTGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIY
Ga0137390_1152346113300012363Vadose Zone SoilMKLFLVVPAMGLVGLSAAARPQDSSQQQSGDAVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSDVAPA
Ga0137358_1033636323300012582Vadose Zone SoilMRVKLWIKASMIVAALCMAGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKP
Ga0137358_1072257623300012582Vadose Zone SoilMRVRFWMKASVMVAALCLVGLTAAAQQQTGDPVADAARKA
Ga0137398_1102453523300012683Vadose Zone SoilMRVKFWMKVFLVVPAMGLVGLSAAARPQDSSQQQSGDAVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSDVAPASAPV
Ga0137396_1036777923300012918Vadose Zone SoilMQVRLWIKASMIVAALCMAGLSAAARQQGSSQQQTGDPVADAARKAREKKKDAPKPKKIYTDD
Ga0137396_1061710623300012918Vadose Zone SoilMNVFAVVPAMCLVGLSAAAQPQGSSQQQTGDPVADAARKAREAKKDAPKPKRVITDDDLKRSAP
Ga0137396_1107791623300012918Vadose Zone SoilMRAKSGMMAFVMGAALCLAGLSAARPQSSTPQQTGDPVADAARKAREAKKD
Ga0137396_1116909623300012918Vadose Zone SoilMQMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREK
Ga0137359_1083191323300012923Vadose Zone SoilMQMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREKKKDAAKPKKVYTDDDLK
Ga0137419_1069479923300012925Vadose Zone SoilMRVKFWMKVFLLVPSMCLVGLSAAARPQDSSQQQTGDPVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSDVAPAS
Ga0137419_1087194613300012925Vadose Zone SoilMRVRFWIKASMIVAALCMAGLLAAARQQGSSQQQADDPVADAARKAREKKKDAPKPKKIYTDDDLKKSAP
Ga0137416_1018270823300012927Vadose Zone SoilMRVRFWIKASMIVAALCMPGLLAAAQQQGSSEQQTSDPVADAARKAREKKKDAPKPKKIYTDDDLKKSAPAPDA
Ga0137416_1173969123300012927Vadose Zone SoilMKAFVIVPALCLAGLSAAAQQTGDPVADAARKAREMKKKDAAKPKKVYTDDDLKGSVPAPEAAPASAPANA
Ga0137407_1074540323300012930Vadose Zone SoilMRVKFWMKVFLVVPAMCLVGLSAAARPQDSSQQQTGDPVADAARKAREAKKDAPKPKRVITDDDL
Ga0134110_1029177523300012975Grasslands SoilMRMRLRMKAFFFVPALCLAGLSAAAQQAGDPVADAARK
Ga0137411_102381713300015052Vadose Zone SoilMKASVMVAALCLVGLTAAAQQQTGDPVADAARKARESKKDAPKPKKVYTDDD
Ga0137420_112547223300015054Vadose Zone SoilMRVRFWIKAFMIVAALCMAGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIYTDDDLKKSAPAPDAAA
Ga0137420_121429713300015054Vadose Zone SoilMQMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREKKKDAAKPKKVYTDEYGR*
Ga0137420_135678433300015054Vadose Zone SoilMKFWMKAFVIVPALCLAGSRQPRNSRFFPATDWRSGGGRRKEGREKKKDAAKPKKVYTDDDLKGSVPAPEAAATSARQMPVERRPRRPRR*
Ga0137412_1052917013300015242Vadose Zone SoilMRVKFWIKVFLAVPAMCLVGLSAAARPQDSSQQQTGDAVADAARKAREAKKDAPKPKRVITDDDLKT
Ga0066662_1291064113300018468Grasslands SoilMRVRFWIKASMIVAALCMAGLSAAARQQGSSQQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKSAPEPAP
Ga0137408_101392713300019789Vadose Zone SoilMTMKAWMKILIIVPTLCVAGMVTYAQSQGSSQQSGSDPVADAARKAREDKKNAAKPKKVYTDDDVKCQACGSG
Ga0193735_101706813300020006SoilMIVAALCMAGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKS
Ga0179592_1040208713300020199Vadose Zone SoilMRVRFWMKAFVIVPALCLAGLSAAAEQTGDPVADAARKAREMKKKDAAKPKKVYTDDDLKGSVPAPEAAPTPAPANA
Ga0210403_1080425413300020580SoilMRVRFWMKASVMVTALCLVGLTAAAQQQTGDPVADAARKARESKKDAPKPK
Ga0210399_1020338433300020581SoilMRVRFWIKASMIFAALCVTGLLAAARQQGSSEQQTGDPVADAARKAREK
Ga0210408_1012648713300021178SoilMRAKIEIMGFVMGAALCLAWLSAAARPQSSTPQQTGDPVADAARKAREAKKDAPKPKKVYTDDDVKMSAPEPAAAPA
Ga0210384_1032428823300021432SoilMRVRLWIKASMIFSALFVTGLLAAARQQGSSHQETGDPVADAARKAREKKKDAPKPKK
Ga0137417_120456713300024330Vadose Zone SoilMRVRFWIKASMIVAALCMAGLLAAAQQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIYTDDDLGR
Ga0209240_108650323300026304Grasslands SoilMKFWMKAFVIVPALCLAGLSAAAQQQGSSQQQTGDPVADAARKAREKKKDAAKPKKVYTDDDLKGSVPAPEAA
Ga0209801_100994913300026326SoilMTAAAVCLAGFSATARRQGSGQQQTGDPVADAARKARETKKDAP
Ga0209267_117845013300026331SoilMRMKFWTKAWLIVAALCLSGLSAAARQQGSTQQETGDAVADAARKARETKKDAPKPKKVYTDDDLKKS
Ga0257149_105610523300026355SoilMRVRFWMKAFVIVPALCLAGLSAAAQQTGDPVADAARKAREMKKKDAAKPKKV
Ga0257157_100714823300026496SoilMRVRFWMKAFVIVATLCLAGLSAAAQQTGDPVADAARKAREMKKKDAAKPKKV
Ga0257161_109255713300026508SoilMRVRFWMKAFVIVPALCLAGLSAAAQQTGDPVADAARKAREMKK
Ga0257168_112666923300026514SoilMRVRFWMKAFVIVPALCLAGLSAAAQQTGDPVADAARKAREMKKKDAAKPKKVYTDDDLKGSVPAPEAAPA
Ga0209648_1026591523300026551Grasslands SoilMQVTFWLKSFAIVAGICLAGLPAMAQQQVSSQQQTGDPVADAARKAREAKKDAPKPKKIY
Ga0209648_1049877813300026551Grasslands SoilMRVKFWMKVFLVVPAICLVVLSAAARPQDSSQQQTGDPVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSD
Ga0209648_1073472123300026551Grasslands SoilMRVKFWMKAFVIVPAMCLAGLSASAQQQGSAQQQAGDPVADAARKARES
Ga0179593_110223313300026555Vadose Zone SoilMRVKFWIKVFLVVPAMCLVGLSAAARPQDSSQQQTGDAVADAARKAREAKKDAPKPKRVITDDDLKTSGPRSDV
Ga0209076_103482923300027643Vadose Zone SoilMRVRFWIKASMIVAALCMAGLLAAAQQQGSSEQQTSDPVADAARKAREKKKDAPKPKKIYTDDDLKKSAPAPDAAAS
Ga0209588_125055523300027671Vadose Zone SoilMRVRFWMKAFVIVPAMCLAGLSAAARQQGSAQQQTGDPVADAARKAREMKK
Ga0209180_1060402013300027846Vadose Zone SoilMQMRFWMKAFVIVPALWLAGLSAAAQQQGSSQQQSDDPVAAAARKAREAKKDASKPKKIYTDDDVKKSAPEPVPAPA
Ga0209701_1060054723300027862Vadose Zone SoilMRVNLWMKLLVFVPGLFLAGLAAAAQPQGSSQQTGDPVADAARKAREAK
Ga0209488_1079032023300027903Vadose Zone SoilMRVRFWINVSMIVATLCVTGLLAAARQQGSSEQQTGDPVADAARKAREKKKDAPKPKKIYTDDDVKKSAPE
Ga0209526_1011111933300028047Forest SoilMIFAALCVTGLLAAARQQGSSEQQTGNPVADAARKAREKKKDTPKPKKIYTDDDVKKSAPEPAAAA
Ga0137415_1000056513300028536Vadose Zone SoilMRVRFWMNVFAVVPAMCLAGLSAAAQVQGSSQQQTGDPVADAARKAREAKKDAPKAK
Ga0137415_1036217423300028536Vadose Zone SoilMRTRIWMKGLALVPVLCLVGFSAAARPQAASQQTGDPVADAARK
Ga0137415_1053458313300028536Vadose Zone SoilMRMRFWTKAWMTAAAVCLAGFSATARQQGSAQQQGDAVADAARKARETKKDAPKPKKVYTDDDLKK
Ga0307504_1037639213300028792SoilMRVRFWMKAFVIVPALCLAGLSAAAQQTGDPVADAARKAR
Ga0307474_1086999623300031718Hardwood Forest SoilMRMGFWIKAFAVVPVLCLAGLSAAAHQQGSSQQQTGDPVADAARKAREAKKDTTKPKK
Ga0307475_1015280933300031754Hardwood Forest SoilMRVRFWIKAFAVVPVLCLAGLSAAAHQQTGDPVADAARKAREAKKDTTKPKKVY
Ga0307478_1123138723300031823Hardwood Forest SoilMRVRFWMKTFVVVPALFLAGLSPAAQQQDSSQQQTGDSLADAARKARATKKD
Ga0307479_1072210113300031962Hardwood Forest SoilMQLKFFTKAFVMVPALCLVALGAAAQQQGSSQQQTGDPVADAARKARESKKDAPKPKKVYTDDDL
Ga0307479_1077120413300031962Hardwood Forest SoilMKAFMIVPFLSLLGFAAAARQQGSSQQQTGDPVADAARKARESKK
Ga0307479_1219074123300031962Hardwood Forest SoilMRMKFWMKAFLIVPVLCVAGLSAAARQQDSSHQQTGDPVADAARKAREAKKDTTKPKKVYTDDDIKRSAPDQA
Ga0307471_10034711913300032180Hardwood Forest SoilMRVRFWMKAFMIVPFLSLLGFAAAARQQGSSQQQTGDPVADAAR
Ga0307471_10078211923300032180Hardwood Forest SoilMRVRFWIKASMIFAALCVTGLLAAARQQGSSEQQTGDPVADAARKAREKKKDTPKPKKIYTDDDVKKSAPEPAAAAG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.