NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F073401

Metagenome Family F073401

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073401
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 67 residues
Representative Sequence LDGSKRSVMHWILATFLLCLSCSSRRIARELGVHGRTSYRWCWWLRNAALSYEMERQLEGTVEADDLYH
Number of Associated Samples 79
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 15.38 %
% of genes near scaffold ends (potentially truncated) 8.33 %
% of genes from short scaffolds (< 2000 bps) 9.17 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (90.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(30.833 % of family members)
Environment Ontology (ENVO) Unclassified
(32.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.667 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 63.92%    β-sheet: 0.00%    Coil/Unstructured: 36.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF12762DDE_Tnp_IS1595 9.17
PF02371Transposase_20 5.00
PF12760Zn_Tnp_IS1595 3.33
PF13613HTH_Tnp_4 2.50
PF01609DDE_Tnp_1 1.67
PF04014MazE_antitoxin 1.67
PF04392ABC_sub_bind 0.83
PF13412HTH_24 0.83
PF00856SET 0.83
PF10370DUF2437 0.83
PF13586DDE_Tnp_1_2 0.83
PF01656CbiA 0.83
PF15738YafQ_toxin 0.83
PF13714PEP_mutase 0.83
PF13401AAA_22 0.83
PF00589Phage_integrase 0.83
PF00239Resolvase 0.83
PF00076RRM_1 0.83
PF02796HTH_7 0.83
PF03706LPG_synthase_TM 0.83
PF01610DDE_Tnp_ISL3 0.83
PF01548DEDD_Tnp_IS110 0.83
PF13340DUF4096 0.83
PF03400DDE_Tnp_IS1 0.83
PF01844HNH 0.83
PF13495Phage_int_SAM_4 0.83
PF00561Abhydrolase_1 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG3547TransposaseMobilome: prophages, transposons [X] 5.83
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 1.67
COG3293TransposaseMobilome: prophages, transposons [X] 1.67
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 1.67
COG5421TransposaseMobilome: prophages, transposons [X] 1.67
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 1.67
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 1.67
COG0392Predicted membrane flippase AglD2/YbhN, UPF0104 familyCell wall/membrane/envelope biogenesis [M] 0.83
COG1662Transposase and inactivated derivatives, IS1 familyMobilome: prophages, transposons [X] 0.83
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.83
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.83
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.83
COG3464TransposaseMobilome: prophages, transposons [X] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A90.00 %
All OrganismsrootAll Organisms10.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005468|Ga0070707_100666120All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300005985|Ga0081539_10316186All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria666Open in IMG/M
3300006844|Ga0075428_100037599All Organisms → cellular organisms → Bacteria5327Open in IMG/M
3300006847|Ga0075431_100882641All Organisms → cellular organisms → Bacteria864Open in IMG/M
3300009100|Ga0075418_12065538All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria621Open in IMG/M
3300010043|Ga0126380_11412210All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria612Open in IMG/M
3300010047|Ga0126382_10453933All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1017Open in IMG/M
3300010362|Ga0126377_10513657All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1232Open in IMG/M
3300010398|Ga0126383_10115851Not Available2430Open in IMG/M
3300012206|Ga0137380_11056684All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria693Open in IMG/M
3300012359|Ga0137385_11274406All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria597Open in IMG/M
3300012930|Ga0137407_11188348All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria724Open in IMG/M
3300012971|Ga0126369_12477598All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria604Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil30.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere17.50%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.50%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.67%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.67%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.67%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.83%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.83%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.83%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.83%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005983Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S2T1R1Host-AssociatedOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011416Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT551_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300032013Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3EnvironmentalOpen in IMG/M
3300032126Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-2Host-AssociatedOpen in IMG/M
3300032397Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_0EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10313295623300000955SoilLLHQSKRSLAHWILATFLLCLACSSRRIARELGVHIRTSYRWCWWLRNAALSYEMERQLEGTVEADD
JGI1027J12803_10350737943300000955SoilSLVHWILATFLLCLSCSSRRIARELGVHILTSYRWCWWLRNAALSYEVDRPLAFR*
JGI1027J12803_10951491023300000955SoilNDLTEPLLHRSTRPLAYWILATFLLCLACSSRCIAREVGVHIRTSSCWSWWLRNAALSYEMHRQLE
Ga0066676_1081776213300005186SoilLPYWILATFLLCLSCSSRRIARELGVHISTSYRWCWWLRNTAISYERTGCGFCKWA*
Ga0070708_10086825723300005445Corn, Switchgrass And Miscanthus RhizosphereMSEAHHTIKRDKLNQSKRPLAYWILATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMGRQLAGTVEADDLYHT
Ga0066686_1050629013300005446SoilLDGSKRSVMHWILATFLLCLSCASRRIAKELGVHVRTGYRWCWWLRNAALSYELGRQ*
Ga0070707_10066612013300005468Corn, Switchgrass And Miscanthus RhizosphereMSEAHHTIKRDKLNQSKRPLAYWILATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMGRQLAGTVEADDLYHTAGNKGQAKGGGKK
Ga0066705_1056629923300005569SoilMHWILATFLLCLSCSSRRITRELGVHMRTGYRWCWWLRNAALSYEMERQLEGTVEADDLYHTAGHKESREMLGVTAKLR*
Ga0066903_10011180843300005764Tropical Forest SoilMFLSRYSTLLHRNQRSLSHWILATFLVCLACSSRRIAREVGIHIRTSYRWCWWLRNAALSYEMQRQLD
Ga0068858_10221878113300005842Switchgrass RhizosphereDTLLHRSQRSLPYWILAAFLLCLACSSRCIARELGIHIRTSYRWCWWLRNAALSYEMERQVEGTVEADDL*
Ga0081540_1007298113300005983Tabebuia Heterophylla RhizosphereMLATFLLCLSCSSRRIAREVGVHVRTSYRWCWWLRNAALSYEIGRQLDGTVEADDLYHTAGHKGQAKQGGTKS*
Ga0081539_1031618613300005985Tabebuia Heterophylla RhizosphereLCLSCASRRVAKELGGHIRTGYQWCWWLRNAALSYEMGRQLAGTVEADDMP*
Ga0066656_1043333413300006034SoilLLDGSKRSVMHWILATFLLCLSCVSRRIAKELGVHIRTGYRWCWWLRNAALSYELGRQ*
Ga0079222_1076870623300006755Agricultural SoilLLCLSCSSRRIARELGVHIRTGYRWCWWLRNAALSYEIGRRLKGTVEADELYHTAGHKGQAKVAMLNQRSRASRA*
Ga0075428_10003759943300006844Populus RhizosphereMPWMLATFLLCLSCSSRRIAREVGVHMRTGYRWCWWLRNAALSYEMERQLGGTVEADDL*
Ga0075421_10113186213300006845Populus RhizosphereLHQSQRPLAYWILTTFLRCLACSSRRIAREVGIHIRTSYRWCWWLRNAALSYEMHRQLDGTVEADDLYHTGVG*
Ga0075430_10102762513300006846Populus RhizosphereMHDSKRSLPHWILATFLLCLACSPWRLAREVGLHDRTSYRWCWWPRNVALSYKMLRQLVGTV
Ga0075430_10172647323300006846Populus RhizosphereCLACSSRRIARELGIHIRTSYRWCWWLRNAALSYEMERQVEGTVEAVM*
Ga0075431_10008499723300006847Populus RhizosphereMPWILATFLLCLSCSSRRIAREVGVHMRTGYRWCWWLRNAALSYEMERQLGGTVEADDL*
Ga0075431_10088264133300006847Populus RhizosphereVAYWILATFLLCLSCSSRRIARELGVQSRTSYRWCWWLRNAAVSYETDRQLEGTVEADELYHTAGQKGQAKQGGTK
Ga0075433_1083458713300006852Populus RhizosphereMLATFLLCLACSSRRIAREVGVHIRTSYRWRWWLRSAALSYEMERQLEGTVEADDLYHTAGQKGQAKQ
Ga0075420_10086638433300006853Populus RhizosphereMLATFLLCLACSSRRIAREVGVHIRTSYRWRWWLRSAALSYEMERQLEGTVEADDLYHTAGQKGQA
Ga0075419_1092011233300006969Populus RhizosphereMLATFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYELERQLEGTVEADDLYHTAGQKGQ
Ga0099793_1035880323300007258Vadose Zone SoilLHQSKRPLSYWILATFLLCLACSSRRIAREVGVHISTSYRWCWWLRNAALSYEMERQLEETVEADELYHTAGNK
Ga0099830_1084200823300009088Vadose Zone SoilLHQSQRPLAYWIFATFLLCLACSSRRIAREVGVHIRTSYRWCWGLRHAALSYEMERQLAGTVEADDLYHTAGQKGQAKQGGK
Ga0099827_1039451623300009090Vadose Zone SoilMRGSGQKTPHWILATFLLCLACSSRRIAREVGIHIRTSYRWCWWLRNAALSYEMQRQLEGTVE
Ga0111539_1113183123300009094Populus RhizosphereHWILATFLLCLACSSRRIAREVGLHIRTSYRWCWWLRNAALSYEMHRQLEGTVEADDLYHTARVNRTKVRKVS*
Ga0075418_1029359133300009100Populus RhizosphereLDGSKRSMMHWMLATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYE
Ga0075418_1043952733300009100Populus RhizosphereLSYWLLATFLLCLACSSRRIARELGIHIRTSYRWCWWLRNAALSYEMERQVEGTVEAVM*
Ga0075418_1206553813300009100Populus RhizosphereMHWILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGRKLDGTGEADELYHTAGHKGQAKTGGTKSLGRKPR
Ga0075418_1264462613300009100Populus RhizosphereMGTLLDGSKRSVMHWILATFLLCLLCASRRIAWGLGVHICTGYRWCWWLRNAALSYEI
Ga0075418_1280259213300009100Populus RhizosphereMHWILATFLLCLSCSSRRIARELGVHRRTGYRWCWWLRNAALSYEMERQLAGTVEADDLY
Ga0066709_10115321343300009137Grasslands SoilLHQSKRSLPYWILATFLLCLSCSSRRIARELGVHIRTSYRWCWWLRNTALSYETDRQLAGTVEADDLYHT
Ga0114129_1147681623300009147Populus RhizosphereSVMHWILTTFLLCLACSSRRIARELGIRVHTSYRWCWWLRNTALPYETDRQVAGTVEADDMP*
Ga0111538_1229564023300009156Populus RhizosphereGSKRSVMHWILATFLLCLSCSSRRIARELGVHLRTSYRWCWWLRNAALSYEAHRQLAGTVEADDMP*
Ga0111538_1289145313300009156Populus RhizosphereLDGSKRSLAHWILATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMHRQLEGTVEADDLYHTAGNK
Ga0075423_1004514363300009162Populus RhizosphereLLHQTQRSLSHWILATFLLCLACSSRRIAREVGLHIRTSYRWCWWLRNAALSYEMHRQLEGTVEADD
Ga0075423_1158470723300009162Populus RhizosphereLDGSKRSLAHWILATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMERQLEGTVEADDLYHT
Ga0126374_1040213413300009792Tropical Forest SoilILATFLLCLACSSRRIARELGVHLRTGYRWCWWLRNAAMSYEMGRQVEGTVEADDL*
Ga0105065_104240123300009803Groundwater SandVFNDLTKTLLSQSKRSLPHWIVATFLLCLSCSSRRTAKELGIHVSTSYRWCWWLRNAALSYELDRQLEGCVEADELYHTAG
Ga0105088_111650113300009810Groundwater SandMLAQSKRSLPHWILATFLLCLSCSSRRIARELGVHIRTSYRWCWWLRNAAVSYETDRHVEGT
Ga0126380_1069445513300010043Tropical Forest SoilLCLACSSRRIARELGVHVRTSYRWCWWLRNAALAYEMHRQLEGTVEADDLYHTAGNKGQAKGGGKKSLGRQPCG
Ga0126380_1141221013300010043Tropical Forest SoilMRSLVYWILATFLLCLSCSSRRIARELGMHGRTSYRWCWWLRNTAISYEADRQLAGIVEADDLYHTAGQKGQAKHGGKKALG
Ga0126380_1167639513300010043Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRIAREVGVHMRTGYRWCWWLRNAALSYEMTRQL
Ga0126384_1003288313300010046Tropical Forest SoilLDGSKRLVMHWILATFLLCLSCASQRIAKELGGHVRTGYRWGWWLRNAALTYELGRQLAGTVEADD
Ga0126384_1120226413300010046Tropical Forest SoilLRIEFNDLTGTLLDGSKRSLMHWILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGRQLAGTVEADD
Ga0126384_1153735313300010046Tropical Forest SoilVYWILATFLLCLSCSSRRIARELGIHSRTSYRWCWWLRNTAVSYETDRQLEGTV
Ga0126384_1178301413300010046Tropical Forest SoilHWILAAFLLCLSCSSRRIARELGVHGRTSYRWCWWLRNAALSYEMERQLAGTVEADELYHTAGQKGQV*
Ga0126384_1191480913300010046Tropical Forest SoilLDGSKRSVMHWIFATFLLCLSCSSHRIAREVGVHIRTSYRWCWWLRNAALSYEMERQLAGTVEADDLYHTAGQKGQAKQG
Ga0126384_1212523113300010046Tropical Forest SoilMRSVMHWILATFLLCLACSSRRVAKELGVHIRTSYRWCWGLRNAAISYETDRQLEGTVEADDLYHTAGQKG*
Ga0126384_1220298313300010046Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRIARELGVHGRTSYRWCWWLRNAALSYEMERQLEGTVEADDLYH
Ga0126382_1045393333300010047Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRITKELGVHVRTGYRWCWWLRNAALSYEMERQLEGTVEADDLYHTAG
Ga0126382_1055388023300010047Tropical Forest SoilLHQSQRPLAYWILATFLVCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMQRQLEGTVEADD
Ga0134065_1030323623300010326Grasslands SoilMMHQSKRSLPHWVLATFLLCLACSSRRMAREVGVHIRTSSRWCWWLRNAALSYEMHRHLEGTVAADDLYHPAGNKGQAKQGGK
Ga0126370_1036566513300010358Tropical Forest SoilMHWMLATFLLCLACSSRRVAKELGVHIRTSYRWCWWLRNAALSYEMGRQLEGTVEADDLSHTAGQKGQAPQGGKKAL
Ga0126376_1025982733300010359Tropical Forest SoilLLLCLACASRRIARELGIPVRTGYRWCWWLRNAAVSYAMHRQLAGTVEADALYQTAGNKGQAKQGGKKALGHRPRGR
Ga0126376_1186930723300010359Tropical Forest SoilVYWILTTFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALFYEMQRQVDGTVEADDLYHTAGQKGQAKHGGKK
Ga0126376_1195296713300010359Tropical Forest SoilTLFHRRQRSLADWSLATFLWCLSCSSRRIARELGGQSRTSYRWGWRLRHAAVS*
Ga0126376_1308574113300010359Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRIAREVGVHMRTGYRWCWWLRNAALSYEMTRQLEGTVEADELYHTAGQKGQATQG
Ga0126378_1332663613300010361Tropical Forest SoilLLLCLACASRRIARELGIPVRTGYRWCWWLRNAAVSYAIHRQLAGTVEADA
Ga0126377_1050514713300010362Tropical Forest SoilMPTGTLLDGSKRSLMHWMLATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNATLSYEIGRQLDG
Ga0126377_1051365743300010362Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRIARELGVQVRTGYRWCWWLRNAALSYEIGRKLAGTVEADDLSHTAGHKGQA
Ga0126377_1065860613300010362Tropical Forest SoilWIFATFLLCLSCSSHRIAREVGVHIRTSYRWCWWLRNASLSYEMERQLEGTVEADDMP*
Ga0126377_1085168013300010362Tropical Forest SoilMLATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGRQLDGTVEADDLSQTAGNKGQAKQGGKKALGR
Ga0126377_1243405623300010362Tropical Forest SoilLDGSKRSVMHWMLATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEMARQLEGTVEA
Ga0134066_1032590913300010364Grasslands SoilMMHQSKRSLPHWVLATFLLCLACSSRRMAREVGVHIRTSSRWCWWLRNAALSYEMHRHLEGTVAADDLYHPAGNKGQAK
Ga0126379_1084824613300010366Tropical Forest SoilLDGSKRSLVHWLLATFLLCLACSSRRIAREVGVHVQTSYRWCWGLRNAALSY
Ga0126379_1125159713300010366Tropical Forest SoilMLATFLLCLSCASRRIARERGVHSRSSYRWGWWLRNTAVSYEMHRQLDGTVEADDL
Ga0126381_10341110113300010376Tropical Forest SoilLDGSKRSVMHWMLATFLLCLACSSRRVAKELGVHIRTSYRWCWWLRNAALSYEMGRKLAGTVEADDLYHTAGH
Ga0126383_1011585123300010398Tropical Forest SoilLDGSNRSLRHWILATFLLCLSCASRRIAKEVGVHIRTSYRWCWWLRNAALSYEVQRQVDGTVEADDMP*
Ga0126383_1020861513300010398Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCSSRRIAREVGVHGRTGYRWCWWLRNAALSYEVGRKLAGTVE
Ga0126383_1154459413300010398Tropical Forest SoilLDGSKRSVMHWILATFLLCLSCASRRIARELGVHVRTGYRWCWWLRNAA
Ga0137422_111559713300011416SoilLPHWILATFLLCLACSSRRIAREIGVHIRTSYRWCWWLRNTALSYEMHRQLEGTVEADDLYCVFHAKAATDSR*
Ga0137388_1035424533300012189Vadose Zone SoilLHWSKRPLSYWILATFLLCLACSSRRIAREVGVQSRTSYRWCWWLRNTAMSYETDRRLEGTVEADDLYHTAGSKGQA
Ga0137399_1155174013300012203Vadose Zone SoilMHRSKRPLSYWILATFLLCLSCSSRRIARELGIHSRTSYRWCWWLRNTAVSYETDRRLEGTVEADDLYHTAGSKGQAK
Ga0137380_1058043313300012206Vadose Zone SoilMLHQSRRSLPHWILATFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYEMQRQLEGTVEADDLYHTAGNKGQAKQGGKKALG
Ga0137380_1105668413300012206Vadose Zone SoilLAQSKRSLGHWILATFLLCLSCSSRRIARELGVHIPTSYRWCWWLRNTAISYETDRQVEGTVEADELYHIAGSKGQAKHGGTKHLGR
Ga0137377_1192052013300012211Vadose Zone SoilMLHQSKRSLPHWILATFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYEMERQLEGTVEADELYHTAGNKGQAKQGGKKAL
Ga0137384_1075432323300012357Vadose Zone SoilMLHQSKRSLPHWILATFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYEMQRQLEGTVEADE
Ga0137385_1008340643300012359Vadose Zone SoilMSNDLTGTLLDGSKRSLAHWILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGR
Ga0137385_1122885323300012359Vadose Zone SoilMLHQSRRSLPHWILATFLLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYEMQRQLEGTVEADE
Ga0137385_1127440613300012359Vadose Zone SoilLCLACSSRRIARELGIHIRTSYRWCWWLRNAALSYEMQRQLDGTVEADDLYHTAGNKGQAKQGGKKLLGSRARVRKKKREP
Ga0137375_1133795013300012360Vadose Zone SoilMHRSKRPLSYWILATFLLCLSCSSRRIARELGVHSRTSYRWCWWLRNTAVSYETDRRLEGTVEADDLYHTA
Ga0137360_1174467613300012361Vadose Zone SoilLAQSKRSLGHWILATFLLCLSCASRRIARELGVHIPTSSRWCWWLRNAALSYEMERQLEGTVEA
Ga0137397_1033675813300012685Vadose Zone SoilMIETIASNTLLAQSKQSLPHWILATFLLCLSCSSRRIARELGVHIRTSYRWCWWLRNAALSYEMHRQLEGTVELDDLYH
Ga0137397_1041655913300012685Vadose Zone SoilLLAQSKRSLGHWILATFLLCLACSSRRIARELGVHIRTSYRWCWWLRNAALSYEMQRQLEGTVEADDLYHTAGQ
Ga0137359_1152037113300012923Vadose Zone SoilMLHQRKRSLSHWILGTFLLCLWCSSRRMARELGVHVRTGYRWCWWLRHAALSY
Ga0137359_1168444313300012923Vadose Zone SoilMLAQSTWALPHGILATFRLCLSCSARRIARELGVHIRTSDHWCWWLRNAAVSYETDRHLKGTVEADEL*
Ga0137407_1118834813300012930Vadose Zone SoilMHQSKRSLPHWILATFLLCLACSSRRIAREIGVHIRTSYRWCWWLRNAALSYEMERQLEGTVEADELY
Ga0126375_1040118123300012948Tropical Forest SoilLCLACSSRRIAREVGVHIRTSYRWCWWLRNAALSYEMHRQLAGMVEADDLYHTAGQKGQAKHSGKKHIPMD*
Ga0126375_1045040423300012948Tropical Forest SoilLHQSQRPLAYWILATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMQRQLEGTVE
Ga0126375_1098193513300012948Tropical Forest SoilTLLDGSKRSLMHWILATFLLCLSCSSRRIAREVGVHRRTGSRWCWWLRNAALSYEMERQLAGTVEAG*
Ga0126375_1210097813300012948Tropical Forest SoilTFNDLTNMLLARSKRSLPHWILATFLLCLLCSSRRIARELGVHIRTSYRWCWWLCNAAVSYETDRHLEGTIEADELYHTAG*
Ga0164302_1081550013300012961SoilMMHQSKRSLPHWVLATFLLCLACSSRRMAREVGVHIRTSYRWCWWLRNAALSYEMHRQLEGTVEADDLYHTAGNK
Ga0126369_1142755113300012971Tropical Forest SoilMLATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGRQLDGTVEADDLSQTAGNKGQAKQGGKKA
Ga0126369_1236915513300012971Tropical Forest SoilLDGSKRSVVHWILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEIGRKLEGTVEADDL
Ga0126369_1247759813300012971Tropical Forest SoilMHWILATFLLCLSCSSRRIARELGVHGRTGYRWCWWLRNAALSYEMERQLEG
Ga0134110_1032257413300012975Grasslands SoilMHWILATFLLCLSCSSRRIARELGVYGRTSYRWSWWLRNTALSYETDRQLAGTVEADDLYHTA
Ga0157374_1129613613300013296Miscanthus RhizosphereLDGSKRSLAHWRLATFLLCLACSSRRIAREVGVHVRTSYRWCWWLRNAALSYEMHRQLEGTV
Ga0163162_1010646143300013306Switchgrass RhizosphereMHGMLATFLLCLSCASRRMARELGVHGRTSYRWCWWLRNAALSSERHRQLDGTVETDDL*
Ga0157379_1206265813300014968Switchgrass RhizosphereMHWMLATFLLCLSCSSRRIARELGAQTRTGYRWCWWLRNTALSYEMERQLEGTVEADELYHTAGQKGQAKQ
Ga0137403_1012809333300015264Vadose Zone SoilLLAQSKRSLAHWLLAAFLLCLSCASRRIAREVGVHIRTSYRWCWWRRNTAVSYETGRQLEGTVEADDLYHTAGNKG
Ga0132256_10015824963300015372Arabidopsis RhizosphereNDLTGTLLDGSKRSLMHWILATFLLCLSCSSRRIAREVGVHRRTGYRWCWWLRNAALSYEMERQ*
Ga0132255_10445616713300015374Arabidopsis RhizosphereMHGMLATFLLCLSCASRRMARELGVHGRTSYRWCWWLRNAALSSERHRQLDGTVEADDL*
Ga0132255_10509394913300015374Arabidopsis RhizosphereLDGSKRSVMHWILATFLLCLSCSSRRIARELGVHGRTSYRWCWWLRNAALSYEMERQVEGTVEADD
Ga0182033_1018333313300016319SoilLTGTLLDGSKRSVMHWILATFLLCLSCSSRRIARELGVHGRTSYRWCWWLRNAALSYEMERQLEGTVEADDMP
Ga0182033_1142538013300016319SoilMHWILATFLLCLACSSRRIAKELGVHIRTGYRWCWWLRNAALSYELGRQLEGTVEADDLYHTAGNKGQAQGGGKKALGRR
Ga0207677_1079116723300026023Miscanthus RhizosphereMHGMLATFLLCLSCASRRMARELGVHGRTSYRWCWWLRNAALSSERHRQLDGTVETDDL
Ga0256867_1002491623300026535SoilMMHQSKRSLLHWILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRNAALSYEMGRKLDGTVEADDMP
Ga0209481_1001868033300027880Populus RhizosphereMPWMLATFLLCLSCSSRRIAREVGVHMRTGYRWCWWLRNAALSYEMERQLGGTVEADDL
Ga0209069_1029917123300027915WatershedsMASREHLLDGSKRSVMHGILATFLLCLSCSSRRIARELGVHVRTGYRWCWWLRHA
Ga0209859_102315633300027954Groundwater SandLHQSKRSLPHWILATFLLCLACSSRRIAREGGLHIRTSYRWCWWLRNAALSYEMHRQ
Ga0247822_1064306513300028592SoilLLCLSCSSRRIAKELGVHVRTGYRWCWWLRNAALSYEIGRQLAGTVEADDMP
Ga0299907_1039140923300030006SoilMHWIVDTFLLCLSCSSRRLEKELGVHIRTGYRWCWWLRNAALFYEMERRLEGTVEADDMP
Ga0307469_1230898613300031720Hardwood Forest SoilGTLLDGSKRSVMHGILATFLLCLSCASRRIAKELGVHVRTGYRWCWWLRKAALSYELGRQ
Ga0310900_1154020723300031908SoilCLACSSRRIARELGVHIRTGYRWCWWLRNAALSYEMKRQLEGTVEADDLYGECSGYVFMV
Ga0310906_1062303523300032013SoilDTLLHRSKRPVLHWILATFLLCLSCSSRRIARELGLHSRTSYRWCWWLRNAALSYVTGCK
Ga0307415_10168592923300032126RhizosphereTFLLCLACSSRRVAKELGVHIRTSYRWCWWLRNAALSYEMGRKLAGTVEADDLYHTAGHKGQATHGEKK
Ga0315287_1267254113300032397SedimentMCIPCSCLRIRRELGIHIKTAYRWCWWFRNVALSYEVSRQLEGIVEADE
Ga0335083_1076711323300032954SoilPSWILATFLVCLACSSRRIARALGVHGRTSYRWCWWLRNAALSSEMHRQLAGTVAADDL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.