NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F074636

Metagenome Family F074636

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074636
Family Type Metagenome
Number of Sequences 119
Average Sequence Length 129 residues
Representative Sequence ILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Number of Associated Samples 104
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.412 % of family members)
Environment Ontology (ENVO) Unclassified
(44.538 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.387 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 0.00%    β-sheet: 56.00%    Coil/Unstructured: 44.00%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF00072Response_reg 9.24
PF01790LGT 7.56
PF04392ABC_sub_bind 5.04
PF06906DUF1272 3.36
PF01087GalP_UDP_transf 3.36
PF02446Glyco_hydro_77 2.52
PF00496SBP_bac_5 1.68
PF01209Ubie_methyltran 1.68
PF14376Haem_bd 1.68
PF08241Methyltransf_11 1.68
PF01139RtcB 1.68
PF00483NTP_transferase 1.68
PF02585PIG-L 0.84
PF01738DLH 0.84
PF04773FecR 0.84
PF01717Meth_synt_2 0.84
PF00248Aldo_ket_red 0.84
PF13358DDE_3 0.84
PF13424TPR_12 0.84
PF10017Methyltransf_33 0.84
PF16268DUF4921 0.84
PF13231PMT_2 0.84
PF07355GRDB 0.84
PF13557Phenol_MetA_deg 0.84
PF08447PAS_3 0.84
PF12833HTH_18 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG0682Prolipoprotein diacylglyceryltransferaseCell wall/membrane/envelope biogenesis [M] 7.56
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 5.04
COG3813Uncharacterized conserved protein, DUF1272 domainFunction unknown [S] 3.36
COG16404-alpha-glucanotransferaseCarbohydrate transport and metabolism [G] 2.52
COG1690RNA-splicing ligase RtcB, repairs tRNA damageTranslation, ribosomal structure and biogenesis [J] 1.68
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 1.68
COG22272-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylaseCoenzyme transport and metabolism [H] 1.68
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.84
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.41%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil21.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil10.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.20%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.20%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.20%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.52%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.84%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004267Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBioEnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027527Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031778Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f24EnvironmentalOpen in IMG/M
3300031795Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f19EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J11758_1281407923300000789SoilMQRVALVLAMAVAGVFGPIVAVKAESPAIGAENQFRFQFEEAQRSHRGQAVEGYLYNDLPWRITNVRLRVESLDGSGWVTGXASGWVVGDVKXGGRGYXFVPVVSXAATYRAXVQSFDKVVLEAPPLQAP*
JGI1027J11758_1282662313300000789SoilQQTGGGMQRVALVLAMAVAGVFGPIVAAKAESPAIGADNQFRFQFEEAPRSHRGQAVEGYLYNDLPWRITNVRLRVESLDGSGWVTGDASGWVVGDVKAGGRGYFFVPVVSPAATYRASVQSFDKVVLEAPPLQAP*
JGI25385J37094_1000789873300002558Grasslands SoilVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
JGI25383J37093_1001977323300002560Grasslands SoilVHRILVLGMVVAGVVGPVVAGAQTPATDPQNPFRFEVEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDTAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
JGI25383J37093_1015080113300002560Grasslands SoilLIVVAILLGPLPKGTGCRRALNGAIVSGHMISRRAFVGGAVVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
JGI25382J37095_1001404623300002562Grasslands SoilVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFEVEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDTAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
JGI25382J43887_1000240213300002908Grasslands SoilVVHRILVLGMVVAGVVGPVVAGAQTPATDPQNPFRFEVEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDTAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFE
JGI25382J43887_1020324913300002908Grasslands SoilRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066396_1000368333300004267Tropical Forest SoilMQRMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0066398_1005449613300004268Tropical Forest SoilGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0066674_1000023073300005166SoilVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066672_1032516913300005167SoilVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0066677_1012060413300005171SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0066683_1007087123300005172SoilVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066680_1053107213300005174SoilGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066673_1063763913300005175SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNK
Ga0066678_1051645513300005181SoilVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066678_1095536523300005181SoilRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0066676_1001016473300005186SoilILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066675_1141412913300005187SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAAT
Ga0070708_10007404513300005445Corn, Switchgrass And Miscanthus RhizosphereMITRRSALVSLMSLLAAPLAAEAQQAAENPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVENVDPTGVVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVESFNKVALEAPTEAP*
Ga0066681_1000134983300005451SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066697_10000438163300005540SoilVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVDLDTPQFEAP*
Ga0066692_1025991913300005555SoilGARRRPRQSAALIVVAILLGPLPKGTGCRRALNGAIVSGHVISRRAFVGGAVVRSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0066704_1094485923300005557SoilVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRA
Ga0066698_1071570913300005558SoilAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066708_1006019233300005576SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGAGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0066691_1011475533300005586SoilVVHRILVLGMVVAGVVGPVVAGAQTPATDPQNPFRFEVEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066903_10546162813300005764Tropical Forest SoilPAIGAESQFRFQLEEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAATYRATVQSFDKVVLETPLQAP*
Ga0066651_1029157513300006031SoilGAIVSGHMISRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066652_10013558343300006046SoilGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVAGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0066665_1052747413300006796SoilVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKV
Ga0066659_1045090023300006797SoilMISRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0066659_1086712513300006797SoilRYAPVRIRPPPPFTPLGIATDMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0099791_1003941633300007255Vadose Zone SoilMERRVLEDLWRVNLWCNRILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP*
Ga0099791_1033373313300007255Vadose Zone SoilMMKKMILVFAAVVTGVLAPIVATAQTRVTDAPSQFLFELAESESHRGRAVEGYVYNGLPWGITNVRLRVESVDRSGTVTGEASGWVVGDVQAGGRGYFFVPVSSRAAAYRATVQSFDKVAREAPRVEAP*
Ga0099793_1002792523300007258Vadose Zone SoilMERRVLEDLWRVNLWCNGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP*
Ga0126374_1120566513300009792Tropical Forest SoilMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFQLEEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFD
Ga0126384_1001572233300010046Tropical Forest SoilMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0126382_1043518213300010047Tropical Forest SoilMAIVFATVVVGVFGSIVAVKAESPGIGAESQFRFELADAQRSHRGQAVEGYLYSGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAANYRATVQSFDKVVLETPLQAP*
Ga0134070_1000379563300010301Grasslands SoilMISRRAFVGGAVVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0134082_1016763123300010303Grasslands SoilMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0134088_1000210923300010304Grasslands SoilMISRRAFVGGALVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0134063_1004034123300010335Grasslands SoilMISRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0126372_1052618223300010360Tropical Forest SoilVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0126378_1139252213300010361Tropical Forest SoilHIGSGLEAAMQRMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0126377_1000725143300010362Tropical Forest SoilMKRLTLVVAMIVAGVFGPIVTVKAESPAIDVESQFRFELAEAQHSHRGQAVEGYLYNGLPWRITNVRLRVESLDPNGRVTGQASGWVVGDVKGGDRGYFFVPIMSRATTYRATVQSFDKIVLEAPPLQAP*
Ga0126377_1058855613300010362Tropical Forest SoilMAIVFATVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0126379_1147783513300010366Tropical Forest SoilMQRMALVFAAIVVGVFGSIVAVKAESPAIGAESQFRFQLEEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAATYRATVQSFDKVVLETPLQAP*
Ga0126383_1015129223300010398Tropical Forest SoilMQRMAIVFATVVVGVFGSIVTVKAENPAIGAESQFRFELADAQRSHRGQAVEGYLYSGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLETPLQAP*
Ga0126383_1092119113300010398Tropical Forest SoilSISVLDWRLAMQRMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFGLEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP*
Ga0137389_1142984613300012096Vadose Zone SoilIMERRVLEDLWRVNLWCNRILVFGIVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNSLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSRAATSRATVESFNKVALEAP*
Ga0137383_1051236623300012199Vadose Zone SoilMMKKMMILVSAAVVTGILGPIVAQAQTPVTDVPSQFRFELTQAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRANVQSFDKVAREVPRIE
Ga0137382_1064330033300012200Vadose Zone SoilMISRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDGTGGVAGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPDLEAP*
Ga0137363_1043595323300012202Vadose Zone SoilMQRRVLEDLWRVKLWCNRILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPTEAP*
Ga0137399_1052168333300012203Vadose Zone SoilMIARRAFLAGSVAVLTAPCAAEAQQPAQNPFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0137379_1013993153300012209Vadose Zone SoilMISRRAFVGGAVVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLASSRAATYRATVQSFNKVALDTPQFEAP*
Ga0137384_1028283913300012357Vadose Zone SoilPVTDVPSQFRFELTQAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRANVQSFDKVAREAPRIEAP*
Ga0137360_1054388623300012361Vadose Zone SoilMQRRVLEDLWRVNLWCNRILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP*
Ga0137361_1062939133300012362Vadose Zone SoilMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0137361_1109025613300012362Vadose Zone SoilMMKKMMILVSAAVVTGILGPIVAQAQTPVTDVPSQFRFELTQAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRATVQSFDKVAREA
Ga0137358_1012860913300012582Vadose Zone SoilDRRVAAGRPGAPDVGNQQEATFVEEHQMSVQALRVFFTATQRYRFQRAMAGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP*
Ga0137397_1055060323300012685Vadose Zone SoilMSVQALRVFFTATQRYRFQRAMAGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP*
Ga0137396_1006634513300012918Vadose Zone SoilMDRRALEKLQTVNLWCNRILMFGMLVAGSVGPVVVEAQTPATDAQKPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVETVDSAGEVTGETYGWVVGDVKAGGRGYFFVLVPSRAAT
Ga0137396_1007322733300012918Vadose Zone SoilMVVAGVVGPVVVEAQTPATEAQNPFRFELEEAESPFRGRAVEGYVYNNLPWRITNVRLRVESVDPTGRATGEAYGCDVGDLKACGRGYFFVLVSSRAATYRATVQSFHKVVLEAPQFESP
Ga0137396_1014647733300012918Vadose Zone SoilMISRRAFLAGSVAVLTAPCAAEAQQPAQNPFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0137359_1012444743300012923Vadose Zone SoilAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0137404_1010568013300012929Vadose Zone SoilMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRIESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP*
Ga0126375_1061719223300012948Tropical Forest SoilRMAIVFATVVVGVFGSIVTVKAENPAIGAESQFRFELADAQRSHRGQAVEGYLYSGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLETPLQAP*
Ga0126375_1134280413300012948Tropical Forest SoilLVVAMIVGGVFGPIVTVKAESPAIDVESQFRFELAEAQHSHRGQAVEGYLYNGLPWRITNVRLRVESLDPNGRVTGQASGWVVGDVKGGDRGYFFVPIMSRATTYRATVQSFDKIVLEAPPLQAP*
Ga0126369_1280126313300012971Tropical Forest SoilMQRMNLVFATIVVGVFGSSVAVKAESPAIGAESQFRFQLEEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGTGRVTGEASGWVVGDIKGGGRGYFYVPVTSPAATYRATVQSFDKVVLETPLQAP*
Ga0134078_1025171113300014157Grasslands SoilNGAIVSGHMISRRAFVGGAVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP*
Ga0134079_1020204513300014166Grasslands SoilMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATV
Ga0132256_10136931813300015372Arabidopsis RhizosphereVTGTHGAARAQSPAAGAENPFRFELGEAENPHRGRAVEGYVYNELRWRITNVRLRVESVDSAGTVTGQNSGWVLGDVKAGGRGYFFVLVAPGAATYRASVESYARVMLEAPRSEAP*
Ga0132257_10162834233300015373Arabidopsis RhizosphereAENPFRFELGEADNPHRGHAVEGYVYNELPWRITNVRLRVESVDSAGAVTSQNSGWVLGDVKAGGRGYFFVLVAPGAATYRASVESYDRVMLEAPRSEAP*
Ga0182041_1190503023300016294SoilVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLCNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0134112_1000508873300017656Grasslands SoilMISRRAFVGGALVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALETPQFEAP
Ga0187765_1093704313300018060Tropical PeatlandGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0066655_1001007373300018431Grasslands SoilVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0066662_1045959833300018468Grasslands SoilMISRRAFVGGALVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0137408_136252013300019789Vadose Zone SoilGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSRAATYRATVESFNKVAVEAPQFEAP
Ga0179594_1025619813300020170Vadose Zone SoilMMKKMMILVSAAVVTGILGPIVAQAQTPVTDVPSQFRFELTQAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRANVQSFDKVAREAPRIEAP
Ga0179592_1012961723300020199Vadose Zone SoilMERRVLEDLWRVNLWCNRILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP
Ga0137417_128666633300024330Vadose Zone SoilMERRVLEDLWRVNLWCNGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP
Ga0209234_123943813300026295Grasslands SoilRIRPPPPFTPLGIATDMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209235_101560473300026296Grasslands SoilVVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209236_100835693300026298Grasslands SoilVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFDKVALDTPQFEAP
Ga0209761_1006322103300026313Grasslands SoilVVHRILVLGMVVAGVVGPVVAGAQTPATDPQNPFRFEVEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDTAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209761_101111723300026313Grasslands SoilVHSILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209471_107679843300026318SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209472_100578293300026323SoilVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209472_106945543300026323SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTP
Ga0209801_125421923300026326SoilRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209473_117233413300026330SoilRPPPPFTPLGIATDMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209158_102962523300026333SoilVHRILVLGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFALVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209804_108885313300026335SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAA
Ga0209057_100047673300026342SoilVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFNKVALDTPQFEAP
Ga0209808_101260713300026523SoilRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGAGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209806_118628913300026529SoilMRLLTEQRGAIVSGHMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVQSFDKVALDTPQFEAP
Ga0209160_120794543300026532SoilMRLLTEQRGAIVSGHMISRRAFFAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVAGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATV
Ga0209056_1012853443300026538SoilMRLLTEQRGAIVSGDMISRRAFLAGSVAVLTAPCAAEAQQPAQNSFRFELEEAQNPFRGRAIEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAPQLEAP
Ga0209684_104926923300027527Tropical Forest SoilAMQRMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP
Ga0209076_103979013300027643Vadose Zone SoilQGPRRHHPAVAVAEGRSGHRIMERRVLEDLWRVNLWCNGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDSAGGVTGETYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP
Ga0209689_133132323300027748SoilVVHRILLWGMVVAGVVGPVVAGAQTPATDPQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDPAGRVTGEASGWVVGDVKAGGRGYFFVLVSSRAATYR
Ga0209465_1000702633300027874Tropical Forest SoilMQRMALVFVTVVVGVFGSIVAVKAESPAIGAESQFRFELEEAQRSHRGQAVEGYLYNRLPWRITNVRLRVESLDGTGRVTGEASGWVVGDVKGGGRGYFYVPVTSPAVNYRATVQSFDKVVLEIPLQAP
Ga0137415_1002239933300028536Vadose Zone SoilMERRVLEDLWRVNLWCNGILVFGMVVAGILGPVVVEAQTPATDAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVESVDGTGGVTGESYGWVVGDVKAGGRGYFFVLVSSGAATYRATVESFNKVALEAP
Ga0306917_1031688723300031719SoilMKGTVVAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0307469_1036616713300031720Hardwood Forest SoilMKKMILVSATVVTGILGPIVAQAQTPVTDVPSQFRFELTEAESYRGRAVEGYIYNGLPWGITNVRLRVESVDTSGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRAAVQSFDKVAREAPRIEAP
Ga0318500_1030471913300031724SoilMKGTVVAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLCNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0318521_1028854613300031770SoilVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0318498_1039985623300031778SoilAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLCNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0318557_1022841513300031795SoilKGTVVAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLCNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDKVVLEAPLQAP
Ga0307473_1125541013300031820Hardwood Forest SoilVAQAQTLVTDAPSQFRFELTEAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGTVSGETFGWVIGDVRAGGRGYFFVPVSLRAPAYRATVQSFDKVAREAPRIEAP
Ga0306921_1204395713300031912SoilMKGTVVAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQS
Ga0310912_1090005513300031941SoilMKGTVVAVAVLLSGVFGPVVAGRAEIPDVNTEAQFRFQLAEAQRSHRGQAVEGYLYNGLPWRITNVRLRVESLDGNGRVTSEASGWVVGDVKGGGRGYFYVPVTSPAVTYRATVQSFDK
Ga0307471_10074392213300032180Hardwood Forest SoilAEAQQAAQNPFRFELEEAQSPFRGRAVEGYVYNNLPWRITNVRLRVENVDPTGVVTGESYGWVVGDVKAGGRGYFFVLVSSRAATYRATVESFNKVALEAPTEAP
Ga0307471_10282745423300032180Hardwood Forest SoilILGPIVAQAQTPVTDVPSQFRFELTEAESYRGRAVEGYIYNGLPWGITNVRLRVESVDASGAVSGETFGWVIGDVRAGGRGYFFVPVSSRAPAYRATVQSFDKVAREAPRIEAP
Ga0307472_10065003023300032205Hardwood Forest SoilMKMMILVSATVVTGILGPIVAQAQTPVTDVSSQFRFELTEAESYRGRAVEGYIYNGLPWGITNVRLRVESVDTSGTVSGETFGWVIGDVRAGGRGYFFVPVSSRAAAYRANVQSFDKVAREAPRIEAP
Ga0310812_1034323913300032421SoilAARAESPAAGAENPFRFELGEAENPHRGRAVEGYVYNELPWRITNVRLRVESVDSAGTVTGQNSGWVLGDVKAGGRGYFFVLVAPGAATYRASVESYDRVMLEAPRSEAP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.