NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100902

Metagenome / Metatranscriptome Family F100902

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100902
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 151 residues
Representative Sequence MVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIHLADFRRSMLIYPQNRRVRLYLVERRRSS
Number of Associated Samples 92
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.96 %
% of genes from short scaffolds (< 2000 bps) 1.96 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.039 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(21.569 % of family members)
Environment Ontology (ENVO) Unclassified
(33.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.686 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 38.82%    β-sheet: 22.37%    Coil/Unstructured: 38.82%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00106adh_short 13.73
PF13231PMT_2 12.75
PF01408GFO_IDH_MocA 6.86
PF12804NTP_transf_3 2.94
PF14667Polysacc_synt_C 2.94
PF13460NAD_binding_10 1.96
PF01041DegT_DnrJ_EryC1 1.96
PF00483NTP_transferase 0.98
PF01066CDP-OH_P_transf 0.98
PF04794YdjC 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0399dTDP-4-amino-4,6-dideoxygalactose transaminaseCell wall/membrane/envelope biogenesis [M] 1.96
COG0436Aspartate/methionine/tyrosine aminotransferaseAmino acid transport and metabolism [E] 1.96
COG0520Selenocysteine lyase/Cysteine desulfuraseAmino acid transport and metabolism [E] 1.96
COG0626Cystathionine beta-lyase/cystathionine gamma-synthaseAmino acid transport and metabolism [E] 1.96
COG1104Cysteine desulfurase/Cysteine sulfinate desulfinase IscS or related enzyme, NifS familyAmino acid transport and metabolism [E] 1.96
COG2873O-acetylhomoserine/O-acetylserine sulfhydrylase, pyridoxal phosphate-dependentAmino acid transport and metabolism [E] 1.96
COG0558Phosphatidylglycerophosphate synthaseLipid transport and metabolism [I] 0.98
COG1183Phosphatidylserine synthaseLipid transport and metabolism [I] 0.98
COG3394Chitooligosaccharide deacetylase ChbG, YdjC/CelG familyCarbohydrate transport and metabolism [G] 0.98
COG5050sn-1,2-diacylglycerol ethanolamine- and cholinephosphotranferasesLipid transport and metabolism [I] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.04 %
All OrganismsrootAll Organisms1.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300025929|Ga0207664_11872717All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium522Open in IMG/M
3300031945|Ga0310913_10924478All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium613Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.86%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.90%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.98%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.98%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.98%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005290Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Rhizosphere Soil Replicate 1: eDNA_1Host-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012895Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S208-509C-2EnvironmentalOpen in IMG/M
3300012915Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S103-311B-2EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015201Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S014-104B-1 (version 2)EnvironmentalOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300030945Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10187134523300000364SoilPLLVIMLRGFLRLPRPWTLSAICVPALLLYILVPLAIENHGDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILXXVPIAEDEEXXRNAAAVYTDXMSLKXKXXYLIXLAXFRRSXLIYPQNRRVRLYFVERRRSS*
INPhiseqgaiiFebDRAFT_10187174623300000364SoilLPSPWKLSAICVPALLFYIVVPLAIENHRDEAPPVRLVRFLEKLYPPSKRGDVLLVLPTTRRSAQWYAPQFKIMDHVPVSKEDEEMLHNAAAVYTEDASLKGKDYFLIELAEFRRLMLIYPQHRRVRLYLVERRRAS*
INPhiseqgaiiFebDRAFT_10187232513300000364SoilGDQRYYLMVFAPLLVVMLRGFLQLPSPWKLSAICVPALLFYIVVPLAIENHRDESPPVRLVRYLEKLYPPSKRGNVLLVLPTTRRSAQWYAPQFKILDHVPVSQQDEEMLRNAAVVYTEDASLKGKDRYLIELAEFRRSMLIYPQHRRVRLYFVERRRSS*
AF_2010_repII_A1DRAFT_1011368213300000597Forest SoilIIFVALPGDQRYYLMVFPPLLVIMLRGFLRLPKPWTLSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGDVLLILPVAYRSAQWYAPEFKILDHFPTSEQDEEVIRNAKTVYSEVASFNRENYYFIELAEFKRSMLIYPQNRHLRLYLVERRRSS*
JGI25387J43893_103604223300002915Grasslands SoilPWNLWAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0063454_10031857313300004081SoilWKLLAICVPALLLYIVVPLAIENHRDEAPAVRLVRYLEKLYPPAKRGDVLLILPVVYRSAQWYAPQFKILDHLPVAEDEELLRNAAAVYTDEISFNRKDFYLIHLADFRRSVLIYPQNRRVRLYLVQRRQSS*
Ga0062589_10197383913300004156SoilVFIALPGDQRYYLMIFPPLLVIMLRGLLRLPSRWKLLAICVPALLLYIVVPLAIENHRDEAPAVRLVRYLEKLYPPSERGNVLLVLPVVYRSAQWYAPQFKILDHLPVAEDEELLRNAAAVYTDEMSLKRKDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0066677_1032097223300005171SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKQRYLIELAEFRHSMLIYLQHRRVQLYLVERRRSS*
Ga0066673_1001161933300005175SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066673_1052728713300005175SoilPWNLSAICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHNPTAEDQEVLRNAAAVYTDELSLKRKDFYLIELAEFRRSMLIYPQNRRVRLYLVARRRSS*
Ga0066684_1023223213300005179SoilAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066671_1018126513300005184SoilVVMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKLKDFYLIQLADFRRSTLIYTQNRHVRLYLVERWRSS*
Ga0066675_1094854723300005187SoilPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0065712_1020763613300005290Miscanthus RhizosphereSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPLKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSKLIYPQNRGVRLYLVVRRRSS*
Ga0066388_10192428213300005332Tropical Forest SoilRGFLRLPKPWNLSAICVPAFLLYIVVPLAIENYREEAPAVRLVRYFEKLYPPSKRGDVLLILPVAYRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIDLAEFKRSMLIYPQNRHLRLYLVERRRSS*
Ga0070660_10019054013300005339Corn RhizosphereLMIFPPLLVIMLRGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0070671_10113205523300005355Switchgrass RhizosphereIMLRGVLGLPRPWKSLAIWVPALLLYIVVPLAVENHRDEAPAVRLVRYLEKLYPPSQRSNVLLILPVVYRSAQWYAPQFKILDHLPLPEDEELVRNAAAVYTDEISFKREDLYLIHLADFRRSVLIYPQNRRVRLYLVERRRSS*
Ga0070710_1026348123300005437Corn, Switchgrass And Miscanthus RhizosphereAIVFIALPGDQRYYLMVFPPLLVLMLRGVLRLPRPWKLSAICVPALLLYILVPLALENHRDEAPAVQLVRYLEKLYPPSKRGNVVLVLPVVYRSAQWYAPQFKILDHVPIAEDEAVLRNAAAVYTDDLSLKGKDFNLIELADFKRSTLIYTQNRHVRLYLVERRRSS*
Ga0066681_1000512853300005451SoilMVFPPLLVLMLRGFLRFPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0068867_10107120723300005459Miscanthus RhizospherePLLVMMLRGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0066697_1039282123300005540SoilWSTLRLFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066695_1061937713300005553SoilLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066705_1015459513300005569SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQKDEEVLRNAAAVYTEDPSLKGKQRYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066694_1034815313300005574SoilSLTFATDATRWIADFGNFTPHGFWSTLRLFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0066702_1005413023300005575SoilRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPIAEDEELLRNASAVYTDDMSLKGKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0066708_1001737933300005576SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLALAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQKDEEVLRNAAAVYTEDPSLKGKQRYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0068859_10106469323300005617Switchgrass RhizosphereWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0068861_10042318913300005719Switchgrass RhizosphereLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0068863_10017427013300005841Switchgrass RhizosphereGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0066696_1079986313300006032SoilLRGFLRLPKPWNLSAICVAALLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0079222_1126174513300006755Agricultural SoilLLVIMLRGLLRLPIPWKLLAICVPALLLYIVVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHLPLPEDEELLRNAAAIYTDEISFKRKDSYLIHLADFRRSVLIYPQNRRVRLYMVERRRSS*
Ga0066658_1030960923300006794SoilFPPLLVVMLRGLLGLPRLWNLSAICVPALLLYIVLPLAIENHRDESPPVRLVRYLEELYPPSKRGNVLLILPTTRRSAQWYAPQFKILDHVPTAEDEEELRNAAAVYTDDMSFKGRGFYSIQLAEFRRSMLIYPQNRRVRLYLVARRRS*
Ga0066709_10005833323300009137Grasslands SoilMVFPPLLVVMLRGFLRFPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0105242_1009992733300009176Miscanthus RhizosphereLRGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0105237_1244353423300009545Corn RhizospherePTPWKLSAICVPALLLYIIVPLAIENHRDEAPAVRVVRYLEKLYPPSQRGNVLLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0126315_1026396913300010038Serpentine SoilKPWQLSAVCVPALLLYIAVPLAIENHCEESPPVRLVRYLERLYAPSKRDDVLLILPTTRRSAQWYAPQFKLLDHLPTSEKDEETIRNAKAVYTEDTSFNRNNYYFIELAEFRRSMLIYPQHRRVRLYLVQRRPPA*
Ga0126373_1035004433300010048Tropical Forest SoilYLMVFPPLLVIMLRGFLRLPNPWSLSAICVPGLLLYIVVPLATENYREEAPPARLVRYLEKLYPPSKRGDVLLILPVAYRSAQWYAPQFKILDHVPTSEQDEQTIRNAKTVYSEVASFNPKNYYFIELAEFKRSMLIYPQNRHLRLYLVEQRRSS*
Ga0134070_1007991623300010301Grasslands SoilICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIKLAEFRRSMLIYPQNRRVRLYLVERRRSS*
Ga0134063_1011392313300010335Grasslands SoilLMVFPPLLVIMLRGFLRLPKPWNLSAICVAALLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIKLAEFRRSMLIYPQNRRVRLYLVERRRSS*
Ga0134063_1076290813300010335Grasslands SoilFVSLPGDQRYYLMVFPLLLVVTLRGFLGLPKWWNLSAICVPALLLYIVLPLEIENRRDESPPVRLVRYLEKLYPPSKRGNLLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0134071_1009423833300010336Grasslands SoilALPGDQRYYLVVFPPLLVIILRGFLRLPKPWNLSAICVAALLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDDLSLKRKDFYLIKLAEFRRSMLIYPQNRRVRLYLVERRRSS*
Ga0134062_1032375613300010337Grasslands SoilPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRSNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKLKDFYLIQLADFRRSTLIYTQNRHVRLYLVERWRSS*
Ga0126376_1049422933300010359Tropical Forest SoilFLRLPNPWSLSAICVPGLLLYIVVPLATENYREEAPPARLVRYLEKLYPPSKRGDVLLILPVAYRSAQWYAPQFKILDHVPTSEQDEQTIRNAKTVYSEVASFNPKNYYFIELAEFKRSMLIYPQNRHLRLYLVEQRRSS*
Ga0126378_1200828713300010361Tropical Forest SoilLLVIMLRGFLRLPKPWNLSAICVPALLLYIVVPLATENYREEAPPARLVRYLEKLYPPSKRGDVLLILPVAYRSAQWYAPQFKILDHVPTSEQDEQTIRNAKTVYSEVASFNPKNYYFIELAEFKRSMLIYPQNRHLRLYLVEQRRSS*
Ga0126377_1305537513300010362Tropical Forest SoilIFVALPGDQRYYLMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGDVLLILPVAYRSAQWYAPQFKILDHVPTSEQDEQTIRNAKTVYSEVASFNPKNYYFIELAEFKRSMLIYPQNRHLRLYLVEQRRSS*
Ga0134121_1252497313300010401Terrestrial SoilMLRGFLRLPKSWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDELSLKGKDFYLIHLADFRRSMLIYPQNRRVRLYLVERRRSS*
Ga0137380_1038058123300012206Vadose Zone SoilLPGDQRYYLMIFPPLLVAMFGGLLRLPKPWDFSAICVPALLVYIVVPLAIENHREEAPPVRLVRYLEKLYSPSKRDDVLLILPTTRRSAQWYAPEFKILDHLPTSEQDEEMIRNAKAVYTEDASFNREKYHFVELAEFRRSMLIYPRHRRVRLYLVERHRSS*
Ga0137377_1000727193300012211Vadose Zone SoilMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS*
Ga0137366_1010563333300012354Vadose Zone SoilWNLSAICVPALLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDELSLKRKDFYLIELAIFRRSMLIYPQNRRVRLYFVERRRSS*
Ga0137397_1125688823300012685Vadose Zone SoilLRLPKPWNLWAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAQDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0157309_1007707813300012895SoilDQRYYLMIFPPLLVMMLRGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS*
Ga0157302_1021045213300012915SoilMLRGFLRLPKPWNLSAICVPALLLYILVPLALENHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEELLRDAAAVYTDDLSLKLKDFYFIELADFRRSTLIYTQNRHVRLYLVERWRSS*
Ga0126375_1019256813300012948Tropical Forest SoilPWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVAYRSAQWYALQFKILDHVPIAEDEEVLRNAAAVYTDERSFKRKDFYLIHVADFRRSVLIYPQNRRLRLYLVERRRSS*
Ga0164304_1004517413300012986SoilLIAIVFIALPGDQRYYLMVFPPLLVIMLRGFLRLPKSWNLSAICVPVLLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPLKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDDRSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0134081_1032129423300014150Grasslands SoilRGFLGLPKPWNLSAICVAALLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS*
Ga0134079_1005881823300014166Grasslands SoilDQRYYLMVFPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAIQNHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEELLRDAAAVYTDDLSLKLKDFYLIELADFRRSTLIYTQNRHVRLYLVERWRSS*
Ga0173478_1060684013300015201SoilPPLLVIMLRGVLGLPRPWKSLAICVPALLLYIVVPLAVENHRDEAPAVRLVRYLEKLYPPSQRGNVLLILPVVYRSAQWYAPQFKILDHLPVPEDEELLRNAAAVYTDEISFKREDLYLIHLADFRRSVLIYPQNRRVRLYLVERRRSS*
Ga0182007_1027151913300015262RhizosphereLLVIMLRGLLRLPNRWKLLAICVPALLLYIVVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGKVLLILPVVYRSAQWYAPQFKILDHLPLPEDEELLRNAAAVYTDEISFKRKDSYLIHLADFRRSVLIYPQNRRVRLYMVERRRSS*
Ga0182036_1170552213300016270SoilLPGDQRYYLMVFPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLVLPVVCRSAQWYAPQFKISDHAPTVEDEEVLLNAAAVYTDELSPKWKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0182041_1102423423300016294SoilRGFLQLPKPWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLLNAAAVYTDELSPKWKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0182037_1084509113300016404SoilGDQRYYLMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRTAEWYAPQFKILDHVPTAKDEEVLRNAAAVYTDDLTFKRKDFYLIELAHFRRSFLIFNQNHHVRLYLVERRRSS
Ga0066667_1003692923300018433Grasslands SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0066667_1066695033300018433Grasslands SoilFPPLLVVMLRGFLGLPKPWNLSAICVPALLLYILLPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDELSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0066669_1038203423300018482Grasslands SoilVVMLRGFLGLPKPWNLSAICVPALLLYILLPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0173479_1019708113300019362SoilRGFLRLPRPWKLSAICFPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS
Ga0193723_104241533300019879SoilFLRLPKPWNLSAICVPALLLYILVPLAIENHHDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPTAEDEEVVRNAAAVYTDDLSLKRKDFYLIELAIFRRSMLIYPQNRRVRLYLVERRRSS
Ga0193747_101776223300019885SoilMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVSLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELDEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0193735_100056153300020006SoilMRWIADFGNFTPHGFWSTLRLFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVSLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELDEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0193721_102343113300020018SoilMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIHLADFRRSMLIYPQNRRVRLYLVERRRSS
Ga0193733_106994723300020022SoilMRWIADFGNFTPHGFWSTLRLFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVSLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEEPSLKGKERYVIKLVEYRRS
Ga0222623_1038437813300022694Groundwater SedimentIAIVFIALPGDQRYYLMVFPPLLVIILRGFLRLPKPWNLSAICVPALLLYILVPLAIENHHDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDELSLKRKDCYLIELAIFRRSMLIYPQNRRVRLYLVVRGRSS
Ga0207664_1187271713300025929Agricultural SoilWLLVDVAIVFIALPGDQRYYLMVFPPLLVLMLRGFLRLPRPWKLSAICVPALLLYILVPLALENHRDEAPAVRLVRYLEKLYPPSKRGNVVLVLPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKGKDFNLIELADFKRSTLIYTQNRHVRLYLVERRRSS
Ga0207648_1052819423300026089Miscanthus RhizosphereYLMIFPPLLVMMLRGFLRLPRPWKLSAICVPALLLYIVVPLAIENHRDEAPAVRVVRYLEKLYPPSKRGNVVLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS
Ga0207675_10105659713300026118Switchgrass RhizosphereGDQRYYLMIFPPLLVIMLRGFLRLPTPWKLSAICVPALLLYIIVPLAIENHRDEAPAVRVVRYLEKLYPPSQRGNVLLILPVVYRSAQWYAPEFKILDHLPVAEDEELLRNAAAVYTDEMSLKRDDLYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS
Ga0209238_107494113300026301Grasslands SoilPLLVIMLRGFLRLPKPWNLWAICVPALLLYILVPLAIENHRDEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLRNAAAVYTDERSLKRKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0209468_106245223300026306SoilTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0209687_125361713300026322SoilVFVALPSDQRYYLMVFPPLLVIMLRGFLRLPQPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRSNVLLILPVAYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKLKDFYLIQLADFRRSTLIYTQNRHVRLYLVERWRSS
Ga0209472_100479223300026323SoilMVFPPLLVLMLRGFLRFPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0209375_105030433300026329SoilMLRCFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIKLAEFRRSMLIYPQNRRVRLYLVERRRSS
Ga0209375_125139013300026329SoilHGFWSTLRLFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKERYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0209057_117858423300026342SoilFIALPGDQRYYLMVFPPLFVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIKLAEFRRSMLIYPQNRRVRLYLVERRRSS
Ga0209807_102537043300026530SoilFLPLFRVIRDYLMVFPPLLVVMLRGFLRLPTPWNFSAICVPALLLYIVLPLAIENHREEAPPVRLVRYLEKLYPPSKRGNVLLILPTTRRSAQWYAPEFKIMDHVSTSQEDEEVLRNAAAVYTEDPSLKGKQRYLIELAEFRRSMLIYLQHRRVQLYLVERRRSS
Ga0307312_1015362513300028828SoilFIALPGDQRYYLMVFPPLLVIMLRGFLRLPKPWNLSAICVPVLLLYIVVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVYRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKRKDFYLIHLADFRRSMLIYPQNRRVRLYLVERRRSS
Ga0307312_1046048433300028828SoilRGFLRLPRPWNLSAICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPMAEDEEALRNAAVVYTDEMSLKRKDIYLIELAVFRRSMLIYPQNRRVRLYLVERRRSS
Ga0307304_1009234913300028885SoilPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAIENHRDEAPAVRFVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDELSLKRKDCYLIELAIFRRSMLIYPQNRRVRLYLVERRRSS
Ga0075373_1143672823300030945SoilMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHREEAPPVRLVRYLENLYPPSKRGNVLLILPVAYRTAQWYAPQFKILDHVPIAEDEEVLRNAAAVYTDDLSLKGRDFYLIELADFRRSTLIYIQNRHVRLYLVERRPSS
Ga0306917_1010864913300031719SoilVDIAIVFIALPGDQRYYLMVFPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLLNAAAVYTDELSPKWKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0306918_1068881413300031744SoilLLVDIAIVFIALPGDQRYYLMIFPPLLVIMLRGFLRLPKPWNSSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGDVLLILPVAYRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKMVYSEVASFNRENYYFIELAEFKRSMLIYPQNRHLRLYLVEKRRSS
Ga0306918_1152173113300031744SoilSLPGDQRYYLMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRTAEWYAPQFKILDHVPTAKDEEVLRNAAAVYTDDLTFKRKDFYLIELAHFRRSFLIFNQNHHVRLYLVERRRSS
Ga0310917_1064980713300031833SoilLRGFLRLPKPWNSSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS
Ga0306919_1100266713300031879SoilSSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS
Ga0306925_1124535813300031890SoilIVIVFIALPGDQRYYVMVFPPLLVIMLRGFLRLPKLWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLLNAAAVYTDELSPKWKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0308175_10308107923300031938SoilRGVLRLPSPWKLLAICIPALLLYIVVPLAIENHRDEAPAVRLVRYLEKLYPPSQRGNVLLILPVVYRSAQWYAPQFKILDHLPVAEDEELLRNAAAVYTDEMSLKRKDIYLIHLADFRRSVLIYPQNRRVRLYFVERRRSS
Ga0310912_1102029513300031941SoilLIILIAIIFVSLPGDQRYYLMVFPPLLVIMLRGFLRLPKPWNLSAICVPALLLYILVPLAIENHRDEAPPVRLVRYLEKLYPPSKRGNVLLILPVAYRTAEWYAPQFKILDHVPTAKDEEVLRNAAAVYTDDLTFKRKDFYLIELAHFRRSFLIFNQNHHVRLYLVERRRSS
Ga0310916_1003615943300031942SoilMLRGFLRLPKPWNLSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS
Ga0310916_1127345923300031942SoilQRYYLMVFPPLLVIMLRGFLRLPKPWSLSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVAYRSAQWYAPEFKILDHVPTAEDEEMLRNAATVYSEVASFNPKDYYFVELGEFKRSMLIYPQNRHLRLYLVEKRRSS
Ga0310913_1041938213300031945SoilLLVIMLSGFLRLPKPWNLSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS
Ga0310913_1092447813300031945SoilGPWLLIGIAITFVALPGDQRYYLMVFPPLLVIMLRGFLRLPKPWSLSAICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVAYRSAQWYAPEFKILDHVPTAEDEEMLRNAATVYSEVASFNPKDYYFIELAEFKRSMLIYPQNRHLRLYLVQRRRSL
Ga0310910_1009871623300031946SoilIVFIALPGDQRYYLMVFPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAKDEEVLRNAAAVYTDDLTFKRKDFYLIELAHFRRSFLIFNQNHHVRLYLVERRRSS
Ga0310909_1069787923300031947SoilALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS
Ga0306922_1158812713300032001SoilALPGDQRYYLMVFPPLLVIMLRGFLQLPKPWNLSAICVPALLLYILVPLAVENHCNEAPAVRLVRYLEKLYPPSKRGNVLLILPVVCRSAQWYAPQFKILDHVPTAEDEEVLLNAAAVYTDELSPKWKDFYLIHLADFRRSMLIYPQNRGVRLYLVVRRRSS
Ga0310914_1102304613300033289SoilDIAIIFVALPGDQRYYLMVFPPLLVIMLRGFLRLPKPWSLSGICVPALLLYIVVPLAIENYREEAPPVRLVRYFEKLYPPSKRGNVLLILPVASRSAQWYAPEFKVLDHVPTSEQDEEAIRNAKTVYSEVASFNRENYYFIELGEFKRSMLIYPQNRHIRLYLVERRRSS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.