NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F044390

Metagenome / Metatranscriptome Family F044390

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044390
Family Type Metagenome / Metatranscriptome
Number of Sequences 154
Average Sequence Length 75 residues
Representative Sequence MKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHPYK
Number of Associated Samples 101
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.65 %
% of genes from short scaffolds (< 2000 bps) 0.65 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.61

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.454 % of family members)
Environment Ontology (ENVO) Unclassified
(47.403 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(55.195 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 41.49%    β-sheet: 0.00%    Coil/Unstructured: 58.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.61
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 154 Family Scaffolds
PF13551HTH_29 2.60
PF13518HTH_28 1.95
PF07592DDE_Tnp_ISAZ013 1.30
PF03704BTAD 1.30
PF01610DDE_Tnp_ISL3 1.30
PF00072Response_reg 0.65
PF08267Meth_synt_1 0.65
PF02635DrsE 0.65
PF01081Aldolase 0.65
PF03795YCII 0.65
PF00078RVT_1 0.65
PF02384N6_Mtase 0.65
PF04138GtrA 0.65
PF00400WD40 0.65
PF09234DUF1963 0.65
PF13565HTH_32 0.65
PF03640Lipoprotein_15 0.65
PF00872Transposase_mut 0.65
PF12146Hydrolase_4 0.65
PF08533Glyco_hydro_42C 0.65
PF02518HATPase_c 0.65
PF13676TIR_2 0.65
PF07690MFS_1 0.65
PF01266DAO 0.65
PF00782DSPc 0.65
PF07366SnoaL 0.65
PF07971Glyco_hydro_92 0.65
PF03811Zn_Tnp_IS1 0.65
PF13473Cupredoxin_1 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 154 Family Scaffolds
COG3464TransposaseMobilome: prophages, transposons [X] 1.30
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 1.30
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 1.30
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.65
COG08002-keto-3-deoxy-6-phosphogluconate aldolaseCarbohydrate transport and metabolism [G] 0.65
COG1874Beta-galactosidase GanACarbohydrate transport and metabolism [G] 0.65
COG2350YciI superfamily enzyme, includes 5-CHQ dehydrochlorinase, contains active-site pHisSecondary metabolites biosynthesis, transport and catabolism [Q] 0.65
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.65
COG3537Putative alpha-1,2-mannosidaseCarbohydrate transport and metabolism [G] 0.65
COG3677Transposase InsAMobilome: prophages, transposons [X] 0.65
COG3878Uncharacterized conserved protein YwqG, DUF1963 familyFunction unknown [S] 0.65
COG4315Predicted lipoprotein with conserved Yx(FWY)xxD motif (function unknown)Function unknown [S] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300012210|Ga0137378_10954529Not Available771Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.29%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.44%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.44%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.84%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment4.55%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.60%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.30%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.30%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.30%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.65%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013764Permafrost microbial communities from Nunavut, Canada - A28_35cm_6MEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017926Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_2EnvironmentalOpen in IMG/M
3300017928Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10491033423300000364SoilMKDHPERKGRLQLXEXMFAGHSWQXAXAXSQXHISRSTAYRLRQLARDDDKAELAFLDDRHGHPYKLIEPARTWLAQFCTTHPQLAS
INPhiseqgaiiFebDRAFT_10541384313300000364SoilMKDHRXRRARLXLIEXMFAGXSWQEAVAQSQLKISRAMAYRLRQMACDEEKAERAFLDDRHGHPSKLETIHFWVLAIPLYPF*
JGI25614J43888_1000617313300002906Grasslands SoilMKDHPQRKGRLQLIEAMFAGHSWQSAVAQSKLNVSRATAYRLRQLARDEQKAASAF
JGI25614J43888_1018253523300002906Grasslands SoilMKDHPERKGRLQLIESMFAGHSWQTAVAQSQLHVSRATAYRLRQLARDEEKAALAFLDDRHG
JGI25387J43893_103025823300002915Grasslands SoilMKDRPDRAARLQLIESMFAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHP
JGI25616J43925_1035998813300002917Grasslands SoilMKDHRERTARLQLIEHMSAGHSWQTAAAQSQLKVSRSTAYRLLKLVRDEEKAERAFLDDR
Ga0066674_1012060433300005166SoilMKDHRERTGRLQLIEYMFAGHSWQSAVAQSQLNISRSTAYRLRQLARSEDKAGLAFLDDR
Ga0066680_1092393713300005174SoilMKDRPDRAARLQLIESMCAGHSWRAAAAQSQLKISRSTAYRLLQLARDEEKAAAAFLDDR
Ga0066690_1076327223300005177SoilMKDRPDRAARLQLIESMFAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDDRHGHPYKLTEPVQAWLVEVCTRDPQIPSSRIQAELK
Ga0066688_1064883423300005178SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQL
Ga0066678_1045676523300005181SoilMKDHPERTARLQLLEHMFAGHSWQTAVSQSGLHISRSTAYRLRQLARDEDKAAVVFLDDRHGHPYK
Ga0070703_1015748623300005406Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIGRMFAGQSWQTVVAQSQLHISRSTAYRLVQRARDEDKAPLVFLDDRHG
Ga0070709_1086012013300005434Corn, Switchgrass And Miscanthus RhizosphereMKDHPERKGRLQLIEAMFAGHSWQEALALSQLHISRSTAYRLRQLARSEDKAELAFLDDRHGHPYKLIEPARTWLAEFCTTHPLLASSRVQ
Ga0070714_10089763913300005435Agricultural SoilMKDRPDRTARLQLIEYMFMGQSWQAAVTQSQLKISRSTTYRLIQLARNEEKATSAFLDDRHGHPYKITERVRLGEVCTRDPLIPSSPTGWATR*
Ga0070713_10128985213300005436Corn, Switchgrass And Miscanthus RhizosphereMKDRSDRVARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRIVQLARDE
Ga0066686_1110869213300005446SoilMKDHRERTGRLQLIEYMFAGHSWQSAVAQSQLNISRSTAYRLRQLARSEDKAGLAFLDDRHGH
Ga0066681_1081811623300005451SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDDRHGHPYKLTEPVQGWLVDVCTRDPQIPSSRIQAEL
Ga0070706_10111525313300005467Corn, Switchgrass And Miscanthus RhizosphereMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLNISRSTAYRLVKLARDEDKVAKAFLDDRHGHPYKVTEPVRVWLVDVCTQNPQLASSRV*
Ga0070707_10025899123300005468Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIESMCAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAASAFLDDRHGHPYKLTEPVQAWLV
Ga0070698_10054893313300005471Corn, Switchgrass And Miscanthus RhizosphereMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLNISRSTAYRLVKLARDEDKVAKAFLDDRHGHPYKVTEPVRVWLVDVCTQNPQMASSRVQ
Ga0066701_1038410833300005552SoilMKNHPDRAARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEK
Ga0066661_1094373513300005554SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLSMSRSTAYRLVQQARNEDKAAQVFLDDRHGHPYKLTEPVQVWINELCSDDLQMPSSRVQ
Ga0066707_1080904723300005556SoilMKDHPDRTARLQVMGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKAFLDDRHGHPYKVTEPVRVWLVDVCTQNPQMASSRVQAELQ
Ga0066707_1091924723300005556SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLNISRSTAYRLVQQARNEDKA
Ga0066700_1052939223300005559SoilMKDRPDRAARLQLIESMCAGHSWRAAAAQSQLKISRSTAYRLLQLARNEEKAAAAFLDDRHGHAYKLTEPVQAWLIDVCSKDPQIPSSRIQAELKTSFGI
Ga0066691_1042911513300005586SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLVFLDDRHGHPYKLTEPMQIWINELCSDDLQMPSSRVQRELKSRFSVAVSVS*
Ga0070717_1000178313300006028Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIASMCAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAGLAFLDDRHGHPYKLTEPVQAWLVDVCSKDPQIPSSRIQ
Ga0070717_1005359713300006028Corn, Switchgrass And Miscanthus RhizosphereMERTEKMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQLNISRSTAYRLVQRARDEDKAPLVFLDDRHGHPYKLTEPMQIWI
Ga0070717_1168185613300006028Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLVFLDDRHGHPYKLTEPMQIWINE
Ga0075017_10081446113300006059WatershedsMKDHPERKGRLQLLEAMFAGHSWQSAVALSQLHISRSTAYRLRQLARDEDKAELVFLDDRHGHPYKLIEPARTWLAEFCTTHPQLASSRVQAELQTTFGVTV
Ga0066653_1047645113300006791SoilMKDRPDRAARLQLIESMFAGQSWPAAAAQSQLKISRSTAYRLVQLARDEEKAAE
Ga0066653_1070099513300006791SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDK
Ga0066658_1074881423300006794SoilMKDRPDRAARLQLITCMFAGQSWQEAVTESQLNVSRSTAYRLVQLARDEDKAPLAFLDD
Ga0066665_1097092113300006796SoilMKDHPERSARLRLLEHMFTGHSWQTAVSQSGLHISRSTAYRLRQLARDEDKAALVFLDDRHGHPYKLTEPVCLWMVEFCTNNPQVASSRVQAELKSTF
Ga0102924_122023913300007982Iron-Sulfur Acid SpringMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQANISRSTAYRLVKLARDEDKAPLVFLDDRHGHPYKLTEPV
Ga0066710_10212397413300009012Grasslands SoilMKDHPERKGRLQLIESMFAGHSWQEALAQSQLHVSRATAYRLRQLARDEEKAERAFL
Ga0066710_10349085423300009012Grasslands SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKAFLDDRHGHPYKLTEPMQIWI
Ga0099829_1014716513300009038Vadose Zone SoilMHGMKDHRERTARLQLIEHMFAGHSWQTAAAQSQLKVSCSTAYRLLKLVRDEEKAERAFLDDRHGHPYKITEPVRV
Ga0099830_1157428913300009088Vadose Zone SoilMKDRSDRTARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDDRHGHPYKL
Ga0099830_1161224013300009088Vadose Zone SoilMKDHRERTARLELIEHMSAGHSWQTAAAQSQLKVSRSTAYRLLKLVRDEEKAERAFLDDRHGHPYKITEPVRVWMTEFCTSNPQLPSSRVQRELK
Ga0099830_1184390323300009088Vadose Zone SoilMKDHPDRSARLQLIESMFGGQPWQVAAAQSNLRVSRTTAYRLVQLARDEEKVASAFLDDRHGHPYKMTEPVQMWLVEVCTTDPQIPSSRLQ
Ga0099828_1005273743300009089Vadose Zone SoilMHGMKDHRERTARLQLIEHMFAGHSWQTAAAQSQLKVSCSTAYRLLKLVRDEEKAERAFLDDRHGHPYKITEPVRVWMTEFCTSNPQLPSSRVQRELK
Ga0099828_1080841613300009089Vadose Zone SoilMKDRSDRAARLQLIESLFAGQSWRTAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHPYKLTDKVASVAR*
Ga0099828_1127967933300009089Vadose Zone SoilMKDHPERKGRLQLIEAMFAGHSWQEALAQSQLHVSRATAYRLRQLARNEEKAERAFL
Ga0099828_1161773213300009089Vadose Zone SoilMKDHAERTARLQLLEYMFAGYSWQTAVTQSGLHISRSTAYRLRQLARDEDKAALAFLDDRHGHPYKLTEPVRGWMIEFCMNHPQ
Ga0099828_1190517913300009089Vadose Zone SoilMKDHPERTARLQLLEHMCAGHSWQTAVSQSGLHISRSTAYRLRQLARDEDKAALVFQDDRHGHPYKLTEPVRVWMVEFCTTNLAS*
Ga0099827_1110222123300009090Vadose Zone SoilMKDHRERTARLELIEHMFAGHSWQTAAAQSQLKVSRSTAYRLLKLVRDEEKAERAFLDN
Ga0099827_1138545123300009090Vadose Zone SoilMKDHPDRSARLQLIESMFAGQPWQVAAAQSNLRVSRTTAYRLVQLARDEEKAASAFLDDRHGHPYKMTEPVQVWLVEVCTRDPQIPSSRLQAEL
Ga0066709_10029872113300009137Grasslands SoilMKDHPERTARLHLLEHMFAGHSWQAAVSQSGLHISRSTAYRLRKLARDEDKAALVFLDDRHGHPYK
Ga0126373_1029719513300010048Tropical Forest SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLLQLARD
Ga0127460_100786013300010114Grasslands SoilMKDRPDRAARLQLIGRMFAGQSWQTARAESQLNISRSTAYRLVQQARNEDKAAQVFLDDRHGHPYKLTEPVQVW
Ga0134082_1048486213300010303Grasslands SoilMKDHPERKGRLQLIEAMFAGHSWQSAVAQSQLNISRSTAYRLRQLARDDDKAGLAFLDDRHGQPYKLIEPARTWLAEFCTIHPQVASSRVQAELKTAFGVTVSV
Ga0134082_1056543313300010303Grasslands SoilMKDRPDRAARLQLIESMFAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDDR
Ga0134067_1040152123300010321Grasslands SoilMKGMKDHPDRSARFQLIESMFAGQPWQVAAAQSNLHVSRTTAYRLAQLARDEEKAASAFL
Ga0134086_1024677523300010323Grasslands SoilLQLLEHLFAGQSWQATVSQSGLHISRSAAYRLRQQARDEDKAALVFLDDRHGHPYKLTKLVCLWMVEFCTNNPQVASSRVQAELKSTFGIEVSVRQINRV
Ga0134062_1017553723300010337Grasslands SoilMKGMKDRPDRAARLQLIESMFAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAALAFVDDRHGHPYKLTEPVQAWLVEVCTRDPQIPSSRI
Ga0126370_1120811813300010358Tropical Forest SoilMKDHPDRTARLHLLEHMFAGHSWQTAVSQSGLHISRSTAYRLRKLARDEDKAALVFLDDRHGHPYKLTEPVCMWMVQFCTTNRAS*
Ga0126378_1142785723300010361Tropical Forest SoilMKDRLDRAARLQLIESMCAGQSWRAAAAQSQLKISRSTVYRLLQLVR
Ga0126379_1340961713300010366Tropical Forest SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLVFLDDRHGHPYKLTEPMQIWINELCSDDLQIPSSRVQRE
Ga0137389_1008692543300012096Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLVFLDDRHGHPYKLTE
Ga0137389_1094348923300012096Vadose Zone SoilMKDRSDRAARLQLIESLFAGQSWRTAAAQSQLKISRSTAYRLVQLARDEERAAEAFLDDRHGHPYKLTEPVQVWLVEV
Ga0137389_1104804613300012096Vadose Zone SoilMKDHPDRKRRLQLIEAMFAGHSWQSAVAQSQLHISRSTAYRLRQLARDDEKAELAFLDDRHGHPYKLIEPARRWLAEFCASNPQLASSRVQAELKTTFGV
Ga0137389_1107075513300012096Vadose Zone SoilMKGMKDHPDRSARLHLIESMFAGQPWQVAAAQSNLRVSRTTAYRLIQLARDEEKAASAFLDERHGHPYKMTEPVHVWLV
Ga0137389_1127187813300012096Vadose Zone SoilMKDHPERTARLRLLEHMFAGQSWQTAVSLSGLHIARSTAYRLRQRARDEDKAALVFLDDRHGHP
Ga0137389_1172324923300012096Vadose Zone SoilMKDRPDRAARLHLIEYMFAGQSWQAAAAQSQLKISRSTAYRLVQLARNEEKAAEAFLDDRHGHP
Ga0137388_1002847613300012189Vadose Zone SoilMKDRPDRAARLQLIESIFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHPYK
Ga0137388_1027061023300012189Vadose Zone SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARHEEKAAEAFLDDRHGHPYKVTESV*
Ga0137388_1089724113300012189Vadose Zone SoilMHGMKDHRERTARLQLIEHMFAGHSWQTAAAQSQLKVSCSTAYRLLKLVRDEEKAERAFLDDRHGHPYKITEPVR
Ga0137388_1097837123300012189Vadose Zone SoilMKDHPDRKRRLQLIEAMFAGHSWQSAVAQSQLHISRSTAYRLRQLARDDEKAELAFLDDR
Ga0137364_1018199433300012198Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQNQLNISRSTAYRLVQYARDEDKVTMPFLDDRHSHPYKLTEPVQVWINEVCSEKRQ
Ga0137383_1008520333300012199Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLVFLDDRHGH
Ga0137383_1009516613300012199Vadose Zone SoilMKDHPQRKGRLQLIEAMFAGHSWQSAVAQSKLNVSRATAYRLRQLARDEQKAASAFLDD
Ga0137365_1044555823300012201Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAAAQSQANISRSTAYRLVKLAHDEDKAPLVFLDDRHGHPYKMTEPVQRWISEVCINTPQIPASRVQSELKSRFGV
Ga0137363_1036700013300012202Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLNISRSTAYRLLQRARDEDKAPLVFLDDRHGHPYKLTEPMQIWINELCSDNLQIPSSRVQRELKS
Ga0137363_1101147923300012202Vadose Zone SoilMKDHPERKGRLQLIEAMFAGHSWQEALAQSQLHVSRATAYRLRQLACDEEKAER
Ga0137362_1062208113300012205Vadose Zone SoilMKDRPDRAARLQLIESMCAGHSWRAAAAQSQLKISRSTAYRLLQLARNEEKAAAAFLDDRHGHAYKLTEPV
Ga0137380_1034539133300012206Vadose Zone SoilMKDRPDRAARLQLISSMFTGQSWQEALAQSQLNLSRSTVYRLVQLARDEQKAASAFLDDRHGHPYKL
Ga0137381_1081735013300012207Vadose Zone SoilMKDRSDRTARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEE
Ga0137381_1121203513300012207Vadose Zone SoilMKGMKNHPDRAARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEE
Ga0137376_1140743113300012208Vadose Zone SoilMKGMKDRPDRAARLQLIESMFAGHSWQNAAAQSQLKISRSTAYRLVQFARNEEKAGLAFLDDRHG
Ga0137376_1150318213300012208Vadose Zone SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKVFLDDRHGHPYKVTEPVRVWLVDVCTQNPQMASS
Ga0137379_1111052623300012209Vadose Zone SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVA
Ga0137379_1114634323300012209Vadose Zone SoilMKDRPDRAARLQLIGRLFAGQSWQTAIAESQMSISRSTAYRLVQQARNEDKAAQVFLDDRHGH
Ga0137379_1131698213300012209Vadose Zone SoilMKDHPDRTARLQVMGRLFAGQSWQTAVSQSQLNISRSTAYRLVKLARDEDRAARAFLDDRHGHPYKVTEPVRMWLVDVCTQNPQMASSRVQAELQSRFGVA
Ga0137379_1134912313300012209Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQLNISRSTAYRLVQQARNEDKAPQIFLDDRHGHPYKV
Ga0137379_1136262713300012209Vadose Zone SoilMKDRSDRAARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDD
Ga0137379_1160907713300012209Vadose Zone SoilMKDRSDRAARLQLIENMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEKKAARAFLDDRHGHPYKLTEPVQAWLVEVCTRDPQIPSSRIQAELKTSFG
Ga0137378_1060272413300012210Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQNQLNISRSTAYRLVQQARNEDKAAQVFLDDR
Ga0137378_1065131923300012210Vadose Zone SoilLKDRADRAARLQLIGRMLAGQSWQTAAAESQLPISRATAYRLVQYARDEGKATKPFL
Ga0137378_1066770323300012210Vadose Zone SoilMKDHPDRKGRLQLIEATFAGHSWQETLAQSQLHISRSTAYRLRQMARDDEKAELAFLDDRHGHPYKLIEPARTWLAEFCTIHPQVASSRV*
Ga0137378_1095452923300012210Vadose Zone SoilMKGMKNHPDRAARLHLIESMFAGQSWRAAAAQSQLKISRSTAYR
Ga0137378_1106099513300012210Vadose Zone SoilMKDHRDRKRRLQLIEAMFAGHTWQEALAQSQLHISRSTAYRLRQLARDDEKAALAFLDDRHGHPYKLIEPARTWLAEFCTSNPQLASSRVQAELKTTFGVTVSV
Ga0137377_1041915013300012211Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTALAESQLNISRSTAYRLVQQARNEDKAAQVFLDDRHGHPYK
Ga0137377_1048200813300012211Vadose Zone SoilMKDHPQRKGRLQLIEAMFAGHSWQSAVAQSQLNVSRASAYRLRQLARDEQKAASAFLDDRHGHPYKVTEPVQAWISEFCTEQPQVASSRVQSELQSRFGVA
Ga0137377_1147896213300012211Vadose Zone SoilMKDHRERKERLQLIESMFAGHSWQSAVTQSQLNVSRATAYRLKQLARDEEKAERAFLDDR
Ga0137387_1055387523300012349Vadose Zone SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKVFLDDRHGHPYKV
Ga0137372_1019791013300012350Vadose Zone SoilMKDRPDRAARLQVIGGMFAGQSWQTAVAESQLPISRATAYRLLQLARDD
Ga0137372_1051984923300012350Vadose Zone SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQANISRSTAYRLVKLARDEDKAPVVFLDDRHGHPYK
Ga0137386_1109230413300012351Vadose Zone SoilMKDHRDRKRRLQLIEAMFAGHTWQEALAQSQLHISRSTAYRLRQLARDDEKAALAFLDDRHGHPYKLIEPARTWLAEFCTSNPQLASSRVQAELKTT
Ga0137366_1122080213300012354Vadose Zone SoilMKDRSDRAARLQLIENMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEKKAARAFLDDRHGHPYKLTEPVQAWLVEVCTRDPQISSSRIQAELKT
Ga0137385_1051183023300012359Vadose Zone SoilMKDRSDRAARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLA
Ga0137385_1051965523300012359Vadose Zone SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKAFLDDRHGHPYKVTEPVRVWLVDVCTQ
Ga0137385_1096769423300012359Vadose Zone SoilMKGMKDHPDRSARLHLIESMFAGQPWQAAAAQSNLRVSRTTAYRLLQLARDEEKAASAFLDDRHGHPYKMTEPVQVWLVDLCTTVP
Ga0137385_1141258013300012359Vadose Zone SoilMFAGHSWQSAVAQSQLKVSRTTAYRLVQLARDEEKAERAFLDDRHGHPYKVTAEVQVWLVDVCTKDPHIPSSQIQAELKSTFGRAVSISHI
Ga0137385_1141263813300012359Vadose Zone SoilMKDHPDRKRRLQLIEAMFAGHSWQSAVAQSQLNISRSTAYRLRQLARSDEKAELAFLDDR
Ga0137385_1151884813300012359Vadose Zone SoilMKDHPDRTARLQVMGRLFAGQSWQTAVSQSQLNISRSTAYRLVKLARDEDRAARAFLDDRHGHPYKLTEPVCVWMVDLCTQNPQIASSHVQSELQSRFGVAVSVSQ
Ga0137385_1152781413300012359Vadose Zone SoilMKDRPDRAARLQLIESMFAGHSWQAAAAQSQLKISRSTAYRLVQLARDEERAGLAFLDDRHGHPYKLTEPVQEWLVDVCSKDPQMPSS
Ga0137360_1152174413300012361Vadose Zone SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHPYK
Ga0137361_1093645823300012362Vadose Zone SoilMKDRPDRAARLQLIESMCAGHSWRAAAAQSQLKISRSTAYRLLQLARDEEKAAAAFLDDRHGHAYKLTEPVQAWLIDVCSKDPQIPSSRI
Ga0134035_128151023300012391Grasslands SoilMKDHPDRTARLQVIGRMFAGQSWQTAVSQSQLHISRSTAYRLVKLARDEDKVAKAFLDDR
Ga0137398_1059814423300012683Vadose Zone SoilMKDHPERSARLQLIEYMFAGHSWQSAVAQSQLHISRSSAYRLRQLATIQKAAWAFLDGRHGHPYKLTEAARA
Ga0137398_1061446123300012683Vadose Zone SoilMKDHLERKGRLQLIESMFAGHSWQSAVAQSQLNVSRATAYRLRQLARDEQKAASAFLDDRHCKAASESPSV*
Ga0137396_1130063913300012918Vadose Zone SoilMKDHPERTARLHLLEHMFAGHSWQSAVSQSGLHISRSTAYRLRQQARDENKAALVFLDDRHGHSYQLTEPGCTWMVEFGTNTLQVASSRVQAELKSPFGIAV
Ga0137359_1081583413300012923Vadose Zone SoilMKDHPQRKGRLQLIEAMFAGHSWQSAVAQSKLNVSRATAYRLRQLARDEQKAASAFL
Ga0126369_1285203413300012971Tropical Forest SoilMKDHRERTARLHLLEYMFAGHSWQSAVSQSGLRISRSTAYRLRQQARDEDKAALVFLDDRHGHPYKLTEPVRVWMVEFCTNKPQVASSRVQAELKTTFGVEV
Ga0120111_115791823300013764PermafrostMKDHPERKGRMQLIEAMFAGHSWQSAVVQSQLNVSRATAYRLRQLASDEGEGALAFLD
Ga0120125_119272913300014056PermafrostMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQLNISRSTAYRLVQHARD
Ga0134078_1068186513300014157Grasslands SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQANISRSTAYRLVKLARDEDKAPLVFLDDRHGHPYKLTEPVQRWISEVCTTTPQIPASR
Ga0182033_1035488123300016319SoilMKDHPQRKARLQLLEAMFAGHSWQSAVSQSQLNISRSTAYRLRQLARDEDKAELAFLDDRHGHPYKLIEPAR
Ga0182033_1167265813300016319SoilMKDRPDRAARLQLIGRMFAGQAWQTAVAQSQLKISRAPASRLVKLARDEDKAPLVFLDDRHGHPSKV
Ga0134069_132539513300017654Grasslands SoilMKDRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLLQLARDEEKAAEAFLDDRHGHPYKLTEPVQAW
Ga0187807_117213013300017926Freshwater SedimentMKDRPDRAARLQLIELMFTGQSWRAAAARSQLKISRSTAYRLVQFARDEEKAALAFLDDRHGHPYKLTEPVQAWLVDVCTRDPQIPSSRIQAELKTSFGI
Ga0187806_127676113300017928Freshwater SedimentMKDHRERTARLRLIEQMFAGHSWQTAAAQSQLHVSRSTAYRLLKLARDEEKAERAFLDDRHGHPYKITEPVRRWMTEFCATNPQV
Ga0187819_1079904613300017943Freshwater SedimentMKDHRERTARLRLIEQMFAGHSWQTAAAQSQLHVSRSTAYRLLKLARDEEKAERAFLDDRHGHPYKITEPVRRWMTEF
Ga0187817_1065323013300017955Freshwater SedimentMKDRPDRAARLQLIELMFTGQSWRAAAARSQLKISRSTAYRLVQFARDEEKAALAFLD
Ga0187804_1007226913300018006Freshwater SedimentRPDRAARLQLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQFARDEEKAARGIPG
Ga0187804_1037307513300018006Freshwater SedimentMKDHRERTARLRLIEQMFAGHSWQTAAAQSQLHVSRSTAYRLLKLARDEEKAERAFLDDR
Ga0187805_1025866013300018007Freshwater SedimentMKDRPDRAARLQLIEYMFAGQSWRAAAAQSQLKISRSTAYRLVQLACDEKKAAQAFLDDRHGHPYKLTEPVQAWLVEVCRVDPQIPSNRIQAE
Ga0066655_1124660813300018431Grasslands SoilMKDRPDRTARLQVIGRMFAGQTWQTAVIQSQLNISLSTASRLVKLARDEDKVARAFLDDRHGHPYKVTDPVRVWMVELC
Ga0066667_1060667613300018433Grasslands SoilMKDHPERSARLQLIEYMAAGQSWQAAVAQSKLNVSRSTAYRLAQLGRCEDKAVLALLDGRQGHPSKLTEPVQT
Ga0066662_1184569713300018468Grasslands SoilMKDHRDRTARLQLIESMFAGHSWQTAVAQSQLHVSRSTAYRLVQLARDEEKAERAFLDDRHGHVYKVTEPVQAWMIEFCTDNPQVASSRV
Ga0066662_1282749323300018468Grasslands SoilMKDRPDRAARLQLIGRMFAGQSWQTAVAQSQLNISRSTAYRLVQQARNEDKAPQIFLDDRHGHPYKVTEPVQNWINEMCTDHGQIPSSKVQRELQSRFGVTV
Ga0215015_1060208513300021046SoilMKDRPDRAARLQLIEYMFAGQSWRAAAAQSQLKISRSTAYRLVQFARDEEKAALAFLDDRHGHPYKLTEPVQEWLVEVLSLIHI
Ga0126371_1278495223300021560Tropical Forest SoilMKDHPERKGRLQLIEAMFAGHSWQEALAQSQLHISRSTAYRLRQLARDDEKAELAFLDDRHGHPY
Ga0207684_1004455383300025910Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAPLV
Ga0207693_1145344213300025915Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLHLIEYMFAGHSWQTAAAQSQLKISRSTAYRLVQLARNEEKAGLAFLDDRHGHPYKLTEPVQAWLVD
Ga0207646_1001715913300025922Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIESMCAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAASAFLDDRHGHPYKLTEPVQAWLVDVCTRDP
Ga0207646_1026017913300025922Corn, Switchgrass And Miscanthus RhizosphereMKDRPDRAARLQLIASMCAGHSWQAAAAQSQLKISRSTAYRLVQLARDEEKAGLAFLDDRHGHPYKLTEPVQAWLVDVCSKDPQLPSSRIQAELKTGFGIQVS
Ga0207664_1075323623300025929Agricultural SoilMKDRPDRTARLQLIEYMFMGQSWQAAVTQSQLKISRSTTYRLIQLARNEEKATSAFLDDRHGHPYKITERVRLGEVCTRDPLIPSSPTGWATR
Ga0257156_112237213300026498SoilMKDHPERTARLHLLEHMFAGHSWQTAVSQSGLHMSRSTAYRLRQQARDEDKAALVFLDDRHGHPYKLTEPVRVWMVEFCTNNPQVASSRIQ
Ga0209156_1023122323300026547SoilMKDRPDRAARLQLIGRMFAGQSWQTAIAESQLHISRSTAYRLLQRARDEDKAP
Ga0209648_1022360723300026551Grasslands SoilMKDHPQRQGRLQLLEAMFAGHSWQEALAQSQLHISRSTAYRLRQLARDEDKAELAFLDDRHGIWHN
Ga0209648_1033249213300026551Grasslands SoilMKDHPEHTARLHLLEYMFAGHSWQAAVSQSGLHISRSTAYRLRQRAGDEDKAAGVFLDDRHGHAYKLTESVRAWMVEFCTDTPRVASSRIQTELKTTFGIEVSVSQMNRV
Ga0209577_1023122733300026552SoilMKDRSDRTARLHLIESMFAGQSWRAAAAQSQLKISRSTAYRLVQLARDEEKAALAFLDDRHGHPYKLT
Ga0209689_128020413300027748SoilMKDHPERTARLQLLEHMFAGHSWQAAVSQSGLHISRSTAYRLRQQARDEDKAALVFLDDRHGHPYKLTEPVCLWMVEFCT
Ga0209580_1068463013300027842Surface SoilMKDHPERKGRLQLIEAMFAGHSWQEALAQSQLHISRSTAYRLRQLARDEDKAALAFLDDRHGHPYKLLEPARTWLAQFCTTHPQVASSRV
Ga0209166_1066387723300027857Surface SoilMKDRPDRSARLQLIESMFAGQSWQAAVAQSQLKVSRSTAYRLVQMARAEEKAVEAF
Ga0209701_1040658623300027862Vadose Zone SoilMKDHPERTARLQLIESMFAGHSWQTAAAQSQLKVSRSTAYRLIQLARDEEKVALAFLDGRHGHPYKMTKPVQV
Ga0209283_1003343743300027875Vadose Zone SoilMKDHRERTARLQLIESMFTGHSWQTAVALSQLHVSRSTAYRLVKLARDEQKAERAFLDDRHGHPYKITEPVRVWMTEFCTNHP
Ga0209283_1042417623300027875Vadose Zone SoilMKDHRERTARLQLIEHMFAGHSWQTAAAQSQLKVSCSTAYRLLKLVRDEEKAERAFLDDRHGHPYKITEPVRVWMTEFCT
Ga0209283_1084581913300027875Vadose Zone SoilMKDRSDRAARLQLIESLFAGQSWRTAAAQSQLKISRSTAYRLVQLARDEEKAAEAFLDDRHGHPYKLTDKVASVAR
Ga0209590_1089612923300027882Vadose Zone SoilMKDHPDRSARLQLIESMFAGQPWQVAAAQSNLRVSRTTAYRLVQLARDEEKAAPAFWMIATAILLSWCHI
Ga0306926_1066098313300031954SoilMKDHPDRTARLHLLEYLFAGHSWQSAVSQSGLRMSRSTAYRLRQQARDEDQAALVFLDDRHGHPYKLTE
Ga0306920_10383978313300032261SoilMKDRPDRAARLQLIGRMFAGQAWQTAVAQSQLKISRATAYRLVKLARDEDKAPLVFLDDRHGHP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.