NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F098274

Metagenome / Metatranscriptome Family F098274

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098274
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 168 residues
Representative Sequence MKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAK
Number of Associated Samples 88
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.808 % of family members)
Environment Ontology (ENVO) Unclassified
(27.885 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.962 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 56.28%    β-sheet: 5.03%    Coil/Unstructured: 38.69%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.126.1.0: automated matchesd6hn1a16hn10.52222
a.126.1.0: automated matchesd6hn1a26hn10.50733


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF01979Amidohydro_1 5.77
PF00221Lyase_aromatic 5.77
PF02585PIG-L 4.81
PF07110EthD 4.81
PF01370Epimerase 4.81
PF05721PhyH 3.85
PF13561adh_short_C2 2.88
PF07277SapC 2.88
PF06411HdeA 1.92
PF05013FGase 1.92
PF00072Response_reg 0.96
PF00873ACR_tran 0.96
PF02384N6_Mtase 0.96
PF04820Trp_halogenase 0.96
PF16363GDP_Man_Dehyd 0.96
PF00180Iso_dh 0.96
PF07702UTRA 0.96
PF13649Methyltransf_25 0.96
PF02518HATPase_c 0.96
PF09351DUF1993 0.96
PF13537GATase_7 0.96
PF00015MCPsignal 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG2986Histidine ammonia-lyaseAmino acid transport and metabolism [E] 5.77
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 4.81
COG5285Ectoine hydroxylase-related dioxygenase, phytanoyl-CoA dioxygenase (PhyH) familySecondary metabolites biosynthesis, transport and catabolism [Q] 3.85
COG0840Methyl-accepting chemotaxis protein (MCP)Signal transduction mechanisms [T] 1.92
COG3741N-formylglutamate amidohydrolaseAmino acid transport and metabolism [E] 1.92
COG3931Predicted N-formylglutamate amidohydrolaseAmino acid transport and metabolism [E] 1.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.38%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil9.62%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.85%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa3.85%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog3.85%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil2.88%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil2.88%
Host-AssociatedHost-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated2.88%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.92%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.92%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.92%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog1.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.92%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.92%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland0.96%
Permafrost SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Permafrost Soil0.96%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.96%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.96%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.96%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.96%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001405Arctic peat soil from Barrow, Alaska - NGEE Surface sample 53-1 shallow-072012EnvironmentalOpen in IMG/M
3300001546Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300005944Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 2 DNA2013-048EnvironmentalOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300005995Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300007819Permafrost core soil microbial communities from Svalbard, Norway - sample 2-1-2 SoapdenovoEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009500Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300009638Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_10_10EnvironmentalOpen in IMG/M
3300009709Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fb - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010877Boreal forest soil eukaryotic communities from Alaska, USA - W3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013832Permafrost microbial communities from Nunavut, Canada - A3_5cm_0MEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022508Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-19-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024222Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK32EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027439Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027559Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028552Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Bog_N1_1EnvironmentalOpen in IMG/M
3300029883I_Bog_E2 coassemblyEnvironmentalOpen in IMG/M
3300029911III_Bog_N2 coassemblyEnvironmentalOpen in IMG/M
3300029913III_Bog_N3 coassemblyEnvironmentalOpen in IMG/M
3300029951III_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300030399II_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300030580II_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300033405Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB29MYEnvironmentalOpen in IMG/M
3300033982Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB22AY SIP fractionEnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI20186J14852_100792613300001405Arctic Peat SoilMKSYMLIGCLVFAAVPCACRAAPLNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVEALLEALTIVKPEAKNPMDLSTLECASFAARHTEFMRTDSESATTIMMWLFGFSVARSGSHIFDAD
JGI12659J15293_1004433823300001546Forest SoilVKILTGIVLTVCLAPCMSRAAQLNIATLSCEKYENEVLPASVTNPTADNINTVMWLFGYSVAKAGGYVMYPEALTAFGFALDGECKSNPAESVLDALAVVKPEPKNPLNLSTLECSTFAPRHMELARTDPESATTIMMWLFGFSTAKSGSRIFDAD
JGI12659J15293_1013091513300001546Forest SoilMRFLAAITLAICLIPCVSRSAELNLATLTCGRYENEVLPAAATNPIADSLNTVMWLFGYSVAKSGGHVMYSEALAPFGFALDNECKSNPGEVMLEALTIVKPETKEPLDLANVECGSFASRHAEFARTDTESANTIMMWLFGFSVA
JGIcombinedJ26739_10076474813300002245Forest SoilVKILTGIVLTVCLAPCMSRAAQLNIATLSCEKYENEVLPASVTNPTADNINTVMWLFGYSVAKAGGXVMYPEALTAFGFALDGECKSNPAESVLDALAVVKPEPKNPLNLSTLECSTFAPRHMELARTDPESATTIMMWLFGFSTAKSGSRIFDAD
JGIcombinedJ26739_10080595513300002245Forest SoilMRFLAAITLAICLIPCVSRSAELNLATLTCGRYENEVLPAAATNPIADSLNTVMWLFGYSVAKSGGHVMYSEALAPFGFALDNECKSNPGEGMLEALAIVKPETKEPLDLAHVECGSFASRHAEFARTDAESANTIMMWLFGFSVAKSGSHIFDADSLQSFA
JGIcombinedJ26739_10113541323300002245Forest SoilMKSYMLIGCLVFAAVPCACRAAPLNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVETLLEALTIVKPEARNPMDLSTLECASFAARHTEFMRTDPESATTIMMWLFGFSVARSGSHLFDADA
JGI25616J43925_1032747713300002917Grasslands SoilMLRVSTMKSFALFVLALSLIPCIGSAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDNECKSNPDESLLNALAIVKPDAKNPMDLSALECAAFAARHVELARTDPESATTIMMWLFGFSVGRSGSRIFDANSLSLFQTALL
Ga0062384_10058757223300004082Bog Forest SoilMRFLAAITLAVCLIPCVSRSAELNLATLTCGKYENEVLPAAATNPVADSINTVMWLFGYSVAKSGGHVMYSQALAPFGFALDNECKSNPGEVMLEALTIVKPETKEPLDLANVECGSFASRHAEFARSDTESANTIMMWLFGFSVAKSGSHIFDADSLQSFAATLLAECGK
Ga0062387_10112183813300004091Bog Forest SoilGRCRRAANSSAGYHVRYSVRPIHRPDPRVPIVKNIAAIAYIICLCPCVSHADPLNLATLTCDKYENEILPAAVSNPTADTINTVMWLFGYSVAKSGAHVMYPDALTPFGFALDGECKSNPAESMLEALAIVKPETKSPKNLATLDCAVFAPRHVELARTDPESATTIMMWLFGFSVARSGSHIFDAELLRPFQTALLAYCAKH
Ga0062389_10039076323300004092Bog Forest SoilMTKPLALIALSLCFALSSRAAELNLATVSCDKYENEVLPAAAVSPTADSIDTVMWLFGYSVARSGAHVLYPDALAPFGFALDGECKANPHENLLDALSVVKPESKNPMDLATLGCSLFASRHVALQLADHESADTIMMWLFGFSTARSASRLFNPDAVGSFRTALLAECAKHPDLSLFDALTGVKMRSKTPRPAQRP*
Ga0070733_1034651913300005541Surface SoilMVKILIGIVLTVCLAPCMSRAAQLNLATLSCAKYENEVLPASVTNPIADNINTVMWLFGYSVAKSGGYVMYPEALTAFGFALDGECKSNPADSVLDALAIVKPETKNPLDLTTLECSRFAPRHIELVRTDPESATTIMMW
Ga0070696_10050199913300005546Corn, Switchgrass And Miscanthus RhizosphereMKFAAPIALFALCLVPCLSRAAVLDLATLTCAQYENQVLPAAASNPDADSINTVMWLLGFSVAKSGGHEMYPEALSPFGFGLDNECKSNPTETMLNALAAVKPESKTPMDLAALQCSAFASRHAQFASSDPESADTIMMWLFGFAVARSGSHLFDASARKAFEAALLAECAKR
Ga0070766_1113313713300005921SoilMKTVFALAVVLGLLPCAGRAATLNLATLSCGKYENEVLPAAAVNPTADSINTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESMLDALAIVKPEPKNPMNLTSLECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSRIFDADSLSSFQAALLADC
Ga0066788_1002025013300005944SoilMKTLAAIAVFVIFVPCMSRAGELNLATVTCAKYENEVLPAAATNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDGECKSNPIESLLDALAIVKPEAKNPMNLTALECAAFASRHIELARTDPESANTIMMWLFGFSVARSGSHLFDADSLAAFQTALLA
Ga0066789_1007644123300005994SoilMALSGLLPFASRAAELDLAALTCAKYEGEVLPAAATNPTADSLNTVMWLLGYAVAKSGARVMYPEALAPFGFALDGECKTNPAETLLDALAIAKPEAKNPLNLSALECPAFATRHMELARTDPESATTIMMGLFGFSVARSGSHSFDAQLLKAFEGKLLNYCAQHPAASLFDALSAVKISKTAK*
Ga0066790_1029361813300005995SoilMKLFAGIVLAACLIPCMSRAAELNLATLTCGKYEYEVLPAAATNPTVDSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDGECKSNPAESLLDALGSVKPEPKNPMNLTTLECALFASRHVELARTDPESANTIMMWL
Ga0075017_10164922113300006059WatershedsVKIIAAIACLICLFPCVSRAAALNLATLTCAKYENEILPAAVSNPTADTINTVMWLFGYSVAKSGGHVMYPDALRPFGFALDGECKSNPAESMLDALAIVKPETKNPRNLATLDCAVFAPRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADLLGPFQTALLAY
Ga0075019_1091195713300006086WatershedsMKLFCVIAVSLCLFAGIGRAAELNLATLNCGKYENEILPAAAANPIADPINTVMWLFGYSVAKSGAHVMYSDALVPFGYALDGECKSNPLESMLDALAIVKPESKNPMDLSTLECALFASRHIELARTDPESATTIMMWLFGFS
Ga0099795_1002290523300007788Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0104322_11079233300007819Permafrost SoilMLEQVAVHSNDMRCHTARMRFLAATTLAVCLIPCVSRSADLNLATLTCDKYENEVLPAAATNPTADSIDTVMWLFGYSVAKSGKHVMYSEALAPFGFALDGECKSNPGEVILEALTIVKPETKDPLDLANVECESFASRHTEFARSDTESANTIMMWLFGFSVASSGGHIFEADSLQ
Ga0099829_1038402023300009038Vadose Zone SoilLATLTCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALIKPDPKNPMDLTALECAAFAPRHLGLARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPRPAKTPDR*
Ga0116229_1019793213300009500Host-AssociatedMILGVCALSGVSGAAELNLATLSCDKYENEVLPAAATNPTIDSLNTVMWLMGYAVAKSGQHVLYPEALSPFGFALDDECKSNPRETLLEALAIVKPQPKNPMDLNSLPCATFAQRHVDLARGDPDSATTIMMWLFGFSVAKSGSHLFDADALG
Ga0116113_112783723300009638PeatlandVKTVVALAFVLVLLPCAGRAGTLNLATLSCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGAHVMYPDALAPFGFALDNECKSNPAESMLDALNIVKPEAKNPMDLSAVECSSFAARHAEFRRSDPESATTIAMWLYGFSVATSGSHLFNADAVGAFETSLLA
Ga0116227_1003914353300009709Host-AssociatedMKCGAATVLGLSCCVAASMSRAAELNLATLSCDKYETEILPAAAAAVDPAADSINTVMWLFGYAVAKSGAHVLYPDALAPFGFALDGECKSNPAENLLDALSIVKPETKNPMDLATLECATFTSRHVRTASADPESAHTIMMWLFGFSVAKAGGHLFNAEGLNGFQSSLFAECANHPGMSVFDALSAKKSAPRGARPAPKPSITSP*
Ga0116227_1021954713300009709Host-AssociatedLKFFTLMMLGIWLLPVASGAAELNLATLTCDKYENEVLPAAASNPTMDSLNTVMWLFGYAVAKSGAHVMYPEALAPFGFALDGECKSNPRESLLEALAIVKPEAKNPMDIGTLTCATFASRHVDLARGDPESATTIMMWLFGFSVAKSGGHLLDAD
Ga0134125_1109793423300010371Terrestrial SoilMKSPAPIALFSLCVIFCLVSRPSRAAVLDLARLTCAQYENQVLPAAASNPDADSINTVMWLLGFSVAKSGGHDMYPEALSPFGFGLDNECKSNPGENMLSALAAVKPESKTPMDLSALECSAFASRHAQFASSDPESADTIMMWLFGFAVARSGSHLFDASARKAFEATLLADC
Ga0134125_1129336413300010371Terrestrial SoilMKFPALIALLARCLIFCLIPCLGRAAVLDLATLTCAQYEDQVLPAAASNPDADSINTVMWLLGYSVAKSGGHDMYPEALSPFGFGLDNECKSNPAENMLNALAAVKPESKTPMDLSALECSAFASRHAQFARSDPESADTIMMWLF
Ga0126356_1106714313300010877Boreal Forest SoilVRFLCAVACCLCIVPCASRAAPLNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPLESLLEALTIVKPEAKNPMDLSRLECASFAARHAQFLGTDPDSATTIMMWLFGFSVARSGSHIFDADAFGSFQ
Ga0137393_1012478633300011271Vadose Zone SoilMKFVAPVALGLCLIPCISRAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALIKPVPKNPMDLTALECAAFAPRHLGLARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALL
Ga0137388_1094870513300012189Vadose Zone SoilAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPNALAPFGFALDSECKSNPNESLLEALALVKPDPKNPMDLTALECAAFAPRHLELARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPKPAKTPG*
Ga0137388_1107932113300012189Vadose Zone SoilMKLFAAIALGVCLAPCMSPAAELNLATLTCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPGESMLGALAIVKPEPKNPMNLTLLECAAFASRPLELTRPDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCGK
Ga0137363_1019851623300012202Vadose Zone SoilMKFVAPVALGLCLIPCISRAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALIKPDPKNPMDLTALECAAFAPRHLGLARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPRPAKTPDR*
Ga0137399_1041214833300012203Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0137362_1010717433300012205Vadose Zone SoilSEMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0137385_1011750133300012359Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCEKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0137360_1004751133300012361Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCEKYESEVLPDAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0137361_1064195113300012362Vadose Zone SoilMKPFAAIALGLCLTPCISRAGALNLATLTCEKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESILDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLF
Ga0137358_1003800633300012582Vadose Zone SoilVSRAAEINLATLTCGKYENEVLPAAASNPTADSLNTVMWLLGYSVAKSGAHVMYPGALAPFGFALDGECKTNPAESLLDALGIVKAEPKNPMDLTALECAAFASRHVELARTDAESATTIMMWLFGFSVARSG
Ga0137358_1006927713300012582Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFS
Ga0137398_1041268013300012683Vadose Zone SoilMKLFAAIALGICFAPRMSPAAELNLATLTCGKYENEVLPAAASNPAVDSLNTVMWLFGYSVAKSGAHVMYPAALGPFGFALDNECKSNPAESLLAALAVVKPEVKHPMDLTTVECAAFASRHIELARTDPQSATTIMMWLFGFSVARSGSHIFDADSVSSFQAA
Ga0137359_1160927013300012923Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMM
Ga0137416_1008458723300012927Vadose Zone SoilMKFVAPVALGLCLIPCISRAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALVKPDPKNPMDLTALECAAFAPRHLELARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPKPAKTPG*
Ga0137407_1018920413300012930Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMHPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK*
Ga0120132_103763713300013832PermafrostMKSYMLICCLVFAAVPCRAAPLNLATLTCAKYENEVLPAAAANPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPAEALLEALTIVKPEAKNPIDLSTLECASLAARHTEFMRTDAESATTIMMWLFGFSVARSGSHIFDPDALVTFQTGLLADCAGHPGITLLDALTPFGLHGLRIEHSNDAVGVTHR*
Ga0182024_1109824413300014501PermafrostVRFLAAIALCLCIVPGACRAAPLNLATLTCSKYENDVLPAAATNPTADSLNTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVESLLEALAIVKPEAKNPMDISHLECASFAARHARFISTDPDSATTIMMWLFGFSVA
Ga0182030_10040750103300014838BogMKTGAAFALSLLVLPCLGRAGALNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYAVAKSGGHVMYPDALAPFGFALDGECKSNPSESMLDALSIVKPEAKSPMDLSAVECASFAARHAEFARSDPESATTIM
Ga0182030_1010350243300014838BogMRILASIALGLCVFPCMSRGGELNLATLTCAKYENEVLPAAATNPAADSINTVMWLFGFSVAKSGGHVMYPDALAPFGFALDNECKSNPIESLLDALASVKPEARNPMNLATLDCGVFAQRHMELARTDPESANTIMMWLFGFSVARSGSHLFDADSLAAFQTALLADCAKRPNVSLFDSLSSVKTPK
Ga0137411_102033913300015052Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAK
Ga0137411_115906313300015052Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCEKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYP
Ga0137411_115906433300015052Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYP
Ga0137411_118403813300015052Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYP
Ga0137411_124677523300015052Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESMLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAK
Ga0137409_1012687923300015245Vadose Zone SoilMKSFALFVLALSLIPCIGSAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALIKPDPKNPMDLTALECAAFAPRHLGLARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPKPAKTPG*
Ga0182034_1207479613300016371SoilMKLLPAIILAAGFSPCLGLAAELNLATLSCEKYENEILPNAVSEPVADPINIVMWLFGYSVAKSGAHVLYADALTAFGFALDGECKSNPRESVLDALSVVKPETKNPMDLTTLECTPFASRHNELARSDPESSTTIMMWLL
Ga0179590_104808913300020140Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIF
Ga0179592_1013283413300020199Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALAIVKPEPKNPMNLTTLECATFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYP
Ga0210403_1014890123300020580SoilMKLFAAIALGLCLTPCMSGAGALNLATVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARAGSHIFDADSLNSFQAALLADCGKYPKRPLFDALSSVKPPKPAK
Ga0210403_1093576513300020580SoilMVPCACRAAPLNLATLTCGKYENEVLPAAAANPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVEALLEALTIVKPEAKNPMDLSTLECASFAARHTEFMRTDSESATTIMMWLFGFSV
Ga0210399_1091712713300020581SoilMKLFAAIALGLCLTPCISGAGALNLAAVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLSSFQAALLADCGKYPKRPLFDALSSVKPPKPAK
Ga0210406_1037135323300021168SoilMKLFAAIALGLCLTPCMSGAGALNLATVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLS
Ga0210406_1055991313300021168SoilMKFFAAIALSFSLFPCMSHAAELNLATLSCEKYENEVLPAAATNPKADSINTVMWLFGYSVARSGGHVMYPDALAPFGFALDNECKSNPRENMLDALAAVKLESKNPMDLGDVECASFAAKHVEFVKTDAESANTIMMWLFGFSVARSGSHLFNADSLNSFQTTMLAY
Ga0210396_1064635513300021180SoilMKFLAAITLAICLAPCLCRSAELNLATLTCGKYENEVLPAAAANPVADSINTVMWLFGYSVAKSGAHVLYSDALAPFGFALDNECKNNPAETMLEALAIVKPETKNPMDLANVECGSFASRHAEFARSDTDSANTIMMWLFGFSVARSGSHIFDADSVQSFA
Ga0210397_1124689613300021403SoilMSGAGALNLATVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGS
Ga0210389_1020881823300021404SoilMKTFPGILLAACLLPCLSRAAELNLATLSCGKYENEVLPAAVSNPVADSINTVMWLFGYSVAKSGAHVMYPDALTAFGYALDGECKSNPVESLLDALAVVKPDAKNPMDLTALECATFAPRHIELARTDSESANTIMMWLFGFSVARSGSHLFNADLLSSFQTALLADCAKHPKMALFDALGSVKIAKPAK
Ga0210386_1001619313300021406SoilMKLFAAVALGVCMAPCLSRAAELNLATLSCGKYENEVLPAAASNPTADSLNTVMWMFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESMLDALAIVKPEPKNPMDLASLDCAAFASRHMELARTDPESATTIMMWLFGFSVARSGSHIFNADSLSSFQTALLADCAKQPKQ
Ga0210394_1007839523300021420SoilMKFIAGIALIVCLFPGMSRATELNLATLTCAKYENEVLPAAAATNTVADSINTVMWLFGYSIAKSGAHVMYPDALTPFGFALDGECKSNPPESLLDALGFVKPETQNPMNLTTLECAVFASRHVELARTDPESATTIMMWLFGFSVARSGSHMFDADSVGSFQSALLADCAKHPHISLFDALTAASRPKAADKPKATATPNGVKSPAHP
Ga0210384_1028179023300021432SoilMKLFAAIALGLCLTPCMSGAGALNLATVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLSSFQAALLADCGKYPKRPLFDALSSVKPPKPAK
Ga0210402_1022396513300021478SoilPRADTQSEMKLFAAIALGLCLTPCMSGAGALNLATVSCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARAGSHIFDADSLNSFQAALLADCGKYPKRPLFDALSSVKPPKPAK
Ga0210410_1006568113300021479SoilMKLFAAIALGLCLTPCMSSAAELNLATLTCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGGHVMYPEALAPFGFALDGECKSNPAESLLDALGIVKPEPKNPMDLTALDCGTFAPRHVELARTDPESATTIMMW
Ga0222728_110519513300022508SoilEKYENEVLPAAASNPTADSLNTVMWLLGYSVAKSGAHVMYPEALAPFGFALDNECKSNPRESLLDALAIVKPEPKNPMDLTLLECSAFASRHVALTRTDPESATTIMMWLFGFSVARSDSHIFAANSVSLFQTALLAECAKHPKQPLFGALTAVKMSKPAK
Ga0247691_105901313300024222SoilMKFLLGIVLGACLFPCAGRAAELNLATLSCGKYENEVLPAAVSSPTADTINTVMWLFGYSVAKSGARVMYPEALTAFGFALDGECKSNPVESLLDALAIVKPEAKSPMDLTSLECATFASRHIELARTDPESATTIMMW
Ga0179589_1047141713300024288Vadose Zone SoilLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPA
Ga0207685_1082855113300025905Corn, Switchgrass And Miscanthus RhizosphereMKFLLGIVLGACLFPCVGRAAELNLATLSCGKYENEVLPAAVSSPTADTINTVMWLFGYSVAKSGARVMYPEALTAFGFALDGECKSNPVESLLDALAIVKPEAKSPMDLTSLECATFASRHIELARTDPESATTIMM
Ga0209131_115613523300026320Grasslands SoilMLRVSTMKSFALFVLALSLIPCIGSAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDNECKSNPDESLLNALAIVKPDAKNPMDLSALARHVELARTDPESATTIMMWLFGFSVGRSGSRIFDANSLSLFQTALLADCAKHPNQPLFDALTAAVKTSKPAK
Ga0257172_108805013300026482SoilMETDRVLIDQEHFPPVDLGSRLILKMKLFAAIALGLCLTPCISRAGVLNLATLTCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLAALECAAFASRHVELARTDPESA
Ga0179587_1014802213300026557Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCEKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK
Ga0209332_101388223300027439Forest SoilAIIPCASRAAELNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVMYPDALAPFGFALDNECKSNPVESLLEALTIVKPEAKNPMDLSVLECASFASRHAQFLLSDPESATTVMMWLFGFSVARSGSHIFDANALGSFQTGLLADCAKRPGITLFDALSAVKTPKAAK
Ga0209179_100274813300027512Vadose Zone SoilMKLFAAIAFGLCLTPCISRAGALNLATLTCQKYESEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALGPFGFALDNECKSNPAESLLDALGIVKPEPKNPMDLTALECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSLNSFQAALLADCAKYPKRPLFDALSTPKPPRPAK
Ga0209222_102198623300027559Forest SoilMRFLAAITLAVCLIPCVSRSAELNLATLTCGKYENEVLPAAAANPIADSINTVMWLFGYSVAKSGGHVMYSEALAPFGFALDNECKSNPGEGMLEALAIVKPETKDPLDLANVECGSFASRHAEFARSDTESANTIMMWLFGFSVARSGSHIFDADSLQSFAAA
Ga0209329_112517013300027605Forest SoilMKSYMLIGCLVFAAVPCACRAAPLNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVETLLEALTIVKPEARNPMDLSTLECASFAARHTEFMRTDPESATTIMMWLFGFSVARSGSHLFDADALGAF
Ga0209701_1023374113300027862Vadose Zone SoilMETDRVLIDQEHFPPVDLGSRLILKMKLFAAIALGLCLTPCISRAGVLNLATLTCGKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPEALAPFGFALDNECKSNPGESLLDALAIVKPEPKNPMNLTTLECAAFASRHVELARTDPESATTIMMWLFGFSVARSGSHIFDADSL
Ga0209167_1029413923300027867Surface SoilMVKILIGIVLTVCLAPCMSRAAQLNLATLSCAKYENEVLPASVTNPIADNINTVMWLFGYSVAKSGGYVMYPEALTAFGFALDGECKSNPAETVLDALAIVKPEPKNPMNLATLECSTFAPRHIELARTDPESATTIMMWLFGFSTAKSGSYIFD
Ga0209006_1011601823300027908Forest SoilMKSYMLIGCLVFAAVPCACRAAPLNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVETLLEALTIVKPEARNPMDLSTLECASFAARHTEFMRTDPESATTIMMWLFGFSVARSGSHLFDADALGAFQTGLLADCASHPGITLLEALSPAKTKKPAGKPPNRS
Ga0209006_1074250213300027908Forest SoilMKTFPGILLAACLLPCLSRAAELNLATLSCGKYENEVLPAAVSNPVADSINTVMWLFGYSVAKSGAHVMYPDALTAFGYALDGECKSNPVESLLDALAVVKPDPKNPMDLTSLECATFAPRHIELARTDPESANTIMMWLFGFSVARSGSHLFNADLLGSFQTALLADCAKHPKLALFDALGSVKIPKPAK
Ga0137415_1002169823300028536Vadose Zone SoilMKISAAIALSLFLLPCMSRAGELNLATLTCGKYENEVLPAAAVNPSADSLDTVMWLFGYSVARSGAHVMYSDALAPFGFALDNECKSNPAESLLDALAIVKPESKHPMDLTNVECGPFASRHVELARTDAESAKTIMMWLFGFSVGRSGSHIFDANSLSLFQTALLADCAKHPNQPLLGALSAVKTSKPAK
Ga0137415_1022142723300028536Vadose Zone SoilMKFVAPVALGLCLIPCISRAGELNLATLTCAKYENEVLPAAASNPTADSLNTVMWLFGYSVAKSGAHVMYPDALAPFGFALDSECKSNPNESLLEALALVKPDPKNPMDLTALECAAFAPRHLELARTDPESATTIMMWLFGFSVARSGSHIFDANSLSLFQSALLADCAKYPSRTLFDALTAVKTPKPAKTPG
Ga0302149_120941713300028552BogMKTGAAFALSLLVLPCLGRAGALNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYAVAKSGGHVMYPDALAPFGFALDGECKSNPSESMLDALSIVKPEAKSPMDLSAVECASFAARHAEFARSDPESATTIMMWLNGFSVAASGSHMFDADSVHSFETSL
Ga0311327_1037853623300029883BogLKFVAAIALLITLLPCVSRAGELNLATLTCQKYENEVLPAAATNPDADSINTVMWLFGYSVARSRAHVMYPDALSPFGFALDGECKSNPTETLLDALAIVKPESKNPMDLTDVECAKFAARHVEFERSDPESANTIMMWLFG
Ga0311361_1015528143300029911BogMKTGAAFALSLLVLPCLGRAGALNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYAVAKSGGHVMYPDALAPFGFALDGECKSNPSESMLDALSIVKPEAKSPMDLSAVECASFAARHAEFARSDPESATTIMMWLNGFSVAASG
Ga0311362_1096641723300029913BogMKTGAAFALSLLVLPCLGRAGALNLATLTCGKYENEVLPAAATNPTADSINTVMWLFGYAVAKSGGHVMYPDALAPFGFALDGECKSNPSESMLDALSIVKPEAKSPMDLSAVECASFAARHAEFARSDPESATTIMMWLNGFSVAASGSHMFDADSVHS
Ga0311371_1188320823300029951PalsaLKTPAAIALGLLLIPCFGHAAALNLSSLTCDKYENEVLPAAATNPTADSINTVMWLFGYAVAKSGAHVMFPDALAPFGFALDNECKSNPAENMLAALSIVKPEAKNPLDISAVECASFAARHAEFARSDTESATTIMMWLYGFSVA
Ga0311353_1125816423300030399PalsaMKLLAATALVVCLSPCMSRAAALNLATLTCGKYENEVLPAAAAAANPNADSINTVMWLFGYSVARSGAHVLYPDALAPFGFALDNECKTNPRESMLDALAAVKIESRNPMDLSNLESATFASRHVEMSKTDPESANTIMMWLFGF
Ga0311355_1105621523300030580PalsaMIRSAGIPLVLCLVAGMSHAAALNLATVTCGKYQNEILPAAASNPTADPINTVMWLFGFAVAKSGSHFLYPEGLEPFGFALDGECRNNPAESMLDALAAVKIEAKSPMDLANLASSAFASRHVEMARTDPDSANTIMMWLFGFSVARSGGHMFDADSLNAFQTSLLADCAKHPATSLFDTLTSIKIAKPAK
Ga0265753_114797213300030862SoilLIGCLVFALVPCACRAAPLNLATLTCGKYENEVLPAAAVDPTADSINIVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVEALLEALTIVKPEAKNPMDLSTLECASFAARHTEFMRTDSESATTIMMWLFGFSVARSGSHIFDADALGAFQTELLADCAGH
Ga0170824_11043777313300031231Forest SoilMKFLTGIVLGACLFPWTSRAAELNLATLSCGKYENEVLPAAVSNPTADTINTVMWLFGYSVAKSGAHVMYPEALTAFGFALDGECKSNPVESLLDALAIVKPEAKNPIDLTALECATFASRHIELARTDPESATTIMMWLFGFWVARSGSHIFDADSL
Ga0302324_10353858913300031236PalsaTGNLDRVQRRGPGIRLPGEQNDSAAERLLKMKLLAAIALCSSLLPCMSRAAVLNLATLTCAKYENEVLPAAASNPDADSINTVMWLFGFSVAKAGAHVLYPDALSPFGFALDNECKSNPAESMMDALAIVKPESKNPMDLANVDCSAFASRHAQFVSSDPESATTIMM
Ga0310686_11129765123300031708SoilMKFVAAIALLLSLLPCVSRAGELNLATLTCQKYENEVLPAAATNPDADSINTVMWLFGYSVARSRAHVMYPDALSPFGFALDGECKSNPAESLLDALAVVKPESKNPMDLANVECAAFTSRHSEFERSDPESANTIMMWLFGFSVARSGSHIF
Ga0307476_1096259513300031715Hardwood Forest SoilMRCLAAILLAICLIPCVSRSAELNLATLTCGKYENEVLPAAATNPIADSLNTVMWLFGYSVAKSGGHVMYSEALAPFGFALDNECKSNPGEVMLEALAIVKPETKEPLDLANVECGSFASRHAEFARTDTESANTIMMW
Ga0307477_1022744523300031753Hardwood Forest SoilMKSYLLIGCLIFALVPCACRAAPLNLATLTCGKYENEVLPAAAANPTADSINTVMWLFGYSVAKSGGHVLYPDALAPFGFALDNECKSNPVEALLEALTIVKPEAKNPMDLSTLECASFAARHTEFMRTDSESATTIMMWLFGFSVARSGSHTFDADALGAFQTELLA
Ga0307475_1114338113300031754Hardwood Forest SoilMFRPPHELTMKFLTGIVLAACLLPCMSRAAELNLATLRCERYENEVLPASVSNPTADNINTVMWLFGYSVAKSGAHVMYPEALTAFGFALDGECKSNPAESVLDALAIVKPEPKNPLNLAALDCAAFASRHIALASTDAESATTIMMWLFGFSVARSGSHIF
Ga0307470_1164797613300032174Hardwood Forest SoilMKMLAAIAVVLWVLIPGVSRAAELNLATQTCGKYEGEVLPAAATNPTADSLNTVMWLLGFAVAKSGARVMYPDALAPFGFALDNECKSNPAETLLDALAIAKPDAKNPLNLTTLECAAFASRHVDLERTDPESATTIMMWLFGFSVARSGSHLFDADLLGAFEGKLL
Ga0326727_1005140873300033405Peat SoilMKSFAAIALAACVLPCMGRAAELNLATLTCGKYENEVLPAAATNPSADSINTVMWLFGYSVAKSGGHFMYSDALAPFGFALDNECKSNPAETMLDALASVRLDLKNPMDLSVLESSTFASRHVEFARSDPESANSIMMWLFGFSTARSGSHVFDADQLGAFQTALLAYCG
Ga0371487_0317555_2_4603300033982Peat SoilMKSFAAIALAACVLPCMGRAAELNLATLTCGKYENEVLPAAATNPSADSINTVMWLFGYSVAKSGGHFMYSDALAPFGFALDNECKSNPAETMLDALASVRLDLKNPMDLSVLESSTFASRHVEFARSDPESANSIMMWLFGFSTARSGSHVF
Ga0370515_0120585_59_5863300034163Untreated Peat SoilMSHAAALNLATVTCGKYQNEILPAAASNPTADPINTVMWLFGFAVAKSGSHFLYPEGLEPFGFALDGECRNNPAESMLDALAAVKIEAKSPMDLANLASSAFASRHVEMARTDPDSANTIMMWLFGFSVARSGGHMFDADSLNAFQTSLLADCAKHPATSLFDTLTSIKIAKPAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.