NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F050582

Metagenome / Metatranscriptome Family F050582

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F050582
Family Type Metagenome / Metatranscriptome
Number of Sequences 145
Average Sequence Length 107 residues
Representative Sequence MATASIGGLEGLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLSMKAKNPTTTGNPG
Number of Associated Samples 93
Number of Associated Scaffolds 145

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 84.83 %
% of genes near scaffold ends (potentially truncated) 21.38 %
% of genes from short scaffolds (< 2000 bps) 76.55 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.61

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (53.793 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil
(25.517 % of family members)
Environment Ontology (ENVO) Unclassified
(41.379 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(80.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.72%    β-sheet: 0.00%    Coil/Unstructured: 49.28%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.61
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 145 Family Scaffolds
PF01565FAD_binding_4 4.83
PF12704MacB_PCD 2.76
PF02687FtsX 2.76
PF04030ALO 1.38
PF05532CsbD 1.38
PF05239PRC 1.38
PF01850PIN 1.38
PF00691OmpA 1.38
PF00773RNB 0.69
PF13802Gal_mutarotas_2 0.69
PF08843AbiEii 0.69
PF06210DUF1003 0.69
PF13483Lactamase_B_3 0.69
PF00375SDF 0.69
PF13545HTH_Crp_2 0.69
PF00027cNMP_binding 0.69
PF00486Trans_reg_C 0.69
PF13247Fer4_11 0.69
PF14067LssY_C 0.69
PF00982Glyco_transf_20 0.69
PF04972BON 0.69
PF01740STAS 0.69
PF00890FAD_binding_2 0.69
PF00005ABC_tran 0.69
PF13633Obsolete Pfam Family 0.69
PF09095AmyA-gluTrfs_C 0.69
PF12681Glyoxalase_2 0.69
PF10282Lactonase 0.69
PF00881Nitroreductase 0.69
PF04366Ysc84 0.69
PF13188PAS_8 0.69
PF00575S1 0.69
PF13618Gluconate_2-dh3 0.69
PF06283ThuA 0.69
PF07366SnoaL 0.69
PF08282Hydrolase_3 0.69

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 145 Family Scaffolds
COG0277FAD/FMN-containing lactate dehydrogenase/glycolate oxidaseEnergy production and conversion [C] 1.38
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 1.38
COG0380Trehalose-6-phosphate synthase, GT20 familyCarbohydrate transport and metabolism [G] 0.69
COG0557Exoribonuclease RTranscription [K] 0.69
COG0560Phosphoserine phosphataseAmino acid transport and metabolism [E] 0.69
COG0561Hydroxymethylpyrimidine pyrophosphatase and other HAD family phosphatasesCoenzyme transport and metabolism [H] 0.69
COG1877Trehalose-6-phosphate phosphataseCarbohydrate transport and metabolism [G] 0.69
COG2217Cation-transporting P-type ATPaseInorganic ion transport and metabolism [P] 0.69
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 0.69
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 0.69
COG3769Mannosyl-3-phosphoglycerate phosphatase YedP/MpgP, HAD superfamilyCarbohydrate transport and metabolism [G] 0.69
COG4420Uncharacterized membrane proteinFunction unknown [S] 0.69
COG4776Exoribonuclease IITranscription [K] 0.69
COG4813Trehalose utilization proteinCarbohydrate transport and metabolism [G] 0.69


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A53.79 %
All OrganismsrootAll Organisms46.21 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_17190438All Organisms → cellular organisms → Bacteria1717Open in IMG/M
3300000550|F24TB_15841631Not Available506Open in IMG/M
3300000559|F14TC_100258445Not Available867Open in IMG/M
3300001431|F14TB_100891083Not Available856Open in IMG/M
3300002245|JGIcombinedJ26739_100006712All Organisms → cellular organisms → Bacteria9001Open in IMG/M
3300002245|JGIcombinedJ26739_101120893Not Available674Open in IMG/M
3300003372|JGI26336J50218_1014834Not Available563Open in IMG/M
3300004080|Ga0062385_10005765All Organisms → cellular organisms → Bacteria → Acidobacteria3884Open in IMG/M
3300004080|Ga0062385_10472202Not Available767Open in IMG/M
3300004082|Ga0062384_100197803All Organisms → cellular organisms → Bacteria1183Open in IMG/M
3300004104|Ga0058891_1430805Not Available557Open in IMG/M
3300004104|Ga0058891_1482623Not Available538Open in IMG/M
3300004114|Ga0062593_100000183All Organisms → cellular organisms → Bacteria19153Open in IMG/M
3300004114|Ga0062593_101727172Not Available685Open in IMG/M
3300004120|Ga0058901_1559604Not Available732Open in IMG/M
3300004135|Ga0058884_1371319All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium933Open in IMG/M
3300004157|Ga0062590_101199763Not Available740Open in IMG/M
3300004157|Ga0062590_101440405Not Available687Open in IMG/M
3300004631|Ga0058899_10028797Not Available677Open in IMG/M
3300004631|Ga0058899_10229574All Organisms → cellular organisms → Bacteria → Acidobacteria1347Open in IMG/M
3300004631|Ga0058899_11998284Not Available568Open in IMG/M
3300004631|Ga0058899_12008671Not Available760Open in IMG/M
3300004631|Ga0058899_12190112Not Available672Open in IMG/M
3300004631|Ga0058899_12266511Not Available623Open in IMG/M
3300004631|Ga0058899_12288535Not Available650Open in IMG/M
3300004631|Ga0058899_12302138Not Available516Open in IMG/M
3300005468|Ga0070707_100169718All Organisms → cellular organisms → Bacteria2126Open in IMG/M
3300005471|Ga0070698_100037546All Organisms → cellular organisms → Bacteria → Acidobacteria4994Open in IMG/M
3300005526|Ga0073909_10008304All Organisms → cellular organisms → Bacteria3106Open in IMG/M
3300005536|Ga0070697_100325298Not Available1324Open in IMG/M
3300005537|Ga0070730_10009132All Organisms → cellular organisms → Bacteria8262Open in IMG/M
3300005537|Ga0070730_10010130All Organisms → cellular organisms → Bacteria7757Open in IMG/M
3300005541|Ga0070733_10008037All Organisms → cellular organisms → Bacteria6766Open in IMG/M
3300005549|Ga0070704_100003574All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8936Open in IMG/M
3300005591|Ga0070761_10004365All Organisms → cellular organisms → Bacteria8372Open in IMG/M
3300005602|Ga0070762_10473960Not Available817Open in IMG/M
3300005602|Ga0070762_10766373All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300005610|Ga0070763_10905339Not Available525Open in IMG/M
3300005610|Ga0070763_10941754Not Available515Open in IMG/M
3300005921|Ga0070766_10944293Not Available592Open in IMG/M
3300006102|Ga0075015_100769167Not Available576Open in IMG/M
3300006174|Ga0075014_100075530All Organisms → cellular organisms → Bacteria → Acidobacteria1516Open in IMG/M
3300006174|Ga0075014_100743996Not Available574Open in IMG/M
3300006176|Ga0070765_100576103Not Available1061Open in IMG/M
3300006176|Ga0070765_101162211Not Available729Open in IMG/M
3300006176|Ga0070765_101550709Not Available623Open in IMG/M
3300006806|Ga0079220_10236395Not Available1083Open in IMG/M
3300007982|Ga0102924_1000065All Organisms → cellular organisms → Bacteria118867Open in IMG/M
3300011120|Ga0150983_10903096Not Available958Open in IMG/M
3300011120|Ga0150983_11103766Not Available675Open in IMG/M
3300011120|Ga0150983_11554161Not Available605Open in IMG/M
3300011120|Ga0150983_12020869Not Available756Open in IMG/M
3300011120|Ga0150983_12127505Not Available582Open in IMG/M
3300011120|Ga0150983_12610540Not Available617Open in IMG/M
3300011120|Ga0150983_13342186Not Available975Open in IMG/M
3300011120|Ga0150983_13473165Not Available606Open in IMG/M
3300011120|Ga0150983_13833369All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidipila → unclassified Acidipila → Acidipila sp.969Open in IMG/M
3300011120|Ga0150983_13979694Not Available692Open in IMG/M
3300011120|Ga0150983_14735542Not Available913Open in IMG/M
3300011120|Ga0150983_15205245Not Available553Open in IMG/M
3300011120|Ga0150983_15283247Not Available610Open in IMG/M
3300011120|Ga0150983_15311077Not Available517Open in IMG/M
3300011120|Ga0150983_15554038Not Available645Open in IMG/M
3300011120|Ga0150983_15684459Not Available508Open in IMG/M
3300011120|Ga0150983_16276609All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis858Open in IMG/M
3300011120|Ga0150983_16570268Not Available542Open in IMG/M
3300012096|Ga0137389_10544161Not Available997Open in IMG/M
3300012683|Ga0137398_10884803Not Available623Open in IMG/M
3300012931|Ga0153915_10095290All Organisms → cellular organisms → Bacteria3148Open in IMG/M
3300012960|Ga0164301_10054182All Organisms → cellular organisms → Bacteria2085Open in IMG/M
3300012986|Ga0164304_10028204All Organisms → cellular organisms → Bacteria2831Open in IMG/M
3300019185|Ga0184587_136702Not Available531Open in IMG/M
3300019361|Ga0173482_10054257All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1314Open in IMG/M
3300019887|Ga0193729_1004290All Organisms → cellular organisms → Bacteria6925Open in IMG/M
3300019887|Ga0193729_1115738Not Available1004Open in IMG/M
3300020579|Ga0210407_10521328All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300020580|Ga0210403_10001256All Organisms → cellular organisms → Bacteria24428Open in IMG/M
3300020580|Ga0210403_10040773All Organisms → cellular organisms → Bacteria3711Open in IMG/M
3300020580|Ga0210403_10631985Not Available862Open in IMG/M
3300020581|Ga0210399_10138102All Organisms → cellular organisms → Bacteria2010Open in IMG/M
3300020581|Ga0210399_10222731All Organisms → cellular organisms → Bacteria1571Open in IMG/M
3300020582|Ga0210395_10000095All Organisms → cellular organisms → Bacteria → Acidobacteria70962Open in IMG/M
3300020583|Ga0210401_10223486All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1742Open in IMG/M
3300021168|Ga0210406_10297518Not Available1313Open in IMG/M
3300021170|Ga0210400_11040782Not Available664Open in IMG/M
3300021171|Ga0210405_10668916Not Available804Open in IMG/M
3300021171|Ga0210405_10791300All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300021171|Ga0210405_11265490Not Available543Open in IMG/M
3300021178|Ga0210408_10230181All Organisms → cellular organisms → Bacteria1476Open in IMG/M
3300021401|Ga0210393_10321923All Organisms → cellular organisms → Bacteria1257Open in IMG/M
3300021401|Ga0210393_11012549Not Available672Open in IMG/M
3300021403|Ga0210397_11305898Not Available564Open in IMG/M
3300021405|Ga0210387_10338394All Organisms → cellular organisms → Bacteria1328Open in IMG/M
3300021411|Ga0193709_1057487All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium897Open in IMG/M
3300021420|Ga0210394_10004973All Organisms → cellular organisms → Bacteria15152Open in IMG/M
3300021420|Ga0210394_11783166Not Available513Open in IMG/M
3300021432|Ga0210384_10000297All Organisms → cellular organisms → Bacteria77961Open in IMG/M
3300021432|Ga0210384_10054530All Organisms → cellular organisms → Bacteria3622Open in IMG/M
3300021433|Ga0210391_10094315All Organisms → cellular organisms → Bacteria2361Open in IMG/M
3300021478|Ga0210402_10049389All Organisms → cellular organisms → Bacteria3676Open in IMG/M
3300021478|Ga0210402_10312684Not Available1455Open in IMG/M
3300021478|Ga0210402_11185143Not Available691Open in IMG/M
3300021559|Ga0210409_11697791Not Available508Open in IMG/M
3300022557|Ga0212123_10000537All Organisms → cellular organisms → Bacteria → Acidobacteria122288Open in IMG/M
3300024179|Ga0247695_1006843All Organisms → cellular organisms → Bacteria → Acidobacteria1625Open in IMG/M
3300024182|Ga0247669_1017512All Organisms → cellular organisms → Bacteria → Acidobacteria1255Open in IMG/M
3300024271|Ga0224564_1031394All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300024271|Ga0224564_1034904Not Available954Open in IMG/M
3300024284|Ga0247671_1043772Not Available698Open in IMG/M
3300025915|Ga0207693_10760941All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae748Open in IMG/M
3300025922|Ga0207646_10076473All Organisms → cellular organisms → Bacteria2991Open in IMG/M
3300026285|Ga0209438_1072687All Organisms → cellular organisms → Bacteria → Acidobacteria1119Open in IMG/M
3300027073|Ga0208366_1018199All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300027562|Ga0209735_1027676All Organisms → cellular organisms → Bacteria1180Open in IMG/M
3300027575|Ga0209525_1012689Not Available2029Open in IMG/M
3300027635|Ga0209625_1011949All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1891Open in IMG/M
3300027645|Ga0209117_1077639Not Available934Open in IMG/M
3300027768|Ga0209772_10021257All Organisms → cellular organisms → Bacteria → Acidobacteria1844Open in IMG/M
3300027821|Ga0209811_10019216All Organisms → cellular organisms → Bacteria → Acidobacteria2226Open in IMG/M
3300027857|Ga0209166_10000968All Organisms → cellular organisms → Bacteria26218Open in IMG/M
3300027857|Ga0209166_10028165All Organisms → cellular organisms → Bacteria3433Open in IMG/M
3300027867|Ga0209167_10026878All Organisms → cellular organisms → Bacteria2752Open in IMG/M
3300027889|Ga0209380_10172415All Organisms → cellular organisms → Bacteria1267Open in IMG/M
3300027889|Ga0209380_10808266Not Available531Open in IMG/M
3300028906|Ga0308309_10499265Not Available1052Open in IMG/M
3300028906|Ga0308309_11077146Not Available695Open in IMG/M
3300029636|Ga0222749_10004162All Organisms → cellular organisms → Bacteria → Proteobacteria6109Open in IMG/M
3300029636|Ga0222749_10155227All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1118Open in IMG/M
3300030803|Ga0074037_1640229Not Available559Open in IMG/M
3300030884|Ga0265758_106135Not Available587Open in IMG/M
3300030950|Ga0074034_10934917Not Available539Open in IMG/M
3300031043|Ga0265779_105431Not Available698Open in IMG/M
3300031057|Ga0170834_100328723All Organisms → cellular organisms → Bacteria1011Open in IMG/M
3300031128|Ga0170823_13523842Not Available565Open in IMG/M
3300031231|Ga0170824_101757959All Organisms → cellular organisms → Bacteria1212Open in IMG/M
3300031247|Ga0265340_10121284All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1203Open in IMG/M
3300031708|Ga0310686_109682150All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1030Open in IMG/M
3300031718|Ga0307474_10534938All Organisms → cellular organisms → Bacteria → Acidobacteria920Open in IMG/M
3300031754|Ga0307475_10103323All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2227Open in IMG/M
3300031866|Ga0316049_124777Not Available501Open in IMG/M
3300031962|Ga0307479_10901731Not Available856Open in IMG/M
3300032119|Ga0316051_1033856Not Available509Open in IMG/M
3300032174|Ga0307470_11303846All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium595Open in IMG/M
3300032515|Ga0348332_10952698Not Available569Open in IMG/M
3300032515|Ga0348332_14459363Not Available516Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil25.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil23.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.66%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil8.97%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil5.52%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.14%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil3.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.76%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.76%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.07%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.07%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.07%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.38%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.38%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter1.38%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.69%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.69%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.69%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.69%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003372Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004104Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF218 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004135Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300019185Soil microbial communities from Bohemian Forest, Czech Republic ? CSE2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024271Soil microbial communities from Bohemian Forest, Czech Republic ? CSU5EnvironmentalOpen in IMG/M
3300024284Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK12EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300027073Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF010 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027575Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027768Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030803Metatranscriptome of forest soil microbial communities from Dalarna County, Sweden - Site 2 - Mineral C3 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030884Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSA5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030950Metatranscriptome of forest soil microbial communities from Dalarna County, Sweden - Site 2 - Mineral N3 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031043Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZA6 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031247Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-CB2-25 metaGHost-AssociatedOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031866Metatranscriptome of soil microbial communities from Bohemian Forest, Czech Republic ? CSU5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032119Metatranscriptome of soil microbial communities from Bohemian Forest, Czech Republic ? CSE5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_034611502088090014SoilMATTSLNGLEVLRLPELLVLIKRTAEWARNRRVLNPREAMVADEVKEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTLAVNEKNPTPTSNRI
F24TB_1584163113300000550SoilMATTSLSGLEVSRLPELLVLIKRTAEWARNRRVLNPREAMVPAEVKEAKILYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTRREREDPSAD*
F14TC_10025844513300000559SoilMATTSLSGLEVLRLPELLVLIKRTAEWARNLRVLNPSETMVADEVKEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTRREREDPSAD*
F14TB_10089108313300001431SoilMATTSLSGLEVSRLPELLVLIKRTAEWARNRRVLNPREAMVPAEVKEAKILYARARAWPLGEPPRRAEPKWQKPGDVEAAFFALYDALRSKMRVRTRREREEPNSD*
JGIcombinedJ26739_100006712103300002245Forest SoilMATASIGSLEGLRLPDLLVLVKQAAERARICRMLDPRAAMVPDEVMEAKVLYSWARAWPLGEPQRRVEPNWRRPGDVEAAFFALYDGLRSSAAQLPWEA*
JGIcombinedJ26739_10112089313300002245Forest SoilMATASIGGLEGLRLSDLLVLIKQTAECARNSRVRDPRAAIGPDEVLQAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQT
JGI26336J50218_101483413300003372Bog Forest SoilMATATLSGLEVLRLPDLLVLVKQTAERARSRRVLDPSAVMIPDEVMEAKILYARARAWIVGETQRRVEPRWRGLGDVEATFFALYDVLRSQISPYSSRTRSTQRPPAS*
Ga0062385_1000576543300004080Bog Forest SoilMATASPGGLEGLRLADLLVLVKQTAEQARNLRVHNPCAAMVPEEVLKATVLYARARVWLLGKSYRRAEPKWRRPGDIEAAFFALYDALGSQISLTSVLKGQMHGESYI*
Ga0062385_1047220213300004080Bog Forest SoilMATASISGLDGVSLQDLLVLVKQTAERARKCRVLNPQAAMVPDDVMEAKVLYARARTWLLGEPQRRAEPKWRRPGDVEATFFVLFDALRSHARSAFVVNEKNPGPTNTISNF*
Ga0062384_10019780333300004082Bog Forest SoilPMATASPGGLEGLRLADLLVLVKQTAEQARNLRVHNPCAAMVPEEVLKATVLYARARVWLLGKSYRRAEPKWRRPGDIEAAFFALYDALGSQISLTSVLKGQMHGESYI*
Ga0058891_143080513300004104Forest SoilMATASFGSLESLRLPDMLVLIKQTAECARSRRVIDPRATIEADEGLQAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRLQIRSPLLVNAKNPTTTGNHS*
Ga0058891_148262313300004104Forest SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIPAMGEREHSNGNGNPS*
Ga0062593_100000183113300004114SoilMATTSLSGLEVLCLPELLVLIKRTAEWARNRRVLNPREAMVANEVNEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTRREREEPNAD*
Ga0062593_10172717213300004114SoilMATASLGSLESLRLPDLLVLIKQTAERARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGGSPRRVEPKWRRPGDVEAAFFALYDALRSQIGSPLLVDAKNPTTTGHYS*
Ga0058901_155960413300004120Forest SoilMATASIGGQEGLHLPDLLVLIKRTAECARNRRVLDPRAAIVPDEVQQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQNRSLLLMNAKNPTTAGHPG*
Ga0058884_137131913300004135Forest SoilMATTSIGGLDGLGLQDLLVLVKQTAERARNCRVLNPRAAMVPDELVEAKVLCARARAWPLGEPQRRAEPKWRRPGDVEATFFALYDALRFHITSAFAVNEKNPAPADTISDY*
Ga0062590_10119976323300004157SoilMATTSLSGLEVLCLPELLVLIKRTAEWARNRRVLNPREAMVANEVNEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTRRER
Ga0062590_10144040513300004157SoilMATASLGSLESLRLPDLLVLIKQTAERARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGGSPRRVEPKWRRPGDVEAAFFALYDALRSQIG
Ga0058899_1002879713300004631Forest SoilMATASIGGQEGLRLPDLLVLIKHTAECARNRRVLDPRAAIVPDEVQQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQNRSLLLMNAKNPTTAGHPS*
Ga0058899_1022957413300004631Forest SoilLGLQDLLVLVKQTAERARNSRVLDPQAAMAPDEVMEAKFLYTRARAWPLREPQRRAEPKWRRPGDVEATFFALYDALRSHIRSAFVVKEKNSAPTSAISSF*
Ga0058899_1199828413300004631Forest SoilMATASIGGLESLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIRPPLSMKAKNPTTTGNPG*
Ga0058899_1200867113300004631Forest SoilMATASIRGLEGLRLPELLVLIKQTAEYARSRRVLDPRARIVPEEVRQAKVLYARARAWLSGASPRRAEPKWRGLGDVEAAFFALYEVLRSQIRSPLLVNAKYRTATGDHS*
Ga0058899_1219011213300004631Forest SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGDSPRRAEPKWRRPGDVEATFFALYDALRSQIPAMGEREQPNGSLS*
Ga0058899_1226651113300004631Forest SoilMATAPMGGLEGLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPAEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP*
Ga0058899_1228853523300004631Forest SoilMATASIGGQEGLHLPDLLVLIKRTAECARNRRVLDPRAAIVPDEVLEAKVLYTRARAWLLGESPRRAEPKWRRLGDVEAAFFALYDALRSQATSSHS*
Ga0058899_1230213823300004631Forest SoilMATAPMGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNTENHRQLATIAHS*
Ga0070707_10016971833300005468Corn, Switchgrass And Miscanthus RhizosphereMATTSRSGVAVLRLPELLGLIKRTAEWARNRRVLNPREAMVPDEVKDAEILYARARAWPLGEPLRRAEPKWRKPGDVEAAFFALYDALRSRMYVRTRREREEPNPD*
Ga0070698_10003754633300005471Corn, Switchgrass And Miscanthus RhizosphereMATTSRSGVEVLRLPELLGLIKRTAEGARNRRVLNPRQAMVPDEVKDAEILYARARAWPLGEALRRAEPKWRKPGDVEASFFALYDALRSRIYVRTRLEREEPNPD*
Ga0073909_1000830443300005526Surface SoilMATASLGSLESLRLPDLLVLIKQTAERARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYDALRSQIGSPLLVDAKNPTTTGNYS*
Ga0070697_10032529813300005536Corn, Switchgrass And Miscanthus RhizosphereMATTSLGGLEVLRLPELLVLIKRTAEWARNRRVLNPREAMVPNEVKEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDALRSKTWVRTAVNEKHPTPISTRSHFDSL*
Ga0070730_1000913243300005537Surface SoilMATASIISVDGVGLQDLLVLVKQTAERARNYRVRNPQASMAPEEVTEATVLYARARAWPLRDPQRRAEPKWRRPGDVEAAFFALYDALRFHISSAFVVNDKNPAPTSTISSF*
Ga0070730_1001013033300005537Surface SoilMATASLRGLPVLRLPDLLVLIKRTAEWARNLRLQNLREAMIPDEVQQAKALFARARAWPLNEPLLRAEPKWRKPGDVEAAFFALYDALRSQMRACTLGKPDVLKLGLPESKS*
Ga0070733_1000803743300005541Surface SoilMATASLSGLPVLRLPDLLVLIKRTAEWARNLRLQNLREAMIPDEVQQAKALFARARAWPLNEPLLRAEPKWRKPGDVEAAFFALYDALRSQMRACTLGKPDVLKLGLPESKS*
Ga0070704_10000357443300005549Corn, Switchgrass And Miscanthus RhizosphereMATASLGSLESLRLPDLLDLIKQTAERARSRRVIDPRAAIEAEEGLQAKVLYTRARAWLLGGSPRRVEPKWRRPGDVEAAFFALYDALRSQIGSPLLVDAKNPTTTGHYS*
Ga0070761_1000436573300005591SoilMATAPMGGLEGLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILEAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP*
Ga0070762_1047396023300005602SoilMATAVIGGLEGLRLADLLVLIKQTAECARVRRVLDPRAAIVPDEVLEATVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYDALRRQTRSPLFMNAKNPTVAGDYS*
Ga0070762_1076637323300005602SoilAECARNRRVLDPRAAIVPDEVLEAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLSMKAKNPTTTGNPG*
Ga0070763_1090533923300005610SoilMATAPMGGLEGMRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGEGPRRAEPKWRRPGDVEAAFFALYDALRSQIKSPLSMNAKNPTTTGNLS*
Ga0070763_1094175423300005610SoilMTTTSIGSLDGLGLHDLLLLVKQTAERARNSRVLNPRAAMVPDELMEAKVLCARARAWPLGEPQRRAEPRWRRPGDVEATFFALYDALR
Ga0070766_1094429313300005921SoilMATASIGGQEGLHLPDLLVLLKQTADRARNRRVLDPRAAIVADEVLQAKVLYTRARAWLLGECPRRAEPKWRRLGDVEAAFFALYDALRSQATSSHS*
Ga0075015_10076916713300006102WatershedsGLRLPDLLVLIKQTAECARSQRVLNPRATIVNDEVLQAKVLFTRARAWLLGISPRRAEPKWRRPGDVEAAFFALYDALRSQIRAQLFVNAKNATTTSNRT*
Ga0075014_10007553023300006174WatershedsVATASMGGLEGLRLPDLLVLIKQTAECARSQRVLNPRATIVNDEVLQAKVLFTRARAWLLGISPRRAEPKWRRPGDVEAAFFALYDALRSQIRAQLFVNAKNATTTSNGT*
Ga0075014_10074399623300006174WatershedsMATATLSGLPVLRLPDLLDLIKRTADWARSLRLQNLSAAMIPDEVQQAKALFARARAWPLNEPLLRAEPKWQKPGDVEAAFFALYDALRSQMRVCTRGTSDAFRLALPESKF*
Ga0070765_10057610323300006176SoilMATASIGGQEGLHLPDLLVLIKRTAECARNRRVLDPRAAIVPDEVLEAKVLYTRARAWLLGESPRRAEPKWRRLGDVEAAFFALYDVLRSQTRSPHS*
Ga0070765_10116221113300006176SoilMATAPMGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQATVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQITSPLLMNAKNIDGWQP*
Ga0070765_10155070923300006176SoilMATASIGSLQGLHLPDLLVLVKQAAERARICRMLNPRAAMVPDEVMEAKVLYSRARAWPLGEPQRRVEPKWRRPGDVEAAFFALYDGLRSSAVQPLGKRDG*
Ga0079220_1023639513300006806Agricultural SoilMATASINGLDGVGLQDLLVLVKQTAERARKCRVLNPRAVMAPDEVIEAKVLYAQARAWPLGEPQWRAEPKWRRPGDVEATFFVLFDALRSYASSAFVVSEKNPAPTSTISNS*
Ga0102924_1000065343300007982Iron-Sulfur Acid SpringMAIASINGPEVLRLPDLLLLVKQTAERARNCRVLNPRALMVPDEVLEAKVLYAQARAWPLREPQRRAEPKWRRPGDVEAAFFALYDALRSHIKSAFVV*
Ga0150983_1090309613300011120Forest SoilMATASIGVQEGLHLPDLLVLLKQTAERARKRRVLDPRAAIVPDEVLQAKVLYTRARAWLLAETPRRAEPKWRRLGDVEAAFFALYDALRSQATSPPPLMNAKNPRTAGNPS*
Ga0150983_1110376613300011120Forest SoilMATASIGGQEGLRFSDLLVLIKQTAERARNRRVLDPRATILPDEVLQAKVLYTRARAWLLGESPRRVEPKWRRLGDVEAAFFALYDALRSQTGSSPS*
Ga0150983_1155416113300011120Forest SoilMSTASIGGLEGLRLPDLLVLIKQTAECARNRRVLDPCAAIVPDEVLQAKVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYEALRSQITPPLFVRAKNSSMTGNQS*
Ga0150983_1202086913300011120Forest SoilMATASIAGKQGLHLPDLLVLIKQTAEWARNRRVLDPRAAIVPHEVLEAKVLYKRARAWLWGEGPRRAEPKWRRLGEVEAAFFALYDALRLQNRSPLLMNAKNPGTAGNHS*
Ga0150983_1212750513300011120Forest SoilMGGLEGLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILEAKVLYTRARAWLLGESPRRSEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP*
Ga0150983_1261054013300011120Forest SoilMATASFGSLESLRLPDMLVLIKQTAECARSRRVIDPRATIEADEGLQAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDELRLQIRSPLLVNAKNPTTTGNYS*
Ga0150983_1334218613300011120Forest SoilMATASIGSLEGLRLPDLLVLVRQAAERARICRMLNPRAAMVPDEVMEAKVLYSWARAWPLGEPQRRVEPKWRRPGDVEAAFFALYDGLRSSTVQTPGKHDGH*
Ga0150983_1347316513300011120Forest SoilMATTSISGLDGVGLQDLLVLVKQTAERARKCRMLNPQAVMIPDELIEAKVLYARARAWPLGEPQRRAEPKWRRPGDVEATFFVLFDALRSYARSAFVVNERNPAPTSTLSNF*
Ga0150983_1383336913300011120Forest SoilLAHGAGSFLNGGQPMATASISGLDDLGLQDLLVLVKQTAERARNSRVLDPQAAMAPDEVMEAKFLYTRARAWPLREPQRRAEPKWRRPGDVEATFFALYDALRSHIRSAFVVKEKNSAPTSAISSF*
Ga0150983_1397969413300011120Forest SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGDSPRRAEPKWRRPGDVEAAFFALYDALRSQIPAMGEREHSNGNGNPS*
Ga0150983_1473554223300011120Forest SoilMTTTSIGSLDGLGLHDLLLLVKQTAERARNSRVLNPRAAMVPDELMEAKVLCARARAWPLGEPQRRAEPRWRRPGDVEATFFALYDALRFHISSAFVVNGKKNSAG*
Ga0150983_1520524513300011120Forest SoilMATASIGGQEGLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIKSPLSMNAKNPTTTGNLS*
Ga0150983_1528324713300011120Forest SoilMATASIGGLEGLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLSMKAKNPTTTGNPG*
Ga0150983_1531107713300011120Forest SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDVLRSQISAMGEREQPN
Ga0150983_1555403813300011120Forest SoilMATASIGGLEGLRLPDLLVLIKETAESARYLRVLDPRAAIAPDEVQRAKVLYSRARAWLLGESSRRAEPKWRRPGDVEAAFFALYDALRRQTSYS*
Ga0150983_1568445913300011120Forest SoilLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPAEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP*
Ga0150983_1627660923300011120Forest SoilMATASIGGQEGLRLPDLLVLIKHTAECARNRRVLDPRAAIVPDEVLEAKVLYTRARAWLLGESPRRAEPNWRKLGQVEAAFFALYDALRSQSRSPLLGNAKNPRRLAIIADS*
Ga0150983_1657026813300011120Forest SoilHLPDLLVLIKQTAECARNRRVLDPCAAIVPDEVLAAKVLYTRTRAWLLGESPRRAEPKWRRLADVEAAFFALYDALRSQTRSPHS*
Ga0137389_1054416113300012096Vadose Zone SoilMATASIGSLQGLRLPDLLVLVKQAAERARICRMLNPRAAMVPDEVMEAKVLYSRARAWPLGKPQRRAEPKWRRPGDVEAAFFALYDALRSTPVQHSWEA*
Ga0137398_1088480313300012683Vadose Zone SoilMATTSGSGLEVLRLPELLVLIKRPAEWARNRRVLNPREAMVPDEVKDAEILYARARAWPLGEPLRRAEPKWRKPGDVEAAFFALNDALRSRMYVRACRERKEPDAG*
Ga0153915_1009529033300012931Freshwater WetlandsMATASISGLEGSRLPDLLLLVKQTAECARNRRVLNPRAAMAPDEVLEAKVLYARARAWPLGKPQWRAEPRWRWPGEVEAAFFALYDALRYKLGPHSS*
Ga0164301_1005418223300012960SoilMATASFGSLESLRLPDLLVLIKQTAECARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRLQIRSPLLVNAKNPTTTGNYS*
Ga0164304_1002820423300012986SoilMATGSFGNLESLRLPDLLVLIKQTAEYARSRRVIDARAAIEADEGLQAKVLYARARAWLQGKSPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLLVKAKNPTTTSNIS*
Ga0184587_13670213300019185SoilMATASLTGLEALRLPDLLVLVKQTAEGARSRRVLDPSAIMIPDELTKAKILYARARAWIVGENQRRAEPKWRGLGDVEAAFFALYDALRSQIRPHSSRT
Ga0173482_1005425713300019361SoilMATTSLSGLEVLCLPELLVLIKRTAEWARNRRVLNPREAMVANEVNEAKVLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDRLRSKMSVRTRREREEPNAD
Ga0193729_100429093300019887SoilMATTSLRALEVLRLPELLVLIKRTAEWARNRRVLNPREAMVPDEVKEAKVLYARARAWPLREPLRRAEPKWQKPGDVEAAFFALYDALRSKTRVVSVNEKHPSPISTISHF
Ga0193729_111573813300019887SoilMATTSLSGLEVLRLPELLVLIKRTAESARNRRVLNPRQEIVADEVKEAEVLYARARAWPLEEPLRRAEPKWQKPGDVEAAFFALYDALGSKMCVRNRGGEEPNRD
Ga0210407_1052132813300020579SoilMATAPIGGLEGLCLPDLLVLIKQTAEWARNRRVLDPRAAIVPAEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP
Ga0210403_10001256153300020580SoilMATAPMGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNTENHRQLATIAHS
Ga0210403_1004077333300020580SoilMATATLSSLQVLRLPDLLVLVKRTAEWARNLRQQNLSAAMIPDEVQQAKALFERARAWPLNEPLLRAEPQWQKPGDVEAAFFALYEALRSQMRAWTLRAPDVFKLGLPESKF
Ga0210403_1063198513300020580SoilMAAAAIGGLEASRLPDLLVLIKQTAECARSRRVLDPRATIVSDEILQAKVLFTRARAWLLGVSTRRAEPKWRRLGDVEAAFFALYDALRSQIRSPLLVTAKHPTTTGDHS
Ga0210399_1013810223300020581SoilMATASIGGQEDLHLPDLLVLIKQTAECARNRRVLDPCAAIVPDEVLAAKVLYTRTRAWLLGESPRRAEPKWRRLADVEAAFFALYDALRSQTRSPHS
Ga0210399_1022273123300020581SoilMATASIAGKQGLHLPDLLVLIKQTAEWARNRRVLDPRAAIVPHEVLEAKVLYTRARAWLLGEGPRRAEPKWRRLAEVEAAFFALYDALRLQNRSPLLMNARNPRTAGSHS
Ga0210395_10000095353300020582SoilMATAPMGGLEGLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILEAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP
Ga0210401_1022348623300020583SoilMATTSIGGLDGLGLQDLLVLVKQTAERARNCRVLNPRAAMVPDELVEAKVLCARARAWPLGEPQRRAEPKWRRPGDVEATFFALYDALRFHITSAFAVNEKNPAPADTISDY
Ga0210406_1029751823300021168SoilMATTSLGGLEVLRLPELLVLVKRTAEWARNRRVLNPREAMVPDEVKEAKDLYARARAWPLGEPLRRAEPKWQKPGDVEAAFFALYDALRSKMWVRTAVNEKHPTPISTISHFDSL
Ga0210400_1104078213300021170SoilRVNRLFLGENSNVNYIDHSSRSVFLRRRLECPVIFKRRLVMAAAAIGGLEALRLPDLLVLIKQTAECARSRRVLDPDATIVSDEILQAKVLFTRARAWLLGVSPRRAEPKWRRPGDVEAAFFALYDALRSQIRPPLLVTAKPPTTTGDHS
Ga0210405_1066891613300021171SoilMATASIGGQEDLRLPDLLVLIKQTAECARNRRVLDPCAAIVPDEVLAAKVLYTRTRAWLLGESPRRAEPKWRRLADVEAAFFALYDALRSQTRSPHS
Ga0210405_1079130013300021171SoilMATASIGVQEGLHLPDLLVLLKQTAERARKRRVLDPRAAIVPDEVLQAKVLYTRARAWLLAETPRRAEPKWRRLGDVEAAFFALYDALRSQATSPPPLMNAKNPRTAGNPS
Ga0210405_1126549013300021171SoilMAAAATGGLEASRLPDLLVLIKQTAECARSRRVLDRRATIVSDEILQAKVLFTRARTWLLGVSPRRAEPKWRRLGDVEAAFFALYDALRSQIRSPLSVTAKHPTTGDHS
Ga0210408_1023018123300021178SoilMATASIGGQEDLHLPDLLVLIKQTAECARNRRVLDPSAAIVPDEVLAAKVLYTRTRAWLLGESPRRAEPKWRRLGDVEA
Ga0210393_1032192323300021401SoilMATAPIGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP
Ga0210393_1101254913300021401SoilCPVIFKRRLVMAAAAIGGLEALRLPDLLVLIKQTAECARSRRVLDRRATIVSDEILQAKVLFTRARTWLLGVSPRRAEPKWRRLGDVEAAFFALYDALRSQIRSPLSVTAKHPTTADDHS
Ga0210397_1130589813300021403SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGDSPRRAEPKWRRPGDVEATFFALYDALRSQIP
Ga0210387_1033839423300021405SoilMATAPIGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNSENHRQLATIAHS
Ga0193709_105748723300021411SoilLRGLEVLCLPELLVLIKRTAEWARNRRVLNPREAMVADEVKEAKVLYARARAWPLGAPLRRAEPKWQKPGDVEAAFFALYDALRSKTRVRTRLEREASNPD
Ga0210394_1000497363300021420SoilMATASLSGLEVLRLPDLLILVRQTAESARSRRVLDPSARMIADEVIEAKILYARARTWIAGETQRRAEPKWRRLGEVEAAFFALYDALRSQTTRPFVSNVKNSTPIGNHI
Ga0210394_1178316613300021420SoilMTTTSIGSLDGLGLHDLLLLVKQTAERARNSRVLNPQAAMVPDELMEAKVLCARARAWPLGEPQRRAEPRWRRPGDVEATFFALYDALRFHISSAFVVNGKKNSAG
Ga0210384_10000297783300021432SoilMATTSISGLDGLGLQDLLALVKQTAERARNWRLCNPRAAMVPDELMEARILCARARAWPLEAPPRRAEPKWTRPGDVEATFFALYDALRFHISSAFVVNDKNPSPADTISNNG
Ga0210384_1005453023300021432SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGDSPRRAEPKWRRPGDVEAAFFALYDALRSQIPAMGEREHSNGNGNPS
Ga0210391_1009431523300021433SoilMGGLEGLRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILEAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP
Ga0210402_1004938923300021478SoilMATAPIGGLEGLRLPDLLVLIKQTAEWARNRRVLDPCAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNAKNIDGWQP
Ga0210402_1031268423300021478SoilMATASMSGLDGLGLEDLLVLVKQTAERARKCRVLNPQAPMVPDEVMEAKALYARARAWPLGEPERRAEPKWRRPGDVEATFFVLFDALRSYARSAFVVNERNPAPTSAISVFDSP
Ga0210402_1118514313300021478SoilMATAPMGGLEGMRLPDLLVLIKQTAEWARNRRVLDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLLMNTENHRQL
Ga0210409_1169779113300021559SoilPMATASIGGQEGLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIKSPLSMNAKNPTTTGNLS
Ga0212123_10000537383300022557Iron-Sulfur Acid SpringMAIASINGPEVLRLPDLLLLVKQTAERARNCRVLNPRALMVPDEVLEAKVLYAQARAWPLREPQRRAEPKWRRPGDVEAAFFALYDALRSHIKSAFVV
Ga0247695_100684323300024179SoilSFPNGGVQMATGSFGNLESLRLPDLLVLIKQTAEYARSRRVIDARAAIEADEGLQAKVLYARARAWLQGKSPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLLVKAKNPTTTSNIS
Ga0247669_101751213300024182SoilMATGSFGNLESLRLPDLLVLIKQTAEYARSRRVIDARAAIEADEGLQAKVLYARARAWLQGKSPRRAEPKWRRPGDVEAAFFALYDALRSQIRSPLLVKAKNPTTTSNIS
Ga0224564_103139413300024271SoilMATAPIGGLEGMRLPDLLVLIKQTAEWARNRRVRDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITS
Ga0224564_103490413300024271SoilMATASLTGLEALRLPDLLVLVKQTAEGARSRRVLDPSARMIPDEVMEAKILYARARTWIAGETQRRAEPKWRRPGEVEAAFFALYDALRSQIRPPFVSNVKNSTPIGNHI
Ga0247671_104377213300024284SoilMATASLGSLESLRLPDLLVLIKQTAERARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGGSPRRVEPKWRRPGDVEAAFFALYDALRSQIGSPLLGN
Ga0207693_1076094123300025915Corn, Switchgrass And Miscanthus RhizosphereMATTSLSGLEVLRLPELLVLIKRTAEWARNRRVLNPHEAMVPDEVKEAKVLYARARAWPLGEALRRAEPKWRKPGDVEAAFFALYDALRSRMSVRTRREREEPNPG
Ga0207646_1007647343300025922Corn, Switchgrass And Miscanthus RhizosphereMATTSRSGVAVLRLPELLGLIKRTAEWARNRRVLNPREAMVPDEVKDAEILYARARAWPLGEPLRRAEPKWRKPGDVEAAFFALYDALRSRMYVRTRREREEPNPD
Ga0209438_107268723300026285Grasslands SoilPLMATTSRSGLEVLHLPELLVSIKRTAEWARNRRVLNPREAMVPDEVKEAEILYAGAKAWPLGEPLRRAEPKWRKPGDVEAAFFALYEALRSRMYVRARRERKGPGAD
Ga0208366_101819923300027073Forest SoilMATASIGGQEGLHLPDLLVLIKRTAECARNRRVLDPRAEIVPDEVLQAKVLYARARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQIRPPLSMKAKNPTTTGNPG
Ga0209735_102767623300027562Forest SoilMATASIGDLEGLRLPDLLVLIKQTAECARSRRVLDPRAAVVPDEVLQAKVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYDALRSQIKAQLFVNAKNATTTSNLS
Ga0209525_101268933300027575Forest SoilMATASIGSLEGLRLPDLLVLVKQAAERARICRMLDPRAAMVPDEVMEAKVLYSWARAWPLGEPQRRVEPNWRRPGDVEAAFFALYDGLRSSAAQLPWEA
Ga0209625_101194923300027635Forest SoilMATTSLAGLEILRLPELLVLIKRTAEWARNRRVLNSREAMVPDEVKEAEVLYARARAWPLEEPLRRAEPKWRKPGDVEAAFFALYDALDSKMYGPNRGQEEPNGD
Ga0209117_107763923300027645Forest SoilMATTSLRGLEVLRLPELLVLIKRTAEWARNRRVLNPREAMVPDEVKEAKVLYARARAWPLGEPLRRAEPKWRRPGDVEGAFFALYDALRSQITSALVMKANKPNAD
Ga0209772_1002125723300027768Bog Forest SoilMATATLSGLEVLRLPDLLVLVKQTAERARSRRVLDPSAVMIPDEVMEAKILYARARAWIVGETQRRVEPRWRGLGDVEATFFALYDVLRSQISPYSSRTRSTQRPPAS
Ga0209811_1001921613300027821Surface SoilMATASLGSLESLRLPDLLVLIKQTAERARSRRVIDPRAAIEADEGLQAKVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYDALRSQIGSPLLVDAKNPTTTGNYS
Ga0209166_1000096873300027857Surface SoilMATASIISVDGVGLQDLLVLVKQTAERARNYRVRNPQASMAPEEVTEATVLYARARAWPLRDPQRRAEPKWRRPGDVEAAFFALYDALRFHISSAFVVNDKNPAPTSTISSF
Ga0209166_1002816533300027857Surface SoilMATASLRGLPVLRLPDLLVLIKRTAEWARNLRLQNLREAMIPDEVQQAKALFARARAWPLNEPLLRAEPKWRKPGDVEAAFFALYDALRSQMRACTLGKPDVLKLGLPESKS
Ga0209167_1002687843300027867Surface SoilMATASLSGLPVLRLPDLLVLIKRTAEWARNLRLQNLREAMIPDEVQQAKALFARARAWPLNEPLLRAEPKWRKPGDVEAAFFALYDALRSQMRACTLGKPDVLKLGLPESKS
Ga0209380_1017241523300027889SoilMATASIGGLESLRLPDLLVLIKQTAEGARSRRVLDPRAEIVPDEVLQAKVLYARARAWLLGKSPRRAEPKWRRPGDVEAAFFALYDALRSQIKSPLSMNAKNPTTTGNLS
Ga0209380_1080826613300027889SoilATAVIGGLEGLRLADLLVLIKQTAECARVRRVLDPRAAIVPDEVLEATVLYTRARAWLLGESPRRVEPKWRRPGDVEAAFFALYDALRRQTRSPLFMNAKNPTVAGDYS
Ga0308309_1049926523300028906SoilMATASIGGQEGLHLPDLLVLIKRTAECARNRRVLDPRAAIVPDEVLEAKVLYTRARAWLLGESPRRAEPKWRRLGDVEAAFFALYDVLRSQTRSPHS
Ga0308309_1107714613300028906SoilMATASIGGLEGSRLPDLLVLIKQTAECARSRRVHDPRAAILPDEVLQAKVLYARARAWLLGDSPRRAEPKWQRPGDVEATFFALYDALRSQIPAMGEREQPNGSLS
Ga0222749_1000416293300029636SoilLGLQDLLALVKQTAERARNWRLCNPRAAMVPDELMEARILCARARAWPLEAPPRRAEPKWTRPGDVEATFFALYDALRFHISSAFVVNDKNPSPADTISNNG
Ga0222749_1015522723300029636SoilMAAASIGGQEGLRLSDLLVLIKQTAERARNRRVLDPRATILPDEVLQAKVLYTRARAWLLGESPRRVEPKWRRLGDVEAAFFALYDALRSQTRSSHS
Ga0074037_164022913300030803SoilMATASIGGLEGLRLPDLLVLIKQTAERARSQRVLNPRAIIVNDEIQQAKALFTRARAWLLGVSPRRAEPKWQRPGDVEAAFFALYDALRSQIKAQLFVNAKNPTTISNRT
Ga0265758_10613513300030884SoilMATASIGGLEGLHLSDLLVLIKQTAEGARNRRVLDPRATIVPDEVLQAKVLYTRARAWLLGESPRRAEPKWRRPGDVEAAFFALYDALRSQTSYHAKNPTMVGNHS
Ga0074034_1093491713300030950SoilVIQRRDFSLDLQINRLMAAAAMGGLEASRLPDLLVLIKQTAECARSRRVLDPRATIVSDEILQAKVLFTRARAWLLGVSPRRVEPKWRRLGDVEAAFFALYDALRSQIRSPLLVTAKQPTTTGDHS
Ga0265779_10543113300031043SoilMATASISGLEGLRLPDLLVLIKQTAECARSQRVLNPRATIVNDEIQQAKVLFTRARAWLLGVSPRRVEPKWRRPGDVEAAFFALYDALRSQIKAQLFVNAKNATTTSNLS
Ga0170834_10032872313300031057Forest SoilMATASRSGLQALRLPDLLALIKRTAERARNLRVQNPREAMVPDEVQQAKTLYAQARAWPLNEALQRAEPKWRKPGDVEAAFFALYDGLRSLMK
Ga0170823_1352384213300031128Forest SoilMATTSLGGLEVLRLPELLVLIKRTAEWARNRRVLNPREAMVPDEVKDAEILYATARSLPLGEPLRRAEPKWRKPGDVEAAFFALYDALRSRMYVRTRREREEPNPD
Ga0170824_10175795923300031231Forest SoilMATASRSGLQALRLPDLLALIKRTAERARNLRVQNPREAMVPDEVQQAKTLYAQARAWPLNEALQRAEPKWRKPGDVEAAFFALYDGLRSLTKARTRLDLREPNAD
Ga0265340_1012128443300031247RhizosphereMATASLCALEVLRLPDLLVLVKRTAESARSRRVLDPSAIMIPDEVMEAKILYARVRARIVGETQRRAEPKWRRLGDVEAAFFALNDALRS
Ga0310686_10968215023300031708SoilLAQGLARSIRVKPRKVPSFQNGRLPMATASLSGLEVLRLPDLLILVRQTAESARSRRVLDPSARMIPDEVMEAKILYARARTWIAGETQRRAEPKWRRPGEVEAAFFALYDALRSQIRPPFVSNVKNSTPIGNHI
Ga0307474_1053493823300031718Hardwood Forest SoilMETVSIGDLEGLRLRELLVLIKQTAECARTRRVLDPRAAVVPDEVRQAKVLYTRARAWLLGESPRRAEPKWRGPGDVEAAFFALYEVLRSQISAPLLVNAKNPTATASHS
Ga0307475_1010332333300031754Hardwood Forest SoilMATAALSGVQAVRLPDLLVLIKRTAEWARNLRVQNPREALVADEVQQAEALYARARAWPLNEPLQRAEPKWRKPGDVEAAFFALYDGLRSLPETRTRLDRSHTPISSYI
Ga0316049_12477713300031866SoilMATAPIGGLEGMRLPDLLVLIKQTAEWARNRRVRDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLSMNSENHRRLATIAHS
Ga0307479_1090173113300031962Hardwood Forest SoilMATAALSGVQAVRLPDLLVLIKRTAEWARNLRVQNPREALVADEVQQAEALYARARAWPLNEPLQRAEPKWRKPGDVEAAFFALYDGLRSLPEALTRLDQREPYAD
Ga0316051_103385613300032119SoilGISSADEFERRSVMAAAAMGGLEASRLPDLLVLIKQTAECARSRRVLDPRATIVSDEILQAKVLFTRARAWLLGVSPRRVEPKWRRLGDVEAAFFALYDALRSQIRSPLLVTAKQPTTTGDHS
Ga0307470_1130384623300032174Hardwood Forest SoilMATASLSGLPVLRLPDLLVLIKRTAERAHNLRLQSLTAAMIPDEVQQAKELFARARAWPLHEPLLRAEPKWQKPGDVEAAFFALYDALRSQMRAWTLRAPDVFKRSWPESKF
Ga0348332_1095269813300032515Plant LitterMATAPIGGMEGMRLPDLLVLIKQTAEWARNRRVRDPRAAIVPDEILQAKVLYTRARAWLLGESPRRAEPKWRRLGEVEAAFFALYDALRSQITSPLSMNSENHRRLATIAHS
Ga0348332_1445936313300032515Plant LitterSKGISSADEFERRSVMAAAAMGGLEASRLPDLLVLIKQTAECARSRRVLDPRATIVSDEILQAKVLFTRARAWLLGVSPRRVEPKWRRLGDVEAAFFALYDALRSQIRSPLLVTAKQPTTTGDHS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.