NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F052771

Metagenome / Metatranscriptome Family F052771

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F052771
Family Type Metagenome / Metatranscriptome
Number of Sequences 142
Average Sequence Length 137 residues
Representative Sequence MNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIPRATGSATHKP
Number of Associated Samples 81
Number of Associated Scaffolds 142

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 87.30 %
% of genes near scaffold ends (potentially truncated) 21.13 %
% of genes from short scaffolds (< 2000 bps) 50.70 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (88.028 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(40.141 % of family members)
Environment Ontology (ENVO) Unclassified
(57.746 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(85.915 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 33.73%    β-sheet: 0.00%    Coil/Unstructured: 66.27%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 142 Family Scaffolds
PF01425Amidase 27.46
PF00072Response_reg 22.54
PF14332DUF4388 6.34
PF01584CheW 2.82
PF01739CheR 2.82
PF02518HATPase_c 2.82
PF00202Aminotran_3 1.41
PF14235DUF4337 1.41
PF05201GlutR_N 1.41
PF00032Cytochrom_B_C 0.70
PF02633Creatininase 0.70
PF12697Abhydrolase_6 0.70
PF01339CheB_methylest 0.70
PF07969Amidohydro_3 0.70
PF10047DUF2281 0.70
PF03705CheR_N 0.70

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 142 Family Scaffolds
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 27.46
COG1352Methylase of chemotaxis methyl-accepting proteinsSignal transduction mechanisms [T] 7.04
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 2.82
COG0373Glutamyl-tRNA reductaseCoenzyme transport and metabolism [H] 1.41
COG2201Chemotaxis response regulator CheB, contains REC and protein-glutamate methylesterase domainsSignal transduction mechanisms [T] 1.41
COG1290Cytochrome b subunit of the bc complexEnergy production and conversion [C] 0.70
COG1402Creatinine amidohydrolase/Fe(II)-dependent FAPy formamide hydrolase (riboflavin and F420 biosynthesis)Coenzyme transport and metabolism [H] 0.70


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms88.03 %
UnclassifiedrootN/A11.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10021687All Organisms → cellular organisms → Bacteria5260Open in IMG/M
3300002245|JGIcombinedJ26739_100356042All Organisms → cellular organisms → Bacteria → Acidobacteria1344Open in IMG/M
3300003351|JGI26346J50198_1000099All Organisms → cellular organisms → Bacteria4128Open in IMG/M
3300003505|JGIcombinedJ51221_10311035All Organisms → cellular organisms → Bacteria → Acidobacteria641Open in IMG/M
3300004080|Ga0062385_10064507All Organisms → cellular organisms → Bacteria1645Open in IMG/M
3300004091|Ga0062387_101625869All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium522Open in IMG/M
3300004092|Ga0062389_101459001All Organisms → cellular organisms → Bacteria → Acidobacteria868Open in IMG/M
3300004092|Ga0062389_102261800All Organisms → cellular organisms → Bacteria → Acidobacteria716Open in IMG/M
3300004117|Ga0058893_1298584All Organisms → cellular organisms → Bacteria → Acidobacteria547Open in IMG/M
3300004120|Ga0058901_1038314All Organisms → cellular organisms → Bacteria → Acidobacteria1501Open in IMG/M
3300004152|Ga0062386_100747992All Organisms → cellular organisms → Bacteria → Acidobacteria804Open in IMG/M
3300004631|Ga0058899_10078031All Organisms → cellular organisms → Bacteria2091Open in IMG/M
3300004631|Ga0058899_10128942All Organisms → cellular organisms → Bacteria → Acidobacteria1513Open in IMG/M
3300004631|Ga0058899_10163063All Organisms → cellular organisms → Bacteria → Acidobacteria1560Open in IMG/M
3300004635|Ga0062388_100382634All Organisms → cellular organisms → Bacteria1215Open in IMG/M
3300004635|Ga0062388_100852311All Organisms → cellular organisms → Bacteria → Acidobacteria870Open in IMG/M
3300005534|Ga0070735_10019341All Organisms → cellular organisms → Bacteria4942Open in IMG/M
3300005541|Ga0070733_10038743All Organisms → cellular organisms → Bacteria2973Open in IMG/M
3300005591|Ga0070761_10002312All Organisms → cellular organisms → Bacteria11703Open in IMG/M
3300005591|Ga0070761_10024455All Organisms → cellular organisms → Bacteria → Acidobacteria3368Open in IMG/M
3300005591|Ga0070761_10195079All Organisms → cellular organisms → Bacteria → Acidobacteria1198Open in IMG/M
3300005602|Ga0070762_10188417All Organisms → cellular organisms → Bacteria → Acidobacteria1256Open in IMG/M
3300005602|Ga0070762_10199493All Organisms → cellular organisms → Bacteria → Acidobacteria1224Open in IMG/M
3300005610|Ga0070763_10161104All Organisms → cellular organisms → Bacteria → Acidobacteria1178Open in IMG/M
3300005712|Ga0070764_10292711All Organisms → cellular organisms → Bacteria → Acidobacteria938Open in IMG/M
3300006174|Ga0075014_100516419All Organisms → cellular organisms → Bacteria → Acidobacteria671Open in IMG/M
3300006176|Ga0070765_100025687All Organisms → cellular organisms → Bacteria4490Open in IMG/M
3300006176|Ga0070765_100049112All Organisms → cellular organisms → Bacteria3431Open in IMG/M
3300006176|Ga0070765_100347986All Organisms → cellular organisms → Bacteria1377Open in IMG/M
3300006804|Ga0079221_10710560All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300006893|Ga0073928_10000531All Organisms → cellular organisms → Bacteria88275Open in IMG/M
3300007982|Ga0102924_1000012All Organisms → cellular organisms → Bacteria231435Open in IMG/M
3300010343|Ga0074044_10367430All Organisms → cellular organisms → Bacteria → Acidobacteria943Open in IMG/M
3300010343|Ga0074044_10474610All Organisms → cellular organisms → Bacteria → Acidobacteria818Open in IMG/M
3300010379|Ga0136449_100017506All Organisms → cellular organisms → Bacteria18669Open in IMG/M
3300010858|Ga0126345_1140906All Organisms → cellular organisms → Bacteria → Acidobacteria679Open in IMG/M
3300011120|Ga0150983_12169607All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium959Open in IMG/M
3300011120|Ga0150983_15339521All Organisms → cellular organisms → Bacteria → Acidobacteria595Open in IMG/M
3300011120|Ga0150983_16128974All Organisms → cellular organisms → Bacteria → Acidobacteria1504Open in IMG/M
3300012202|Ga0137363_10008877All Organisms → cellular organisms → Bacteria → Acidobacteria6357Open in IMG/M
3300017924|Ga0187820_1002471All Organisms → cellular organisms → Bacteria4211Open in IMG/M
3300017930|Ga0187825_10147667All Organisms → cellular organisms → Bacteria → Acidobacteria830Open in IMG/M
3300017936|Ga0187821_10512871All Organisms → cellular organisms → Bacteria → Acidobacteria501Open in IMG/M
3300018007|Ga0187805_10004643All Organisms → cellular organisms → Bacteria5461Open in IMG/M
3300020579|Ga0210407_10013799All Organisms → cellular organisms → Bacteria6010Open in IMG/M
3300020579|Ga0210407_10019295All Organisms → cellular organisms → Bacteria5063Open in IMG/M
3300020579|Ga0210407_10054998All Organisms → cellular organisms → Bacteria → Acidobacteria2974Open in IMG/M
3300020579|Ga0210407_10244691All Organisms → cellular organisms → Bacteria → Acidobacteria1397Open in IMG/M
3300020579|Ga0210407_10297014All Organisms → cellular organisms → Bacteria → Acidobacteria1261Open in IMG/M
3300020579|Ga0210407_10549803All Organisms → cellular organisms → Bacteria → Acidobacteria901Open in IMG/M
3300020579|Ga0210407_10895864All Organisms → cellular organisms → Bacteria → Acidobacteria680Open in IMG/M
3300020580|Ga0210403_10209625All Organisms → cellular organisms → Bacteria → Acidobacteria1599Open in IMG/M
3300020581|Ga0210399_10008478All Organisms → cellular organisms → Bacteria → Acidobacteria8028Open in IMG/M
3300020581|Ga0210399_10232692All Organisms → cellular organisms → Bacteria → Acidobacteria1535Open in IMG/M
3300020581|Ga0210399_10376538All Organisms → cellular organisms → Bacteria → Acidobacteria1185Open in IMG/M
3300020581|Ga0210399_10429632All Organisms → cellular organisms → Bacteria → Acidobacteria1101Open in IMG/M
3300020582|Ga0210395_10000019All Organisms → cellular organisms → Bacteria246945Open in IMG/M
3300020582|Ga0210395_10004103All Organisms → cellular organisms → Bacteria11195Open in IMG/M
3300020582|Ga0210395_10093917All Organisms → cellular organisms → Bacteria → Acidobacteria2215Open in IMG/M
3300020582|Ga0210395_10244897All Organisms → cellular organisms → Bacteria → Acidobacteria1347Open in IMG/M
3300020583|Ga0210401_10269596All Organisms → cellular organisms → Bacteria1562Open in IMG/M
3300021168|Ga0210406_10228612All Organisms → cellular organisms → Bacteria1534Open in IMG/M
3300021171|Ga0210405_10002600All Organisms → cellular organisms → Bacteria18580Open in IMG/M
3300021171|Ga0210405_10003107All Organisms → cellular organisms → Bacteria16572Open in IMG/M
3300021171|Ga0210405_10134880All Organisms → cellular organisms → Bacteria1953Open in IMG/M
3300021178|Ga0210408_10057815All Organisms → cellular organisms → Bacteria3030Open in IMG/M
3300021181|Ga0210388_10058703All Organisms → cellular organisms → Bacteria3222Open in IMG/M
3300021181|Ga0210388_10217493All Organisms → cellular organisms → Bacteria → Acidobacteria1673Open in IMG/M
3300021181|Ga0210388_11552954All Organisms → cellular organisms → Bacteria → Acidobacteria551Open in IMG/M
3300021401|Ga0210393_10246292All Organisms → cellular organisms → Bacteria → Acidobacteria1449Open in IMG/M
3300021401|Ga0210393_10281433All Organisms → cellular organisms → Bacteria → Acidobacteria1350Open in IMG/M
3300021401|Ga0210393_10948339All Organisms → cellular organisms → Bacteria → Acidobacteria697Open in IMG/M
3300021407|Ga0210383_10000008All Organisms → cellular organisms → Bacteria → Acidobacteria286884Open in IMG/M
3300021407|Ga0210383_10054789All Organisms → cellular organisms → Bacteria3325Open in IMG/M
3300021407|Ga0210383_10763473All Organisms → cellular organisms → Bacteria → Acidobacteria829Open in IMG/M
3300021420|Ga0210394_10011867All Organisms → cellular organisms → Bacteria8518Open in IMG/M
3300021420|Ga0210394_10600232All Organisms → cellular organisms → Bacteria → Acidobacteria967Open in IMG/M
3300021432|Ga0210384_10021433All Organisms → cellular organisms → Bacteria6200Open in IMG/M
3300021432|Ga0210384_10060092All Organisms → cellular organisms → Bacteria → Acidobacteria3436Open in IMG/M
3300021433|Ga0210391_10175580All Organisms → cellular organisms → Bacteria → Acidobacteria1686Open in IMG/M
3300021474|Ga0210390_10082403All Organisms → cellular organisms → Bacteria2669Open in IMG/M
3300021474|Ga0210390_10507528All Organisms → cellular organisms → Bacteria → Acidobacteria1015Open in IMG/M
3300021475|Ga0210392_11119237All Organisms → cellular organisms → Bacteria → Acidobacteria590Open in IMG/M
3300021478|Ga0210402_10127304All Organisms → cellular organisms → Bacteria → Acidobacteria2306Open in IMG/M
3300021478|Ga0210402_10496304All Organisms → cellular organisms → Bacteria → Acidobacteria1134Open in IMG/M
3300021478|Ga0210402_11111534Not Available718Open in IMG/M
3300021478|Ga0210402_11210893All Organisms → cellular organisms → Bacteria → Acidobacteria682Open in IMG/M
3300021479|Ga0210410_10006420All Organisms → cellular organisms → Bacteria10225Open in IMG/M
3300021479|Ga0210410_10068249All Organisms → cellular organisms → Bacteria3115Open in IMG/M
3300021479|Ga0210410_10403537All Organisms → cellular organisms → Bacteria → Acidobacteria1224Open in IMG/M
3300021559|Ga0210409_10022423All Organisms → cellular organisms → Bacteria6122Open in IMG/M
3300022533|Ga0242662_10324129All Organisms → cellular organisms → Bacteria → Acidobacteria519Open in IMG/M
3300022557|Ga0212123_10000270All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia176536Open in IMG/M
3300022557|Ga0212123_10011277All Organisms → cellular organisms → Bacteria12325Open in IMG/M
3300024227|Ga0228598_1004240All Organisms → cellular organisms → Bacteria → Acidobacteria3038Open in IMG/M
3300024271|Ga0224564_1016608All Organisms → cellular organisms → Bacteria → Acidobacteria1304Open in IMG/M
3300026551|Ga0209648_10237207All Organisms → cellular organisms → Bacteria → Acidobacteria1361Open in IMG/M
3300027069|Ga0208859_1046120All Organisms → cellular organisms → Bacteria → Acidobacteria510Open in IMG/M
3300027660|Ga0209736_1005717All Organisms → cellular organisms → Bacteria4081Open in IMG/M
3300027698|Ga0209446_1000094All Organisms → cellular organisms → Bacteria24035Open in IMG/M
3300027729|Ga0209248_10231244All Organisms → cellular organisms → Bacteria → Acidobacteria541Open in IMG/M
3300027853|Ga0209274_10025164All Organisms → cellular organisms → Bacteria → Acidobacteria2730Open in IMG/M
3300027853|Ga0209274_10031789All Organisms → cellular organisms → Bacteria2451Open in IMG/M
3300027884|Ga0209275_10331651All Organisms → cellular organisms → Bacteria → Acidobacteria850Open in IMG/M
3300027905|Ga0209415_10003353All Organisms → cellular organisms → Bacteria29788Open in IMG/M
3300027908|Ga0209006_10046799All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3893Open in IMG/M
3300027986|Ga0209168_10013857All Organisms → cellular organisms → Bacteria → Acidobacteria4703Open in IMG/M
3300028906|Ga0308309_10002058All Organisms → cellular organisms → Bacteria11973Open in IMG/M
3300028906|Ga0308309_10075709All Organisms → cellular organisms → Bacteria → Acidobacteria2526Open in IMG/M
3300028906|Ga0308309_10473586All Organisms → cellular organisms → Bacteria → Acidobacteria1080Open in IMG/M
3300028906|Ga0308309_11044687All Organisms → cellular organisms → Bacteria → Acidobacteria707Open in IMG/M
3300029636|Ga0222749_10029468All Organisms → cellular organisms → Bacteria2323Open in IMG/M
3300030743|Ga0265461_12102060All Organisms → cellular organisms → Bacteria → Acidobacteria649Open in IMG/M
3300031240|Ga0265320_10230349All Organisms → cellular organisms → Bacteria → Acidobacteria824Open in IMG/M
3300031708|Ga0310686_102043763All Organisms → cellular organisms → Bacteria9629Open in IMG/M
3300031708|Ga0310686_102134238All Organisms → cellular organisms → Bacteria → Acidobacteria699Open in IMG/M
3300031708|Ga0310686_104863660All Organisms → cellular organisms → Bacteria → Acidobacteria1999Open in IMG/M
3300031708|Ga0310686_108390857All Organisms → cellular organisms → Bacteria → Acidobacteria2182Open in IMG/M
3300031708|Ga0310686_115208803All Organisms → cellular organisms → Bacteria → Acidobacteria3219Open in IMG/M
3300031715|Ga0307476_10023302All Organisms → cellular organisms → Bacteria4049Open in IMG/M
3300031718|Ga0307474_10026798All Organisms → cellular organisms → Bacteria4213Open in IMG/M
3300031718|Ga0307474_10167337All Organisms → cellular organisms → Bacteria → Acidobacteria1664Open in IMG/M
3300031720|Ga0307469_10919088All Organisms → cellular organisms → Bacteria → Acidobacteria812Open in IMG/M
3300031823|Ga0307478_10597966All Organisms → cellular organisms → Bacteria → Acidobacteria923Open in IMG/M
3300031823|Ga0307478_11405602All Organisms → cellular organisms → Bacteria → Acidobacteria579Open in IMG/M
3300032180|Ga0307471_100886809All Organisms → cellular organisms → Bacteria → Acidobacteria1059Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil40.14%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil14.08%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil11.27%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil9.15%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.23%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.82%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring2.82%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.11%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.41%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil1.41%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.70%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.70%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.70%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.70%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.70%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.70%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.70%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.70%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003351Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM2EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004117Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF222 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010858Boreal forest soil eukaryotic communities from Alaska, USA - C3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300024227Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic - CZU4Host-AssociatedOpen in IMG/M
3300024271Soil microbial communities from Bohemian Forest, Czech Republic ? CSU5EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027096Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF043 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027729Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027795Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031240Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-8-27 metaGHost-AssociatedOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1002168743300001593Forest SoilMNGKRSSPISRREFARRAAIVSAVSMVPTSALPARPSIEEPPLIQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDAQKSDLRRLCFAAQPSLDHLRVYEIENGDGPALYLKPLVEREKKTGPPVIPRAAGAAIQKP*
JGIcombinedJ26739_10035604223300002245Forest SoilMNGKNSSPISRREFARRAAIVSAVSIVPAGALPLHSSTPESAQQQTSATSSLSIESQAEAEVRYQAILAVYGSRFSDTQKADLRRLCSAAQPSLDRLRAYSIENGDGPALYLKPLVEREKKTAAIPRAANPAVKKP*
JGI26346J50198_100009933300003351Bog Forest SoilMNGKSGTSLSRREFARRAAIVSAASMVPARALPADSSSADPRLTQAPGTPALSPEGQTEAQTRYQAILALYGSRFSEAQKTDLRRLCFLAQEPLDHLRSYKIENGDDPALYFKPLVEKEKKPEAAATSRAAGQAAAKP*
JGIcombinedJ51221_1002171943300003505Forest SoilMDGKSSSPITRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVA
JGIcombinedJ51221_1031103513300003505Forest SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREKKPEA
Ga0062385_1006450723300004080Bog Forest SoilMDGKNGSSISRREFARRAAIVSAVSLVPARGLPTEPSNVEPGLAQVTGTPALSPEGQAEAQARYQTILATYGSRFSDAQKIELRRLCFLAQEPLDHLRAYPIENGDGPALYLKPLMEREKKPEAVTAPHPVGQAAAKP*
Ga0062387_10017642023300004091Bog Forest SoilMNGKSSSSISRREFARRAAIVSAATMVPPNALAVPSPGAVPPTTQTPDSPSLSAEGKAEAEARYQAILAAYGPRFSETQKADLRRLSYEAQEPLDRLRAYSITNGDGPALYLKPLVEREKKTEPAVIPHAASAATTKP*
Ga0062387_10162205713300004091Bog Forest SoilMDGKTNSSISRREFARRVAIVSAATMVPADALAVPSPSDLSAITQTPDSPSLSAEGKAESEARYQAILAVYGARFSETQKADLRRLSYEAQEPLDRLRAYPIANGDGPALYLKPLLEREKKTEPAVMPHVASAATTKP*
Ga0062387_10162586913300004091Bog Forest SoilAIASAVSMVPTRALPTDSLSADPLPAQAPGTPALSRESQAEAEARYQAILAVYGPRFTDTQKTDLRRLCSLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAVVTPRTAGQAATKP
Ga0062389_10145900123300004092Bog Forest SoilMNGKSGLSISRREFARRAAIVSAASMVPASALPVHPPIAEAALKQLPDTPSLSPQSQAEAEARYQAILADYSSRFSDAQKSELRRLSFAAQPSLDHLRAYAIENSDGPALYLKPLLEREKKTEPPVIP
Ga0062389_10226180013300004092Bog Forest SoilMNGKHGSSISRREFARRAAIASAVSMVPTRALPTDSLSADPLPAQAPGTPALSRESQAEAEARYQAILAVYGPRFTDTQKTDLRRLCSLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAVVTPRTAGQAATKP*
Ga0058893_129858413300004117Forest SoilMNGKSGSPFSRREFARRAAIVSAASLVPRSALPAPPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDCPALYLKPLVEREKKTGSAAIPRAAGSAAQKP*
Ga0058901_103831423300004120Forest SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDGPALYLKPLVERERKAAPAAVPRAAGSATQKP*
Ga0062386_10074799223300004152Bog Forest SoilMDGKSGSSISRREFARGAAIVSAVSMVPESALRVETSSAEPRLTQAPGTPSLSPAGQAESEARFQAILAVYGSRFSEAQKDDLRKLCFSAQEPLDHLRAYPIENSDAPALYLKPLVEREKQPEAAAALRATGQAAAKP*
Ga0058899_1007803133300004631Forest SoilMNGKSGSPISRREFARRAAIVSTVSMVPRSALPAPPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENSDCPALYLKPLVEREKKTGPAAIPRAAGSATQKP*
Ga0058899_1012894223300004631Forest SoilMNGKGGFPISRREFARRAAIVSAVSMVPTSALPARPSMEDPSFDQSSDAPSLSTESKAEAEARYQTILGVYGARFSDTQKSDLRRLCFAAQPSLDRLRAYPIENGDCPALYLKPLVEREKKSGPAAVPRAAGSATQKP*
Ga0058899_1016306323300004631Forest SoilMNGKSGSPISRREFARRAAIVSAVSMVRASALPARPSIVDPSFDQSSDTPSRSTESKAEAEARYQTILGVYGTRFTDTQKSDLRRLCFAAQSSLDHLRAYTIENGDGPALYLKPLVERERKAAPAAVPRAAGSATQKP*
Ga0062388_10038263423300004635Bog Forest SoilMNGKSGLSISRREFARRAAIVSAASMVPASALPVHPPIAEAALKQLPDTPSLSPQSQAEAEARYQAILADYSSRFSDAQKSELRRLSFAAQPSLDHLRAYAIENSDGPALYLKPLLEREKKTEPPVIPHAATPVTKKP*
Ga0062388_10085231113300004635Bog Forest SoilMDGKNGSSISRREFARRAAIVSAVSLVPARGLPTEPSNVEPGLAQVTGTPALSPEGQAEAQARYQTILATYGSRFSDAQKIELRRLCFLAQEPLDHLRAYPIENGDGPALYLKPLMEREKKPEAVTAPHPVGQAAAKP*GNPAMPAEEIF
Ga0070735_1001934133300005534Surface SoilMNGMSGSPISRREFARRAAIVSAASMVPVTGLPVHAATPESPRQQSADTHSLSPESQAEAEARYQTILNVYGSRFSEAQKADLRRLCFSAQAPLDRLRAYTLENGDGSALYLKPLVERDKKPGFAPAPRSASPVAKKP*
Ga0070733_1003874323300005541Surface SoilMNGKNSSHISRREFARRAAIVSAVSIVPANALPDHSLIAPAVSQSPNASSLSPESQSEAEARYQAILGAYGSRFSDEKKADIRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKTQSVMIPKAADSTAKKP*
Ga0070761_1000231223300005591SoilMNGKSGFPISRREFARRAAIVSAVSMVPTSALPARRLMEDPSFDQSSDTPSLSTESKAEAEARYETILGIYGTRFTDTQKSDLRRLCFAAQPSLDHLRAYTIENGDCPALYLKPLVEREKKTGPVAVPRAAGSATQKP*
Ga0070761_1002445523300005591SoilMNGKSGSSISRREFARRAAIVSAVSMVPAGAIPARTTISDLSGEQSSGLPALSPESQTEAEARYQSILAVYGARFSEAQKADLRRLCFSAQEPLDHLRAYTVENGDAPALYFKPLVEREKKPEPAAIAHAAQPTPKL*
Ga0070761_1019507923300005591SoilMNGNSNAPISRREFARRAAIVSAVSIVPAGKLPLNSPLPESAQPQTPAASSLTPESQADAEARYQAILAVYGARFSDAQKNELRRLCSAAQPTLDRLRAYSIENGDGPALYLKPLVEREKKSEPAVLVREASPAAKKP*
Ga0070762_1018841723300005602SoilMNGKSSSSISRREFARRAAIASAVSIVPAGALPLHSSIPQSAQQQPSGTLSLSPESQAEAEARYQAILAVYGSRFSDAQKADLRRLCFAAQPSLDRLRAYPIENDDGPALYLKPLVEREKKTEPAAIPRAASPATKKP*
Ga0070762_1019949313300005602SoilSMNGKSGSPISRREFARRAAIVSAVSMVPTNPLAARPSMEHPSLDQSSDTPSLSTESKAEAEARYETILGIYGTRFTDTQKSDLRRLCFAAQPSLDHLRAYTIENGDCPALYLKPLVEREKKTGPVAVPRAAGSATQKP*
Ga0070763_1004383233300005610SoilMDGKSSLPITRREFARRAAMVSAVTMVPAGALAVDSPSAVPPLTQTPDSPSLSVEGKAEAEARYQAILAVYGARFSETQKADLRRLSYEAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEAAAVPRAAETAPKQP*
Ga0070763_1016110423300005610SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIARATGSATHKP*
Ga0070764_1029271123300005712SoilMNGKSGFPISRREFARRAAIVSAVSMVPTSALPGRPSMEDPSFDQSSDAPSLSTESKAEAEARYQTILGVYGTRFSDTQKSDLRRLCFAAQPSLDRLRAYTIENGDCPALYLKPLVEREKKTGPVAVPRAAGSATQKP*
Ga0075014_10051641913300006174WatershedsMNGESISHISRREFARRAAIVSAVSMVPATALPGHSSISAPDVGQSPAASSLSPESQAEAEARYQAILGVYGSRFSDVKKTDIRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKSQPGMIPQAQNSAAKKP*
Ga0070765_10002568723300006176SoilMNGKSSSSISRREFARRAAIASAVSIVPAGALPLHSSIPQSAQQQPSGTLSLSPESQAEAEARYQAILAVYGSRFSDAQKADLRRLCFAAQPSLDRLRAYPIENGDAPALYLKALVEREKKTEPAAIPRAASPAAKKP*
Ga0070765_10004911233300006176SoilMNGKNSSHISRREFARRAAIVSAVSMVPASTLPGHSLISAPAVAQSPDASSLSPQSQAEAEARYQAILGAYGSRFSDVKKADIRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKTQSVMIPKAADSTAKKP*
Ga0070765_10034798623300006176SoilMDGKRDTSNGTSISRREFARRAAIVSAVSMVPASALPVETSSAEPRLTQAPAAPSLSHEGQAESEARYQAILAVYGSRFSDPQKDDLRKLCFSAQDPLDHLRAYPIENSDAPALYLKPLVEREKQPGAATASHAAGQTAVKP*
Ga0079221_1071056013300006804Agricultural SoilRREFARRTAIVSALSFAPSGASSLYSEVGQSAAQQPSNTPSLSPEGQAEAEARFQAILAAYGSRFSDAQKPELRRLCFLSQPPIDRLRAYPIENGDGPALYLKPLVERAKPPAPAAASHKARQSARKSQE*
Ga0073928_10000531253300006893Iron-Sulfur Acid SpringMNGKGGSSISRREFAWRAAIVSAVSMVPTNALPARPSMEGPPPNQSPDTASLSTESKAEAETRYQAILGVYGSRFSDTQKTDLRRLCFAAQPSLDHLRAYAIENGDGPALYLKPLVEREKKTGPAVVPRTTGSATQKP*
Ga0102924_1000012523300007982Iron-Sulfur Acid SpringMNGKSSSAISRREFARRAAIVSAVSLVPPGTSPMYSAVGQPAPQQPSDTPSLSPEGQAEAEARFQAILAVYGSRFSDAQKPELRRLCFTAQAPLSHLRAYAIENGDGLALYLKPLMEREKKPEPAAIPRKASQPAKKP*
Ga0074044_1036743023300010343Bog Forest SoilMNGNSGSSISRREFARRAAIASAVSIVPVGGLTVPASILDKTPEQTPDTPSLSPESQAEAEARFQSILAAYGSRFSDTQKSELRRLSFAAQPSLDHLRAYTIENGDGPALYLKPLLEREKKTGTAAIPHASSPAPNTP*
Ga0074044_1047461023300010343Bog Forest SoilMNGKSSSPISRREFARRAAIVSAVSMVPNTALTAHLPIPESFLKQMPDTFSLSPESQSEAEARYQAILGVYGSRFSDAQKVDLRRLCFAAQPSLDRLRAYAIENGDSPALYLKPLVEREKKTKPASIPQVPSPATKKP*
Ga0136449_100017506143300010379Peatlands SoilMNGKSGSHISRREFARRAAIVSAVSMVPATTLPGHSLTSALAVAQSPDASSLSPESQSEAEARYQAILGVYGSRFSDEKKADLRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKTQPGMIPQAADSAAKKP*
Ga0126345_114090623300010858Boreal Forest SoilMNGKSSSPISRREFARRAAIASAVSIVPVGALPLHSSIPESAQQQTSATSSLSPESQAEVEARYQAILAVYGSRFSDAQKTDLRRLCSSAQPSLDRLRAYSIENGDGPALYLKPLV
Ga0150983_1216960713300011120Forest SoilAAIVSAASLVPRSALPAPASMGDPSPEQSSDTPSLSTASKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENADGPALYLKPLVEREKRSGPAAIPRAASSATQKP*
Ga0150983_1533952113300011120Forest SoilISRREFARRAAIVSAVSMVRASALPARPSIADPSFDQSSDTPSRSTESKAEAEARYQTILGVYGTRFTDTQKSDLRRLCFAAQSSLDHLRAYTIENGDGPALYLKPLVERERKAAPAAVPRAAGSATQKP*
Ga0150983_1612897423300011120Forest SoilMNGKGGFPISRREFARRAAIVSAVSMVPTSALPARPSMEDPSFDQSSDAPSLSTESKAEAEARYQTILGVYGARFSDTQKSDLRRLCFAAQPSLDRLRAYPIENCDCPALYLKPLVEREKKSGPAAVPRAAGSATQKP*
Ga0137363_1000887743300012202Vadose Zone SoilMNGKGGSPISRREFAQRAAIVSVASMIPASALPMQSSAAKAPLRQTPDTPSISPESQAEAEARYQAILTVYGSRFSDEQKAELRRLCFAAQPSLDRLRTYTVENSDAPALYLKPLVEREKKIAPAATPHSASPTPKRP*
Ga0187820_100247143300017924Freshwater SedimentLTLANEFVSFALCSASIRFLIWGGSVNSKSGSPISRREFARRAAIASAVALAPPGAPTMYSAVGRPLPQQPSEIPSLSPEGQAESEARYQAILARYGSRFSDSQKSELRRLSFLAQSPLDHLRAYPIENGEGSALYLKPLVEREKNPEHATVSHQSTEPAKKR
Ga0187825_1014766723300017930Freshwater SedimentMNGKSNSAISRREFARRAAIVSAVSLVPPSTLPVRSEAAMPVAQQPADTPLLPPESQAEAEARYQAILAVYGSRFSDKQKPDLRRLCFTAQPPLAHLRAYEIENGDGPALYLKPLMERDRKPEPAANLRKASQSSKKP
Ga0187821_1051287123300017936Freshwater SedimentAVSIVPATALPCHSSISAPDVGQSPDASSLSPESQAEAEARYQAILGVYGSRFSDVKKADVRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKSQPGMIPQAQNSAAKKP
Ga0187805_1000464333300018007Freshwater SedimentLTLTNEFVSFALCSASIRFLIWGGSVNSKSGSPISRREFARRAAIASAVALAPPGASTMYSAVGRPLPQQPSEIPSLSPEGQAESEARYQAILARYGSRFSDSQKSELRRLSFLAQSPLDHLRAYPIENGEGSALYLKPLVEREKNPEHATVSHQSTEPAKKR
Ga0210407_1001379973300020579SoilMNGKGGSPISRREFARRAAIVSAVSMVPTSALPARPSTEEPSLEQSSDRPSLSAESQAEAEARYQTILGIYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALYLKPLVEREKKTGPAAIPRSAGSAIQKP
Ga0210407_1001929533300020579SoilMNNKSASPISRREFARRAAIVSAVSIVPASALPVHSAIAEPALVQSLDTPSLSPESQAEAAARHQAILAVYGSRFSDTQKSELRRLCFAAQPSLDRLRAYSLENGDAPSLYLKPLVEREKKSEPAVIPRSASPATKKP
Ga0210407_1005499833300020579SoilMNGKSSSYISRREFARRAAIVSAASMVPATALPGHSLIAPAVSQSPNASSLSPESQSEAEARYQAILGVYGSRFSDEKKADIRRLCFAAQPSLDRLRAYPLENSDSPALYLKPLVEREKKTQPGMIPQVANSAVKKP
Ga0210407_1024469123300020579SoilMNGKSGTPISRREFARRAAFVSAVSMVPTSALAARPSMGDPSLDQSSDTSSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALYLKPLVEREKKTGPAAIPRAAGSATQKP
Ga0210407_1029701423300020579SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIARATGSATHKP
Ga0210407_1054980323300020579SoilMNGKSGSPISRREFARRAAIVSAASLVPRSALPAPASMGDPSPEQSSDTPSLSTASKAEAEARYQTILGVYGSRFSDAQKSDLRRLCFAAQPSLDQLRAHTIENGDCPALYLKPLVEREKKTGPAAISRVAASATQKP
Ga0210407_1089586413300020579SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREKKPEAAATSRTAGQAATKP
Ga0210403_1002298643300020580SoilRRAAIVSAVSIVPANALPMHSSVAELSPKDTTDLSSLSPENQAEAEARYQAILGTYGSRFSNAEKSELLRLCLLAQPPLENLRKYAIENSDGPALYLKPLVEREKKPGSTAIPRTAGQAAKKL
Ga0210403_1020962523300020580SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARGLPAEPSHAEPPLAQVPGTPALTPEGQAEAQARYQTILAVYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPAAAATPRAPGQAATRP
Ga0210399_1000847873300020581SoilMNSKSASPISRREFARRAAIVSAVSIVPASALPVPSAIAEPALEQSSDTPSLSPESQAEAASRLQAILAVYGSRFSDTQKSELRRLCFAAQPSLDRLRAYSLENGDAPALYLKPLVEREKKIEPAVIPRSASPATKKP
Ga0210399_1023269223300020581SoilMNGKSGSPISRREFARRAAIVSTVSMVPRSALPAPPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENSDCPALYLKPLVEREKKTGPAAIPRAAGSATQKP
Ga0210399_1037653813300020581SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFK
Ga0210399_1042963223300020581SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARGLPAEPSHAEPPLAQVPGTPALTPEGQAEAQARYQAILAVYGSRFSDPQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAATPRAPGQAATRP
Ga0210395_10000019393300020582SoilMDGKSSSPITRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVAVPRAAETAPKQP
Ga0210395_1000410323300020582SoilMNGKSGFPISRREFARRAAIVSAVSMVPTSALPARRLMEDPSFDQSSDTPSLSTESKAEAEARYETILGIYGTRFTDTQKSDLRRLCFAAQPSLDHLRAYTIENGDCPALYLKPLVEREKKTGPVAVPRAAGSATQKP
Ga0210395_1009391723300020582SoilMNGKNGTSLSRREFARRAAIVSAASMVPAQALQADSLSPEPRLTQAPDTPALSPKGQAEAEARYQAILAVYGTRFSDAQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAATPRAPGQAATRP
Ga0210395_1024489723300020582SoilMDGKNGSSISRREFARRAAIVSAVSMVPATAFPAEVTNTEPWLSQAPGAPSLSPAGQAESEARYQAILAVYGSRFSDAQKTDLRRLCFSAQEPLDHLRAYPIENGDAPALFLKPLVEREKQPGAATAPHAAGQAAAKP
Ga0210401_1026959623300020583SoilMNGKSGSSISRREFARRAAIVSAVSMVPASALPARPSVEDPSPGQSSDTPSLSIESQAEAEARYQAILVVYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALYLKPLVEREKKTGPAVIPRAARSSTQKP
Ga0210406_1022861223300021168SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDGPALYLKPLVEREKKTGPAAVPRATGSATHQP
Ga0210405_10002600143300021171SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREKKPEAVATSRTAGQAATKP
Ga0210405_1000310723300021171SoilMNGKSGSPISRREFARRAAIVSAVSMVPPAALPARPSTENPPLNQSSDTPSLSTESKAEAEARYQAILAIYGSRFSDMQKADLRRLCYVAQPSLDHLRAYSIENGDAPALYLKPLVEREKKTGPAAIPRAAGSAAQKP
Ga0210405_1013488023300021171SoilMNSESGSHISRREFARRAAIVSAVSIVPASALPMHSSIAEPALEQSPDTPSLSRESQAEAEARYQAILGVYGSRFSVTQKADLRRLCFAAQPSLDRLRAYTIENGDAPALYLKPLVEREKKIAPAAIPHPASPATKKP
Ga0210408_1005781523300021178SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREKKPEAAVTSRTAGQAATKP
Ga0210388_1005870333300021181SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHFRAYKIENGDGPALYFKPLVEREKKPEAAATSRTAGQAATKP
Ga0210388_1021749313300021181SoilLFRCPVSGTGALMNGKNGTSLSRREFARRAAIVSAASMVPAQALQADSLSPEPRLTQAPDTPALSPKGQAEAEARYQAILAVYGTRFSDAQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAATPRAPGQAATRP
Ga0210388_1025020413300021181SoilRGQFLTLTLGDEFGSFALCSSSIHFSTLGASMDGKSSSPITRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVAVPRAAETAPKQP
Ga0210388_1155295413300021181SoilSIVSLRALLGSILQTGASMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHRRAYAMENGDGPALYLKPLVEREKKTGPAAIARATGSATHKP
Ga0210393_1024629223300021401SoilMNGKSGSPISRREFARRAAIVSAVSLVPASALPAHPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDGPALYLKPLVEREKKTGPAAVPRATGSATHQP
Ga0210393_1028143323300021401SoilMNGKGGFPISRREFARRAAIVSAVSMVPTSALPARPSMEDPSFDQSSDAPSLSTESKAEAEARYQTILGVYGARFSDTQKSDLRRLCFAAQPSLDRLRAYPIENGDCPALYLKPLVEREKKSGPAAVPRAAGSATQKP
Ga0210393_1094833923300021401SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAIENGDGPALYLKPLVEREKKTGPAAI
Ga0210385_1051716723300021402SoilMNGKGSSSISRREFARRAAIVSAATMVPADALAMPSPSIVPPTTQTPDKPSLSAEGKAEAEARYQAILAAYGTRFSETQKADLRRLSYEAQEPLDRLRAYTITNGDGPALYFKPLVEREKKTEPSVIPHAASAAND
Ga0210386_1085318923300021406SoilIAGPGCLVTWRGQFLTLTLGDEFGSFALCSSSIHFSTLGASMDGKSSSPITRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVAVPRAAETAPKQP
Ga0210383_100000081723300021407SoilLTLEDEFGSFALYFLSIYSLTLGASMDGKSSSPITRREFARRAAMVSAVTMVPAGALAAHSPSAVPPFTQTPGLPSLSAEGKAEAEARYQAILAVYGARFSETQKTDLRRLSYEAQEPLDRLRAYTIENGEGPALYLKPLVEREKKTEPAVIPHAASAVPSKP
Ga0210383_1005478923300021407SoilMNGKSSSYISRREFARRAAIVSAASMVPATTLPGHSLIAPAVSQSPNASSLSPESQSEAEARYQAILGVYGSRFSDEKKADIRRLCFAAQPSLDRLRAYPLENSDSPALYLKPLVEREKKTQPGMIPQVANSAVKKP
Ga0210383_1030232623300021407SoilMNGKSSSSISRREFARRAAIVSAATMVPADALAMPSPSIVPPTTQTPDKPSLSAEGKAEAEARYQAILAAYGTRFSETQKADLRRLSYEAQEPLDRLRAYTITNGDGPALYFKPLVEREKKTEPSVIPHAASAAND
Ga0210383_1076347313300021407SoilMDGKNGSSISRREFARRAAIVSAVSMVPATAFPAEVTNTEPWLSQAPGAPSLSPAGQAESEARYQAILAVYGSRFSDAQKTDLRRLCFSAQEPLDHLRAYPIENGDAPALFLKPLVEREKQP
Ga0210394_1001186713300021420SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARGLPAEPSHAEPPLAQVPGTPALTPEGQAEAQARYQAILAVYGSRFSDPQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAA
Ga0210394_1060023223300021420SoilMNGKNGTSLSRREFARRAAIVSAASMVPAQALQADSLSPEPRLTQAPDTPALSPKGQAEAEARYQAILAVYGTRFSDAQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAA
Ga0210384_1002143373300021432SoilMNGKSGSSISRREFARRAAIVSAVSMVPASALPARPSIEDPSPGQSSDTPSLSIESQAEAEARYQAILGVYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALYLKPLVEREKKTGPAVIPRAARSSTQKP
Ga0210384_1006009223300021432SoilMNGLNGSPISRREFARRAAIVSAASLVPAPALPLHAANPESPLEQSADKNSSSPEHQAEAEARYQAILGVYGSRFSETQKADLRRLCFTAQEPLDRLRAYAVENGDGPSLYLKPLVEREKKPEAATAPRSAGQAAKHP
Ga0210391_1011068133300021433SoilTRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVAVPRAAETAPKQP
Ga0210391_1017558013300021433SoilLLALLASILRTGVSMNGKSGFSISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIARATGSATHKP
Ga0210390_1008240313300021474SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREKKPE
Ga0210390_1050752823300021474SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARGLPAEPSHAEPPLAQVPGTPALTPEGQAEAQARYQAILAVYGSRFSDPQKTDLRRLCFLAQEPLDHLRAYTIENSDGPALYFKPLVEREKKPEVVGAPRATSQPATKP
Ga0210392_1111923723300021475SoilMNGKNSSPISRREFARRAAIVSAVSIVPAGALPLHSSTPESAQQQTSATSSLSLESQAEAEVRYQAILAVYGSRFSDTQKADLRRLCSAAQPSLDRLRAYSIENGDGPALYLKPLVEREKKTETAAIPRSANPAVKKP
Ga0210402_10000736303300021478SoilMNGKRGSPISRREFARRAAIVSAVSIVPANALPMHSSVAELSPKDTTDLSSLSPENQAEAEARYQAILGTYGSRFSNAEKSELLRLCLLAQPPLENLRKYAIENSDGPALYLKPLVEREKKPGSTAIPRTAGQAAKKL
Ga0210402_1012730423300021478SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAESRYQTILGVYGSRFSDTQKLDLRRLCFAAQPSLDHLRAYAMENGDCPALYLKPLVEREKKTGPAAIPRATGSATHKL
Ga0210402_1049630423300021478SoilMDGKNGSSISRREFARRAAIVSAVSMVPATAFPAEVTNTEPWLSQAPGAPSLSPAGQAESEARYQAILAVYGSRFSDAQKTDLRRLCFSAQEPLDHLRAYPIENGDAPALFLKPLVEREKQPGAATAPHAAGQAA
Ga0210402_1111153413300021478SoilMDGKRGTSNGTSISRREFARRAAIVSAVSMVPASALPAESSSAEPRLTQAPGTPSLSPEGQAESEARYQAILAVYGARFSDSQKADLRKLCFSAQEPLDHLRAYPIENSDAPALYLKPLVEREKQP
Ga0210402_1121089323300021478SoilSRREFARRAAIVSTVSMVPRSALPAPPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENSDCPALYLKPLVEREKKTGPAAIPRAAGSATQKP
Ga0210410_1000642053300021479SoilMNGKSGTPISRREFARRAAFVSAVSMVPTSALAARPSMGDPFLDQSSDTSSLSTESRAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALYLKPLVEREKKTGPAAIPRAAGSATQKP
Ga0210410_1006824923300021479SoilMNSKSASPISRREFARRAAIVSAVSIVPASALPVPSAIAEPAPEQSSDTPSLSPESQAEAGARLQAILAVYGSRFSDTQKSELRRLCFAAQPSLDRLRAYSLENGDAPALYLKPLVEREKKIEPAVVPRSASPATKKP
Ga0210410_1040353723300021479SoilMNGKSGSPISRREFARRAAIVSTVSMVPRSALPAPPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDAQKSDLRRLCFAAQPTLDQLRAHTIENGDCPALYLKPLVEREK
Ga0210409_1002242363300021559SoilMNGLNGSPISRREFARRAAIVSAASLVPAPALPLHAANPESPLEQSADKKSSSPEHQAEAEARYQAILGVYGSRFSETQKADLRRLCFTAQEPLDRLRAYAVENGDGPSLYLKPLVEREKKPEAATAPRSAGQAAKHP
Ga0242662_1032412913300022533SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAESRYQTILGVYGSRFSDTQKLDLRRLCFAAQPSLDHLRAYAMENGDCPALYLKPLVE
Ga0212123_100002701003300022557Iron-Sulfur Acid SpringMNGKSSSAISRREFARRAAIVSAVSLVPPGTSPMYSAVGQPAPQQPSDTPSLSPEGQAEAEARFQAILAVYGSRFSDAQKPELRRLCFTAQAPLSHLRAYAIENGDGLALYLKPLMEREKKPEPAAIPRKASQPAKKP
Ga0212123_1001127753300022557Iron-Sulfur Acid SpringMNGKGGSSISRREFAWRAAIVSAVSMVPTNALPARPSMEGPPPNQSPDTASLSTESKAEAETRYQAILGVYGSRFSDTQKTDLRRLCFAAQPSLDHLRAYAIENGDGPALYLKPLVEREKKTGPAVVPRTTGSATQKP
Ga0228598_100424023300024227RhizosphereMNGKSGTSISRREFARRAAIVSAASMVPARALPADSLSAEPRLTQAPGTPALSPQSQEEAQARYQAILAVYGSRFSDAQKTDLRRLCFLAQEPLDQLRAYTIENGDGPALYLKPLMEREKKPEAVTAPRPVGQAAAKP
Ga0224564_101660813300024271SoilTGASMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIPRATGSATHKP
Ga0209648_1023720713300026551Grasslands SoilMNGKGGSPISRREFAQRAAIVSVASMIPASALPMQSSAAKAPLRQTPDTPSISPESQAEAEARYQAILTVYGSRFSDEQKAELRRLCFAAQPSLDRLRTYTVENSDAPALYLKPLVEREKKIAPAASPHSASPTPKRP
Ga0208859_104612013300027069Forest SoilMNGKSGTSLSRREFARRAAIVSAVSMVPARALPPDSPSAEPRLTQAPGTPALSPEGQAEGQARYQAILALYGSRFSDAQKTDLRRLCFLAQEPLDHLRAYKIENGDGPALYFKPLVEREK
Ga0208099_105466513300027096Forest SoilDEFGSFALCSSSIHFSTLGASMDGKSSSPITRREFARRAAMVSAVTMVPAGALAVHSPSAGPPVTQTPDSPSLSAEGKAEAEARYQAILAVYGARFSETQKADLRRLSYQAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEPVAVPRAAETAPKQP
Ga0209736_100571733300027660Forest SoilMNGKRSSPISRREFARRAAIVSAVSMVPTSALPARPSIEEPPLIQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDAQKSDLRRLCFAAQPSLDHLRVYEIENGDGPALYLKPLVEREKKTGPPVIPRAAGAAIQKP
Ga0209446_100009423300027698Bog Forest SoilMNGKSGTSLSRREFARRAAIVSAASMVPARALPADSSSADPRLTQAPGTPALSPEGQTEAQTRYQAILALYGSRFSEAQKTDLRRLCFLAQEPLDHLRSYKIENGDDPALYFKPLVEKEKKPEAAATSRAAGQAAAKP
Ga0209248_1023124413300027729Bog Forest SoilMDGKNGSSISRREFARRAAIVSAVSLVPARGLPTEPSNVEPGLAQVTGTPALSPEGQAEAQARYQTILATYGSRFSDAQKIELRRLCFLAQEPLDHLRAYPIENGDGPALYLKPLMEREKKPEAVTAPHPVGQAAAKP
Ga0209139_1000130783300027795Bog Forest SoilMNGKSSSSISRREFARRAAIVSAATMVPPNALAVPSPGAVPPTTQTPDSPSLSAEGKAEAEARYQAILAAYGPRFSETQKADLRRLSYEAQEPLDRLRAYSITNGDGPALYLKPLVEREKKTEPAVIPHAASAATTKP
Ga0209274_1002516433300027853SoilMNGKSGSSISRREFARRAAIVSAVSMVPAGAIPARTTISDLSGEQSSGLPALSPESQTEAEARYQSILAVYGARFSEAQKADLRRLCFSAQEPLDHLRAYTVENGDAPALYFKPLVEREKKPEPAAIAHAAQPTPKL
Ga0209274_1003178943300027853SoilMNGNSNAPISRREFARRAAIVSAVSIVPAGKLPLNSPLPESAQPQTPAASSLTPESQADAEARYQAILAVYGARFSDAQKNELRRLCSAAQPTLDRLRAYSIENGDGPALYLKPLVEREKKSEPAVLVREASPAAKKP
Ga0209693_1004991233300027855SoilMDGKSSLPITRREFARRAAMVSAVTMVPAGALAVDSPSAVPPLTQTPDSPSLSVEGKAEAEARYQAILAVYGARFSETQKADLRRLSYEAQEPLDRLRAYSIENSDGPALYLKPLVERERKTGSAVIPPTPGEAPNKP
Ga0209275_1033165123300027884SoilMNGKSSSSISRREFARRAAIASAVSIVPAGALPLHSSIPQSAQQQPSGTLSLSPESQAEAEARYQAILAVYGSRFSDAQKADLRRLCFAAQPSLDRLRAYPIENDDGPALYLKPLVEREKKTEPAAIPRAASPATKKP
Ga0209275_1038054413300027884SoilSMDGKSSLPITRREFARRAAMVSAVTMVPAGALAVGSPSAVPPLTQTPDSPSLSVEGKAEAEARYQAILAVYGARFSETQKADLRRLSYEAQEPLDRLRAYSIENSDGPALYLKPLVEREKKTEAAAVPRAAETAPKQP
Ga0209415_1000335393300027905Peatlands SoilMNGKSGSHISRREFARRAAIVSAVSMVPATTLPGHSLTSALAVAQSPDASSLSPESQSEAEARYQAILGVYGSRFSDEKKADLRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKTQPGMIPQAADSAAKKP
Ga0209006_1004679933300027908Forest SoilMNGKNSSPISRREFARRAAIVSAVSIVPAGALPLHSSTPESAQQQTSATSSLSIESQAEAEVRYQAILAVYGSRFSDTQKADLRRLCSAAQPSLDRLRAYSIENGDGPALYLKPLVEREKKTAAIPRAANPAVKKP
Ga0209168_1001385733300027986Surface SoilMNGMSGSPISRREFARRAAIVSAASMVPVTGLPVHAATPESPRQQSADTHSLSPESQAEAEARYQTILNVYGSRFSEAQKADLRRLCFSAQAPLDRLRAYTLENGDGSALYLKPLVERDKKPGFAPAPRSASPVAKKP
Ga0308309_1000205863300028906SoilMNGKSSSSISRREFARRAAIASAVSIVPAGALPLHSSIPQSAQQQPSGTLSLSPESQAEAEARYQAILAVYGSRFSDAQKADLRRLCFAAQPSLGRLRAYPIENDDGPALYLKPLVEREKKTEPAAIPRAASPATKKP
Ga0308309_1007570923300028906SoilMNGKNSSHISRREFARRAAIVSAVSMVPASTLPGHSLISAPAVAQSPDASSLSPQSQAEAEARYQAILGAYGSRFSDVKKADIRRLCFAAQPSLDRLRAYPLENGDSPALYLKPLVEREKKTQSVMIPKAADSTAKKP
Ga0308309_1047358613300028906SoilMDGKRDTSNGTSISRREFARRAAIVSAVSMVPASALPVETSSAEPRLTQAPAAPSLSHEGQAESEARYQAILAVYGSRFSDPQKDDLRKLCFSAQDPLDHLRAYPIENSDAPALYLKPLVEREKQPGAATASHAAGQTAVKP
Ga0308309_1104468713300028906SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIARATGS
Ga0222749_1002946813300029636SoilMNGKSGTPISRREFARRAAFVSAVSMVPTSALAARPSMGDPFLDQSSDTSSLSTESRAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQASLDHLRAYTIENGDCPALELKPLVEREKKTEPAAIPRAAGSATQKP
Ga0265461_1210206013300030743SoilMNGKSGSPISRREFARRAAIVSAVSMVPTSALPARPSMEDPCFDQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKVDLRRLCFAAQPPLDHLREYAIENGDCPALYLKPLVEREKKTGPAAIPRAIGSATHKP
Ga0170834_11218653013300031057Forest SoilMNGKRGSPISRREFARRAAIVSAVSIVPASALPMHSSVAELSPKDTTDLSSLSPENQAEAEARYQAILGTYGSRFSNAEKSELLRLCLLAQPPLENLRKYAIENSDGPALYLKPLVEREKKPGSTAIPRTAGQAAKKP
Ga0265320_1023034923300031240RhizosphereMNGKSSISISRREFARRAAIISAVSMVSAAPLSGHSSIPESGLKQAPDGSTLSRESQAEAEARYQAILGVYGARFSDGQKADLRRLCFAAQPSLDRLRAYTIENGDSPALYLKPLVEREKKAPAAATPKAASSETKKP
Ga0310686_10204376323300031708SoilMNGKSGSPISRREFARRAAIVSAVSMVPASALPARPSMEDPSLEQSSDTPSLSTESKAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPPLDHLRAYAMENGDGPALYLKPLVEREKKTGPAAIPRATGSATHKP
Ga0310686_10213423823300031708SoilMNGKSGSSISRREFARRAAIVSAVSMVPPRALPADSPSGEPLLTQVPGTPTLSPDGQAEAQARYQAILAVYGSRLSDTQQADLRRLCFLAQEPLDHLRAYTIENDDSPALYFKPLMEREKKPEALATPHTAGQAATKP
Ga0310686_10486366023300031708SoilMNGKSGSPISRREFARRAAIVSAVSMVPTSALPARPSMEVPSQNQSSDTPSLSTESQAEAEARYQTILNVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDCPALYLKPLVEREKKTASAAIPRAAGSATQKP
Ga0310686_10839085723300031708SoilMNGKSGTSISRREFARRVAIVSAVSMVPARTLPADSLSAEPRLTQAPGTPALSPQSQEEAQARYQAILAVYGSRFSDAQKTDLRRLCFLAQEPLDQLRAYTIENGDGPALYLKPLMEREKKPEAVTAPRPVGQAAAKP
Ga0310686_11520880333300031708SoilMNGKSGTSLSRREFARRAAIVSAASMVPAQALQADSLSPEPRLTQAPDTPALSPKGQAEAEARYQAILAVYGTRFSDAQKTDLRRLCFLAQEPLDHLRAYTIENGDGPALYFKPLVEREKKPEAAATPRAPGQAATRP
Ga0307476_1002330253300031715Hardwood Forest SoilMNGRSNFPISRREFARRAAIVSAVSIVHAGTLPLDSSLPASAQPQTQAQSALSPESQAEAEARYQAILASYGSRFSEAQKVELRRLCSAAQPPLDRLREYTIENGDGPALYLKPLVAREKKPEAALPPQGASPSTKKP
Ga0307474_1002679853300031718Hardwood Forest SoilMNGKSSSHISRREFARRAAVVSAASMLPATALPGHSLIAAPDVSQSPDASSLSPESQSEAEARYQAILGVYGPRFSDEKKADIRRLCFAAQPSLDRLRAYALENGDSPAIYLKPLVEREKKTQPGMIPQAANSAAKKP
Ga0307474_1016733723300031718Hardwood Forest SoilMKGMNGSPISRREFALRAAIVSAASMVPATALPIHAANSESPEQQSADTHSSSPESHAEAEARYQAILGLYGSRFSETQKADLRRLCFTAQEPLDRLRAYPIENGDGPALYLKPLVEREKKSEAAAAPRPAGQAAKKP
Ga0307469_1091908813300031720Hardwood Forest SoilMNGKRSSPISRREFARRAAIVSAVSIVPANALPMHSSVAELSPKDTTDLSSLSPENQAEAEARYQAILGTYGSRFSNAEKSELLRLCLLAQPPLDNLRKYAIENSDGPALYLKPLVEREK
Ga0307478_1059796623300031823Hardwood Forest SoilMNGKSSSHISRREFARRAAVVSAASMVPATALPGHSLVAAPVVSQSPDASSLSPESQSEAEARYQAILGVYGPRFSDEKKADIRRLCFAAQPSLDRLRAYALENGDSPAIYLKPLVEREKKTQPGMIPQTANSAAKKP
Ga0307478_1140560223300031823Hardwood Forest SoilMNGKSGSPISRREFARRAAIVSAVSMVPTSARPARPSMEDPFPDQSSDTPSLSTESQAEAEARYQTILGVYGSRFSDTQKSDLRRLCFAAQPSLDHLRAYTIENGDGPALYLKPLVEREKKAGPAAIPRAAGSATQKP
Ga0307471_10088680923300032180Hardwood Forest SoilMNGKRGSPISRREFARRAAIVSAVSIVPANALPMHSSVAELSPKDTTDLSSLSPENQAEAEARYQAILGTYGSHFSNAEKSELLRLCLLAQPPLDNLRIYAIENSDGPALYLKPL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.