NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F056369

Metagenome / Metatranscriptome Family F056369

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F056369
Family Type Metagenome / Metatranscriptome
Number of Sequences 137
Average Sequence Length 173 residues
Representative Sequence MIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRINAGLMSGIARYRGRSHHTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRELIEELDAGVRSASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Number of Associated Samples 93
Number of Associated Scaffolds 137

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 63.24 %
% of genes near scaffold ends (potentially truncated) 36.50 %
% of genes from short scaffolds (< 2000 bps) 75.91 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (67.883 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(36.496 % of family members)
Environment Ontology (ENVO) Unclassified
(38.686 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.146 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.71%    β-sheet: 0.00%    Coil/Unstructured: 40.29%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 137 Family Scaffolds
PF00441Acyl-CoA_dh_1 11.68
PF00535Glycos_transf_2 5.84
PF02771Acyl-CoA_dh_N 5.11
PF02770Acyl-CoA_dh_M 4.38
PF132794HBT_2 2.92
PF13549ATP-grasp_5 2.19
PF13977TetR_C_6 1.46
PF13442Cytochrome_CBB3 1.46
PF01593Amino_oxidase 0.73
PF00873ACR_tran 0.73
PF08669GCV_T_C 0.73
PF08402TOBE_2 0.73
PF00378ECH_1 0.73
PF05853BKACE 0.73
PF02537CRCB 0.73

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 137 Family Scaffolds
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 21.17
COG0239Fluoride ion exporter CrcB/FEX, affects chromosome condensationCell cycle control, cell division, chromosome partitioning [D] 0.73
COG3246Uncharacterized conserved protein, DUF849 familyFunction unknown [S] 0.73


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A67.88 %
All OrganismsrootAll Organisms32.12 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100923946All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300002906|JGI25614J43888_10068143Not Available1016Open in IMG/M
3300002917|JGI25616J43925_10003177All Organisms → cellular organisms → Bacteria → Proteobacteria6875Open in IMG/M
3300006893|Ga0073928_10006415All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria15854Open in IMG/M
3300006893|Ga0073928_10050363All Organisms → cellular organisms → Bacteria → Proteobacteria3763Open in IMG/M
3300006893|Ga0073928_10174691All Organisms → cellular organisms → Bacteria1707Open in IMG/M
3300007255|Ga0099791_10221233Not Available895Open in IMG/M
3300007265|Ga0099794_10008564All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4257Open in IMG/M
3300007265|Ga0099794_10420992Not Available699Open in IMG/M
3300009038|Ga0099829_10441777Not Available1078Open in IMG/M
3300009088|Ga0099830_11129606Not Available650Open in IMG/M
3300009143|Ga0099792_10000855All Organisms → cellular organisms → Bacteria → Proteobacteria11170Open in IMG/M
3300010159|Ga0099796_10022972All Organisms → cellular organisms → Bacteria → Proteobacteria1941Open in IMG/M
3300010855|Ga0126355_1024675Not Available590Open in IMG/M
3300010860|Ga0126351_1294638Not Available722Open in IMG/M
3300011120|Ga0150983_14620739Not Available603Open in IMG/M
3300011269|Ga0137392_10037300All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3575Open in IMG/M
3300011269|Ga0137392_10141117All Organisms → cellular organisms → Bacteria → Proteobacteria1938Open in IMG/M
3300012189|Ga0137388_10834245Not Available854Open in IMG/M
3300012202|Ga0137363_10006417All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria7315Open in IMG/M
3300012202|Ga0137363_10286637Not Available1348Open in IMG/M
3300012203|Ga0137399_10892686Not Available748Open in IMG/M
3300012205|Ga0137362_10481299Not Available1074Open in IMG/M
3300012205|Ga0137362_10892301Not Available760Open in IMG/M
3300012359|Ga0137385_10573633Not Available950Open in IMG/M
3300012361|Ga0137360_11241867Not Available644Open in IMG/M
3300012363|Ga0137390_10074082All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium3338Open in IMG/M
3300012582|Ga0137358_10048936All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → unclassified Candidatus Omnitrophica → Omnitrophica WOR_2 bacterium RIFCSPHIGHO2_02_FULL_50_172813Open in IMG/M
3300012582|Ga0137358_10524859Not Available797Open in IMG/M
3300012685|Ga0137397_10002264All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria13481Open in IMG/M
3300012685|Ga0137397_10297938Not Available1202Open in IMG/M
3300012923|Ga0137359_10039165All Organisms → cellular organisms → Bacteria → Proteobacteria4098Open in IMG/M
3300012923|Ga0137359_10063480All Organisms → cellular organisms → Bacteria → Proteobacteria3221Open in IMG/M
3300012923|Ga0137359_10087261All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium2745Open in IMG/M
3300012927|Ga0137416_10124048All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1981Open in IMG/M
3300012927|Ga0137416_10239428Not Available1477Open in IMG/M
3300012929|Ga0137404_10190468Not Available1732Open in IMG/M
3300012929|Ga0137404_10417498Not Available1186Open in IMG/M
3300012930|Ga0137407_11452065Not Available652Open in IMG/M
3300012944|Ga0137410_10032037All Organisms → cellular organisms → Bacteria → Proteobacteria3678Open in IMG/M
3300012944|Ga0137410_10058862All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium2758Open in IMG/M
3300014501|Ga0182024_10566797Not Available1429Open in IMG/M
3300015241|Ga0137418_10127978All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium2258Open in IMG/M
3300015264|Ga0137403_10142381All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2370Open in IMG/M
3300020140|Ga0179590_1021805Not Available1513Open in IMG/M
3300020199|Ga0179592_10016711All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → Methylobacillus → Methylobacillus rhizosphaerae3226Open in IMG/M
3300020199|Ga0179592_10144720Not Available1088Open in IMG/M
3300020580|Ga0210403_10039215All Organisms → cellular organisms → Bacteria → Proteobacteria3787Open in IMG/M
3300020580|Ga0210403_10069752All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium2823Open in IMG/M
3300020580|Ga0210403_10089545All Organisms → cellular organisms → Bacteria2488Open in IMG/M
3300020580|Ga0210403_10580068Not Available907Open in IMG/M
3300020581|Ga0210399_10338655Not Available1256Open in IMG/M
3300020581|Ga0210399_10589051Not Available921Open in IMG/M
3300020581|Ga0210399_11061361Not Available650Open in IMG/M
3300021151|Ga0179584_1041974Not Available593Open in IMG/M
3300021151|Ga0179584_1108386Not Available681Open in IMG/M
3300021151|Ga0179584_1442754Not Available658Open in IMG/M
3300021151|Ga0179584_1450485Not Available638Open in IMG/M
3300021168|Ga0210406_10018140All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6628Open in IMG/M
3300021168|Ga0210406_10329260Not Available1236Open in IMG/M
3300021168|Ga0210406_10584994Not Available873Open in IMG/M
3300021170|Ga0210400_10743853Not Available805Open in IMG/M
3300021170|Ga0210400_10761407Not Available795Open in IMG/M
3300021315|Ga0179958_1257154Not Available502Open in IMG/M
3300021406|Ga0210386_10363037Not Available1246Open in IMG/M
3300021432|Ga0210384_10197017Not Available1811Open in IMG/M
3300021432|Ga0210384_11416398Not Available600Open in IMG/M
3300021478|Ga0210402_10056197All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3451Open in IMG/M
3300021479|Ga0210410_10012671All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7262Open in IMG/M
3300021559|Ga0210409_10234401All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1665Open in IMG/M
3300022504|Ga0242642_1048323Not Available658Open in IMG/M
3300022506|Ga0242648_1035432Not Available710Open in IMG/M
3300022506|Ga0242648_1096081Not Available512Open in IMG/M
3300022508|Ga0222728_1007454Not Available1345Open in IMG/M
3300022509|Ga0242649_1055100Not Available568Open in IMG/M
3300022510|Ga0242652_1039935Not Available568Open in IMG/M
3300022523|Ga0242663_1034461Not Available833Open in IMG/M
3300022528|Ga0242669_1083801Not Available595Open in IMG/M
3300022530|Ga0242658_1164837Not Available581Open in IMG/M
3300022532|Ga0242655_10153334Not Available676Open in IMG/M
3300022532|Ga0242655_10199103Not Available612Open in IMG/M
3300022533|Ga0242662_10077238Not Available912Open in IMG/M
3300022533|Ga0242662_10149514Not Available705Open in IMG/M
3300022533|Ga0242662_10166491Not Available676Open in IMG/M
3300022533|Ga0242662_10179944Not Available656Open in IMG/M
3300022533|Ga0242662_10335857Not Available512Open in IMG/M
3300022557|Ga0212123_10007925All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria16549Open in IMG/M
3300022724|Ga0242665_10263147Not Available591Open in IMG/M
3300022726|Ga0242654_10261071Not Available624Open in IMG/M
3300022726|Ga0242654_10276342Not Available610Open in IMG/M
3300022726|Ga0242654_10280104Not Available607Open in IMG/M
3300022726|Ga0242654_10311310Not Available582Open in IMG/M
3300024330|Ga0137417_1117942Not Available622Open in IMG/M
3300024347|Ga0179591_1001050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3316Open in IMG/M
3300026319|Ga0209647_1085259All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1548Open in IMG/M
3300026320|Ga0209131_1002914All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria11488Open in IMG/M
3300026482|Ga0257172_1010333All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1528Open in IMG/M
3300026551|Ga0209648_10560762Not Available636Open in IMG/M
3300026551|Ga0209648_10727719Not Available542Open in IMG/M
3300026555|Ga0179593_1015579All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium3380Open in IMG/M
3300026557|Ga0179587_10117744All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1626Open in IMG/M
3300026557|Ga0179587_10422299Not Available871Open in IMG/M
3300027908|Ga0209006_10745351Not Available798Open in IMG/M
3300028047|Ga0209526_10467045Not Available828Open in IMG/M
3300028536|Ga0137415_10081139All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium3102Open in IMG/M
3300028536|Ga0137415_10300569Not Available1409Open in IMG/M
3300028536|Ga0137415_11335847Not Available536Open in IMG/M
3300030730|Ga0307482_1017981Not Available1428Open in IMG/M
3300030839|Ga0073999_11074124Not Available611Open in IMG/M
3300030845|Ga0075397_11133896Not Available615Open in IMG/M
3300030923|Ga0138296_1677640Not Available505Open in IMG/M
3300030937|Ga0138302_1175920Not Available630Open in IMG/M
3300030937|Ga0138302_1243125Not Available868Open in IMG/M
3300031022|Ga0138301_1606807Not Available589Open in IMG/M
3300031023|Ga0073998_11150627Not Available591Open in IMG/M
3300031023|Ga0073998_11607247Not Available730Open in IMG/M
3300031047|Ga0073995_12086335Not Available583Open in IMG/M
3300031057|Ga0170834_101551421All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1514Open in IMG/M
3300031057|Ga0170834_105443267Not Available1000Open in IMG/M
3300031128|Ga0170823_17294613Not Available1315Open in IMG/M
3300031231|Ga0170824_125280462Not Available1000Open in IMG/M
3300031236|Ga0302324_100569513All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium1633Open in IMG/M
3300031469|Ga0170819_13231345Not Available565Open in IMG/M
3300031474|Ga0170818_101451141Not Available632Open in IMG/M
3300031525|Ga0302326_11459965Not Available919Open in IMG/M
3300031590|Ga0307483_1018231Not Available672Open in IMG/M
3300031663|Ga0307484_112312Not Available589Open in IMG/M
3300031663|Ga0307484_114790Not Available557Open in IMG/M
3300031715|Ga0307476_10177767Not Available1539Open in IMG/M
3300031753|Ga0307477_10045337All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3023Open in IMG/M
3300031754|Ga0307475_11121332Not Available615Open in IMG/M
3300031823|Ga0307478_10124531All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → unclassified Rhodobacteraceae → Rhodobacteraceae bacterium2024Open in IMG/M
3300031962|Ga0307479_10224673All Organisms → cellular organisms → Bacteria1852Open in IMG/M
3300032174|Ga0307470_10001077All Organisms → cellular organisms → Bacteria → Proteobacteria11213Open in IMG/M
3300032180|Ga0307471_104347916Not Available500Open in IMG/M
3300032515|Ga0348332_13098299Not Available878Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil36.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.38%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.38%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring2.92%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.92%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.46%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.46%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.73%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.73%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010855Boreal forest soil eukaryotic communities from Alaska, USA - W1-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010860Boreal forest soil eukaryotic communities from Alaska, USA - C5-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021315Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_2_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022504Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022506Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-26-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022508Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-19-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022509Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022510Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-14-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022528Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022530Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030839Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil TCEFB (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030845Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - OA7 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030923Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A3_MS_autumn Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300030937Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A4_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031022Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A3_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031023Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil TCEFA (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031047Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031590Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031663Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10092394623300002245Forest SoilMIDSEVMRLRRLRNTALRARALAAALDPDWARRSSVFSRSSVNCWRITRVITGWLRAHPYLSYHQGPSEMRGLYDRLSTGLLGAIARYRGRSLQTFSSELQRVARELDDARALTWSSELSDTLGRAQIQIRSLIQELEADARNEGASRD
JGI25614J43888_1006814323300002906Grasslands SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDGVSAGLLGAITRYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGACSEAGR
JGI25616J43925_1000317723300002917Grasslands SoilMIDSEVTRLRRLRNTALRARAVAATLDSEPARRDSVFSRGAVSCWQIARIITGLLRAHPYLSYQRGPSEVRGVYDRLSAGLLGGIARYRGRSHQTFSDELRRVARELDDARALTWSSDLSDTLGRSQIQIRGLIKELGAGALNESGSRHETAPRVETRIGAVRDDSGSVAGNWPYLAI*
Ga0073928_10006415123300006893Iron-Sulfur Acid SpringMIDTEVARLRRLRNTALRARALAATLDSDPARRESVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLMGGIARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAPRVDTRISAARDDTGSVAGNWPYLAI*
Ga0073928_1005036323300006893Iron-Sulfur Acid SpringMIDAEVMRLRRLRNTALKARALATALDSDPGRGSSVFSRSAVNCWRISRVITGWLRAHPYLSYQQGPSEARGVYDRLSAGVQSAVARSRGRSRQTLSGELLRVARELDDARALTWSSDLSDTFGRSQMQIRALIKELDADALNEAASRHETPMRLDTRTGNRRDDAGSIAGNWPYLAI*
Ga0073928_1017469123300006893Iron-Sulfur Acid SpringMIDTEVMRLRQLRNTALKARALAAALDSDPARRNSVFSQSAVSCWRIARVITGRLRAHPYLSYQRGPSAVRGVYDRLSAGLLSAIARYRGRSAQTFSRELQRVARGLDDARALTWSSDLSDTLGRSQMQIRTLIKELNADAREVASRHETLARVETRAADGRGDAGSVAGNWPYLAI*
Ga0099791_1022123313300007255Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRVVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0099794_1000856443300007265Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDTRALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVPGNWPYLAI
Ga0099794_1042099213300007265Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDRVSAGLLGAITRYRGRSQQSFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGAYSEAGRRNEV
Ga0099829_1044177723300009038Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDTLGRSQMQIRELIKELDAVARNESGSRHET
Ga0099830_1112960613300009088Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDRVSAGLLGAITRYRGRSQQSFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGACSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGNWPYLAI*
Ga0099792_10000855103300009143Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAVLDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTQHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0099796_1002297243300010159Vadose Zone SoilSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI*
Ga0126355_102467513300010855Boreal Forest SoilMIDVEVMRLRRLRYTALRARALAAALDSDPVQGRSVFSRSAVSCWRISRVITGWLRAHPYLSYQQGPSELRGVYDRFGAGLLSAVARSRGRSRQALCGELQRVARELDDARALTWSSELSDTFGRSQMQIRSLIKELDADAFNEAASRHETPMRLDSRTGNRRDNAGSIAGNWPYLAI*
Ga0126351_129463813300010860Boreal Forest SoilMIDSEVMRLRRLRNTALRARALAAALDLDSARRNSVFSRSSVNCWRIARVITGWLRAHPYLSYHQGPSAMRGFYDRLSTALLGAVARYRGRSLQTFSGELRRVARGLDDARALTWSFELSDTLGRAQTQIRSLIQELDADAHKEGASRDATMARVDLRVGDGRDGASSVAGNWPYLAI*
Ga0150983_1462073913300011120Forest SoilMIYTEVARLRRLRNTALRARALAATLDSDPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRGIYDRVGAGLLGGIARYRGRSHQAFSDELRRVARELDDARALTWSSDLSDTLGRSQIQIRGLINELDAGASKESGSQCETASRVETRTDAVRDDTGSVAGNWPYLAI*
Ga0137392_1003730043300011269Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137392_1014111723300011269Vadose Zone SoilMIDTEVTRLRQLRNTALRARAVAATLGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGAYDRVSAGLLGAITRYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGACSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGNWPYLAI*
Ga0137388_1083424513300012189Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAGTLDSDSARRDSVFSRGAVNCWRIARVITGLLRGHPYLSYHQGPGAVRGVYDRVSAGLLGGIARYRGRSLQTFSDELRRVARELDDARALTWSSDLSDTLGRSQMQIRELIKELDAVARNESGSRHETAPRVETRTADGRDDAVSVAGNWPYLAI*
Ga0137363_1000641733300012202Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRRETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137363_1028663713300012202Vadose Zone SoilMIDSEVTRLRRLRNTALRARAVAAALDSGTSRRDSVFSRSAVNFWQITRVITGLLRGHPYLNYQREPGEVRGLYDCAGAGLLGGIARYRGRSRQALSEELRRVARELDDARALTWSSDLSDTLGRSQMRVRGLIKELDADARSESESRHATSPRVETRIGAVREDAGSVAGNW
Ga0137399_1089268613300012203Vadose Zone SoilTALRARAVAATLGSGPPRRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDRINAGVLGAFARYRGRSHETLSDELRGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDTGARHETGSRHEPAQRVETRIGAVRDDSGSVEGTWPYLAI*
Ga0137362_1048129933300012205Vadose Zone SoilRARAVAATLGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGAYDRVSAGLLGAITRYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGACSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGNWPYLAI*
Ga0137362_1089230113300012205Vadose Zone SoilMIDSEVTRLRRLRNTALRARAVAAALDSGTSRRDSVFSRSAVNFWQITRVITGLLRGHPYLSYQREPGEVRGLYDCAGAGLLGGIARYRGRSRQALSEELRRVARELDDARALTWSSDLSDTLGRSQMRVRGLIKELDADARSESESRHATSPRVETRIGAVREDAGSVAGNWPYLAI*
Ga0137385_1057363323300012359Vadose Zone SoilLRRLRNTALRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDQINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRIETRIAAVRDDAGSVAGNWPYLAI*
Ga0137360_1124186713300012361Vadose Zone SoilMIDTEVMRLRRLRNTALRARALAAVLDSEPARRRSVFSRSAVSCWQIARVVTGWLRAHPYLSYQRGPSAVRGVYDRLGAGLLGAVARYQGRTLQTFFHELQGVARELDDARALTWSSDLSDTLGRSQMHLRSLIKELDAETHSDVASRHETLVRVEARSTNGRDDAGSVAGNWPYLAI*
Ga0137390_1007408223300012363Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAATLDSDSARRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDRASAGLLGGIARYRGRSQQTFSDELRRVARELDDARALTWSSDLSDTLGRSQMQIRELIKELDAVARNESGSRHETAPRVETRIGAVRYDTAVRDDSGSVAGNWPYLAI*
Ga0137358_1004893643300012582Vadose Zone SoilMIDSEVTRLRRLRNTALRARAVAAALDSGTPRRDSVFSRSAVNFWQITRVITGLLRGHPYLSYQRGPGEVRGLYDRAGAGLLGGIARYRGRSRQALSEELRRVARELDDARALTWSSDLSDTLGRSQMRVRGLIKELDADARSESESRHETSPRIETRIGAVREDAGSVAGNWPYLAI*
Ga0137358_1052485923300012582Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGAVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSR
Ga0137397_10002264123300012685Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSAALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137397_1029793813300012685Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVASTLDSGPPRRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDHINAGVLGAFARYRGRSHETLSDELRGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDASARHETGSRHEPASRVETRIGAVGDDSNRVEGSWPYLAI*
Ga0137359_1003916533300012923Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVASTLDSGPPRRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDRINAGVLGAFARYRGRSHETLSDELRGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDASARHETGSRHEPAPRVETRIGAVGDDSNRVEGSWPYLAI*
Ga0137359_1006348023300012923Vadose Zone SoilMIDSEVTRLRRLRNTALRARAVAAALDSGTPRRDSVFSRSAVNFWQITRVITGLLRGHPYLSYQRGPGEVRGLYDRAGAGLLGGIARYRGRSRQALSEELRRVARELDDARALTWSSDLSDTLGRSQMRVRGLIKELDADARSESESRHATSPRVETRIGAVREDAGSVAGNWPYLAI*
Ga0137359_1008726123300012923Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAVLDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRRETAPRVETRIAAVRDDAGSVPGNWPYLAI
Ga0137416_1012404823300012927Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAPRDSAFSRSAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDRVSAGLLGAIARYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGAYSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGSWPYLAI*
Ga0137416_1023942823300012927Vadose Zone SoilMIDIEVARLRRLRNTALSARALAAALDSDPARLGSVFSRGAVTCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVCTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137404_1019046823300012929Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVASALDYDSARHGSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDRVSAGLLGGIARYRGRSRQTFSDELRRVAHELDDARALTWSSDLSDTLGRSQMQIRGLIKELDAGARNESASRHETAARVETQIGAAREGSGSVAGNWPYLAI*
Ga0137404_1041749813300012929Vadose Zone SoilPENKWIHMIDTEVARLRRLRNTALRARTLAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRVSAGLLGGVARYRGRTHQALSDELRCVARELDDARALTWSSDLSDTLGRSQMQIRGLIKDLDARARAESGSRHETAPQVETRIAAVRDDAGSVAGNWPYLAI*
Ga0137407_1145206513300012930Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALNSDPAPRDSAFSRSAASCWRIARVVTGRLRAHPYLSYQRGPSELRGGYDRLSAGLLGAIARYRGRTQQIFAEELRRVARELDDARALTLSPDLSDTLGRSQTQIRRLIKELDVGALNEAGAFNEAGTFNEAGAFNEAGAFN
Ga0137410_1003203713300012944Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGSSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137410_1005886243300012944Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDRVSAGLLGAITRYRGRSQQSFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGAYSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGNWPYLAI*
Ga0182024_1056679723300014501PermafrostMIDAEVMRLRRLRNTALKARALAGMLDSDAARRDSLFSRSALSCWRIARLTTGTLRAHPYQSFQRGPSALRGIYNRLVAGTVGATARRQERRLQAFYPELLRVARELDDARALTWSAELSDTLGRSQTEIRGLLRELHAGARSEAGPPREMAPEVEARTGAARADAGSVAGNWPYLAF*
Ga0137418_1012797813300015241Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDQARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRRETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137403_1014238113300015264Vadose Zone SoilMIDTEVARLRRLRNTALRARTLAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSAALGRSQRQIRGLIEELDADAGVRTASGSRHETEPRVETRNAA
Ga0179590_102180523300020140Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAVLDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVPGNWPYLAI
Ga0179592_1001671123300020199Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVASTLDSGPPRRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDHINAGVLGAFARYRGRSHETLSDELRGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDASARHETGSRHEPAPRVETRIGAVGDDSNRVEGSWPYLAI
Ga0179592_1014472023300020199Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0210403_1003921533300020580SoilMIDSEVMRLRRLRNTALKARALAAALDPDSARRNSVFSRSSVNCWRISRVITGWLRAHPYLSYHQGPSAMRGFYDRLSTALLGVFARYRGRSLQTFSGELRRVARGLDDARALTWSSELSDTLGRAQTQIRSLIQELDADAHKEGASREATIARVDLRVGDGRDGASSVAGNWPYLAI
Ga0210403_1006975233300020580SoilMIDTEVARLRRLRNTALWARAVAAALNTEPAGRGSVFSRSAVRSWQIARVITGWLRAHPYLSYQRGPSEVRGVYDRLSAGLLAGFTRYRGRTLQAFSPVLRRLARELDDSRALTWSSDLSDTLGRSQKQIRTLLEELDADARDEAGSRRETVARVATRTGNERDDAGVAASWPYLAI
Ga0210403_1008954523300020580SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGLYDRLNAGLLGGIAGYRGRSRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGSRHETSQRIGAARDDTGSVAGNWPYLAI
Ga0210403_1058006813300020580SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLQGGIARSRGRSLQAFSNELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNE
Ga0210399_1033865533300020581SoilMIDTEVVRMQRLRNTALSARALAGALDPGPARRGSLFSRGALICWQIARVVTGTLRSHPYPKYQREPSVLRADYDRLSAAWRGGITRYRRRSLQALSDELRRVARELDDARALTWSAEMSNTFGRLQVHVRKLLRELDLGVRSDAGSHDDPVSALQPEIGTSADGDVAGNWPYLAF
Ga0210399_1058905113300020581SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARSRGRSLQAFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARSEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0210399_1106136113300020581SoilRLRNTALRARALAATLDSDPARRESLFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLLGGIVRYRGRTHQTLSDELRRVARELDDARALTWSSDLSDILGRSQMQIRGLIQDLDAGARAESGSRHETAPQVETRIAAVRDDAGSVAGNWPYLAI
Ga0179584_104197413300021151Vadose Zone SoilMIDAEVMRLRRLRNTALKVRALAGALDSDPARGSSVFSRSAASCWRISRVITGWLRAHPYLSYQQGPSEARGIYDRFSAGLLSAVARSRGRSRQTFSGELLRVARELDDARALTWSSDLSDTFGRSQMQIRALIKELDADALNEATSRHETPVRLGTRTGNRRDDAGSIAGNWPYLAI
Ga0179584_110838613300021151Vadose Zone SoilMIDTEVIRLRRLRNTALRARAVAAALDSDSARRGSVFSRSAVNCWRIARVITGLLRGHPYLSYQRGPSELRGLYDRVSAGLLGGIARYRGRSLQIFSDELRRVARELDDARALTLSSDLSDTLGRSQTQIRGLIQELDAGARNEAGSRHETAPRVETRTGAVRNDTGSVAGNWPYLAI
Ga0179584_144275413300021151Vadose Zone SoilVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDRINAGVLGAFARYRGRSHETLSDELGGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDASARHETGSRHEPASRVETRIGAVGDDSNRVEGSWPYLAI
Ga0179584_145048513300021151Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARLGSVFSRGGVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINAGLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRELIEELDAGARSASGSRHETAARVDTRIGAARDDAGSAAGNWPYLAI
Ga0210406_1001814063300021168SoilMIDTEVARLRRLRNTALRARALAAALDSDPARHDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGLYDRLNAGLLGGIAGYRGRTRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGSRHETSQRIGAARDDTGSVAGNWPYLAI
Ga0210406_1032926033300021168SoilRLRRLRNTALRARAVAATLDSDSARRDSMFSRGAVNCWRIARIITGLLRGHPYLSYQRGPSGVRAIYDRVIAGLLGGLARHRGRSHLTFFDELRRVARELDDARALTWSSHLSDTLGRSLTQIRGLINELDAAARNESGSRQEAAPRVETRIGAVRDDSGSAGSWPYLAI
Ga0210406_1058499413300021168SoilMIDSEVVRLRRLRDTALRVRAIAAVLDSHPARRSSLVARSGRSCWRIARAITGTLRAHPYLNYQRGPSEVRAVYDRIRAGLLGGIARYRGRSLQTFCAELQRVAHELDDTRALTWSADLSDTLGRSQTQMRRLIKELDAAVRTEAGSLIETDTRVEARSAALREDAGSVAGNWPYLAF
Ga0210400_1074385313300021170SoilMIDTEVTRLRRLRNTALRARALAAALDSDSARRDSVFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRGVYDRVSAGLLGSVARYRGRSHETFSDELRRVARELDDARALTWSPGLSDTLGRSQMQIRGLIKELDAGVRNESGLRRETAPRVETRIGTVRDDSSSLSGNWPYLAI
Ga0210400_1076140713300021170SoilRARDLRISLCAPTYGSALRIQGTKNNWVHMIDTEVARLRRLRNTALRARALAAALDSDPARHDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPGEMRGAHDRISAGLMGGIARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGARHETSPRVDTRIGAARGDTGSVAGNWPYLAI
Ga0179958_125715413300021315Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASG
Ga0210386_1036303713300021406SoilMIDTEVMRLRRLRKTALKARALARALNSDPAQRSSVFSRSAVNCWRIAGVATGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRILQTLSHELQRVARELDDARALTWSPDLSDTLGRSQVQLRSLIKELDADAHKADAYKEVAVRRETPVRVETRTGAGRD
Ga0210384_1019701723300021432SoilMIDTEVVRMQRLRNTALSARALAGALDPGPARRGSLFSRGALICWQIARVVTGTLRSHPYPKYQREPSVLRADYDRLSAAWRGGITRYRRRSLQALSDELRRVARELDDARALTWSAEMSDTFGRLQVHVRKLLRELDLGVRSDAGSHDDPVSALQPEIGTSADGDVAGNWPYLAF
Ga0210384_1141639813300021432SoilHMIDTEVARLRRLRNTALRARALAAALDSDPARHDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGLYDRLNAGLLGGIAGYRGRTRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHVRGLIEELDAGVRSASGARHETSPRVDTRIGAARDDTGSVAGNWPYLAI
Ga0210402_1005619743300021478SoilMIDTEVARLRRLRNTALRARALAAALDSDPARHDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGLYDRLNAGLLGGIAGYRGRTRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGARHETSPRVDTRIGAARGDTGSVAGNWPYLAI
Ga0210410_1001267143300021479SoilMIYTEVARLRRLRNTALRARALAATLDSDPARRDSLFSRGAVNCGRIARVITGLLRGHPYLSYQRGPSEVRGIYDRVGAGLLGGIARYRGRSHQAFSDELRRVARELDDARALTWSSDLSDTLGRSQIQIRGLINELDAGASKESGSQCETASRVETRIDAVRDDTGSVAGNWPYLAI
Ga0210409_1023440123300021559SoilMIDTEVARLRRLRNTALRARAVAATLDSDSARRDSMFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSGVRGIYDRVIAGLLGGLARHRGRSHLTFFDELRRVARELDDARALTWSSHLSDTLGRSLTQIRGLINELDAAARNESGSRQEAAPRVETRIGAVRDDSGSAGSWPYLAI
Ga0242642_104832313300022504SoilMIDSEVTRLRRLRNTALRARAVAAALDSGAPRRDSVFSRSAVNFWQITRVITGLLRGHPYLSYQRGPGEVRGLYDRAGAGLLGGIARYRGRSRQALSEELRRVARELDDARALTWSSDLSDALGRSQMRLRGLIRELNADARNVSESRHETLPRVETRIGAVREDAGSVAGNWPYLAI
Ga0242648_103543213300022506SoilRSSVFSRSAVNCWRIAGVATGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRILQTLSHELQRVARELDDARALTWSQDLSDTLGRSQVQLRSLIKELDADAHKADAYKEVAVRRETPVRVETRTGAGRADGGSVAGNWPYLAL
Ga0242648_109608113300022506SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSVFSRGAVNCWQIARVITGLLRGHPHLSFQRGPSELRGVYDRLSANWLGGIAGYRGRTRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGARHETSPRVDT
Ga0222728_100745433300022508SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLQGGIARSRGRSLQAFSNELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARSEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0242649_105510013300022509SoilMIDTEVARLRRLRNTALRARAVAATLDSDSARRDSMFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARFRGRGLQAFSDELRHVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNEAGSRHETAPRVETRTGAVRNDSGSVAGNW
Ga0242652_103993513300022510SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARFRGRGLQAFSDELRRVARELDDARALTWSAELSDSFGRSQRQIRRLIEELNAGARHELGGDHDTATREALVAECQKEGGSAANW
Ga0242663_103446113300022523SoilMIDTEVMRLRRLRKTALKARALARALNSDPAQRSSVFSRSAVNCWRIAGVATGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRILQTLSHELQRVARELDDSRALTWSPDLSDTLGRSQVQLRSLIKELDADAHKASAHKADAYKEVAVRRETPVRVETRTGTGRADGGSVAGNWPYLAI
Ga0242669_108380113300022528SoilMIDTEVMRLRRLRKTALKARALARALNSDPAQRSSVFSRSAVNCWRIAGVATGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRILQTLSHELQRVARELDDSRALTWSPDLSDTLGRSQVQLRSLIKELDADAHKADAYKEVAVRRETPVRVETRTGTGRA
Ga0242658_116483713300022530SoilMIDVEVMRLRRLRNTSLRARALAAALDSDPVQGRSVFSRSAASCWRISRVITGWLRAHPYLSYQQGPSELRGVYDRFGAGLLSAVARSRGRSRQALCGELQRVARELDDARALTWSSDLSDTFGRSQMQIRSLIKELDADAFNEAALRHETPMRLDSRTGNRRDDAGSIAGNWPYLAI
Ga0242655_1015333413300022532SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSVFSRGAVNCWQIARVITGLLRGHPHLSFQRGPSELRGVYDRLSANWLGGIAGYRGRTRQTLSDELRRVARDLDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGARHETSPRVDTRIGAARGDTGSVAGNWPYLAI
Ga0242655_1019910313300022532SoilRPYRGKCVEDRGSKNKWIHMIYTEVARLRRLRNTALRARALAATLDSDPARRDSLFSRGAVNCGRIARVITGLLRGHPYLSYQRGPSEVRGIYDRVGAGLLGGIARYRGRSHQAFSDELRRVARELDDARALTWSSDLSDTLGRSQIQIRGLINELDAGASKESGSQCETASRVETRIDAVRDDTGSVAGNWPYLAI
Ga0242662_1007723823300022533SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSVFSRGAVNCWQIARVITGLLRGHPHLSFQRGPSELRGVYDRLSANWLGGIAGYRGRTRQTLSDELRRVARELDDARALTWSSDLSDALGRSQRHIRGLIEELDAGVRSASGARHETSPRVDTRIGAARGDTGSVAGNWPYLAI
Ga0242662_1014951413300022533SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDTISADLMGGIARYRGRTHQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRALIEELDAGVRSASGSRHETAPRVDTRIAAAGDDTGSVAGNWPYLAI
Ga0242662_1016649113300022533SoilMIDTEVARLRRLRNTALRARAVAATLDSGSARRGSMFSRGAVNCWRIARGITGLLRGHPYLSYQRGPSEVRGIYDRVIAGLLGGFARYRGRSHLTFSDELRRVARELDDARALTWSSHLSDTWGRSLMQIRGLINELDAIARNESGSRPEAAPRVETRIGALREDPGRVAGSWPYLAI
Ga0242662_1017994413300022533SoilMIGTEVARLRRLRNTALRARALAAALDSDPAQRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPGEMRGAHDRISAGLMGGIARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAPQVETRIAAVPDDAGSVAGNWPYLAI
Ga0242662_1033585713300022533SoilSARRDSVFSRSAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARSRGRSLQAFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARSEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0212123_1000792563300022557Iron-Sulfur Acid SpringMIDTEVARLRRLRNTALRARALAATLDSDPARRESVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLMGGIARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAPRVDTRISAARDDTGSVAGNWPYLAI
Ga0242665_1026314713300022724SoilMIDTEVMRLRRLRKTALKARALARALNSDPAQRSSVFSRSAVNCWRIAGVATGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRILQTLSHELQRVARELDDARALTWSQDLSDTLGRSQVQLRSLIKELDADAHKASAHKADAYKEVAVRRETPVRVETR
Ga0242654_1020632313300022726SoilMIDTEVARLRRLRNTALRARAVAATLDSGAARRGSMFSRGAVNCWRIARGITGLLRGHPYLSYQRGPSEVRGIYDRVIAGLLGGFARYRGRSHLTFSDELRRVARELDDARALTWSSHLSDTLGRSLTQIRGLINELDAAARN
Ga0242654_1026107113300022726SoilMIDVEVMRLRRLRNTSLRARALAAALDSDPVQGRSVFSRSAVSCWRISRVITGWLRAHPYLSYQQGPSELRGVYDRFGAGLLSAVARSRGRSRQALCGELQRVARELDDARALTWSSDLSDTFGRSQMQIRSLIKELDADAFNEAAPRHETPMRLDSRTGNRRDDAGSIAGNWPYLAI
Ga0242654_1027634213300022726SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRDSAFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDTISADLMGGIARYRGRTQQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRALIEELDAGVRSASGSRHETAPRVDTRIAAAGDDTGSVAGNWPYLAI
Ga0242654_1028010413300022726SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLQGGIARSRGRSLQAFSNELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0242654_1031131013300022726SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRSSVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARFRGRGLQAFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNEAGSRHETAPRVETRTGAVRNDSGSVAGNW
Ga0137417_111794213300024330Vadose Zone SoilRALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0179591_100105033300024347Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0209647_108525923300026319Grasslands SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAARDSAFSRGAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDGVSAGLLGAITRYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGACSEAGRRNEVESRH
Ga0209131_1002914103300026320Grasslands SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGSSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRRETAPRVETRIAAVRDDAGSVAGSVAGNWPYLAI
Ga0257172_101033323300026482SoilMIDTEVIRLRRLRNTALGARAVAATLGADPAPRDSVFSRGAVNCWQIARVITGLLRAHPYLSYQRGPSEVRGIYDRVRAGLLGGIARYRGRSHQTFSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0209648_1056076213300026551Grasslands SoilRARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0209648_1072771913300026551Grasslands SoilRNTALKARALAAALDSAPARRSSVFSRSAVSCWQIARVITGSLRAHPYLSYQRGPSEVRGLYDRLSAGFLSSIARYRGRSLQIFSREMQRVARGLDDARALTWSSDLSDTLGRSQMQFRRLIKELDADAPNEVASRRETLARVETRDGRDDAGSVAGDWPYLAI
Ga0179593_101557943300026555Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVASTLDSGPPRRDSVFSRGAVNCWRIARVITGLLRAHPYLSYQRGPSEVRGVYDHINAGVLGAFARYRSRSHETLSDELRGVARELDDARALTLSSALSDTLGRSQMQIRALIKELDASARHETGSRHEPAPRVETRIGAVGDDSNRVEGSWPYLAI
Ga0179587_1011774423300026557Vadose Zone SoilMIDTEVARLRRLRNTALRARALAAVLDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0179587_1042229913300026557Vadose Zone SoilMIDTEVMRLRRLRNTALSARAIAAALESNPARRDSPFSRGAVSCWQIARVITGRLRAHPYLSYQRGPSGVRGICDRVSAGVRGAIARSRDLSLQIFCGELQRVARELNDARALTWSADLSDALGRSQTQIRKLIKELDVGARNEVGLHRESAGRGYARTGAVEDDAGSTAGNWPYLAF
Ga0209006_1074535113300027908Forest SoilMIDTEVARLRRLRNTALRARAVAATLDSDSARRGSMFSRGAVNCWRIARGITGLLRGHPYLSYQRGPSEVRGIYDRVIAGLLGGFARYRGRSHLTFSDELRRVARELDDARALTWSLHLSDTLGRSLMQIRGLINELDAIARNESGSRPEAAPRVETRIGALREDSGRVAGSWPYLAI
Ga0209526_1046704513300028047Forest SoilPMIDSEVMRLRRLRNTALRARALAAALDPDWARRSSVFSRSSVNCWRITRVITGWLRAHPYLSYHQGPSEMRGLYDRLSTGLLGAIARYRGRSLQTFSSELQRVARELDDARALTWSSELSDTLGRAQIQIRSLIQELDADARNEGASRDATMARVDLRVGDGRDGPGSVAGNWPYLAI
Ga0137415_1008113923300028536Vadose Zone SoilMIDTEVTRLRRLRNTALRARAVAAALGSDPAPRDSAFSRSAASCWRIARVITGRLRAHPYLSYQRGPSEMRGVYDRVSAGLLGAIARYRGRSQQTFSEELRRVARELDDARALTWSSELSDSLGRSQTQIRALIKELDDGAYSEAGRRNEVESRHAPRIETRTGAVGNDAADVAGSWPYLAI
Ga0137415_1030056933300028536Vadose Zone SoilMIDIEVARLRRLRNTALSARALAAALDSDPARLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGVYDRINASLMSGITRYRGRTHHALSDELRGVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDADAGVRTASGSRRETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0137415_1133584713300028536Vadose Zone SoilDSEPARRRSVFSRSAVRCWQIARVVTGWLRAHPYLSYQRGPSAVRGVYDRLGAGLLGAVARYQGRTLQTFFHELQGVARELDDARALTWSSDLSDTLGRSQMHLRSLIKELDAETHSDVASRHETLVRVEARSTNGRDDAGGVAGNWPYLAI
Ga0307482_101798113300030730Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRAVYDRISAALLGGIARYRDRSHRAFSIELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADARNASGSLHETAPRVETRIGAVQDDSGSVAGNWPYLAI
Ga0073999_1107412413300030839SoilMIGTEVARLRRLRNTALRARALAAALDSDPARPASVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGSYDRISAGLMGGISRYRGRTRQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAPRIDSRIAAARDDTGNVSGNWPYLAI
Ga0075397_1113389613300030845SoilMIGTEVARLRRLRNTALRARALAAALDSDPAQLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRINAGLMSGIARYRGRSHHTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRELIEELDAGVRSASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0138296_167764013300030923SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRIDAGLMSGIARYRGRSHRTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAARVDTRIGAARDDTG
Ga0138302_117592013300030937SoilMIGTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARIITGLLRGHPYLSYQRGPSELRGVYDRINAGLMSGIARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAARVDTRIGAARDDTGSVAEKK
Ga0138302_124312513300030937SoilMIDTEVMRLRRLRKTALKARALARALNSDPAQRSSVFSRSAVNCWRIAGVVTGWLRGHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRSLQTLSHELQRVARELDDARALTWSPDLSDTLGRSQVQLRSLIKELDADAHKADANKEVAVRRQTPVRVETRTGAGRADGGSVAGNWPYLAI
Ga0138301_160680713300031022SoilMIGTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRIDAGLMSGIARYRGRSHHALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRELIEELDADAGVRSASGSRHETAPRVETRIG
Ga0073998_1115062713300031023SoilGSHMIDTEVARLRRLRNTALRARAVAAALDSDSARRGSVFSRSAVNCWRIARVITGLLRGHPYLSYQRGPSELRGLYDRVSAGLLGGIARYRGRGLQTFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGVRNEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLA
Ga0073998_1160724713300031023SoilHMIDTEVARLRRLRNTALRARALAAALDSDPARPASVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGSYDRISAGLMGGISRYRGRTRQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETAPRIDSRIAAARDDTGNVAGNWPYLAI
Ga0073995_1208633513300031047SoilMRLRRLRNAALKARALAAALDADPGRRSSVFSRSAVNCWRIARVVTGWLRAHPYLSYQRGPSAVRDVYDRLGAGLLGAVARSQGRSQQALSSELQRVARGLDDARALTWSSELSDTLGRAQIQIRTLIKELDCDARDAAASRHETLARVDTRA
Ga0170834_10155142123300031057Forest SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRINAGLMSGIARYRGRSHHTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRELIEELDAGVRSASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0170834_10544326713300031057Forest SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVITGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARSRGRSLQAFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0170823_1729461313300031128Forest SoilAALDSDPAQLGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRINAGLMSGIARYRGRSHHTLSDELRRVARELDDARALTWSSDLSDALGRSQRQIRELIEELDAGVRSASGSRHETAPRVETRIAAVRDDAGSVAGNWPYLAI
Ga0170824_12528046213300031231Forest SoilMIDIEVTRLRRLRNTVLRARAVAATLDLDSARRDSVFSRGAVSCWQIARVVTGLLRGHPYLSYQRGPSGLRGLYDRVSAGLLGGIARSRGRSLQAFSDELRRVARELDDARALTLSSDLSDTLGRSQTRIRGLIKELDAGARNEAGSRHETAPRVETRTGAVRNDSGSVAGNWPYLAI
Ga0302324_10056951313300031236PalsaMIDAEVMRLRRLRNTALKARALAGMLDSDAAQRNSLFSRSALSCWRIARLTTGTLRAHPYQSFQRGPSALRGMYNRLVAGMVGATARRQERRLQAFYPELLRVARELDDARALTWSAELSDTLGRSQTEIRGLLRELQAGARSEAGPTHELAPKVEARTGAARADAGSVA
Ga0170819_1323134513300031469Forest SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLMGGVARYRGRTHQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETSPRVDTRIGAARDDAGRVAGKG
Ga0170818_10145114113300031474Forest SoilMIGTEVARLRRLRNTALRARALAAALDSDPARRGSVFSRGAVNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLMGGVARYRGRTHQALSDELRRVARELDDARALTWSSDLSDALGRSQRQIRGLIEELDAGVRSASGSRHETSPRVDTRIGAARDDAGSVAGNWPYLAI
Ga0302326_1145996513300031525PalsaRSALSCWRIARLTTGTLRAHPYQSFQRGPSALRGMYNRLVAGMVGATARRQERRLQAFYPELLRVARELDDARALTWSAELSDTLGRSQTEIRGLLRELQAGARSEAGPTHELAPKVEARTGAARADAGSVAGNWPYLAF
Ga0307483_101823113300031590Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRAVYDRISAALLGGIARYRDRSHRAFSIELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADALIESGSLHETAPRVETRIGAVQDDSGSVAGNWPYLAI
Ga0307484_11231213300031663Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSGVRAVYDRISAALLGGIARYRDRSHRAFSNELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADALIESGSLHETAPRVETRIGAVQDDSGSVAGNWPYL
Ga0307484_11479013300031663Hardwood Forest SoilVRALAATLDSDSARSDSVFSRAAVSCWRIGRVITGLLRAHPYLSYRRGPSEVRGVYDRVSARLLGGIARYRGRSHQTFFVELRRVARELDDTRALTWSSDLSDTLGRSQIQIRGLINELDACARNESGSRHETASQVETRIGPVQDDSGSGAGNWPYLAI
Ga0307476_1017776743300031715Hardwood Forest SoilRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRAVYDRISAALLGGIARYRDRSHRAFSNELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADALIESGSLHETAPRVETRIGAVQDDSGSVAGNWPYLAI
Ga0307477_1004533713300031753Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSEVRAVYDRISAALLGGIARYRDRSHRAFSNELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADALIESGSLHETAPRVETRIGAVQDDSGSVAGNWPYLAI
Ga0307475_1112133213300031754Hardwood Forest SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRESVFSRGAVNCWQIARVITGLLRGHPYLNYQRGPSELRGAYDRISAGLMGGIARYRGRAHQTLSDELRRVARELDDARALTWSSDLSDTLGRSQMRIRGLINDLDAGTRAESGSRHETATQVETRIAAAQDDAGSVAGNWPYLAI
Ga0307478_1012453123300031823Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAATLDSGPARRDSLFSRGAVNCWRIARVITGLLRGHPYLSYQRGPSGVRAVYDRISAALLGGIARYRDRSHRAFSNELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLIKELDADALIESGSLHETAPRVETRIGAVQDDSGSVAGNWPYLAI
Ga0307479_1022467333300031962Hardwood Forest SoilMIDTEVTRLRRLRNTALRVRAVAATLDSDSARSHSVFSRGAVSCWRIARVITGLLRAHPYLSYRRGPSKVRGVYDRVSARLLGGIARYRGRSHQTFSVELRRVARELDDARALTWSSDLSDTLGRSQTQIRGLINELDAGARNESGSRRETASQVETRIGPVQDDSGSGAGNWPYLAI
Ga0307470_1000107743300032174Hardwood Forest SoilMIDTEVARLRRLRNTALRARALAAALDSDPARRESVFSRGAMNCWQIARVITGLLRGHPYLSYQRGPSELRGAYDRISAGLLGAVARYRGRTHQTLSDELRRVARELDDARALTWSSDLSDTLGRSQMQIRGLTKDLDAGARAESGSRHETAPQVETRSAAVRDDAGSVAGNWPYLAI
Ga0307471_10434791613300032180Hardwood Forest SoilMIDTEVTRLRRLRNTALRARAVAAALDSDSAPRDSAFSRSAASCWRIARVVTGRLRAHPYLSYQRGPSELRGVYHRLCAGLLAAIARYRGRTQQIFAEELRRVARELDDARALTLSPDLSDTLGRSQMQIRRLIKELDDGALNEAGALNEAG
Ga0348332_1309829913300032515Plant LitterMIDTEVMRLRRLRKTALKARALARALNSDPAQRNSVLSRSAVNCWRIAGVATGWLRAHPYLSYQRGPSEVRAVYDRLTADWRSSVARSRGRSLQTLSRELQRVAHELDDARALTWSSDLSDTLGRSQVQLRSLIKELDADAHKASALKADANKEVTVRRETPVRVETRTGAGRADGGSVAGNWPYLAI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.