NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F066535

Metagenome / Metatranscriptome Family F066535

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066535
Family Type Metagenome / Metatranscriptome
Number of Sequences 126
Average Sequence Length 180 residues
Representative Sequence MSLNKWYETLVNSQVDGSALTAAAAASAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN
Number of Associated Samples 111
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 79.20 %
% of genes near scaffold ends (potentially truncated) 26.19 %
% of genes from short scaffolds (< 2000 bps) 48.41 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.84

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (67.460 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil
(29.365 % of family members)
Environment Ontology (ENVO) Unclassified
(45.238 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(42.063 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 1.41%    β-sheet: 45.54%    Coil/Unstructured: 53.05%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.84
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.29.1.0: Concanavalin A-like lectins/glucanasesd4u36a_4u360.63
b.29.1.13: Concanavalin A-like lectins/glucanasesd3a4ua_3a4u0.59
b.29.1.16: Concanavalin A-like lectins/glucanasesd1ux6a11ux60.57
b.29.1.13: Concanavalin A-like lectins/glucanasesd2a6za12a6z0.57
b.29.1.0: Concanavalin A-like lectins/glucanasesd5z5ia25z5i0.57


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 126 Family Scaffolds
PF01833TIG 23.02
PF06381DUF1073 1.59
PF01471PG_binding_1 1.59
PF00176SNF2-rel_dom 1.59
PF08708PriCT_1 0.79
PF03864Phage_cap_E 0.79
PF02709Glyco_transf_7C 0.79
PF04860Phage_portal 0.79
PF00535Glycos_transf_2 0.79
PF05050Methyltransf_21 0.79
PF04480DUF559 0.79
PF10983DUF2793 0.79
PF13659Obsolete Pfam Family 0.79
PF13227DUF4035 0.79
PF00801PKD 0.79
PF00589Phage_integrase 0.79
PF07505DUF5131 0.79
PF01612DNA_pol_A_exo1 0.79
PF05272VirE 0.79
PF02653BPD_transp_2 0.79
PF01531Glyco_transf_11 0.79
PF04865Baseplate_J 0.79
PF05063MT-A70 0.79
PF10518TAT_signal 0.79
PF01464SLT 0.79
PF11651P22_CoatProtein 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 126 Family Scaffolds
COG4725N6-adenosine-specific RNA methylase IME4Translation, ribosomal structure and biogenesis [J] 1.59
COG4422Bacteriophage protein gp37Mobilome: prophages, transposons [X] 0.79
COG5545Predicted P-loop ATPase and inactivated derivativesMobilome: prophages, transposons [X] 0.79


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A67.46 %
All OrganismsrootAll Organisms32.54 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_105038092Not Available1014Open in IMG/M
3300001213|JGIcombinedJ13530_102759070Not Available1049Open in IMG/M
3300002549|JGI24130J36418_10004120Not Available4957Open in IMG/M
3300003320|rootH2_10026934All Organisms → cellular organisms → Bacteria27825Open in IMG/M
3300004481|Ga0069718_10057323All Organisms → cellular organisms → Bacteria17733Open in IMG/M
3300005172|Ga0066683_10589086Not Available675Open in IMG/M
3300005178|Ga0066688_10397253Not Available894Open in IMG/M
3300005253|Ga0073583_1130832All Organisms → cellular organisms → Bacteria4207Open in IMG/M
3300005334|Ga0068869_101455687Not Available607Open in IMG/M
3300005345|Ga0070692_10948740Not Available598Open in IMG/M
3300005440|Ga0070705_100000026Not Available77617Open in IMG/M
3300005468|Ga0070707_100179117Not Available2066Open in IMG/M
3300005536|Ga0070697_100888531Not Available790Open in IMG/M
3300005537|Ga0070730_10215218All Organisms → cellular organisms → Bacteria1277Open in IMG/M
3300005559|Ga0066700_10001204All Organisms → cellular organisms → Bacteria10555Open in IMG/M
3300005577|Ga0068857_100262812All Organisms → cellular organisms → Bacteria1584Open in IMG/M
3300005615|Ga0070702_101408156All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Leisingera → Leisingera aquimarina570Open in IMG/M
3300006028|Ga0070717_10004383Not Available10184Open in IMG/M
3300006853|Ga0075420_100229007Not Available1619Open in IMG/M
3300006881|Ga0068865_100043063All Organisms → cellular organisms → Bacteria3082Open in IMG/M
3300007516|Ga0105050_10032861All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium4148Open in IMG/M
3300007523|Ga0105052_10508036Not Available780Open in IMG/M
3300007722|Ga0105051_10516098Not Available884Open in IMG/M
3300009147|Ga0114129_10000360All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → unclassified Hyphomicrobiaceae → Hyphomicrobiaceae bacterium52606Open in IMG/M
3300009177|Ga0105248_10542255All Organisms → cellular organisms → Bacteria → Proteobacteria1312Open in IMG/M
3300009609|Ga0105347_1086310All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1168Open in IMG/M
3300009678|Ga0105252_10000125Not Available61922Open in IMG/M
3300009777|Ga0105164_10253544Not Available883Open in IMG/M
3300009813|Ga0105057_1001624Not Available2647Open in IMG/M
3300010397|Ga0134124_10081769Not Available2777Open in IMG/M
3300010400|Ga0134122_10112459Not Available2170Open in IMG/M
3300010403|Ga0134123_10000072Not Available61643Open in IMG/M
3300011271|Ga0137393_10831581Not Available789Open in IMG/M
3300011402|Ga0137356_1000002Not Available56173Open in IMG/M
3300011410|Ga0137440_1004227Not Available2002Open in IMG/M
3300011417|Ga0137326_1169184Not Available505Open in IMG/M
3300011420|Ga0137314_1010068Not Available2492Open in IMG/M
3300011423|Ga0137436_1108112Not Available736Open in IMG/M
3300011429|Ga0137455_1006828Not Available3082Open in IMG/M
3300011429|Ga0137455_1262750Not Available508Open in IMG/M
3300011433|Ga0137443_1089619Not Available876Open in IMG/M
3300011441|Ga0137452_1000040Not Available54528Open in IMG/M
3300011441|Ga0137452_1011779All Organisms → cellular organisms → Bacteria → Proteobacteria2664Open in IMG/M
3300011441|Ga0137452_1274577Not Available566Open in IMG/M
3300011445|Ga0137427_10002766All Organisms → cellular organisms → Bacteria7354Open in IMG/M
3300012034|Ga0137453_1000573Not Available3723Open in IMG/M
3300012034|Ga0137453_1052985Not Available741Open in IMG/M
3300012040|Ga0137461_1009938Not Available2215Open in IMG/M
3300012140|Ga0137351_1000224Not Available6434Open in IMG/M
3300012160|Ga0137349_1011970All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1286Open in IMG/M
3300012171|Ga0137342_1000920All Organisms → Viruses → Predicted Viral3657Open in IMG/M
3300012172|Ga0137320_1003531Not Available2982Open in IMG/M
3300012225|Ga0137434_1000003All Organisms → cellular organisms → Bacteria19895Open in IMG/M
3300012231|Ga0137465_1000273All Organisms → cellular organisms → Bacteria30007Open in IMG/M
3300012231|Ga0137465_1198565Not Available603Open in IMG/M
3300012232|Ga0137435_1000046All Organisms → cellular organisms → Bacteria → Proteobacteria56447Open in IMG/M
3300012232|Ga0137435_1041566Not Available1342Open in IMG/M
3300012685|Ga0137397_10292579Not Available1213Open in IMG/M
3300012923|Ga0137359_10018177Not Available5941Open in IMG/M
3300012943|Ga0164241_10000679Not Available54530Open in IMG/M
3300012944|Ga0137410_10285019Not Available1304Open in IMG/M
3300012944|Ga0137410_10612751Not Available901Open in IMG/M
3300014613|Ga0180008_1195283Not Available778Open in IMG/M
3300014872|Ga0180087_1032720Not Available953Open in IMG/M
3300014876|Ga0180064_1028438Not Available1086Open in IMG/M
3300014878|Ga0180065_1000507Not Available5957Open in IMG/M
3300014879|Ga0180062_1009491Not Available1933Open in IMG/M
3300014881|Ga0180094_1020681Not Available1284Open in IMG/M
3300014884|Ga0180104_1014832Not Available1866Open in IMG/M
3300014884|Ga0180104_1032486All Organisms → Viruses → Predicted Viral1345Open in IMG/M
3300014885|Ga0180063_1015189All Organisms → Viruses → Predicted Viral2072Open in IMG/M
3300015163|Ga0167665_1000280All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → unclassified Gemmatimonadaceae → Gemmatimonadaceae bacterium18433Open in IMG/M
3300015196|Ga0167627_1029792All Organisms → cellular organisms → Bacteria1895Open in IMG/M
3300015203|Ga0167650_1005279Not Available3816Open in IMG/M
3300015204|Ga0167626_1014757All Organisms → cellular organisms → Bacteria3486Open in IMG/M
3300015245|Ga0137409_10287453Not Available1449Open in IMG/M
3300015259|Ga0180085_1186581Not Available622Open in IMG/M
3300018059|Ga0184615_10028730Not Available3074Open in IMG/M
3300018059|Ga0184615_10293452All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium907Open in IMG/M
3300018068|Ga0184636_1002301All Organisms → cellular organisms → Bacteria → Proteobacteria4999Open in IMG/M
3300018068|Ga0184636_1078295Not Available1112Open in IMG/M
3300018070|Ga0184631_10006139Not Available3556Open in IMG/M
3300018070|Ga0184631_10021839All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium2151Open in IMG/M
3300018084|Ga0184629_10002526Not Available6978Open in IMG/M
3300018084|Ga0184629_10003125Not Available6187Open in IMG/M
3300018482|Ga0066669_10049975All Organisms → cellular organisms → Bacteria2634Open in IMG/M
3300019249|Ga0184648_1444592Not Available633Open in IMG/M
3300020027|Ga0193752_1017402All Organisms → cellular organisms → Bacteria3513Open in IMG/M
3300020580|Ga0210403_10113021Not Available2206Open in IMG/M
3300021090|Ga0210377_10004931Not Available11089Open in IMG/M
3300021090|Ga0210377_10071354Not Available2364Open in IMG/M
3300021178|Ga0210408_10000213Not Available83579Open in IMG/M
3300021363|Ga0193699_10006008Not Available4373Open in IMG/M
3300021478|Ga0210402_10099482Not Available2609Open in IMG/M
3300022213|Ga0224500_10001704Not Available12658Open in IMG/M
3300022745|Ga0228698_1000298All Organisms → cellular organisms → Bacteria42010Open in IMG/M
3300024224|Ga0247673_1000006All Organisms → cellular organisms → Bacteria41956Open in IMG/M
3300025173|Ga0209824_10199725Not Available706Open in IMG/M
3300025725|Ga0209638_1009115Not Available5346Open in IMG/M
3300025922|Ga0207646_10167498Not Available1984Open in IMG/M
3300025938|Ga0207704_10114135All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1833Open in IMG/M
3300026089|Ga0207648_10407309All Organisms → cellular organisms → Bacteria1233Open in IMG/M
3300026116|Ga0207674_10260637Not Available1681Open in IMG/M
3300026542|Ga0209805_1117181Not Available1261Open in IMG/M
3300027068|Ga0209898_1019450Not Available843Open in IMG/M
3300027513|Ga0208685_1037356All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1062Open in IMG/M
3300027573|Ga0208454_1000118Not Available56217Open in IMG/M
3300027887|Ga0208980_10547500Not Available662Open in IMG/M
3300029901|Ga0247051_1046979All Organisms → cellular organisms → Bacteria → Proteobacteria1436Open in IMG/M
3300031670|Ga0307374_10164211Not Available1684Open in IMG/M
3300031671|Ga0307372_10349045Not Available793Open in IMG/M
3300031672|Ga0307373_10025819Not Available7159Open in IMG/M
3300031707|Ga0315291_10046609All Organisms → Viruses → Predicted Viral4921Open in IMG/M
3300031707|Ga0315291_10453541Not Available1203Open in IMG/M
3300031772|Ga0315288_10094427All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium3433Open in IMG/M
3300031873|Ga0315297_10808215Not Available782Open in IMG/M
3300031885|Ga0315285_10144426All Organisms → Viruses → Predicted Viral1989Open in IMG/M
3300031997|Ga0315278_10600744Not Available1126Open in IMG/M
3300031999|Ga0315274_11298996All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage710Open in IMG/M
3300032053|Ga0315284_10066807All Organisms → Viruses → Predicted Viral4891Open in IMG/M
3300032118|Ga0315277_10751818Not Available929Open in IMG/M
3300032156|Ga0315295_10008348All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium8898Open in IMG/M
3300032164|Ga0315283_12157774Not Available549Open in IMG/M
3300032516|Ga0315273_10011663Not Available11244Open in IMG/M
3300032516|Ga0315273_10098707Not Available3988Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil29.37%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment10.32%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.35%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.56%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.17%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere3.17%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.38%
FreshwaterEnvironmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater2.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.38%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.38%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil2.38%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil2.38%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater1.59%
WetlandEnvironmental → Aquatic → Marine → Wetlands → Sediment → Wetland1.59%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil1.59%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.59%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.59%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.59%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.79%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.79%
CryconiteEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Cryconite0.79%
FreshwaterEnvironmental → Aquatic → Freshwater → Creek → Unclassified → Freshwater0.79%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment0.79%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.79%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.79%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.79%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Glacier Forefield Soil0.79%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.79%
Sugarcane Root And Bulk SoilHost-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil0.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.79%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001213Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly)EnvironmentalOpen in IMG/M
3300002549Arctic peat soil from Barrow, Alaska - Barrow Graham LP Ref core NGADG0002-212EnvironmentalOpen in IMG/M
3300003320Sugarcane root Sample H2Host-AssociatedOpen in IMG/M
3300004481Combined Assembly of Gp0112041, Gp0112042, Gp0112043EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005253Marine sediment microbial community near Loki's castleEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300007516Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - FRY-01EnvironmentalOpen in IMG/M
3300007523Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - FRY-03EnvironmentalOpen in IMG/M
3300007722Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - FRY-02 (megahit assembly)EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011402Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT830_2EnvironmentalOpen in IMG/M
3300011410Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT222_2EnvironmentalOpen in IMG/M
3300011417Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT500_2EnvironmentalOpen in IMG/M
3300011420Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT199_2EnvironmentalOpen in IMG/M
3300011423Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT119_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300011433Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT300_2EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012034Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT526_2EnvironmentalOpen in IMG/M
3300012040Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT746_2EnvironmentalOpen in IMG/M
3300012140Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT690_2EnvironmentalOpen in IMG/M
3300012160Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT630_2EnvironmentalOpen in IMG/M
3300012171Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT466_2EnvironmentalOpen in IMG/M
3300012172Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT366_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012231Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT828_2EnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014613Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PW_MetaGEnvironmentalOpen in IMG/M
3300014872Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT790_16_10DEnvironmentalOpen in IMG/M
3300014876Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200_16_10DEnvironmentalOpen in IMG/M
3300014878Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200A_16_10DEnvironmentalOpen in IMG/M
3300014879Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015163Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb1b, glacier snout)EnvironmentalOpen in IMG/M
3300015196Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G2C, Ice surface)EnvironmentalOpen in IMG/M
3300015203Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-3c, vegetated patch on medial moraine)EnvironmentalOpen in IMG/M
3300015204Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G2B, Ice surface)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018068Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_b2EnvironmentalOpen in IMG/M
3300018070Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300022213Sediment microbial communities from San Francisco Bay, California, United States - SF_Oct11_sed_USGS_4_1EnvironmentalOpen in IMG/M
3300022745Freshwater microbial communities from McNutts Creek, Athens, Georgia, United States - 1-17_Aug_MGEnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025725Arctic peat soil from Barrow, Alaska - Barrow Graham LP Ref core NGADG0002-211 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027573Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027887Wetland microbial communities from Twitchell Island in the Sacramento Delta, sample from surface sediment Aug2011 Site A1 BulkEnvironmentalOpen in IMG/M
3300029901Cryconite microbial communities from ice sheet in Kangerlussuaq, Greenland - KAN_P-B3aEnvironmentalOpen in IMG/M
3300031670Soil microbial communities from Risofladan, Vaasa, Finland - OX-3EnvironmentalOpen in IMG/M
3300031671Soil microbial communities from Risofladan, Vaasa, Finland - OX-1EnvironmentalOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031707Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_20EnvironmentalOpen in IMG/M
3300031772Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_20EnvironmentalOpen in IMG/M
3300031873Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G15_0EnvironmentalOpen in IMG/M
3300031885Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_36EnvironmentalOpen in IMG/M
3300031997Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0EnvironmentalOpen in IMG/M
3300031999Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_20EnvironmentalOpen in IMG/M
3300032053Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_16EnvironmentalOpen in IMG/M
3300032118Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G05_15EnvironmentalOpen in IMG/M
3300032156Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G14_0EnvironmentalOpen in IMG/M
3300032164Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_0EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10503809223300000364SoilMSKQTWMGELIRAQSDGAALTAAAAASCLPTHAKYTLPPNYFDAPGKGLRIKASGRISSVITTPGTARFDVRLGSTVVFDGLAIVLDSIAAHTNVGWALEIELTARTVGTGTNATLIGTGWWTCEDIIGVPATAPKGVLSAILPWNSAPAVGNGFDSTIANVLDLFFTQTQATGSMTLHQYTAIEPN*
JGIcombinedJ13530_10275907023300001213WetlandMLNGQSFEEIVVTSQVDGSALTAAAAASMIPAAAKFTLPPNWWAIGRKALVRAAGKISSVITTPGTARFDVRFGTDVVFDSLAILLDSVAAHTDVGWWLEIMLTCRGIGAAATLMGHGIFVCEDLLGVPATAPKGCLTAMLPWNAAPAVGATFDSTVSNVVDLYFTQTVATGSLTCQQYSLSLMN*
JGI24130J36418_1000412023300002549Arctic Peat SoilMLNGQTFDEIVIVSQVDGAALTAAAAASMIPAAAKFTLPPNWCQIGRQILLRAAGKISSVITTPGTARFDVRLGTDVVFDGLAILLDSVAARTDAAWWLEILLTCRGIGAAATFMGHGRWSCEDILGVPATAPKGALTAMLPWNSAPAVGASFDSTVSQVIDVFFTQTAATGSLTCQQYSVSLLN*
rootH2_10026934173300003320Sugarcane Root And Bulk SoilMALGYRKTLVSAQGDGPALTAAAAATCLPGQAKVLIPGGTFDAAGKLLEIEAWGRISSVITTPGTARFDVRLGGTVLFDSLAILLDSVAAHTNVGWYLKILMSLRTPGTAATFFGQGTFTCEDILGVPATAPKGVLSAILPWNSAPAVSGNTFDATAAQTLDLFFTQTVATGSLTLHQYRASIESYE*
Ga0069718_1005732363300004481SedimentMLNGQTFEEVVITSQEDGAALTAAAAASMIPAAAKFTLPPNWCQIGRQIHIRAAGKISSVITTPGTARFDIRFGATVVFDGLAALLDSVAAHADVGWWLEIILTVRVIGTSAALMGHGIFITEDLLGVPATAPKGVLVAMLPWNATPANGNTFDSTVSQVIDAFFTQTAATGSLTCQQYSVALLN*
Ga0066683_1058908613300005172SoilGAAAAMCLPAAAKTTIPVNTFDTIGKGLLITASGRISSVITSPGTARFDVRLGGTVVFDSLAILLDSVAAHTNVGWWLQIYLTLRTLGAATAPAAANFMGQGLWTCEDLLGVPATAPKGVLSAILPWNSAPAVGGNIDPTTALTLELFFTQTLRTGSMTLHQYMAELIAWE*
Ga0066688_1039725323300005178SoilMSLQTWQETLVSSQVDGPVLTAAAAASCIPPAAKITLPNNYFYIGRVLRITATGRISSVITTPGTARFDVRIGGVIAFDSLVILLDTVAGHTNVGWWLDLFLTCRAIGNGTLTTLMGQGSWTCEYILGVPATAPKGVLTAMLPWNSAPAVGTGFDNTLANTLDMFFTQTAATGSLTVHQYIVSSLN*
Ga0073583_113083223300005253Marine SedimentMSFQSWQETLITSQGDGATITAAAATSMLPAAAVYTLPANFFDVIGKQVMIKASGRITSVITTPGTARYDVRLGGGVVFDGLATLLDTVAGHTTVAWDLEILLTCRVIGATANLFGIGKWASEDLLGVPATAPKGVLTAMLPWDTTPAVGSNFDSTTSQTVDVQFTQTAATGSMTCHQYSLISMN*
Ga0068869_10145568713300005334Miscanthus RhizosphereATCIPTAAKITLPNNYLTIGRELWIRATGRISSLITTPGTARFDIRIGAVVAFDSLAILLDTVAGHTNVGWVLDLHLTVRAQGSGTSGNLIGQGTWACEDILGVPATAPKGSLVAVLPWNSAPAVGTGYDSTAASTLDMFFTQTAATGSLTVHTYRVISLN*
Ga0070692_1094874013300005345Corn, Switchgrass And Miscanthus RhizosphereEVLVNAQSNGTALTAAAAASCIPAAAKKTMWNNYFDTIGQQLRITACGIISSVITTPGTARFDFRLGGTVVFDGLAVLLDTVAGHTNVGWWLEILLRLDTIGSGTSAKFMGHGKWTCEDILGVPATAPKGVLTASLPWNSAPALGGGFDSTAALSMDLFFTQTVATGSCQLQQFMIEALN
Ga0070705_10000002623300005440Corn, Switchgrass And Miscanthus RhizosphereMSLQTWRETLIVASTDGTTLTAAAAATCIPATARFTLPANYWTIGRQLKITASGRISSVITTPGTARFDIRMGAAGNTVVFDSLAILLDTVAAHTTVGWYLEILMTCRAIGTAANFMGQGKWTCEDILGTPAAAPKGSLTAILPWNAAPAVGGNFDSTVTNIVDMFFTQTVATGSMTLHQYSAVACN*
Ga0070707_10017911713300005468Corn, Switchgrass And Miscanthus RhizosphereMSFNSWHQTLVTAQGDGTALTAAAAASCLPAHAKFPLPGCFFDTVGKQLMIRASGRISSLITTPGTARFDVRFGSTVVADGLAVLLDTVAAHTNVGWWLKMLLTCRAVGASASLMQSGIWVCEDILGVPATAPKGSLTALLPWNSAPTVGATFDSTAAQIVDLFFTQTVATGSLTLHQYSLIS
Ga0070697_10088853123300005536Corn, Switchgrass And Miscanthus RhizosphereMSLQSWRETLAIAAGDGTAITAAAAATCIPATARLTLPANYWSVGRQLKITASGRISSVITTPGTARYDIRMGSAGNTVVFDSQAILLDTVAAHTNVGWYLEILMTCRAIGTAANFMGQGKWTCEDILGVPATAPKGVLTAILPWNAAPAVGGNFDSTVTNIVDMFFTQTVATGSMTLHQYSAVACN*
Ga0070730_1021521823300005537Surface SoilMSQQSWQQTLISSQVDGSALTAAAAASMLPLQAKYTLPANFVNFVGQQLLVKASGRISTVITTPGTARFDIRLGGTVDFDSLAIALDTAAAYTNVGWYLEILLTARSISGTTSNFMGQGTWVCTQMAGTIQTPPKGALTAILPWNSAPAVGSNVDLTTALQVDAFFTQTVATGSLTCHQFSLIAMN*
Ga0066700_1000120453300005559SoilMSLQTWQETLVSSQVDGPVLTAAAAASCIPPAAKITLPNNYFYIGRVLRITATGRISSVITTPGTARFDVRIGGVIAFDSLAILLDTVAGHTNVGWWLDLFLTCRAIGNGTLTTLMGQGSWTCEDILGVPATAPKGVLTAMLPWNSAPAVGTGFDNTLANTLDMFFTQTAATGSLTVHQYIVSSLN*
Ga0068857_10026281223300005577Corn RhizosphereMGAQRWSQPLIVQSVTDGPTLTAAAAASCIPTPNKITLPNNYWEVGRQWRVSLHGRISTVVTTPGTARFDLRIGAVVAFDSLAIALDAVAGYTTVPWWLEIMLTCRAAGAGTVTQLLGVGKWTSPNIAGAIAGAPKGSLTALLPWNAAPALGTGFDNTLANTVDMFFTQTVATGSLTVHGYQIEELN*
Ga0070702_10140815613300005615Corn, Switchgrass And Miscanthus RhizosphereTLVSSQGDGPTLTAAAAATCLPGQAKFTIPAGTFDAPGKILHIWASGRISSLITTPGTARYDVRLGGTVMFDSLAILLDSVAAHTTVGWTLDITMTMRTAGSSATFWGQGTWTSEDILGVPATAPKGVLSAILPWNSTPAVSGNTFDANASQVLDLFFTQTAATGSMTVHQYMAEIVSFE
Ga0070717_10004383163300006028Corn, Switchgrass And Miscanthus RhizosphereMSKQTWVETLVSQQIDGPTLTAASAASCIAPAAKITLPNNFFDYIGKAIRIKVYGKISSVITTPGTARFDVRFGSVVVFDGLAILLDTVAAHTNVGWELEIVLTCRSIGNSTLATMMGVGKWTCEDILGVPATAPKGVLTAILPWNSAPAVGAGFDSTAANTLDLFFTQTVATGSLTVLGYTVEALN*
Ga0075420_10022900723300006853Populus RhizosphereMAKQTWQQTLITALIDGTTLTAAAAASCIPVAAKYTLPANFFELGRQLRIQASGRISSVITTPGTARFDVRLGGTIVWDSQAILLDTVAAHTNVGWFLDVLLTCRAIGSAANLIGQGQWACEDILGVPATAPKGVLTALLPWNQAPAVGANFDSTAAQQLDLFFTQTQGTGSLTVHQYRLESVN*
Ga0068865_10004306323300006881Miscanthus RhizosphereMALGYRKTLVSSQGDGPTLTAAAAATCLPGQAKFTIPAGTFDAPGKILHIWASGRISSLITTPGTARYDVRLGGTVMFDSLAILLDSVAAHTTVGWTLDITMTMRTAGSSATFWGQGTWTSEDILGVPATAPKGVLSAILPWNSTPAVSGNTFDANASQVLDLFFTQTAATGSMTVHQYMAEIVSFE*
Ga0105050_1003286133300007516FreshwaterMSLQTWQETLVTAQVNGTALTAAARASCIPAAAKFTLPANYFSQIGKALRIKASGRITSVVTTPGTARFDVAFGSNVVADGLAILLDSVAGHTDVGWSLEFLLTCRAIGATANLMQSGIWVCEDILGVPASAPKGVLTALLPWNAAPAVGANFDSTASQVVDLFFTQTAATGSLQVHQYSLESLN*
Ga0105052_1050803623300007523FreshwaterMSLQTWQETLVTAQVNGTALTAAARASCIPAAAKFTLPANYFSQIGKALRIKASGRITSVVTTPGTARFDVAFGSNVVADGLAILLDSVAGHTDVGWSLEFLLTCRAIGATANLLQSGIWVCEDILGVPASAPKGVLTALLPWNAAPAVGANFDSTASQVVDLFFTQTAATGSLQVHQYSLESLN*
Ga0105051_1051609823300007722FreshwaterMSLQTWQETLVTAQVNGTALTAAARASCIPAAAKFTLPANYFSQIGKSLRIKASGRISSVVTTPGTARFDVAFGSNVVADGLAILLDSVAGHTDVGWSLEFLLTCRAIGATANLMQSGIWVCEDILGVPASAPKGVLTALLPWNAAPAVGANFDSTASQVVDLFFTQTAATGSLQVHQYSLESLN*
Ga0114129_10000360783300009147Populus RhizosphereMTSQADGPTLSAAAAATAIPAAAKLTLPANFFRGLGQQLRIQACGRISSVITTPGTARFDVRFGSTVVFDGLAILLDTVAAHTNVGWWLEILLTCRSLGASAQLFGQGYWTCEDILGVPATAPKGVLSAMLPWNSAPALGNAFDSTIAQVVDFFFTQTVATGTLIVHQYSVISEN*
Ga0105248_1054225513300009177Switchgrass RhizosphereVDGPTLTAAAAATCIPTATKITLPNNYLTIGRELWIRATGRISSLITTPGTARFDIRIGAVVAFDSLAILLDTVAGHTNVGWVLDLHLTVRAQGSGTSGNLIGQGTWACEDILGVPATAPKGSLVAVLPWNSAPAVGTGYDSTAASTLDMFFTQTAATGSLTVHTYRVISLN*
Ga0105347_108631023300009609SoilMSVGYWETLVEGQVDGPTLTAAARASCIPPAAKFTLRPNFFDRIGKKLFIRAQGRISSLITTPGTARYDIVFGAALMFDSLAILLDTVAGHTTVPWWLEIEMTCRAIGATGNLFTQGRWSCEDILGVPATAPKGVLTAMLPWNVAPAVGANFDTTASQTIDMFFTQTVATGQMILHQYTVISPN*
Ga0105252_10000125153300009678SoilMALGYVETLITSSENGTAVTAAAATTLLPASAVKTLPANYFDFVGKQVLVKASGRISSLITTPGTARFDIRLGGIVVFDGLAVLLDTVAAHTNVGWHLEVLLTCRAIGATGNFFGQGQWTCEDILGVPATAPKGVLTAVLPWNSAPAVGGNVDTTTSLALNLFFTQTVATGSCQLHQFGAYGMN*
Ga0105164_1025354423300009777WastewaterMSLGYWETLISCQVDGAAVTAAAATSLLPAQAKKTIPANYFDVIGKKLIITATGRISSVVTTPGTARFDVRLGSIVVFDGLAILLDSVAAHTTVGWDLYIDLTCRAVGATANLFGYGRWTCEDILGVPATAPKGVLSAVLPWNATPAVGANFDSTVSAVVDLFFTQTVATGSCTVHQFNVYSPN*
Ga0105057_100162413300009813Groundwater SandMSQQTWGEVLVNGSVDGPTLTAAAAASCIPTPSRIILPNNYFYVGRMLRITASGRISSVVTTPGTARFDVRMGPSGTIIVFDGLAILLDTVAAHTTVGWFLEILLTCRAVGSGTATNFMGQGKWTCEDILGGPATAPKGVLTAILPWNTAPAVGTGFDNTAANAVDMFFTQTVATGSMTVHQYMVESLN*
Ga0134124_1008176923300010397Terrestrial SoilMSLQGWQETLITSQVDGTTLTAAAAASCIPAAAKYTFPTNYFSIGKQLLVKFSGRISSVITTPGTARFDVRLGASVVFDSQAMLLDTAAGHTTESFFGEVLLTCRSIGATATIIGQGFWAADNLKNVGTMPVGGLVAPMPWSGTAPAVGSTFDSTATQQLDFFFTQTVATGSMTLHQYSLISLN*
Ga0134122_1011245913300010400Terrestrial SoilLGYVEPLITSQVDGTANTAGTAASAIPAAAKKTLPANYFDVIGKQLLIKASGRISTVITTPGVVTYDIRFGATKVFDGLGILPDTVAAHSNVGWMLEILLTCRAIGTSGNLMGQGFWVCEDLLGVPATAPKGVLSAILPWNSAPAVGSNFDTTATQQIDMFFANTVGTGSHTVHQFGVYGLN*
Ga0134123_10000072963300010403Terrestrial SoilMVGYSAVLVNSQVDGTANTTGVAASAIPAAAKFTLPTNFFDVIGKKLVIEATGRISTVITTPGVITFDVRFGATKVFDGLGILPDTVAAHTNVGWMLKIDLTCRAIGTAANLMGQGMWTCEDILGVPATAPKGVLSAILPWNSAPAVGSNFDSTATQQVDLFFTNTVGTGSMTVHQFCLYSPN*
Ga0137393_1083158123300011271Vadose Zone SoilMQQGWVGTLISAQVDGTALTAVGPASLLPAAAKVTIPNNFFAIGSALRIKASGRISSLITTPGTARFDVRLGGTVVFDGLAILLDTVAAHTTVGWWLEILLTCRAIGATGNFMGQGVWTCEDILGVPATAPKGVLSAMLPWNSAPTVGGNVDTTTALQLDLQFTQTVT
Ga0137356_1000002283300011402SoilMALGYVETLIASTENGAARVVAAAATMLPASAIRTLPANYFDFVGKQILIKASGRISSVITTPGTARFDVRMGGTVVFDGLAILLDTVAGHTNVGWELEILLSCRAIGATANFFGIGKWTCEDILGVPATAPKGVLVAILPWNTAPAVGGNFDSTVSQAVDVQFTQTVATGSLTCHTYGLYGLN*
Ga0137440_100422713300011410SoilMIQSWHETLVTSQVDGTAVTAASPTSCIPAAAKYTLRPNFLERLGQKLYIKASGRVSSVITTPGTMRFDVRLGATVVFDGLAVLLDTAAAHTNVGWWLEMELTVRAIGTSANLMGQGFLTSRNLLGAPTAPAPALTAMLPWNSAPAVGNNFDSTASQQVDIFFTQTVATGSLTVHQYELIAAN*
Ga0137326_116918413300011417SoilASCIPPASKFTLRPNFFDKIGKKLFIRAQGRISSVITTPGTARYDIVFGAALMFDSLAILLDTVAGHTTVPWWLEIEMTCRAIGATGNLFTQGRWSCEDILGVPATAPKGVLTAMLPWNVAPAVGANFDSTASQTIDMFFTQTVATGQMILHQYTVISPN*
Ga0137314_101006823300011420SoilLTAAAAASAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN*
Ga0137436_110811213300011423SoilMLPAAAIKTLPANYFDTIGKQISIVASGRISSVITTPGTARYDVRLGGTVVFDGLAILLDTVAAHTNVGWQLEILLTCRAIGTAGNLMGVGKWTCEDILGVPAAAPKGVLSAILPWNSAPAVGGNFNTTVSQTLDVQFTQTVATGSMTCHQYGAFGHCGPN*
Ga0137455_100682833300011429SoilMSAQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWN
Ga0137455_126275013300011429SoilMRIRATGRISSVITTPGTARFDIRFGGTVVFDSLAILLDTVAGKTTVGWVLNIELTQRVVGAVASLFGQGTWTCDDLLGRPVGTPVGVLSAIIPWNAAPAVGGTFVSTTAMQMDMFFTQTAATGSMTVHQYKVINEN*
Ga0137443_108961923300011433SoilMSAQGWQETLITATGDGSALTAAAAASALPPQAKYTLKPNFFNYIGQQMLVKASGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFFTQTVATGSMTVHQFSVQAMN*
Ga0137452_1000040453300011441SoilMSAQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFFTQTVATGSMTVHQFSVQAMN*
Ga0137452_101177913300011441SoilMSALTWGETLITSQVDGAAVTAAAATTLLPAAAKKTLEPNFFKTLGQQLIIRASGRVSTVITTPGTFRVDVRFGGTVVFDGLAVALATADAYTNVGWWLEIVLTLRALGTSANLMGQGRLDLPNAAGAAATPPKGALTAMLPWNSAPAVGANFDSGASQVVDLFFTQTAATGSCTLHQYALISGN*
Ga0137452_127457713300011441SoilMIQSWHETLVTSQVDGTAVTAASPTSCIPAAAKYTLRPNFLERLGQKLYIKASGRVSSVITTPGTMRFDVRLGATVVFDGLAVLLDTAAAHTNVGWWLEMELTVRAIGTSANLMGQGFLTSRNLLGAPTAPAPALTAMLPWNSAPAVGNNFDSTASQQVDIFFTQTVATGSLTVHQYE
Ga0137427_1000276673300011445SoilMSLNKWQETLVNSQVDGSALTAAAATSAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN*
Ga0137453_100057323300012034SoilMSAQGWQETLITATGDGSALTAAAAASALPPQAKYTLKPNFFNYIGQQMLVKASGRISSVITTPGTARFDLRFGATVVFDGLAVLLDTVAGHTTVGWFLEILLTCRVIGTSASLFGQGKWTCEDILGVPATAPKGVLTAILPWNSAPATGTVFDSTVSQQMDLFFTQTAATGSMTLHQYQVVALN*
Ga0137453_105298513300012034SoilMSAQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFF
Ga0137461_100993823300012040SoilQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFFTQTVATGSMTVHQFSVQAMN*
Ga0137351_100022443300012140SoilMSLGFQETLVTATTDGTTLTAAAAASALPAQALYTLPANYFAYVGKQLLVKAYGRISSVITTPGTARFDIRLGGTVVFDGLAVLLDTVAAHTTVGWKLEILMTCRAIGTSANLFGVGTWTCEDILGVPATAPKGVLTAILPWNTAPAVGSNFTSTSALQVDMFFTQTVATGSMTVHGYSLIDPTSNGS*
Ga0137349_101197023300012160SoilMSVGYWETLVEGQVDGPTLTAAARASCIPPASKFTLRPNFFDKIGKKLFIRAQGRISSLITTPGTARYDIVFGAALMFDSLAILLDTVAGHTTVPWWLEIEMTCRAIGATGNLFTQGRWSCEDILGVPATAPKGVLTAMLPWNVAPAVGANFDTTASQTIDMFFTQTVATGQMILHQYTVISPN*
Ga0137342_100092023300012171SoilMALGYVETLITSVENGTAVTAAAATTLLPASAVKTLPANYFDFVGKQVLVKASGRISSLITTPGTARFDVRLGGTVVFDGLAILLDSVAAHTNVGWTLEILLTCRAIGATGNFFGQGTWTCEDILGVPATAPKGVLSAILPWNSAPAVGGNVDTTTSLALNLFFTQTVATGSCQLHQFGAYGLN*
Ga0137320_100353123300012172SoilMIGQPETLMVAQVDGAAVTAAAATSLLHASSKRTLPANIFNYIGQTLRVVARGRISSVITTPGTARFDLRLGGALAFDGLAILLDTVAGHTNVGWKLVMDLTVRAIGTTGNFMGSGEWTCEDILGVPATAPKGVLTAILPWNSVPAVGAAVDMTTSLQADLFFTQTVATGSCTLHQFELSLPNGY*
Ga0137434_100000393300012225SoilMSLQELLIASQVDGSALTAAAAATAIPGAAKFTLRPNFFDTIGKQLAVKATGRISTVVTTPGTARFDVRLGGTVIFDSLAIALATADAYTNVGWELEILLTCRAIGATANLFGQGRWTAPNIAGASATPPKGSLIAMLPWNSAPAVGANFDSTASQQVDLFFTQTVATGSLTVHQYSLSSLMPYPSQ*
Ga0137465_1000273283300012231SoilMSLNKWYETLVNSQVDGSALTAAAAASAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN*
Ga0137465_119856513300012231SoilTGETQMSLGFQETLVTATTDGTTLTAAAAASALPAQALYTLPANYFAYVGKQLLVKAYGRISSVITTPGTARFDIRLGGTVVFDGLAVLLDTVAAHTTVGWKLEILMTCRAIGTSANLFGVGTWTCEDILGVPATAPKGVLTAILPWNTAPAVGSNFTSTSALQVDMFFTQTVATGSMTVHGYSLIDPTSNGS*
Ga0137435_1000046983300012232SoilMATAGFEELLITAQVDGTAVTAAAATTLLPPAALYTLPANFFSAIGKQLRIEASGRISSVITTPGTARFDVRLGGTVVFDGQAILLDTVAGHTTVGWKLDILLTCRAIGTAANVFGIGTWTCEDILGVPATAPKGVLTAILPWNTTPAVGANFNSTVSQQVDLQFTQTVATGTCTLHQYALMSRNLPQ*
Ga0137435_104156613300012232SoilETLMTSQVDGAALTAAATASMLPAAAIKTLPANYFDTIGKQISIVASGRISSVITTPGTARYDVRLGGTVVFDGLAILLDTVAAHTNVGWQLEILLTCRAIGTAGNLMGVGKWTCEDILGVPAAAPKGVLSAILPWNSAPAVGGNFNTTVSQTLDVQFTQTVATGSMTCHQYGAFGHCGPN*
Ga0137397_1029257923300012685Vadose Zone SoilMSGQNWKETLTTSIVDGPTITAIGAASCIPTANVITLPPGFFTIGKILKIKASGRISSVITTPGTARFDIRLGGTVVFDGLAILLDTVAAHVTVGWMLELYLTCRAQGNGATANLFGQGTWTCEDILGVPATAPKGVLTAILPWNTAPAVGGGFNSTISQTLDMQFTQTVATGSMTCHQYTVLSPNAGTN*
Ga0137359_1001817773300012923Vadose Zone SoilMSQVTWRETLTSSIVDGPPLTAASAASCIPTGNKITLPNNFFDRPGKSLRIEAHGRISSLITTPGTARFDVRLGGIVVFDSQAILLDTVAAHINVAWWLEIMLVCRAVGNSTTANLMGVGRWTCEDTLGVPASAPKGVLSAILPWNAAPAAGGGFDSTAANALDLFFTQTVATGSMTVHQYRAIAEN*
Ga0164241_1000067913300012943SoilMSLQTWQETLITAQVDGGAITAASATSMLPAAAKYTLPANYFQVGRKLQIKAAGRISSVITTPGTARFDVRFGATVVFDGLAVLLDSVAAHTNVGWILDIELTCRAIGSSANLMGAGLWHCEDILGVPASAPKGVLAALLPWNSAPAVGNNFDSTATQQVD
Ga0137410_1028501933300012944Vadose Zone SoilMSQQTWCETLVNSIVDGPTLTAAAAATCIPTAAKITLPNNYLTIGRELWIRATGRISSLITTPGTARFDIRIGAVVAFDSLAILLDTVAAHTTVGWVLDLHLTVRAQGSGTSGNLIGQGTWTCEDILGVPATAPKGVLSAILPWNSAPAVGTGYDSTAANTLDMQFTQTAATGSMTCHTYRVISLN*
Ga0137410_1061275113300012944Vadose Zone SoilTTLLPPHATFTLPANFFKRLGQKLRIKASGRISSVITTPGVAQFKVKFGATAVFDGLAVLLDTVAAHTNVGWMLEIEMTARALGTSANLMGQGMFTTEDLLGVPASAPKGVLSAILPWNSAPAVGSNFNSSVSQLVDLTFTQTVTSGSITLHTFELIACN*
Ga0180008_119528323300014613GroundwaterMSLQSWGETLITSQVDGPALTAAAAATALPAAAKYTLPANFFSLGKQLLIEASGRISSVITTPGTARFDVRFGATVVFDGLAVLLDSVAAHTNVGWRLKIQMTCRAIGATGNLFGQGRWTCEDILGVPATAPKGVLTAILPWNSAPAVGNNFDTTASQVVDLFFTQTAATGSMTVH
Ga0180087_103272023300014872SoilMIGQPETLMVAQVDGAAVTAAAATSLLHASSKRTLPANIFNYIGQTLRVVARGRISSVITTPGTARFDLRLGGALAFDGLAILLDTVAGHTNVGWKLVMDLTVRAIGTTGNFMGSGEWTCEDILGVPATAPKGVLTAILPWNSVPAVGAAVDMTTSL
Ga0180064_102843813300014876SoilMSLNKWYETLVNSQVDGSALTAAAATSAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN*
Ga0180065_100050723300014878SoilMSLNKWYETLVNSQVDGSALVAAAAASAIPAAAKFTLPTNYFDTIGKKLRIRATGRISSLITTPGTARFDVRFGATVVFDGLAVLLDTVAGHTNVGWILDIELTCRAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGNNFDTTATQQIDLFFTQTVATGSMTVHQYEVIAPN*
Ga0180062_100949123300014879SoilMIGWPETLIVQQGDGPTLSAAAAATCLSAQGKKTLPANFFNYIGQTLRIEAAGRISSIITTPGTARFDIRLGGSVIFDSAAILLDTVAAHTTRQWRLVIDMTVRVVGSAGNFFGHGLWTCEDILGTPATAPKGSLSAVLPWNTTPAVGGNADFTTAGVLDMFFTQTEATGSMTVHQYSAAAPNGY*
Ga0180094_102068123300014881SoilERLGQKLYIKASGRVSSVITTPGTMRFDVRLGATVVFDGLAVLLDTAAAHTNVGWWLEMELTVRAIGTSANLMGQGFLTSRNLLGAPTAPAPALTAMLPWNSAPAVGNNFDSTASQQVDIFFTQTVATGSLTVHQYELIAAN*
Ga0180104_101483223300014884SoilMSLGYKEVLVTSQIDGAAVTAAAATTLLPAAAKFTLPANFFRNIGQRLKITAAGRISSLITTPGTARFSITFGATTVFDGLAVLLDTVAAHTNVGWWLEIDLTCRAIGTSGNLMGVGRWTCEDILGVPAAAPKGVLTAMLPWNTAPAVGNNFDTTATQQVDLVFTQTVATGSCTCHQYSLESQN*
Ga0180104_103248623300014884SoilMALGYVETLITSVENGTAVTAAAATTLLPASAVKTLPANYFDFVGKQVLVKASGRISSLITTPGTARFDVRLGGTVVFDGLAILLDSVAAHTNVGWTLEILLTCRAIGATGNFFGQGTWTCEDILGVPATAPKGVLSAILPWNSAPAVGGNVDTTTSLALNLFFTQTVATGSCQLHQFG
Ga0180063_101518923300014885SoilMALGYVETLITSSENGTAVTAAAATTLLPASAVKTLPANYFDFVGKQVLVKASGRISSLITTPGTARFDIRLGGTVVFDGLAVLLDTVAAHTNVGWHLEVLLTCRAIGATGNFFGQGQWTCEDILGVPATAPKGVLTAVLPWNSAPAVGGNVDTTTSLALNLFFTQTVATGSCQLHQFGAYGMN*
Ga0167665_100028073300015163Glacier Forefield SoilMSLQGWQETLVTALSDGAVITTGIGTSCIPTGAKITFPNNYFQIGKALRIRAHGRISSVITTPGTARFDVRFGSSVVFDGLAVLLDTVAAHTSVAWDLEILLTCRAIGSGTAANMFGVGKWTSEDILGVPATAPKGVLTAMLPWNTAPAVGAGFDSTAANTLDLFFTQTVSTGSMTLHGYIVESLN*
Ga0167627_102979213300015196Glacier Forefield SoilMFITQPQVITPPTLVDGPVLTLGTAASCIPTNSKITLPNNYWYPGRKWRMTSWGRVSSVITTPGTARFDLRIGGVIAWDSQAVLLDTVAAHTTVAWKLVVELTCRSVGSGTATTLIGNGTWSCEDILGVPATAPKGVLVAILPWNATPGVPGTGFDNTLATNTVDLFFTQTAAT
Ga0167650_100527923300015203Glacier Forefield SoilMGFPEVLVIGQADGTPNTAGTAASCIPAHAKITLPNNFFYVGRSLRITASGRISSLITTPGTARFDVRVGAVVAFDSQAILLDTVAAHTNVGWYLDLLLTCRAIGNGTTTTLFGSGKWTCEDILGVPATAPKGVLSAILPWNAAPAVGTGFDNTLANTLDMFFTQTAATGSMTLHQYAVEVLSPWN*
Ga0167626_101475723300015204Glacier Forefield SoilMFITQPQVITPPTLVDGPVLTLGTAASCIPTNSKITLPNNYWYPGRKWRMTSWGRVSSAITTPGTARFDLRIGGVIAWDSQAVLLDTVAAHTNVGWKLVLELTCRSVGSGTATTLVGSGMWISEDILGVPATAPKGVLVAILPWNTTPGVPGTGFDNTLATNTVDLFFTQTAATGSLTVHDYMIEEMN*
Ga0137409_1028745323300015245Vadose Zone SoilLAAAGVRSTCHAVPDAFAAPNNLPANFFKRLGQKLRIKASGRISSVITTPGVAQFKVKFGATAVFDGLAVLLDTVAAHTNVGWMLEIEMTARALGTSANLMGQGMFTTEDLLGVPASAPKGVLSAILPWNSAPAVGSNFNSSVSQLVDLTFTQTVTSGSITLHTFELIACN*
Ga0180085_118658113300015259SoilTSQVDGTAVTAASPTSCIPAAAKYTLRPNFLERLGQKLYIKASGRVSSVITTPGTMRFDVRLGATVVFDGLAVLLDTAAAHTNVGWWLEMELTVRAIGTSANLMGQGFLTSRNLLGAPTAPAPALTAMLPWNSAPAVGNNFDSTASQQVDIFFTQTVATGSLTVHQYELIAAN*
Ga0184615_1002873023300018059Groundwater SedimentMSLQSWQETLITSQVDGAALTAAAAATCIPAAAKYTFPPNSFAIGKQYRVKASGRISSVITTPGTARYDLRLGGTVIFDSLAILLDTAVAATTVGWLLDILLTCRAIGATANFMGQGLWTSYNILGTPATPPKGALSAILPWNSAPAVGGNVDSTTSLQLDMFFTQTVATGSMTVHQYSVQALN
Ga0184615_1029345223300018059Groundwater SedimentMALGYVETLIASTENGAARVVAAAATMLPASAIRTLPANYFDFVGKQVLIKASGRISSVITTPGTARFDVRLGGTVVFDGLAILLDTVAGHTNVGWDLEILLSCRAIGATANFFGIGKWTCEDILGVPATAPKGVLVAILPWNAAPAVGGNFDSTISQAVDVQFTQTVASGSLTCHTYGLYGLN
Ga0184636_100230133300018068Groundwater SedimentMSLQTWQETLIVAPGDGPVLTAAAAAICLPSTALVTLYPNFWAVGRTMRIKASGRISSIITTPGTARFDVRMTSSVVFDSLAILLDTVAAHANVGWLLEIILTCRAIGPTGNLFGQGKWICEDILGVPATAPKGVLSAILPWNSAPAVGGNFDTTVAHILNLYFTQTEATGSLTLHQFIAEMLN
Ga0184636_107829513300018068Groundwater SedimentGPALTAAAAARCIPATGLVTLPPNFWAVGRAMRIKASGKITSVITTPGTARFDVRMTASVVFDSLAILLDSVAAHTNVGWLLEIILTCRAIGATGNLFGQGKWTCEDILGVPATPPKGVLSAILPWNSAPAVGGNFDTTISHILDLYFTQTEATGSLTLNQFLVEMLN
Ga0184631_1000613943300018070Groundwater SedimentMSLQTWQETLIVAPGDGPVLTAAAAAICLPSTALVTLYPNFWAVGRTMRIKASGRISSIITTPGTARFDVRMTSSVVFDSLAILLDTVAAHANVGWLLEIILTCRAIGPTGNLFGQGKWICEDILGVPATAPKGVLSAILPWNSVPAVGGNFDTTVAHILNLYFTQTEATGSLTLHQFIAEMLN
Ga0184631_1002183933300018070Groundwater SedimentMSLQTWQETLIVAPGDGPALTAAAAARCIPATALITLPPNFWAVGRAMRIKASGKITSVITTPGTARFDVRMTASVVFDSLAILLDSVAAHTNVGWLLEIILTCRAIGATGNLFGQGKWTCEDILGVPATAPKGVLSAILPWNSAPAVGGNFDTTISHILDLYFTQTAATGSLTLNQFLVEMLN
Ga0184629_1000252613300018084Groundwater SedimentVHMSAQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFFTQTVATGSMTVHQFSVQAMN
Ga0184629_1000312563300018084Groundwater SedimentMIGWPETLIVQQGDGPTLSAAAAATCLSAQGKKTLPANFFNYIGQTLRIEAAGRISSIITTPGTARFDIRLGGSVIFDSAAILLDTVAAHTTRQWRLVIDMTVRVVGSAGNFFGHGLWTCEDILGTPATAPKGSLSAVLPWNTTPAVGGNADFTTAGVLDMFFTQTEATGSMTVHQYSAAAPNGY
Ga0066669_1004997523300018482Grasslands SoilMSLQSWQETLIASQVDGTTLTAAAAASMIPAAAKYTLPANFFSIGKQLLVKASGRISSLITTPGTARFDIRFGATVVFDSQAILLDTVAAHTTVGWYLELLLTCRAIGASANLMGQGFWMCEDILGVPATAPKGVLSAILPWNTTPAVGTSFDSTASQQVDVFFTQTVATGSLTCHQYSLISLN
Ga0184648_144459213300019249Groundwater SedimentMSAQGWNETLITATTDGATLTAAAAATALPAQARVTLKPNFFYYIGQQMIVKAAGRISSVITTPGTARFDIRFGASIVFDGLAVLLDTVAGHTTEHWHLEILMTCRAIGATANLFGQGRWECQDILGTPAGTPRGGIVAMLPWNTAPAVGANFDSTVSQQVDMFFTQTVATGSMTVHQFSVQAMN
Ga0193752_101740223300020027SoilMSQMSWTERLITAQSDGTAVTAAAATTLLPASAKFTLPPNYFKFLGQQLEVRASGRISSVITTPGTARFQLLLGAAIVFDSQAILLDSVAAHTNVHWQLELLLTLRVLGATANLMGSGRFISEDILGVPATAPKGALIALLPWNAAPAVGANFDSGASQVVDLSFTQTAATGSIQLHQYSLIAPN
Ga0210403_1011302123300020580SoilMSRQGWQEVLVAAQVDGPAITATGAASCLPTHAKISLPAGYFSEAGKQLIIRASGRITSVITTPGTARYDVRLGGTIVFDSQAILLDTAAAHTAVGWLLEITMTCRAVGNATNANLMGQGTWTCEDILGVPAIAPKGVLTAILPWNAAPAVGGGFDSTIAQSIDLFFTQTVATGSMTLHQYSVISPN
Ga0210377_10004931143300021090Groundwater SedimentMSLQSWGETLITTSIDGTAVTAAAATTLLPAAGKYTLPANFFAVGKQLIVKASGRISSLITTPGTARFDLRFGGTVVFDGLAILLDTVAGHTNVGWYLEVMLTCRAIGASANLMGQGTWACEDLLGVPATAPKGVLSAILPWNSAPAVGANFDSTTTQQVDLFFTQTVATGSVQLHQYSLISPN
Ga0210377_1007135413300021090Groundwater SedimentMTVYESLIESQIDGPTLAAAAAATCLPAAAKRTLPANFFDKIGKKLRITASGRISSVITTPGTARFDIRLGGTVVFDGLAILLDTVAAHTNVGWQLDIEMTIRAIGTAANFFSIGKWTCEDILGVPATAPKGVLTAILPWNTAPAVGGNFSSIVSQELDMFFTQTVATGSLTVHTFEAACPNSSV
Ga0210408_100002131743300021178SoilMSRQGWQEVLVAAQVDGPAITATGAASCLPTHAKISLPAGYFSEAGKQLIIRASGRITSVITTPGTARYDVRLGGTIVFDSQAILLDTVAAHTAVGWLLEITMTCRAVGNATNANLMGQGTWTCEDILGVPAIAPKGVLTAILPWNAAPAVGGGFDSTIAQSIDLFFTQTVATGSMTLHQYSVISPN
Ga0193699_1000600823300021363SoilMASGYVETLINAQIDGPALTAAAAATCLAGASKKTLPANYFDFIGKQLLIKATGRITSLITTPGTARFDIRLGGTVVFDSLAILLDTVAAHTNVGWTLEILLTARAIGATGNLMGQGLWTCEDILGVPATAPKGVLSAILPWNSAPAVGGNFDTTTSQQLDMFFTQTAATGSMTVHQFAAYGVN
Ga0210402_1009948223300021478SoilMSRQGWQETLITAEVDGPTLTAAAAASCLPPAAVFTLPANLWDIGRQMRINATGRISSVITTPGTARFDIRIAGGVVFDSLAILLDTVAAHTTVGWKLEILLTCRAIGATGNLFGQGIWTCEDILGVPATAPKGVLTAMLPWNSAPAVGANFSTTPAATLDMFFTQTVATGSLTLHQYSVEMLN
Ga0224500_1000170493300022213SedimentMSLGYAETLVTSQADGTAVTAAAATTLLPTGAVFTLPANFFKAAGQQLLIKASGRISSVITTPGTARFDVRFGATVVWDGLAILLDSVAAHTNVGWSLEVLLTCRAIGTSANLIGQGKWTCEDILGTPATAPKGSLSAVLPWNSAPAVGSNFDSTVSQAVNLFFTQTVATGSCLCHQYSLISLN
Ga0228698_1000298513300022745FreshwaterMGVQTFPQVLAVGKVDGPTLTAAAAASCIPTPEKLTLPNNYWYIGRTWRLTMHGRISTVITTPGTARFDLRIGAVVAWDSQAILLDTVAGHTTMPWVLEVLLTCRAVGTGTSTQLFGVGKWSCEDILGTPATSPKGSLVAMLPWNAAPALGTGFDNTALNTVDVFFTQTAATGSLTVHQYLLEELT
Ga0247673_1000006483300024224SoilMSVGYRKTLVSAQGDGPTLTAAAAATCLPGQAKVLIPGGALDAPGKLLHIKASGRISSVITTPGTARFDVRLGGTVMFDSLAILLDSVAAHTNVGWMLDILLTMRTPGTAATLFGQGTFTCEDILGVPASAPKGVLSAILPWNSAPAVSGNTFDATAANILDLFFTQTVATGSLTLHQYCAELVSFE
Ga0209824_1019972513300025173WastewaterMSLGYWETLISCQVDGAAVTAAAATSLLPAQAKKTIPANYFDVIGKKLIITATGRISSVVTTPGTARFDVRLGSIVVFDGLAILLDSVAAHTTVGWDLYIDLTCRAVGATANLFGYGRWTCEDILGVPATAPKGVLSAVLPWNATPAVGANFDSTVSAVVDLFFTQTVATGSCTVHQFNVYSPN
Ga0209638_100911523300025725Arctic Peat SoilMLNGQTFDEIVIVSQVDGAALTAAAAASMIPAAAKFTLPPNWCQIGRQILLRAAGKISSVITTPGTARFDVRLGTDVVFDGLAILLDSVAARTDAAWWLEILLTCRGIGAAATFMGHGRWSCEDILGVPATAPKGALTAMLPWNSAPAVGASFDSTVSQVIDVFFTQTAATGSLTCQQYSVSLLN
Ga0207646_1016749823300025922Corn, Switchgrass And Miscanthus RhizosphereMSFNSWHQTLVTAQGDGTALTAAAAASCLPAHAKFPLPGCFFDTVGKQLMIRASGRISSLITTPGTARFDVRFGSTVVADGLAVLLDTVAAHTNVGWWLKMLLTCRAVGASASLMQSGIWVCEDILGVPATAPKGSLTALLPWNSAPTVGATFDSTAAQIVDLFFTQTVATGSLTLHQYSLISLN
Ga0207704_1011413513300025938Miscanthus RhizosphereKTLVSSQGDGPTLTAAAAATCLPGQAKFTIPAGTFDAPGKILHIWASGRISSLITTPGTARYDVRLGGTVMFDSLAILLDSVAAHTTVGWTLDITMTMRTAGSSATFWGQGTWTSEDILGVPATAPKGVLSAILPWNSTPAVSGNTFDANASQVLDLFFTQTAATGSMTVHQYMAEIVSF
Ga0207648_1040730923300026089Miscanthus RhizosphereMALGYRKTLVSSQGDGPTLTAAAAATCLPGQAKFTIPAGTFDAPGKILHIWASGRISSLITTPGTARYDVRLGGTVMFDSLAILLDSVAAHTTVGWTLDITMTMRTAGSSATFWGQGTWTSEDILGVPATAPKGVLSAILPWNSTPAVSGNTFDANASQVLDLFFTQTAATGSMTVHQYMAEIVSFE
Ga0207674_1026063723300026116Corn RhizosphereMGAQRWSQPLIVQSVTDGPTLTAAAAASCIPTPNKITLPNNYWEVGRQWRVSLHGRISTVVTTPGTARFDLRIGAVVAFDSLAIALDAVAGYTTVPWWLEIMLTCRAAGAGTVTQLLGVGKWTSPNIAGAIAGAPKGSLTALLPWNAAPALGTGFDNTLANTVDMFFTQTVATGSLTVHGYQIEELN
Ga0209805_111718113300026542SoilMSLQTWQETLVSSQVDGPVLTAAAAASCIPPAAKITLPNNYFYIGRVLRITATGRISSVITTPGTARFDVRIGGVIAFDSLAILLDTVAGHTNVGWWLDLFLTCRAIGNGTLTTLMGQGSWTCEDILGVPATAPKGVLTAMLPWNSAPAVGTGFDNTLANTLDMFFTQTAATGSLTVHQYIVSSLN
Ga0209898_101945013300027068Groundwater SandMSQQTWGEVLVNGSVDGPTLTAAAAASCIPTPSRIILPNNYFYIGRMLRITASGRISSVVTTPGTARFDVRMGPSGTIIVFDGLAILLDTVAAHTTVGWFLEILLTCRAVGSGTATNFMGQGKWTCEDILGVPATAPKGVLTAILPWNTAPAVGTGFDNTAANAVDMFFTQTVATGSMTVHQYMVESLN
Ga0208685_103735623300027513SoilMSVGYWETLVEGQVDGPTLTAAARASCIPPAAKFTLRPNFFDRIGKKLFIRAQGRISSLITTPGTARYDIVFGAALMFDSLAILLDTVAGHTTVPWWLEIEMTCRAIGATGNLFTQGRWSCEDILGVPATAPKGVLTAMLPWNVAPAVGANFDTTASQTIDMFFTQTVATGQMILHQYTVISPN
Ga0208454_1000118853300027573SoilMALGYVETLITSSENGTAVTAAAATTLLPASAVKTLPANYFDFVGKQVLVKASGRISSLITTPGTARFDIRLGGIVVFDGLAVLLDTVAAHTNVGWHLEVLLTCRAIGATGNFFGQGQWTCEDILGVPATAPKGVLTAVLPWNSAPAVGGNVDTTTSLALNLFFTQTVATGSCQLHQFGAYGMN
Ga0209117_102433913300027645Forest SoilAAASCIPAAARYTIPANFFDIGRALRIAATGRISSVIATPGTARFDVRFAGVVVFDGLAILLDTVAAHVNVGWLLEIRLTCRSIGNGVQATLMGEGKWICEDILGTPAVAPKGSLCAVLPWNTPPNVGGGFDSTIANPVDIFFTQTVATGSLTVHQYYLESLN
Ga0208980_1054750013300027887WetlandRGNREIRRMALYTYRETVYTSLGDGSALTAAAAATMLPNTSSRFTFPAGFFHYVGQSMRIVASGRISSVITTPGTARYDIRLGGTVVFDGLAVLLDTVAAHTNVGWMLEIEMTVRAVGTAGNFMGQGKWTCEDILGVPASAPKGVLSAILPWNSAPAVGSNVNFESALALDSFFTQTVATGSMTCHQFKVISEN
Ga0247051_104697923300029901CryconiteMFITQPQVITPPTLVDGPVLTLGTAASCIPTNSKITLPNNYWYPGRKWRMTSWGRVSSVITTPGTARFDLRIGGVIAWDSQAVLLDTVAAHTTVAWKLVVELTCRSVGSGTATTLVGSGMWISEDILGVPATAPKGVLTAMLPWNTTPGVPGTGFDNTLATNTVDLFFTQTAATGSLTVHDYMIEEMN
Ga0307374_1016421113300031670SoilAAATSLLPGAAKYTIPANFFEIGKALRLTAAGKISSVVTTPGTARFDVRLGGTVVFDGLAALLDSVAAHTNVGWWLEILLTCRAIGASANLMGHGKWTSEDLLGVPASAPKGVLTAMLPWNSAPAVGSNFDSTTSQQLDLYFTQTVATGSVTLQQFALEALN
Ga0307372_1034904513300031671SoilMSLQTWQETLISAQVDGGAVTAAAATSLLPGAAKYTIPANFFEIGKALRLTAAGKISSVVTTPGTARFDVRLGGTVVFDGLAALLDSVAAHTNVGWWLEILLTCRAIGASANLMGHGKWTSEDLLGVPASAPKGVLTAMLPWNSAPAVGSNFDST
Ga0307373_1002581963300031672SoilMSLQTWQETLISAQVDGGAVTAAAATSLLPGAAKYTIPANFFEIGKALRLTAAGKISSVVTTPGTARFDVRLGGTVVFDGLAALLDSVAAHTNVGWWLEILLTCRAIGASANLMGHGKWTSEDLLGVPASAPKGVLTAMLPWNSAPAVGSNFDSTTSQQLDLYFTQTVATGSVTLQQFALEALN
Ga0315291_1004660943300031707SedimentMSLQTWQETLVNSSVDGPVLTAAAAATAIPTAAKITLPNNYFYVGRMLRVTASGRISSVITTPGTARFDVRIGAVVAFDSQAILLDSVAAHTNVGWKLDLLLTCRSIGSGVAATLLGQGTWTCEDILGVPATAPKGVVSAILPWNAAPAVGTGFDSTAANTLDLFFTQTAATGSLTVHQYMCESLN
Ga0315291_1045354123300031707SedimentMSLQTWQETLIVAPGDGPALTAAAAARCIPATALVTLPPNFWAVGRAMRIKASGRISSAITTPGTARFDVRMTSTVVFDSLAILLDSVAAHTNVGWLLEIMLTCRAIGATGNLFGQGRLICEDILGVPATAPKGVLVAILPWNSAPAVGGNFDTTISHVLDLYFTQTEATGSLTLHQFLVEMLN
Ga0315288_1009442713300031772SedimentMSLQTWQETLIVAPGDGPALTAAAAARCIPATALVTLPPNFWAVGRAMRIKASGRISSAITTPGTARFDVRMTSTVVFDSLAILLDSVAAHTNVGWLLEIMLTCRAIGATGNLFGQGRLICEDILGVPATAPKGVLVAILPWNSAPAVGGNFDTTISHVLDLY
Ga0315297_1080821513300031873SedimentMSLQTWQETLVNSSVDGPVLTAAAAATAIPTAAKITLPNNYFYVGRMLRVTATGRISSVITTPGTARFDLRIGAVVAFDSQAILLDTVAAHVNVGWKLDLLLTCRSIGSGTAATLLGQGVWTCEDILGVPATAPKGVVSAILPWNAAPAVGTGFDSTAANTLDLFFTQTAATGSLTVHQYMCESLN
Ga0315285_1014442633300031885SedimentMSLQTWQETLVNSSVDGPVLTAAAAATAIPTAAKITLPNNYFYVGRMLRVTASGRISSVITTPGTARFDVRIGAVVAFDSQAILLDSVAAHTNVGWKLDLLLTCRSIGSGVAATLLGQGTWTCEDILGVPATAPKGVLSAILPWNAAPAVGTGFDSTAANTLDLFFTQTAATGSLTVHQYMCESLN
Ga0315278_1060074423300031997SedimentMPIQSWSQTLVVAQGDGTALTAAATASALPAAAKFTLPANFFDTIGKSLRVQASGRVSTVITTPGTFQWQVKFGATAVFDSLSILPDTVAGHTNVGWLLDILLVCRVVGTSAALFGQGTFTSEDILGVPATAPKGVLSAVLPWNSAPASGTTFDSTASQQVDLAFTQTAATGSLTLHNYLLTSLN
Ga0315274_1129899623300031999SedimentMSLQTWQETLVVAQVDGTALTAAAAASCIPAAAKYTLPANFFGTVGKTIRITASGRITSLITTPGTARFDVRFGAAVVADSLAILLDTVAGHTNVGWTLQWLLTCRAIGTAGNLFVQGQWASEDILGSIAAPPKGQLVAQLPWNAAPAVGANFDTTVSQVVDMFFTQ
Ga0315284_1006680793300032053SedimentMSLQTWQETLVNSSVDGPVLTAAAAATSIPTAAKITLPNNYFYVGRMLRVTASGRISSVITTPGTARFDVRIGAVVAFDSQAILLDSVAAHTNVGWKLDLLLTCRSIGSGVAATLLGQGTWTCEDILGVPATAPKGVVSAILPWNAAPAVGTGFDSTAANTLDLFFTQTAATGSLTVHQYMCESLN
Ga0315277_1075181823300032118SedimentMSLQTWQETLVNSSVDGPVLTAAAAATAIPTAAKITLPNNYFYVGRMLRVTATGRISSVITTPGTARFDLRIGAVVAFDSQAILLDSVAAHTNVGWKLDLLLTCRSIGSGVAATLLGQGTWTCEDILGVPATAPKGVLSAILPWNAAPAVGTG
Ga0315295_1000834853300032156SedimentMSLQTWQETLIVAPGDGPALTAAAAARCIPATALLTLPPNFWAVGRAMRIKASGRISSAITTPGTARFDVRMTSTVVFDSLAILLDSVAAHTNVGWLLEIMLTCRAIGATGNLFGQGRLICEDILGVPATAPKGVLVAILPWNSAPAVGGNFDTTISHVLDLYFTQTEATGSLTLHQFLVEMLN
Ga0315283_1215777413300032164SedimentMSLQTWQETLIVAPGDGPALTAAAAARCIPATALLTLPPNFWAVGRAMRIKASGRISSAITTPGTARFDVRMTSTVVFDSLAILLDSVAAHTNVGWLLEIMLTCRAIGATGNLFGQGRLICEDILGVPATAPKGVLVAILPWNSAPAVGGNFDTTISH
Ga0315273_10011663133300032516SedimentMSLQTWQETLVVAQVDGTALTAAAAASCIPAAAKYTLPANFFGTVGKTIRITASGRITSLITTPGTARFDVRFGAAVVADSLAILLDTVAGHTNVGWTLQWLLTCRAIGTAGNLFVQGQWASEDILGSIAAPPKGQLVAQLPWNAAPAVGANFDTTVSQVVDMFFTQTVATGSMTLHQYMIEALN
Ga0315273_1009870723300032516SedimentMPAYFETFITSQSDGSTLTAAATATMLPAAALKTLPANFFDVIGKQLLIKAAGRISSLITTPGTARFDVRMGGTVVFDSLAILLDTVAGHTTVGWELEILLTCRAIGTSANFHGVGKWTSEDILGVPATAPKGVLVAMLPWNTTPAVGANFDSTVSQQVDVRFTQTVGTGSLTCHQYGLYCPNSN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.