NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F053199

Metagenome / Metatranscriptome Family F053199

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F053199
Family Type Metagenome / Metatranscriptome
Number of Sequences 141
Average Sequence Length 111 residues
Representative Sequence VRKLAPILVVMTLTATRLHAQYARRYEVGLFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNP
Number of Associated Samples 116
Number of Associated Scaffolds 141

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 81.43 %
% of genes near scaffold ends (potentially truncated) 97.87 %
% of genes from short scaffolds (< 2000 bps) 84.40 %
Associated GOLD sequencing projects 110
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.291 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.078 % of family members)
Environment Ontology (ENVO) Unclassified
(41.135 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.809 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 10.34%    β-sheet: 31.72%    Coil/Unstructured: 57.93%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 141 Family Scaffolds
PF00691OmpA 43.26
PF02412TSP_3 32.62
PF13505OMP_b-brl 9.22
PF00015MCPsignal 2.84
PF13185GAF_2 2.13
PF00365PFK 0.71
PF00440TetR_N 0.71
PF10262Rdx 0.71
PF030614HBT 0.71

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 141 Family Scaffolds
COG0840Methyl-accepting chemotaxis protein (MCP)Signal transduction mechanisms [T] 5.67
COG02056-phosphofructokinaseCarbohydrate transport and metabolism [G] 0.71


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.29 %
UnclassifiedrootN/A0.71 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1061312All Organisms → cellular organisms → Bacteria → FCB group604Open in IMG/M
3300002560|JGI25383J37093_10014787All Organisms → cellular organisms → Bacteria2587Open in IMG/M
3300002561|JGI25384J37096_10026821All Organisms → cellular organisms → Bacteria2246Open in IMG/M
3300002908|JGI25382J43887_10177154All Organisms → cellular organisms → Bacteria1052Open in IMG/M
3300002911|JGI25390J43892_10066973All Organisms → cellular organisms → Bacteria831Open in IMG/M
3300002916|JGI25389J43894_1050936All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes700Open in IMG/M
3300004268|Ga0066398_10084539All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes710Open in IMG/M
3300005171|Ga0066677_10228915All Organisms → cellular organisms → Bacteria1049Open in IMG/M
3300005177|Ga0066690_10508705All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300005180|Ga0066685_10407562All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300005180|Ga0066685_10411459All Organisms → cellular organisms → Bacteria939Open in IMG/M
3300005334|Ga0068869_100558762All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300005444|Ga0070694_100501975All Organisms → cellular organisms → Bacteria965Open in IMG/M
3300005446|Ga0066686_10101360All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300005446|Ga0066686_10392458All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria948Open in IMG/M
3300005518|Ga0070699_100251036All Organisms → cellular organisms → Bacteria1580Open in IMG/M
3300005536|Ga0070697_100828042All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes819Open in IMG/M
3300005536|Ga0070697_101398080All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes625Open in IMG/M
3300005540|Ga0066697_10731969All Organisms → cellular organisms → Bacteria → FCB group540Open in IMG/M
3300005546|Ga0070696_100923475All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300005553|Ga0066695_10303287All Organisms → cellular organisms → Bacteria1007Open in IMG/M
3300005556|Ga0066707_10133485All Organisms → cellular organisms → Bacteria1559Open in IMG/M
3300005557|Ga0066704_10204158All Organisms → cellular organisms → Bacteria1337Open in IMG/M
3300005568|Ga0066703_10758416All Organisms → cellular organisms → Bacteria → FCB group556Open in IMG/M
3300005576|Ga0066708_10486130All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes795Open in IMG/M
3300006034|Ga0066656_10077658All Organisms → cellular organisms → Bacteria1976Open in IMG/M
3300006755|Ga0079222_12336603All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300006797|Ga0066659_10224232All Organisms → cellular organisms → Bacteria1391Open in IMG/M
3300006800|Ga0066660_10040998All Organisms → cellular organisms → Bacteria2945Open in IMG/M
3300006844|Ga0075428_102324078All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300006847|Ga0075431_100159832All Organisms → cellular organisms → Bacteria → Proteobacteria2317Open in IMG/M
3300006852|Ga0075433_11724035All Organisms → cellular organisms → Bacteria → FCB group539Open in IMG/M
3300006852|Ga0075433_11882183All Organisms → cellular organisms → Bacteria → FCB group513Open in IMG/M
3300006854|Ga0075425_103119347All Organisms → cellular organisms → Bacteria → FCB group505Open in IMG/M
3300007076|Ga0075435_101278792All Organisms → cellular organisms → Bacteria → FCB group642Open in IMG/M
3300007076|Ga0075435_101999079All Organisms → cellular organisms → Bacteria → FCB group509Open in IMG/M
3300007788|Ga0099795_10308515All Organisms → cellular organisms → Bacteria → FCB group698Open in IMG/M
3300009012|Ga0066710_100000894All Organisms → cellular organisms → Bacteria20354Open in IMG/M
3300009088|Ga0099830_11156430All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes642Open in IMG/M
3300009137|Ga0066709_101244179All Organisms → cellular organisms → Bacteria1095Open in IMG/M
3300009137|Ga0066709_102811500All Organisms → cellular organisms → Bacteria → FCB group645Open in IMG/M
3300009162|Ga0075423_10637128All Organisms → cellular organisms → Bacteria1124Open in IMG/M
3300009162|Ga0075423_11897003All Organisms → cellular organisms → Bacteria → FCB group644Open in IMG/M
3300010136|Ga0127447_1090553All Organisms → cellular organisms → Bacteria → FCB group528Open in IMG/M
3300010301|Ga0134070_10054282All Organisms → cellular organisms → Bacteria1353Open in IMG/M
3300010304|Ga0134088_10656197All Organisms → cellular organisms → Bacteria → FCB group524Open in IMG/M
3300010323|Ga0134086_10220225All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes714Open in IMG/M
3300010323|Ga0134086_10438125All Organisms → cellular organisms → Bacteria → FCB group531Open in IMG/M
3300010325|Ga0134064_10411445All Organisms → cellular organisms → Bacteria → FCB group542Open in IMG/M
3300010329|Ga0134111_10461899All Organisms → cellular organisms → Bacteria → FCB group552Open in IMG/M
3300010333|Ga0134080_10203302All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes859Open in IMG/M
3300011269|Ga0137392_10476136All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300012198|Ga0137364_10001362All Organisms → cellular organisms → Bacteria11400Open in IMG/M
3300012202|Ga0137363_11146093All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300012203|Ga0137399_10072668All Organisms → cellular organisms → Bacteria2584Open in IMG/M
3300012203|Ga0137399_10384271All Organisms → cellular organisms → Bacteria1169Open in IMG/M
3300012203|Ga0137399_10663693All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes877Open in IMG/M
3300012206|Ga0137380_11731358All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300012207|Ga0137381_10013236All Organisms → cellular organisms → Bacteria6372Open in IMG/M
3300012207|Ga0137381_10143035All Organisms → cellular organisms → Bacteria2054Open in IMG/M
3300012207|Ga0137381_10412904All Organisms → cellular organisms → Bacteria1178Open in IMG/M
3300012207|Ga0137381_11680504All Organisms → cellular organisms → Bacteria → FCB group525Open in IMG/M
3300012208|Ga0137376_10054158All Organisms → cellular organisms → Bacteria3292Open in IMG/M
3300012210|Ga0137378_10540614All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300012210|Ga0137378_11061002All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes724Open in IMG/M
3300012211|Ga0137377_11657676All Organisms → cellular organisms → Bacteria → FCB group562Open in IMG/M
3300012285|Ga0137370_10010387All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes4424Open in IMG/M
3300012349|Ga0137387_10227717All Organisms → cellular organisms → Bacteria1341Open in IMG/M
3300012351|Ga0137386_10383263All Organisms → cellular organisms → Bacteria1012Open in IMG/M
3300012358|Ga0137368_10409440All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300012362|Ga0137361_10808374All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes853Open in IMG/M
3300012392|Ga0134043_1178653All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300012395|Ga0134044_1147206All Organisms → cellular organisms → Bacteria → FCB group538Open in IMG/M
3300012399|Ga0134061_1156910All Organisms → cellular organisms → Bacteria1020Open in IMG/M
3300012406|Ga0134053_1132389All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes714Open in IMG/M
3300012407|Ga0134050_1039038All Organisms → cellular organisms → Bacteria1006Open in IMG/M
3300012409|Ga0134045_1200907All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300012683|Ga0137398_10408069All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300012685|Ga0137397_10678419All Organisms → cellular organisms → Bacteria → FCB group766Open in IMG/M
3300012907|Ga0157283_10324434All Organisms → cellular organisms → Bacteria → FCB group544Open in IMG/M
3300012918|Ga0137396_10980098All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes614Open in IMG/M
3300012918|Ga0137396_11099845All Organisms → cellular organisms → Bacteria → FCB group568Open in IMG/M
3300012923|Ga0137359_11343294All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes603Open in IMG/M
3300012924|Ga0137413_10661927All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes788Open in IMG/M
3300012925|Ga0137419_10051268All Organisms → cellular organisms → Bacteria2663Open in IMG/M
3300012925|Ga0137419_10940814All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes713Open in IMG/M
3300012927|Ga0137416_11653965All Organisms → cellular organisms → Bacteria → FCB group583Open in IMG/M
3300012972|Ga0134077_10040759All Organisms → cellular organisms → Bacteria1686Open in IMG/M
3300014157|Ga0134078_10014249All Organisms → cellular organisms → Bacteria2400Open in IMG/M
3300014166|Ga0134079_10098361All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300014166|Ga0134079_10119015All Organisms → cellular organisms → Bacteria1031Open in IMG/M
3300015054|Ga0137420_1436338All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes693Open in IMG/M
3300015256|Ga0180073_1017053All Organisms → cellular organisms → Bacteria1298Open in IMG/M
3300015359|Ga0134085_10013997All Organisms → cellular organisms → Bacteria2990Open in IMG/M
3300017654|Ga0134069_1257775All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300017656|Ga0134112_10363440All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300018028|Ga0184608_10264946All Organisms → cellular organisms → Bacteria → FCB group756Open in IMG/M
3300018031|Ga0184634_10086636All Organisms → cellular organisms → Bacteria1352Open in IMG/M
3300018063|Ga0184637_10392078All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300018076|Ga0184609_10059427All Organisms → cellular organisms → Bacteria1649Open in IMG/M
3300018076|Ga0184609_10145708All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300018468|Ga0066662_11229235All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes761Open in IMG/M
3300018482|Ga0066669_10798418All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300018482|Ga0066669_11601000All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300019259|Ga0184646_1294133All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300019362|Ga0173479_10739246All Organisms → cellular organisms → Bacteria → FCB group536Open in IMG/M
3300019789|Ga0137408_1131567All Organisms → cellular organisms → Bacteria1400Open in IMG/M
3300020170|Ga0179594_10035731All Organisms → cellular organisms → Bacteria1613Open in IMG/M
3300021073|Ga0210378_10030491All Organisms → cellular organisms → Bacteria2153Open in IMG/M
3300021080|Ga0210382_10407862All Organisms → cellular organisms → Bacteria → FCB group601Open in IMG/M
3300021344|Ga0193719_10005112All Organisms → cellular organisms → Bacteria5358Open in IMG/M
3300022756|Ga0222622_10613746All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes786Open in IMG/M
3300025318|Ga0209519_10301887All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300026296|Ga0209235_1121345All Organisms → cellular organisms → Bacteria1096Open in IMG/M
3300026301|Ga0209238_1155257All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes691Open in IMG/M
3300026304|Ga0209240_1073882All Organisms → cellular organisms → Bacteria1265Open in IMG/M
3300026304|Ga0209240_1225772All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300026313|Ga0209761_1006788All Organisms → cellular organisms → Bacteria7528Open in IMG/M
3300026329|Ga0209375_1209636All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes727Open in IMG/M
3300026333|Ga0209158_1327178All Organisms → cellular organisms → Bacteria → FCB group530Open in IMG/M
3300026528|Ga0209378_1003806All Organisms → cellular organisms → Bacteria10086Open in IMG/M
3300026528|Ga0209378_1115057All Organisms → cellular organisms → Bacteria1165Open in IMG/M
3300026532|Ga0209160_1112779All Organisms → cellular organisms → Bacteria1353Open in IMG/M
3300027643|Ga0209076_1038308All Organisms → cellular organisms → Bacteria1347Open in IMG/M
3300027643|Ga0209076_1043841All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300027671|Ga0209588_1020530All Organisms → cellular organisms → Bacteria2069Open in IMG/M
3300027765|Ga0209073_10475758All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300027882|Ga0209590_10014685All Organisms → cellular organisms → Bacteria3799Open in IMG/M
3300027903|Ga0209488_10045456All Organisms → cellular organisms → Bacteria3228Open in IMG/M
3300027903|Ga0209488_10312069All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300027903|Ga0209488_11095733All Organisms → cellular organisms → Bacteria → FCB group544Open in IMG/M
3300028380|Ga0268265_12296147All Organisms → cellular organisms → Bacteria → FCB group546Open in IMG/M
3300028771|Ga0307320_10351488All Organisms → cellular organisms → Bacteria → FCB group589Open in IMG/M
3300028878|Ga0307278_10342650All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes659Open in IMG/M
3300030990|Ga0308178_1031411All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes908Open in IMG/M
3300031114|Ga0308187_10098978All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300031720|Ga0307469_11581599All Organisms → cellular organisms → Bacteria → FCB group629Open in IMG/M
3300033813|Ga0364928_0098917All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes685Open in IMG/M
3300033813|Ga0364928_0193147All Organisms → cellular organisms → Bacteria → FCB group510Open in IMG/M
3300034164|Ga0364940_0123746All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes737Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.08%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil14.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.18%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.06%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.55%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.55%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.13%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.13%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.42%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.71%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.71%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.71%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.71%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.71%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.71%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.71%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010136Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012395Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012407Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012907Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015256Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT333_16_10DEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_106131223300002557Grasslands SoilVRTLAATVMVLALGTTSLSAQFERRYEVGLFGAFTRYDQNFGLQDKLGGGVRFAYALGPSASFEVEALFQSPQTPAPSTPIEPLIGSASVVFYALNASRMSAYVLGGYSLLDFGN
JGI25383J37093_1001478733300002560Grasslands SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLL
JGI25384J37096_1002682113300002561Grasslands SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYALNASXM
JGI25382J43887_1017715413300002908Grasslands SoilMLMVATRSLSAQYDRRYEVGLFGAFTRYDKAFNLANKIGGGVRFAYAFTPMIDLEVEALFQSPQDVGTVHLEPLIGGGSLVVNALNAPRMSVYV
JGI25390J43892_1006697313300002911Grasslands SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYAL
JGI25389J43894_105093613300002916Grasslands SoilMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIDVEVEALFQSPQDVGTAHLEPLIGGGSLVVNALNAP
Ga0066398_1008453913300004268Tropical Forest SoilMRKLAAVFVILVLGGNGRLAAQYDRRYEVGLFGAYTKYDATFGLANKPGGGARFSYALTPMVGLEVEALFQSPQDITATSTTIEPMIGSGSLIVNALNKTRMTFFVLGGY
Ga0066677_1022891513300005171SoilLRKFALALLLVVGTTPLYAQYERRYEVGLFGAFTKYDKAFGLSDKPGGGVRFAYALGPAVSLEVEALFQSPQDFSSASIEPMIGSGSLLLYALNASRMSLYLIGGYSRLDFG
Ga0066690_1050870513300005177SoilVRKLAPIFVVMALTGTRLEAQYERRYEVGIFGAFTRYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGS
Ga0066685_1040756213300005180SoilVRKLAPILVVMTLTATRLNAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNAS
Ga0066685_1041145923300005180SoilVRKLAPILVVMTLTATRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNAS
Ga0066678_1046794223300005181SoilVRRLAAAFIVLAFVGGHRLAAQYDRRYEVGLFGAFTKYDKTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHIEPLIGGG
Ga0068869_10055876213300005334Miscanthus RhizosphereLKAFGALAVMLLLGSATLPAQYSRRYEVGFFGGFTKYDQSFQLADKSGGGVRFAYAFAPLVAVEVEGLFQSPQDVGSVHVEPLIGSASLVVNPFNTDRMSLYVLGGYTR
Ga0070694_10050197523300005444Corn, Switchgrass And Miscanthus RhizosphereVRTLAAVGLAVVLGSSTLAAQYERRYEVGLFGAFTKYDKGFGLEDKIGGGVRFAYALGPALSLEVEALFQPPHNIPPSTELEPVIGGGSLVFNVMNRDRLSFYVLGGYSLLDFGNTNP
Ga0066686_1010136013300005446SoilVRKLAPILVVMTLTATRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMINLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSV
Ga0066686_1039245813300005446SoilVRKLAAALTILALSGSTRLAAQFSRRYEVGLFGAYTKYDQTFGLTNKPGGGARFSYALSPLISLEVEALFQSPQDISSSTLEPMIGAGSLIVSPLNASRATF
Ga0070699_10025103613300005518Corn, Switchgrass And Miscanthus RhizosphereVRTFAAAGLALALGASTLTAQYERRYEVGAFAAFTKYDQVFGLADKIGGGVRFGYALGPALTLEVEALFQPPHHIPPSTEIEPVIAGGSLVFNALNRDRLAFYVLAGYS
Ga0070697_10082804213300005536Corn, Switchgrass And Miscanthus RhizosphereVRTIATIAMMMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIGVEVEALFQSPQDVGTVHLEPLIGGGSLVVNALNAPRMSVYV
Ga0070697_10139808023300005536Corn, Switchgrass And Miscanthus RhizosphereVRKLAPMLVVMALTATRLEAQYSRRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYAFTPMIDLEVEALFQYPQDVGAAHIEPLIGSGSLVVNALNAQRMSIYVLGGY
Ga0066697_1073196913300005540SoilVRTLAPLFAIVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNA
Ga0070696_10092347513300005546Corn, Switchgrass And Miscanthus RhizosphereLRTLGAFAALLILGSATLPAQYSRRYEVGLFGGFTKYDESFQLADKSGGGVRFAYAFTPLVALEVEGLFQSPQDVGSVHVEPLIGSASLVLNPFNKSRMSLYMLGGYTRLDFGNSSPYNFTDGGFHGGAGAKF
Ga0066695_1030328713300005553SoilVRRLAAAFIVLAFVGGHRLAAQYDRRYEVGLFGAFTKYDKTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHIEPLIGGGSVVVN
Ga0066707_1013348513300005556SoilVRKLAPILVLLALSANPLAAQYDRRYEVGLFGAFTKYDNTFSLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHLEPLIGGGSLVVNTLNASRMTVYLLGGYSRLDFGGTNPYRFTDGGVHG
Ga0066704_1020415813300005557SoilVRKLAPILVVMTLTATRLHAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTDGGFHGGA
Ga0066703_1075841623300005568SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFGGTSPYRFTD
Ga0066708_1048613013300005576SoilVRKLAPILVVMALTGTRLEAQYQRRYEVGIFGAFTRYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNARRMSVYVLGGYSRLDFGGTNPYRFTDGGFHGGAGAKFFMSS
Ga0066656_1007765823300006034SoilVRKLAPILVLLALSANPLAAQYDRRYEVGLFGAFTKYDNTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHLEPLIGGGSLVVNTLNASRMTVYLLGGYSRLDFGGTNPYRFTDGGVHG
Ga0079222_1233660313300006755Agricultural SoilVRKLAPILAILVALSSSLSAQYARRYEVGLFGAFTKYDNSFGLANKLGGGVRFAYAVTPMIGLEVEALFQAPQDVGTVHIEPMIGGG
Ga0066659_1022423213300006797SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFG
Ga0066660_1004099813300006800SoilVRKLAPIFVVMALTGTRLEAQYERRYEVGIFGAFTRYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNARRMSVYVL
Ga0075428_10232407823300006844Populus RhizosphereMVLVLGSTTLSAQYSRRYEVGLFGGFTKYDDAFQLGNKSGGGVRFAYSFTPLIAVEVEGLFQSPQDVSSVHVEPLIGSASLVFNALNKSRMSAY
Ga0075431_10015983233300006847Populus RhizosphereVRTIAAGLVVLALGGGTATLSAQYDRRYEVGMFGGFTKYDKSFGLADKIGGGVRFAYALTPMLGLEVEGLFQSPQDIGSVHMEPLVGSGSL
Ga0075433_1172403513300006852Populus RhizosphereVRTFVAVGLAFVLGSSTLAAQYERRYEVGLFGAFTKYDKGFGLDDKIGGGVRFAYAFGPALSLEVEAIFQPPYNIPPSTELEPVIGGGSLVFNVMNRDRLSFYVLGGYSILDFGNTNPYHFTDGGVHGGAGVRLFF
Ga0075433_1188218323300006852Populus RhizosphereVRKLAPILAMLVALSSSLSAQYARRYEVGLFGAFTKYDNSFGLANKLGGGVRFAYAVTPMIGLEVEALFQSPQDVSTVHIEPMIGGGSLVINTLNASRMTVYVLGGYSRLDFGGSNPYRFTDGGIHSGAGVKLYMS
Ga0075425_10311934713300006854Populus RhizosphereVRTFVAVGLAFVLGSSTLAAQYERRYEVGLFGAFTKYDKGFGLDDKIGGGVRFAYAFGPALSLEVEAIFQPPYNIPPSTELEPVIGGGSLIFNVMNRDRLSFYVLGGYSILDFGNTN
Ga0075435_10127879223300007076Populus RhizosphereMAAGLAMALGTSTLAAQYERRYEVGLFGSFTKYDQAFGLANKIGGGVRFAYALGPALSLEVEALFQPPYNIPPSTEIEPIIAGGSLVFNALNRDRLAVYVLGGYSRLDFGATNPYH
Ga0075435_10199907923300007076Populus RhizosphereVRTIATIVMMMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIGVEVEALFQSPQDVGTVHLEPLIGGGSLVVNALNAPRMSVYVL
Ga0099795_1030851533300007788Vadose Zone SoilMALTATRLDAQYARRYEVGVFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEIEALFQSPQDVGTAHIEPLIGSGSLVINALNASRMSV
Ga0066710_100000894183300009012Grasslands SoilVRKLAPILVVMTLTATRLHAQYARRYEVGLFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNP
Ga0099830_1115643013300009088Vadose Zone SoilMALTATRLEAQYDRRYEVGLFGAFTKYDKTFGLSNKIGGGVRFSYAITPMIGLEVEALFQSPQTVSSSTQIEPMIGAGSLVINTLNASRMTVYVLGGYSRLDFGGTSPYR
Ga0066709_10124417923300009137Grasslands SoilVRKLAAALTILALSGSTRLAAQFSRRYEVGLFGAYTKYDQTFGLTNKPGGGARFSYALSPLISLEVEALFQSPQDISSSTLEPMIGAGSLIV
Ga0066709_10281150013300009137Grasslands SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDQSFGLADKPGGGVRFAYALGPALSLEVEALFQAPQNLPAAAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFGSTSPYRFTDGGVH
Ga0075423_1063712813300009162Populus RhizosphereVRKLAPLLAIAALTATRLEAQYARRYEVGLFGAYTKYDKAFGLASKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEGLIGSGSLVVNALNASRMS
Ga0075423_1189700323300009162Populus RhizosphereVRTFVAVGLAFVLGSSTLAAQYERRYEVGLFGAFTKYDKGFGLDDKIGGGVRFAYAFGPALSLEVEAIFQPPYNIPPSTELEPVIGGGSLVFNVMNRDRLSFYVLGGYSILDFGNTNPYHFTDGGVHGGAGV
Ga0127447_109055313300010136Grasslands SoilVKRIAPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPIIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNALNASRMSVYVLGGYSLLDFGNTTPYHFTDGGFHGG
Ga0134070_1005428213300010301Grasslands SoilVLLALSANPLAAQYDRRYEVGLFGAFTKYDNTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHLEPLIGGGSLVVNTLNASRMTVYLLGGYSRLDFGGTNP
Ga0134088_1065619713300010304Grasslands SoilVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVY
Ga0134086_1022022513300010323Grasslands SoilVRTLAAILLVTAITGSRLEAQYDRRYEVGLFGAFTKYDKTFGLADKIGGGVRFSYAVTPMIGLEVEALFQSPQDVTASTQIEPMIGAGSLVINTLNASRMTIYVLGGYSRLDFG
Ga0134086_1043812523300010323Grasslands SoilVRKLAAALTILALSGSTRLAAQFSRRYEVGLFGAYTKYDQTFGLTNKPGGGARFSYALSPLISLEVEALFQSPQDISSSTLEPMIGAGSLIVSPLNASRATFYLIGGYS
Ga0134064_1041144513300010325Grasslands SoilVRTLVAILLVTAITRSSLEAQYDRRYEVGLFGAFTKYDKTFGLSDKIGGGVRFSYAVTPMIGLEVEALFQSSQDVTASTQIEPMIGAGSLVVNALNASRMTIYVLGGYSRLDFGG
Ga0134111_1046189923300010329Grasslands SoilVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMVDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPY
Ga0134080_1020330223300010333Grasslands SoilVTKVAVILGVLVLSSTTLTAQYERRYEVGLFGAFTKYDKGFNLADKIGGGVRFAYGLTPMLGLEVDALFQAPQDVGPSAQIEPLIGSASLVVNALNASRMSVYVLGGYSRLDFGATSPY
Ga0137392_1047613613300011269Vadose Zone SoilMMMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIGVEVEALFQSPQDVGTVHLEPLIGGGSLVVNALNAPRMSVYVLGGYSRLDFGATSPYRFTDGGFHGGAGAKF
Ga0137364_1000136213300012198Vadose Zone SoilVLVLSSTTLTAQYERRYEVGLFGAFTKYDKGFNLADKLGGGVRFAYGLTPMLGLEVDALFQAPQDVGPSSQIEPLIGSASLVVNALNASRMSVYVLGGYSR
Ga0137363_1114609313300012202Vadose Zone SoilMALTATRLEAQYERRYEVGLFGAFTKYDKAFGLSDKIGGGVRFSYAVTPMVGLEIEALFQSPQDISATTQIEPLIGGASLVVNTLNASRMTIYALGGYS
Ga0137399_1007266813300012203Vadose Zone SoilVRKLAAVLVLFALNGTRLAAQYERRYEVGLFGAYTKYDKSFGLADKPGGGVRFAYALGPTLSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYALNASRMSLYVIGGYSRLDFGGTSPYRFTDGG
Ga0137399_1038427123300012203Vadose Zone SoilMALIAAARLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNA
Ga0137399_1066369323300012203Vadose Zone SoilVLGSSSLAAQYERRYEVGLFGAFTKYDKGFGLGDKIGGGVRFAYALGPALSLEVEALFQPPYNLPPSTELEPVIGGGSLVFNVMNRDRLSLYVLGGFSVLDFGITNPYHFTDFGGHAGAGIRLFFSDH
Ga0137380_1173135823300012206Vadose Zone SoilMTLTATRLEAQYDRRYEVGIFGAFTKYDNAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNAS
Ga0137381_1001323663300012207Vadose Zone SoilVRRLAAAFIVLAFVGGHRLAAQYDRRYEVGLFGAFTKYDKTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHIEPLIGGGSLVVNTLNASRMSVYVLGGYSRLDFGGTNPYRFTDGGFHGGAGVKM
Ga0137381_1014303533300012207Vadose Zone SoilVRKLAAVLVLFALSGTRLAAQYERRYELGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPAAAQIEPLIGSGSLLLYALNASRMSL
Ga0137381_1041290413300012207Vadose Zone SoilVRTLAAILLVTAITGSRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVY
Ga0137381_1168050413300012207Vadose Zone SoilVLVLSSTTLTAQYERRYEVGLFGAFTKYDKGFNLADKLGGGVRFAYGLTPMLGLEVDALFQAPQDVGPSSQIEPLIGSASLVVNALNASRMSVYVLGGYSRLDFGATSPYRFTDGGVHGGAGAK
Ga0137376_1005415833300012208Vadose Zone SoilVKKLAPILVVMTLTATRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEIEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFG
Ga0137378_1054061423300012210Vadose Zone SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDQLFGLADKPGGGVRFAYALGPALSLEVEALFQAPQNLPASAQIEPLIGSGSLLLYALNASRMSLYLI
Ga0137378_1106100213300012210Vadose Zone SoilVRTLAAILLVTAITGSRLEAQYDRRYEVGLFGAFTKYDKTFGLADKIGGGVRFSYAVTPMIGLEVEALFQSPQDVTASTQIEPMIGAGSLVVNALNASRMSVYVLGGYSLLDFGNTTPYHFTDGGFHG
Ga0137377_1165767623300012211Vadose Zone SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQAPQNLPASAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFGGTSPYRFTDG
Ga0137370_1001038713300012285Vadose Zone SoilVRTLAATVLVLALGTTSLSAQFERRYEVGLFGAFTRYDQNFGLQDKLGGGVRFAYALGPSASFEVEALFQSPQTPAPSTPIEPLIGSASVVFYALNASRMSAYVLGGYSLLDFGNTSPYHFTDGGFHGGAGVKFFMSSRFA
Ga0137387_1022771723300012349Vadose Zone SoilVRKLAAVLVLFALSGTRLAAQYERRYELGLFGAYTKYDKTFGLADKPGGGVRFAYALGPALSLEVEALFQAPQNLPASAQIEPLIGSGSLLLYALNA
Ga0137386_1038326313300012351Vadose Zone SoilVRTLAAILLVTAITGSRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGRCAPCRRP
Ga0137368_1040944023300012358Vadose Zone SoilVRTLAAILLVTALAGSSLEAQYGRRYEVGLFGAFTKYDKAFALSNKIGGGVRFAYAVTPMIGLEVEALFQSPQNITPSTEIEPLIGAASLVVNTLNASRMTVYVLGGYSRLDFGGTN
Ga0137361_1080837413300012362Vadose Zone SoilVRKLAAVLVLFALSGTQLAAQYERRYEVGLFGAYTKYDQSFGLADKPGGGVRFAYALGPALSLEVEALFQSPQNLPASAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFGGTSPYRFTDGGV
Ga0134043_117865323300012392Grasslands SoilVKRIAPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNALNASRMSVYVLGGYSLLDFGNTTPYHFTDGGFH
Ga0134044_114720613300012395Grasslands SoilVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGNTNPYHFT
Ga0134061_115691013300012399Grasslands SoilVRKLAAALTILALSGSTRLAAQFSRRYEVGLFGAYTKYDQTFGLTNKPGGGARFSYALSPLISLEVEALFQSPQDISSSTLEPMIGAGSLIVSPLNASRATFYLIGGYSRLDFGGTDPYRFTDGGV
Ga0134053_113238923300012406Grasslands SoilVALTATRLDAQYARRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGNTNPYHF
Ga0134050_103903823300012407Grasslands SoilVKRITPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIRSGSLVVNALNASRMSVYVLGGYSLLDFGNT
Ga0134045_120090713300012409Grasslands SoilVKRIAPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNALNASRMSVYVLGGYSLLISAIP
Ga0137398_1040806913300012683Vadose Zone SoilVALTATRLEAQYVRRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLG
Ga0137397_1067841923300012685Vadose Zone SoilVRKHAAALVILALSGSTRLAAQYNRRYEVGLFGAFTKYDQAFGLASKPGGGARFSYALTPMIGLEVEALFQSPQDVSSSTLEPMIGAGSVIVSPLNASRATFYLLAGYSRLDFAAPIPIGSLTAACTGGSARRCT*
Ga0157283_1032443413300012907SoilLKAFGALAVMLLLGSATLPAQYSRRYEVGFFGGFTKYDQSFQLADKSGGGVRFAYAFAPLVAVEVEGLFQSPQDVGSVHVEPLIGSASLVVNPFNSSRMSLYVLGGYTRLDFGNSSPYDFTDGGFHGG
Ga0137396_1098009823300012918Vadose Zone SoilMTLTATRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQNVGAAHIEPLIGSGS
Ga0137396_1109984513300012918Vadose Zone SoilVRTLAAVGLAVVLGSSSLAAQYERRYEVGLFGAFTKYDKGFGLGDKIGGGVRFAYALGPALSLEVEALFQPPYNLPPSTELEPVIGGGSLVFNLLNSDRNVLYILGGYSRQDYGAQNPYRFTDGAAHAAVGEKRSARVERESLGPVVCRQAVA
Ga0137359_1134329413300012923Vadose Zone SoilVALTATRLDAQYARRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVY
Ga0137413_1066192713300012924Vadose Zone SoilVKSIAPILVVLALTATRLEAQYARRYEVGIFGAYTRYDKAFGLADKPGGGVRFAYALTPMINLEVEALFQSPQDVGAAHIEGLIGSGSLVVNVLNASRMSVYALGGYSLLDFGNTSPYHFTDGGFHGGAG
Ga0137419_1005126813300012925Vadose Zone SoilMALTAAARLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSIVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTD
Ga0137419_1094081413300012925Vadose Zone SoilMTLTATRLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTDG
Ga0137416_1165396513300012927Vadose Zone SoilMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIDVEVEALFQSPQDVGTAHLEPLIGGGSLVVNALNAPRMSVYVLGGYSRLDFGATSPYRFTDGGFHGSAGA
Ga0134077_1004075923300012972Grasslands SoilMRKLAPILVLLALTASRLAAQYDRRYEVGLFGAFTKYDNTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHIEPMIGGGSLVVNTLNASRMTVYLLGGYSRLDFGGTNPYRFTDGGVHGGAGVKM
Ga0134078_1001424913300014157Grasslands SoilVLLALSANPLAAQYDRRYEVGLFGAFTKYDNTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHLEPLIGGGSLVVNTLNASRMTVYL
Ga0134079_1009836113300014166Grasslands SoilMALTGTRLEAQYERRYEVGIFGAFTRYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNARRMSVYLLGGYSRLDFGGTNPYRFTDGGFHGGA
Ga0134079_1011901523300014166Grasslands SoilVALTATRLDAQYARRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYALTPMVDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGT
Ga0137420_143633823300015054Vadose Zone SoilVKSIALILVVLALTATRVEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQAPQDVGAAHIEGLIGSGSLVVNVLNASRMSVYVLGGYSLL
Ga0180073_101705313300015256SoilVGLAALLGASSLAAQYERRYEVGAFGAFTKYDKAFGLEDKIGGGVRFAYALGPAVSLEVEALFQPPHTIAPSTDIEPVIGGGSLVFNALNRDRLSFYVLGGYSRLDFGGTNPYRFTDGGFHGGAG
Ga0134085_1001399733300015359Grasslands SoilVTKVAVILGVLVLSSTTLTAQYERRYEVGLFGAFTKYDKGFNLADKIGGGVRFAYGLTPMLGLEVDALFQAPQDVGPSAQIEPLIGSASL
Ga0134069_125777513300017654Grasslands SoilVKRIAPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNALNASRMSVYVLGGYSLLDFGNTTP
Ga0134112_1036344013300017656Grasslands SoilVRRLAAAFIVLAFVGGHRLAAQYDRRYEVGLFGAFTKYDKTFGLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHIEPLIGGGSLVVNTLNASR
Ga0184608_1026494613300018028Groundwater SedimentVRTLAAVGLAVVLGSSTLSAQYERRYEVGLFGAFTKYDKGFGLEDKIGGGVRFAFALGPAVSLEVEALFQPPHNIPPSTELEPVIGGGSLVFNVMNRDRLSFYVLGGYSLLDFGNTNPYHFT
Ga0184634_1008663623300018031Groundwater SedimentMRILAALGLASVLGASQVPAQFERRYEVGLFGAFTKYDKTFSLDDKLGGGVRFAYALGPAVSLEFEALFQSPHTIAPSTQIEPLIGGGSLVLNALNASRMSLYLLGGYSRLDFGGTSP
Ga0184637_1039207823300018063Groundwater SedimentVSKLTPMLLVLGLTGTRLEAQYDRRYEVGMFGAFTKYDKAFNLNDKIGGGVRFAYALTPMMALEVEGLFQSPQDVGSVHMEPLIGSGSLVVNAL
Ga0184609_1005942723300018076Groundwater SedimentMPVRTLAAVGLAIALGGGTSTLAAQYERRYEVGAFAAFTKYDKAFGLEDKIGGGVRFAYALGPALSLEIEALFQPPHNIPPSSEIEPVIGGGSLVLNTLNRDRLSFYVLAGY
Ga0184609_1014570823300018076Groundwater SedimentVRKLAPILLVLGLTGTRLEAQYDRRYEVGLFGAFTKYDQAFNLSNKIGGGVRFAYAFTPMLSLEGEGLFQSPQDIGSVHIEPLIGAASLVVNVLNASRMSVYA
Ga0066662_1122923513300018468Grasslands SoilVRKLAAALLLFALNGTRLAAQYERRYEVGLFGAYTKYDKTFGLSDKPGGGVRFAYALGPAMSLEVEALFQSPQSFSSSSIEPLIGSGSLLLYALN
Ga0066669_1079841813300018482Grasslands SoilVKRIAPILIVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNA
Ga0066669_1160100023300018482Grasslands SoilVRTLAAVGLVVGLGCSTLAAQYERRYEVGLFGAFTKYDQGFGLADKIGGGVRFAYAFGPAISLEVEALFQPPQNLPPSTELEPVIGGGSLVFNVMNRDRLSLYVLGGYSVLDFGNTNPYH
Ga0184646_129413323300019259Groundwater SedimentVSKLTFMLLALGLTGARLEAQYDRRYEVGMFGAFTKYDKAFNLNDKIGGGVRFAYSLTPMMALEVEGLFQSPQDVGSVHMEPLIGSGSLVVNALNASRMSVYVLGGYTLLDFGNTNP
Ga0173479_1073924613300019362SoilLKAFGALAVMLLLGSATLPAQYSRRYEVGFFGGFTKYDQSFQLADKSGGGVRFAYAFAPLVAVEVEGLFQSPQDVGSVHVEPLIGSASLVVNPFNTDRMSLYVLGGYTRLDFGNSSPYDFTDG
Ga0137408_113156723300019789Vadose Zone SoilVKRIALILVVLALTATRVEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNVLNASRMSVYALGGYSLLDFGNTSPYH
Ga0179594_1003573113300020170Vadose Zone SoilVRKLAPILVVMTLTATRLEGQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEIEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRF
Ga0210378_1003049113300021073Groundwater SedimentVKRIAPILVVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNVLNA
Ga0210382_1040786213300021080Groundwater SedimentVRTLAAAGLAFVLGSSSLAAQYERRYEVGLFGAFTKYDKGFGLEDKIGGGVRFAYAFGPALSLELEALFQPPQNLPPSTELEPVIGGGSLIFNIMNRDRLSFYVLGGFSVLDF
Ga0193719_1000511213300021344SoilVRKLAPILVVIALTATRLEAQYARRYEVGLFGAYTKYDKAFGLTDKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTD
Ga0222622_1061374613300022756Groundwater SedimentVKRIAPILVVLALTATRLEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLVVNVLNASRMSVYALGGYSLLDFGNTTP
Ga0209519_1030188723300025318SoilVRIVAAVGLAALLGASTLAAQYERRYEVGAFAAFTKYDKAFGLDDKIGGGVRFAYALGPAVSLELEALFQPPHTIAPSTEIEPVIGGGTLVLNALNRDRLSFYLLAGY
Ga0209235_112134523300026296Grasslands SoilVRTIATIVIMMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIDVEVEALFQSPQDVGTAHLEPLIGGGSLVVNALNAPRMSVYVLGGYSRLDFGATSP
Ga0209238_115525723300026301Grasslands SoilVRKLAPMLAVMALTATRLEAQYDRRYEVGLFGAFTKYDKTFGLSNKIGGGVRFSYAVTPMIGLEVEALFQSPQTVSSSTQIEPMIGAGSLVINTLNASRM
Ga0209240_107388223300026304Grasslands SoilVRRLAPLFALVALTATRLDAQYARRYEVGLFGAFTKYDKAFGLADKPGGGVRFAYALTPMVDLEVEALFQSPQDVGTAHIEALIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTDGGFHGG
Ga0209240_122577213300026304Grasslands SoilVKSIAPILVVLALTATRLEAQYARRYEVGIFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSLV
Ga0209761_100678863300026313Grasslands SoilVRTLVTIVMLMVATRSLSAQYDRRYEVGLFGAFTRYDKAFNLANKIGGGVRFAYAFTPMIDLEVEALFQSPQDVGTVHLEPLIG
Ga0209375_120963613300026329SoilVRKLAAALTILALSGSTRLAAQFSRRYEVGLFGAYTKYDQTFGLTNKPGGGARFSYALSPLISLEVEALFQSPQDISSSTLEPMIGAGSLIVSPLNASRATFYLIGGYSRLDFGGTDPYRFTD
Ga0209158_132717813300026333SoilVRKLAAVLVLFALSGTRLAAQYERRYEVGLFGAYTKYDQLFGLADKPGGGVRFAYALGPALSLEVEALFQAPQNLPAAAQIEPLIGSGSLLLYALNASRMSLYLIGGYSRLDFGGTSPYRFTDGGVHGGAG
Ga0209378_100380613300026528SoilVRKLAPILVVMALTATRLEAQYDRRYEVGLFGAFTKYDKTFGLSNKIGGGVRFSYAVTPMIGLEVEALFQSPQTVSSTTQIEPMIGAGSLVINTLNASRMTVYVLGGYSRLDFGGTS
Ga0209378_111505713300026528SoilVRKLAPILVLLALSANPLAAQYDRRYEVGLFGAFTKYDNTFSLSNKLGGGVRFAYAVTPMIGLEVEALFQSPQDVGTAHLEPLIGGGSLVVNTLNASRMTVYLLGGYSRLDFGGTNPYRFTDG
Ga0209160_111277913300026532SoilVRKLAPILVVMTLTATRLHAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTDGGFHGGAVAKF
Ga0209076_103830813300027643Vadose Zone SoilVRKLAPILVVMALTVTRLEAQYDRRYEVGLFGAFTKYDKTFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFGGTNPYRFTD
Ga0209076_104384123300027643Vadose Zone SoilVRKLAPILAVMALIAAARLEAQYDRRYEVGIFGAFTKYDKAFNLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEPLIGSGSLVVNALNASRMSVYVLGGYSRLDFEKAAPYRFTDDAVHGAIGDRIFLGDQAALR
Ga0209588_102053013300027671Vadose Zone SoilVRKLAPIFLVMALTAPRLEAQYDRRYEVGLFGAFTKYDKTFGLSNKIGGGVRFSYAVTPMIGLEVEALFQSPQNVSSSTQIEPMIGAGSLVINTLNASRMTVYVLGGYSRLDFGGTSPYRFTDGGFHGGAGA
Ga0209073_1047575823300027765Agricultural SoilVRKLAPILVLMALTATQLEAQYDRRYEVGLFGAFTKYDKAFGLSDKLGGGVRFSYAVTPMVGLEIEALFQSPQDISATTQIEPLIGGASLVVNTLNASRMT
Ga0209590_1001468533300027882Vadose Zone SoilVRKLAPILVVMALTATRLEAQYDRRYEVGLFGAFTKYDKTFGLSNKIGGGVRFSYAVTPMIGLEVEALFQSPQTVSSSTQIEPMIGAGSLVINT
Ga0209488_1004545613300027903Vadose Zone SoilVRKLAPIFLVMALTGTTLAAQYERRYEVGLFGAFTKYDKAFSLADKIGGGVRFAYGVTPMIGLEVEALFQSPQTIGASSEIEPLIGSASLIVNALNASRM
Ga0209488_1031206923300027903Vadose Zone SoilVKRIAPILVVLALTATRVEAQYARRYEVGVFGAYTRYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGAAHIEGLIGSGSVVVNALNASRMSVYVLGGFSLLDFGNTNPYHF
Ga0209488_1109573313300027903Vadose Zone SoilMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIDVEVEALFQSPQDVGTAHLEPLIGGGSLVVNALNAPRMSVYVLGGYSRLDFGATSPYR
Ga0268265_1229614713300028380Switchgrass RhizosphereLKAFGALAVMLLLGSATLPAQYSRRYEVGFFGGFTKYDQSFQLADKSGGGVRFAYAFAPLVAVEVEGLFQSPQDVGSVHVEPLIGSASLVVNPFNTSRMSLYVLGGYTRL
Ga0307320_1035148813300028771SoilVRTLAAAGLAIALGASTLEAQFERRYEVGAFAAFTKYDKVFGLDDKIGGGVRFAYALGPALSLEVVALFQPPYHIPPSTEIEPVIGGASLVLNAMNRDRMSFYVLAG
Ga0307278_1034265013300028878SoilVRKLAPLLAIVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEGLIGSGSLVVNALNASRM
Ga0308178_103141113300030990SoilVRTLAAVGLAVVLGSSTLAAQYERRYEVGLFGAFTKYDKGFGLEDKIGGGVRFAYALGPALSLEVEALFQPPHNIPPSTELEPVIGGGSLVFNVMNRDRLSFYVLGGISVLDFGNSNPYHFTDFGGHGGAGKQTHAGAHQPRHPDHCYPPAA
Ga0308187_1009897823300031114SoilVRKLAPLFAIVALTATRLDAQYARRYEVGLFGAYTKYDKAFGLADKPGGGVRFAYALTPMIDLEVEALFQSPQDVGTAHIEGLIGSGSLVVNALNASRMSVYVLGGFSLLDFGNTNPYHFTDHGFHGGA
Ga0307469_1158159913300031720Hardwood Forest SoilVRTIATIAMMMVATSSLSAQYDRRYEVGLFGAFTKYDKAFNLASKIGGGVRFAYAFTPMIGVEVEALFQSPQDVGTVHLEPLIGGGSLVVNALNAPRMSVYVLGGYSRLDFGATSPYRFT
Ga0364928_0098917_355_6843300033813SedimentMLLVLGLTGTRLEAQYDRRYEVGMFGAFTKYDKAFNLNDKIGGGVRFAYAFTPMIALEAEGLFQSPQDVSSVHMEPLIGSGSLVVNALNASRMSVYVLGGYTLLDFGNTN
Ga0364928_0193147_125_5083300033813SedimentVRTLAAVGLAIALGTSTLAAQYERRYEVGAFGAFTKYDKTFGLEDKIGGGVRFAYALGPALSLELEALFQPPHSISPSTDIEPVIGGGSLVLNALNRDRLSFYVLAGYSRLDFGGTNPYRFTDGAVHG
Ga0364940_0123746_2_2623300034164SedimentMLLVLGLTGTRLEGQYDRRYEVGMFAAFTKYDKAFNLNDKIGGGVRFAYALTPMMALEVEGLFQSPQDVGSVHMEPLIGSGSLVVNA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.