NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072781

Metagenome Family F072781

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072781
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 129 residues
Representative Sequence MKNNSKSKTFLALVLAIAIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQASTADIAVIQNISAQASRDLNLLQTLYAEYKANPSASTLQKIQNVIADLNQNLPTLLQSAHV
Number of Associated Samples 102
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 88.24 %
% of genes near scaffold ends (potentially truncated) 87.60 %
% of genes from short scaffolds (< 2000 bps) 90.08 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.868 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.314 % of family members)
Environment Ontology (ENVO) Unclassified
(34.711 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.587 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 68.24%    β-sheet: 0.00%    Coil/Unstructured: 31.76%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF02663FmdE 31.40
PF01425Amidase 22.31
PF09190DALR_2 1.65
PF00589Phage_integrase 0.83
PF01966HD 0.83
PF01258zf-dskA_traR 0.83
PF12627PolyA_pol_RNAbd 0.83
PF13735tRNA_NucTran2_2 0.83
PF13432TPR_16 0.83
PF05258DciA 0.83
PF00437T2SSE 0.83
PF02686Glu-tRNAGln 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG2191Formylmethanofuran dehydrogenase subunit EEnergy production and conversion [C] 31.40
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 22.31
COG0215Cysteinyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.65
COG0721Asp-tRNAAsn/Glu-tRNAGln amidotransferase C subunitTranslation, ribosomal structure and biogenesis [J] 0.83
COG1734RNA polymerase-binding transcription factor DksATranscription [K] 0.83
COG5512Predicted nucleic acid-binding protein, contains Zn-ribbon domain (includes truncated derivatives)General function prediction only [R] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms95.87 %
UnclassifiedrootN/A4.13 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001305|C688J14111_10151415All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300002568|C688J35102_117908522All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300002916|JGI25389J43894_1008849All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1661Open in IMG/M
3300004479|Ga0062595_100452417All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300005175|Ga0066673_10041396All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2277Open in IMG/M
3300005175|Ga0066673_10061952Not Available1918Open in IMG/M
3300005176|Ga0066679_10275814All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1089Open in IMG/M
3300005177|Ga0066690_10636590All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300005179|Ga0066684_10382559All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300005184|Ga0066671_11081999All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300005186|Ga0066676_10784569All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300005434|Ga0070709_10331993All Organisms → cellular organisms → Bacteria1119Open in IMG/M
3300005434|Ga0070709_11468565All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005436|Ga0070713_100575781All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300005437|Ga0070710_10292102All Organisms → cellular organisms → Bacteria1061Open in IMG/M
3300005437|Ga0070710_10818608All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300005439|Ga0070711_101356114All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300005439|Ga0070711_101955265All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005471|Ga0070698_101945762All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Selenomonadales → Selenomonadaceae → Selenomonas → unclassified Selenomonas → Selenomonas sp. ND2010541Open in IMG/M
3300005530|Ga0070679_101230947All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300005537|Ga0070730_10441479All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300005537|Ga0070730_10597735All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300005547|Ga0070693_101219840All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300005553|Ga0066695_10339902All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300005560|Ga0066670_10308448All Organisms → cellular organisms → Bacteria963Open in IMG/M
3300005563|Ga0068855_102367399All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300005575|Ga0066702_10154585All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1360Open in IMG/M
3300006028|Ga0070717_11592831All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300006032|Ga0066696_10096382All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1769Open in IMG/M
3300006163|Ga0070715_10617430All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300006175|Ga0070712_100899956All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300006755|Ga0079222_10827103Not Available762Open in IMG/M
3300006796|Ga0066665_11665564All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300006797|Ga0066659_11070007All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300006804|Ga0079221_11696560All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300006893|Ga0073928_10003509All Organisms → cellular organisms → Bacteria24673Open in IMG/M
3300009088|Ga0099830_10728473All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300009093|Ga0105240_11639895All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300009137|Ga0066709_100803540All Organisms → cellular organisms → Bacteria1363Open in IMG/M
3300009162|Ga0075423_11582791All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300009174|Ga0105241_12439406All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300009551|Ga0105238_12951167All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300009792|Ga0126374_11531047All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300010320|Ga0134109_10230598All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300010321|Ga0134067_10396976All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300010321|Ga0134067_10499747All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300010337|Ga0134062_10382025All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300010359|Ga0126376_12648565All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300010375|Ga0105239_11635361Not Available745Open in IMG/M
3300010396|Ga0134126_11325903All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300010396|Ga0134126_11573919All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300011269|Ga0137392_10007486All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7049Open in IMG/M
3300011269|Ga0137392_11190942All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300011270|Ga0137391_10020289All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5481Open in IMG/M
3300011270|Ga0137391_11475748All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300012004|Ga0120134_1013972All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1245Open in IMG/M
3300012204|Ga0137374_10252131All Organisms → cellular organisms → Bacteria1479Open in IMG/M
3300012205|Ga0137362_11666114All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300012208|Ga0137376_10367011All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300012209|Ga0137379_10129990All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2421Open in IMG/M
3300012209|Ga0137379_10194863All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1939Open in IMG/M
3300012209|Ga0137379_11494055All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300012210|Ga0137378_10004224All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis12411Open in IMG/M
3300012210|Ga0137378_10424973All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300012210|Ga0137378_11716404All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300012211|Ga0137377_11533541All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300012349|Ga0137387_11216042All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300012351|Ga0137386_10119539All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1874Open in IMG/M
3300012351|Ga0137386_11131796All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300012357|Ga0137384_10076365All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2784Open in IMG/M
3300012363|Ga0137390_10733680All Organisms → cellular organisms → Bacteria949Open in IMG/M
3300012917|Ga0137395_10117994All Organisms → cellular organisms → Bacteria1783Open in IMG/M
3300012927|Ga0137416_11464787All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300012929|Ga0137404_11846596All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300012960|Ga0164301_10211607All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300012986|Ga0164304_10898953All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300012987|Ga0164307_10430950All Organisms → cellular organisms → Bacteria980Open in IMG/M
3300014166|Ga0134079_10477312All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300014497|Ga0182008_10587366All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300014969|Ga0157376_10785013All Organisms → cellular organisms → Bacteria964Open in IMG/M
3300015262|Ga0182007_10440151All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300015264|Ga0137403_10394860All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1264Open in IMG/M
3300015357|Ga0134072_10206233All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300017937|Ga0187809_10056622All Organisms → cellular organisms → Bacteria1270Open in IMG/M
3300018433|Ga0066667_10934394All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300018468|Ga0066662_10059010All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis2498Open in IMG/M
3300018468|Ga0066662_11270940All Organisms → cellular organisms → Bacteria751Open in IMG/M
3300018482|Ga0066669_11951119All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300019888|Ga0193751_1079159All Organisms → cellular organisms → Bacteria1318Open in IMG/M
3300020006|Ga0193735_1064187All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1070Open in IMG/M
3300020010|Ga0193749_1074155All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300021170|Ga0210400_11021093All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300021405|Ga0210387_10838248All Organisms → cellular organisms → Bacteria811Open in IMG/M
3300021478|Ga0210402_10810124All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300022557|Ga0212123_10002312All Organisms → cellular organisms → Bacteria46703Open in IMG/M
3300025898|Ga0207692_10239029All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300025905|Ga0207685_10075441All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1379Open in IMG/M
3300025913|Ga0207695_10354150All Organisms → cellular organisms → Bacteria1355Open in IMG/M
3300025916|Ga0207663_11192732All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300025924|Ga0207694_10707842All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300025928|Ga0207700_11913309All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300025929|Ga0207664_10044604All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis3473Open in IMG/M
3300025929|Ga0207664_10792596All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300025929|Ga0207664_11794514All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300026078|Ga0207702_10730430All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300026301|Ga0209238_1033182All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1911Open in IMG/M
3300027842|Ga0209580_10300805All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300027846|Ga0209180_10099243All Organisms → cellular organisms → Bacteria → Acidobacteria1658Open in IMG/M
3300027846|Ga0209180_10104627All Organisms → cellular organisms → Bacteria1615Open in IMG/M
3300027862|Ga0209701_10489991All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300028828|Ga0307312_10421080All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium878Open in IMG/M
3300031716|Ga0310813_11831422All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300031720|Ga0307469_11044392All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300031754|Ga0307475_11035434All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300031962|Ga0307479_11929799All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300031996|Ga0308176_12966652All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300032205|Ga0307472_100367547All Organisms → cellular organisms → Bacteria1189Open in IMG/M
3300032805|Ga0335078_12636785All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300032896|Ga0335075_10749542All Organisms → cellular organisms → Bacteria927Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.31%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere14.05%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.79%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.96%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere4.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.31%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.31%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.48%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil2.48%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.65%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.65%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.65%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.65%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.65%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.65%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.83%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.83%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001305Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012004Permafrost microbial communities from Nunavut, Canada - A30_5cm_6MEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300017937Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_4EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032896Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J14111_1015141513300001305SoilMNFKSKTILTLTLAIALAAIGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASAADNAVIQNISAQASRDLNLLQTLYNEYKASPSPTTLQKLQSAIADLNQNLPTMLQ
C688J35102_11790852213300002568SoilVNFKSKSLLALVLAISITTAGCSAQWINIALQDLPVLTQMALNIATLVSAFASGKQANPGDVAVIQNISAQASRDLNLLQSLYAEYKASPSATTLQKLQSVISAMNQNLSALLQSAHISN
JGI25389J43894_100884913300002916Grasslands SoilMNSKSKPFLALVLSILIATAGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASTGDIAVIQNISAQASRDLNLLQTLYSEYKAGPSATTLKKIQNAISDLNQNLPAMLQSAHISNATVSTRIAAAVNLILT
Ga0062593_10198808513300004114SoilMLSRSKSLLALVLAITMSATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANSGDVAVIQNISAQASRDLNLLQSLYAEYKANPSATT
Ga0062595_10045241713300004479SoilMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANSGDVAVIQNISAQASRDLNLLQSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISNSTLAARITAAINLILTTVNNFAALMPQTAPATSQRLP
Ga0066673_1004139613300005175SoilMNLRSKLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASTADFAVIQNISAQASRDLNLLQSLYNDYKANPSSATLQKIQNVISGLNQNLPALLQAAHI
Ga0066673_1006195233300005175SoilMKANSRSKCLIAAVLALTIAATGCSAQWINVALQDLPVLTEMALNIASLVGTLGSGKQASSADVAVVQNISAQASRDLNLLQTLYTEYKSNPNSATLQKIQNVISGLNQNLPALLQSAHISSATLWRGSLLL*
Ga0066679_1027581413300005176SoilMKLNSKSNILLILALAIAIAATGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQTLYGEYKANPSSTTLAKIQNVISDLNQNLP
Ga0066690_1063659013300005177SoilMNLRSKLKSLLALVLALVSATTGCTSQWVNVAVQDQPVLTQMALNIATLVSTLAAGKQASTGDVAVIQNISAQVSRDLNLLQSLYNEYKASPNNTTLQKIQNIISGLNQNLPALLQAAHISNPILSARVSAAIN
Ga0066684_1038255923300005179SoilMNLRSKLKSLLALVLALVSATTGCTSQWVNVAVQDLPVLTQMALNIATLVSTLAAGKQASTGDVAVIQNISAQVSRDLNLLQSLYNEYKASPNNTTLQKIQNVISGLNQNLPALLQAAHISNPTLSTRVSAAVNLIISTVNSVASLMPQSSAATSRK
Ga0066671_1108199913300005184SoilMKANSRSKCLIAAVLALTIAATGCSAQWINVALQDLPVLTEMALNIASLVGTLGSGKQASSADVAVVQNISAQASRDLNLLQTLYNEYKANPSASTRQKIQNVISDLNQNLPALLEAAHLSNATLAARVTAAVNLILTTVNSFASLIPQATIPQSTP
Ga0066676_1078456923300005186SoilKLNFRSKSLLAVVLAISIAATGCSAQWINIALEDLPVLTQMTLSIATLVSALASGKQANPGDVAVIQNISAQASRDLNLLQSLYAEYKASANVTTRQKIQSVFLIWIRTFRRCCNQRTFRMPC*
Ga0070709_1033199313300005434Corn, Switchgrass And Miscanthus RhizosphereMHAYSNAHSKFKPLLALVLAMAIATTACSPNWINIALQDLPVLTQMALNIATLASTFSPQQNPADLAVIQNISGQASRDLNLLLTLYNEYKDSPNATTLGKIQSAIGLIN
Ga0070709_1146856513300005434Corn, Switchgrass And Miscanthus RhizosphereMNPHSKCKSLLALVLAILIAATGCSAQWIKIALEDLPVLTQMALNIAALVGTMTAGKQTNNADLAVIQNLSTQASRDLNLLQTLYNEYQASPSDTTLARIQTVIAALNQSLPSLLESAHISNPL
Ga0070713_10057578113300005436Corn, Switchgrass And Miscanthus RhizosphereMKNNSKSKTFLALVLAIAIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQASTADIAVIQNISAQASRDLNLLQTLYAEYKANPSASTLQKIQNVIADLNQNLPTLLQSAHV
Ga0070710_1029210213300005437Corn, Switchgrass And Miscanthus RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLES
Ga0070710_1081860813300005437Corn, Switchgrass And Miscanthus RhizosphereMKLKSKSKSLLAAVLAITIATTGCSAQWINIALQDLPVLTQMALNIATLVSALVSGQQANPGDVAVIQNISAQASRDLNLLQSLYAQYKANPSATTLQKIQNAISDLNQNLPALIESAHISNLTLAARITAAVNLILTTV
Ga0070711_10135611423300005439Corn, Switchgrass And Miscanthus RhizosphereMNSKSKSLLALVLAISIAATGCSAQWIKIALQDLPVLTQMALNIAALVGTMSAGKQTNNADLAVIQNISAQASRDLNLLQTLYNEYEASPNDTTLAKIQTVIGNLNQNLPSLLES
Ga0070711_10195526513300005439Corn, Switchgrass And Miscanthus RhizosphereSLVVRRSQKSHSPENHMKTHSKSKCLLALVLAIAIAATGCSSQWINIALQDMPVLTQMALNIATLAATLASGNQASTADVAVIQNISAQASRDLNLLQTLYNQYKANPSASTRQKIQNVISDLNQNLPALLEAAHISSATLIARVTAAVNLILTTVNSFASLIPQSTIPQA
Ga0070698_10194576213300005471Corn, Switchgrass And Miscanthus RhizosphereFQIENSPCPRARDHYRGDGCSAQWISLALQDLPVLTQMALNVATLVSTLASGPQASAADVAVIENVSAQASRDLSLLQSLYSEYKANPNATTLQKIQNVISDLNQNMPALLQSAHIGNPVLSARITAAGI*
Ga0070679_10123094713300005530Corn RhizosphereMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADLAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQTGITGINQHLPALLESAHISNPLLSSRVTAAVNLILVTVNNF
Ga0070730_1044147913300005537Surface SoilMAIATTACSPNWINIALQDLPVLTQMALNIATLASTFSPQQNPADLAVIQNISGQASRDLNLLLTLYNEYKDSPDATTLAKIQSAIGLINQHLPALLESAHISNALLKARVTIAVNLILATVNN
Ga0070730_1059773523300005537Surface SoilMKRHSNAHSKSKSLLVLALAISIASTGCSPQWINIALQDLPVLTQMALNIATLASTLSGQQNPADLAVIQNISAQASRDLNLLLTLYNEYKASPNATTMARIQGAIGVINQHLPALLESAHISNALLTARV
Ga0070693_10121984013300005547Corn, Switchgrass And Miscanthus RhizosphereMHAYSNAHSKFKPLLALVLAMAIATTACSPNWINIALQDLPVLTQMALNIATLASTFSPQQNPADLAVIQNISGQASRDLNLLLTLYNEYKDSPNATTLAKIQSAIGLINQHLPALLESAHISNALLKTR
Ga0066695_1033990213300005553SoilVKLNFRSKSLLAVVLAISIAATGCSAQWINIALEDLPVLTQMTLSIATLVSALASGKQANPGDVAVIQNISAQASRDLNLLQSLYAEYKASANVTTRQKIQSVFLIWIRTFRRCCNQRTFRMPC*
Ga0066670_1030844813300005560SoilMKLNFRSKSLLAVVLAISIAATGCSAQWINIALQDLPVLTQMSLNIATLVSAFASGKQANPGDVAVIQNISAQASRDLNLLQSLYADYKASANVTTLQKIQSVISDMNHNL
Ga0068855_10236739923300005563Corn RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLLALLESAHISNSTLAARITAAINLIL
Ga0066702_1015458513300005575SoilMNLRSKLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASVADVAVIQNISSQASRDLNLLQTLYSEYKANPSSATLQKIQNLISDLNQNLPALLQAAHISNPTLSARIAAAVNLILSTVNSV
Ga0070717_1159283113300006028Corn, Switchgrass And Miscanthus RhizosphereMNSKSKTLLALALAIALAATGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASTADTAVIQNISAQASRDLNLLQTLYNEYKTSPTPTTLQKLQSAISDLNQNLPTML
Ga0066696_1009638213300006032SoilMNSKSKPFLALVLAILIATAGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASTGDVAVIQNVSAQASRDLNLLQTLYSEYKASPNATTLKKIQNAISDLNQNLPAMLQSAHISSATVSTRIAAAVNLILTTVNSFAALMPQ
Ga0070715_1061743013300006163Corn, Switchgrass And Miscanthus RhizosphereMNSKSKSLLALVLAIAIATTGCSTQWINIALQDMPVLTQMALNIATLAATLASGNQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSASTRQK
Ga0070712_10089995613300006175Corn, Switchgrass And Miscanthus RhizosphereMHAYSNAHSKFKPLLALVLAMAIATTACSPNWINIAQKDLPVLTQMALNIATLASTFSPQQNPADLAVIENISGQASRDLNLLLTLYNEYKDSPNATTLAKIQSAIGLINQHLPALLESAHISNALLKTRVTIAVNL
Ga0079222_1082710313300006755Agricultural SoilMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADLAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQTGITGI
Ga0066665_1166556423300006796SoilMNTHSKSRIVLALVLAILIAATGCSAQWINIALQDLPVLTQMALNIATLVSSLASGQQISAADTAVIQNISAQASRDLNLLQTLYSEYKADPSATTLAKVQKVISDLNQNLPALLESAH
Ga0066659_1107000723300006797SoilMKLNSKSNILLILALAIAIAATGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQTLYGEYKANPSSTTLAKIQNVISDLNQNLPTLLQSAHISNAVLSARVTAAVNLIRTTVNSFASLTPQ
Ga0079221_1169656013300006804Agricultural SoilMNVHFKSKSLLALVLAIAITATGCSAQWINIALQDLPVLTQMALNIATLVGTLSSNKPPNTADLAVIQNISAQASRDLNLLQALYSEYKENPSDTTLRKIQNVIAGLNRNLPALL
Ga0073928_1000350923300006893Iron-Sulfur Acid SpringMNPNSKSKTLLALVLAITIAATGCSAQWISLALQDLPVLTQKALNVATLVSTLPSGQQASAADVAVIQNVSAQASRDLSLLQSLYSEYKANPNATTLQKIGTDLRLSKIDR*
Ga0099830_1072847323300009088Vadose Zone SoilVRQKATTNDCLSGENMNPNSKSKPLLALVLAITIAATGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVASDLNQNLPALLESAHIGNPVLSARITA
Ga0105240_1163989513300009093Corn RhizosphereMEKTMKTDSKSKILLALVLAIALAATGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASAADTAVIQNISAQAGRDLNLLQTLYNEYKASPSPTTLQKLNSAISDLNQNLPTMLQSAHISNATLSARIAAAVNLILTTVNS
Ga0066709_10080354033300009137Grasslands SoilVLAILIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQISAADTAVIQNISAQASRDLNLLQTLYSEYKADPSATTLAKVQKVISDLNQNLPALLESAHVSNPTLSARVTAAVNLILTTVNSFASLIPQQSASTSRRAKLALPT
Ga0075423_1158279123300009162Populus RhizosphereMNLRSKLKFLLALVLALLTATIGCTPQWINVAVQDLPVLTQTALNIATLMSTLASGKQASTADVAAIQNISAQASRDLNLRKRFTTNTRPIRHNTTLQKIQNVIADVSQNLPSLLQAAHIPNPTLSARVTAAVNLIVSTVNSVATLMPQSSA
Ga0105241_1243940613300009174Corn RhizosphereMKTHSKSKSLLALVLVISIAATGCSAQWIRIALQDLPVLTEMALNIVALVGTMSEGKRTNNADLAVIQNISAQASRDLNLLQALYNEYEANPNDATLAKIQTVIAGLNQNLPALLESAHISNPLLVARVTAAVNLILGTVNSFAALIPQTSTMSARIAV
Ga0105238_1295116713300009551Corn RhizosphereMAIATTACSPNWINIALQDQTVLTQMALNIATLASTFSPQQNPADLAVIQNISGQASRDLNLLLTLYNEYKDSPNATTLGKIQSAI
Ga0126374_1153104713300009792Tropical Forest SoilMNIRSKLKSLLALVMALITATVGCTPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASAADVAVIQNISAQASRDLNLLQSLYNDYRANPRSTTLQKIQNVISDLNQNLPALLQA
Ga0134109_1023059813300010320Grasslands SoilSMKLNFRSKSLLAVVLAISIAATGCSAQWINIALEDLPVLTQMTLSIATLVSALASGKQANPGDVAVIQNISAQASRHLNLLQSLYAEYKASANVTTRQKIQSVFLIWIRTFRRCCNQRTFRMPC*
Ga0134067_1039697613300010321Grasslands SoilMKANSRSKCLIAAVLALTIAATGCSAQWINVALQDLPVLTEMALNIASLVGTLGSGKQASSADVAVVQNISAQASRDLNLLQTLYTEYKSNPNSATLQKIQNVISGLN
Ga0134067_1049974713300010321Grasslands SoilMNLRSKLKSLLALVLALVSATTGCTSQWVNVAVQDLPVLTQMALNIATLVSTLAAGKQASTADVAVIQNISAQASRDLNLLQSLYNEYKASPNSTTLQKIQNVISGLNQNLPALLQAAHI
Ga0134062_1038202513300010337Grasslands SoilVKLNFRSKSLLAVVLAISIAATGCSAQWINIALEDLPVLTQMTLSIATLVSALASGKQANPGDVAVIQNISAQASRHLNLLQSLYAEYKASANVTTRQKIQSVFLIWIRTFRRCCNQRTF
Ga0126376_1264856513300010359Tropical Forest SoilMNISSKLKSLLALVLALLTATIGCTPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASTADVAAIQNISAQASRDLNLLQTLYSEYKANPNSSTLQKIQNVNSDVNQNLPELLQAAHISNPTLSARVTAAVSLIVSTV
Ga0105239_1163536113300010375Corn RhizosphereMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADLAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQTGITGINQHLP
Ga0134126_1132590313300010396Terrestrial SoilMHSKFKSPLALVLATSIATTACSPQWINIALQDLPVLTQMALNIATLAGAFSRQQNTADLAVIQNISTQASRDLNLLLTLYNEYKATPNATTLGKIQAAINQINQHLPALLESAHISNALLTARVTAAVNLI
Ga0134126_1157391923300010396Terrestrial SoilMTVHFKSKSLLALVLAMAIATTGCSAQWINVALQDLPVLTQMALNIATLVGTLSTNKPPSTADLAVIQNISAQAGRDLNLLQTLYAEYKENPSDTTLG
Ga0137392_1000748683300011269Vadose Zone SoilMNLHSKSKIVLALVLAITIAPTGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVASDLNQNLPALLESAHIGNPVLSARITAAVNLILTTATVLRR*
Ga0137392_1119094223300011269Vadose Zone SoilMTTHSKSRIVPAFVLAIVIAATGCSAQWINLALQDLPVLTQMALNVATLVSTLASGKQASSADVALIQNISAQASRDLNLLQTLYSEYKATPSATTLAKIQSVLSDLNQNLPALLESAHLSN
Ga0137391_1002028933300011270Vadose Zone SoilMNLHSKSKIVLALVLAITIAATGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVASDLNQNLPALLESAHIGNPVLSARITAAVNLILTTATVLRR*
Ga0137391_1147574813300011270Vadose Zone SoilNDRAVEMASCSGAAQANDQRRTANNCLSGENMNPHSKSKSILAFVFAIAVVSTGCSAQWVNLALQDLPVLTSMALNIATLVSTLASGQQASAADNAVIQNISVQASRDLNLLQSLYGEYKANPSPTTLQKIQNVISDVNQNLPPLLESAHIANPVLSARITAAVNLILTTVNSF
Ga0120134_101397213300012004PermafrostVRQKANDCLSGENMNPNSKSKTLRALVLAITIAATECSAQWISLALQGLPVLTQMALNVATLVSTLASGQQASAADVAVIQNVSAQPSRDLSLLQSLHSEYKANPNATPLQKIQNVISDL
Ga0137374_1025213123300012204Vadose Zone SoilMTIHRTTKPFLALVLAIAIAATRCSTQWINIALQDLPVLTQMALTIATLASTLSSGKQASSGDVAVIQNISAQASRNLNLLQTLYNEYKAGPNATILAKTQTVISSLNQNLPSTALPH*
Ga0137362_1166611413300012205Vadose Zone SoilMNLHSKSKIVLALVLAITIAATGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVAS
Ga0137376_1036701113300012208Vadose Zone SoilMNIRSRLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASVADVAVIQNISSQASRDLNLLQTLYNDYKANPSSATLQQIQN
Ga0137379_1012999033300012209Vadose Zone SoilMNIRSRLKSLLALVLVLLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASVADVAVIQNISSQASRDLNLLQTLYNDYKANPSSATLQKI
Ga0137379_1019486323300012209Vadose Zone SoilMPGFIHSYAAIPPLILVNDQRRFLQLPRRNMKLNSKSNILLILALAIAIAATGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQTLYGEYKANPSSTTLAKIQNVISDLNQNLPTLLQSAHISNAVLSARVTAAVNLILTTVNSFASLMP*
Ga0137379_1149405513300012209Vadose Zone SoilMKSKSNILLILALAIAIATTGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQALYGEYKGNPSSTTLAKIQNLISDLNQNLPTLLQSAHISN
Ga0137378_1000422453300012210Vadose Zone SoilMTTHSKSRIVLAFVLAIMIAATGCSAQWINLALQDLPVLAQMALNVAMLASTLASGTQASSADVAVIQNISAQTSRDLNLLQTLYSEHRATPAATTLAKIQNVISDLNQNLPALLESAHLSNPTLSARIAAR*
Ga0137378_1042497323300012210Vadose Zone SoilMNTRSRLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVGTLASGKQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSSATLQK
Ga0137378_1171640413300012210Vadose Zone SoilMNNHSKSNALLALVLAISVAVTGCSAQWINIALQDLPVLTQMALNVATLVSTLAAGKQASAADVAVIQNISAQASRDLNLLQTLYSNYKANPSGTTLQKIQDVVSDLNQNLPTLLESAH
Ga0137377_1153354113300012211Vadose Zone SoilVLALAIATTGCSSQWINIALQDLPVLTQMALNIAMLASTLSSGKQASTADVAVIQNISAQASRNLNLLQTLYSEYKAGPNATKLVKIQNVISSLNQNLPALLESAHISNSLLSARISAAVNLILTTVNSFAALMPQSSAPTSRRMQPTS
Ga0137387_1121604213300012349Vadose Zone SoilMNIRSRLKSLLALMLALLIATTGCSPQWINVAVQDLPVLTQMALNIATLVSILASGKQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSSATLQKIQNVISDLNQNLPALLQAAHISNPTLSARITAA
Ga0137386_1011953923300012351Vadose Zone SoilMKSKSNILLILLLTIAIAATGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQALYGEYKANPSSTTLARIQM*
Ga0137386_1113179613300012351Vadose Zone SoilMPGFIHSYAAIPPLILVNDQRRFLQLPRRNMKLNSKSNILLILALAIAIAATGCSAQWINIALQGLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQTLYGEYKANPSSTTLAKIQNLISDLNQNLPTLLQSAHISNTVLSARVTAAVNLILTTVNSFASLM
Ga0137384_1007636533300012357Vadose Zone SoilVQRPTTTDERPPTFFLQGETMKSKSNILLILLLTIAIAATGCSAQWINIALQDLPVLTQMSLNIATLVSTLASGKQASAADLAVIQNISAQASRDLNLLQALYGEYKANPSSTTLARIQM
Ga0137390_1073368023300012363Vadose Zone SoilMENTMTTHSKSRIVLAFVLAIVIAATGCSAQWINLALQDLPVLTQMALNVATLVSTLASGKQASSADVALIQNISAQASRDLNLLQTLYSEYKATPSATTLAKIQSVLSDLNQNLPALLESAHLLIPHSLPASPRR*
Ga0137395_1011799423300012917Vadose Zone SoilMARCGKRPTTNDCLSGENMNPNSKSKPLLALVLAITIAATGCSAQWISLALQDLPVLTQMALNVATLVSTLVSGPQASAADVAVIQNVSAQASRDLSLLQSLYREYKANPNATTLQKIQNVISDLNQNMPALLQSAHIGNPVLSARITAAVNLILTTVNSFAALIPQTA
Ga0137416_1146478713300012927Vadose Zone SoilMKTRSKSKAVLALVLTITLAATGCSAQWLNIALQDLPVLTQMALNIATLVSTMASGQQTSAADTAVIRNISAQASRDLNLLQTLYGEYKASPNGATLEKVQNAISDLNQNLAALL
Ga0137404_1184659623300012929Vadose Zone SoilMMKLRSKLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLAAGKQASTADVAVIQNISAQASRDLNLLQSLYNDYKANPSSATLQKIQNVISDLNQNLPALLQAAHISNPT
Ga0164301_1021160723300012960SoilMNIRSKLKSLLALVLALLTATIGCTPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASAADMAVIQNISAQASRDLNLLQSLYSEYKANPNNNTLQKIQNVIADVNQNLPALLQAAHISNPTLSARV
Ga0164304_1089895313300012986SoilMAIATTACSPNWINIALQDLPVLTQMALNIATLASTFSPQQNPADLAVIQNISGQASRDLNLLQTLYNEYKDSPNATTLAKIQSAIGLINQHLPALLESAHISNALLKTRVTIAVNLILATVNNFAALIPSH
Ga0164307_1043095023300012987SoilMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISNSTLAARITAAINLILTTVNNFAALMPQTAPATSLRLPITPP
Ga0134079_1047731213300014166Grasslands SoilMNTHSKCKSLLALALAISIAATGCSAQWIKIALEDLPVLTQMALNIAALVGTMTAGKQTNNADLSVIQNISAQASRDLNLLQTLYNEYEAGPSETTLARIQTVIASLNQNLPALLESAHISNPLLAARVT
Ga0182008_1058736623300014497RhizosphereMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADLAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQAGITGISQHLPALL
Ga0157376_1078501313300014969Miscanthus RhizosphereMNIRSKLKSLLALVLALLTATIGCTPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASAADMAVIQNIYAQASRDLNLLQSLYSEYKANPNNSALQKIQNVISDVNQNLPALLQAAHI
Ga0182007_1044015113300015262RhizosphereMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADRAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQAGITGISQHLPALLESAHISNPLLSS
Ga0137403_1039486013300015264Vadose Zone SoilMMKLRSKLKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLAAGKQASTADVAVIQNISAQASRDLNLLQSLYNDYKANPSSATLQKIQNVISDLNQNLPALLQAAHISNPTLSARIAAAVNLILSTVNSVASLMPHSS
Ga0134072_1020623313300015357Grasslands SoilMKANSRSKCLIAAVLALTIAATGCSAQWINVALQDLPVLTQMSLNIATLVSAFASGKQANPGDVAVIQNISAQASRDLNLLQSLYADYKASANVTTLQKIQSVISDMNHNLPALLQSA
Ga0187809_1005662223300017937Freshwater SedimentMHSKSKSLLALVLAISIVATGCSAQWINIALEDLPVLTQMALNIATLVATLASGQQATSADVAVIQNISAQASRDLNLLQTLYGEYKAAPSATTLQKIQSVVADLNQNMPALLE
Ga0066667_1093439423300018433Grasslands SoilMNAHSKSKAILALVLAISIAATGCSAQWIKIALEDLPVLTQMALNIAALVGTMTAGKQTNNADLSVIQNISAQASRDLNLLQTLYNEYEAGPSETTLARIQTVIASLNQNLPALLESAHISNPLLAARVTAAVNLILATVNSFAALIPHTS
Ga0066662_1005901023300018468Grasslands SoilMKANSRSKCLIAAVLALTIAATGCSAQWINVALQDLPVLTEMALNIASLVGTLGSGKQASSADVAVVQNISAQASRDLNLLQTLYTEYKSNPNSATLQKIQNVISGLNQNLPALLQSAHISSATLWRGSLLL
Ga0066662_1127094023300018468Grasslands SoilMKSNSRLKVLFAVVLAATIVITGCSGQWINLALQDLPVLTQMALNIATLASTLASGQQANSGDVAVIQNISAQASRDLSLLQSLYSEYKANPSATTLAKIQNVI
Ga0066669_1195111913300018482Grasslands SoilMKLRSKLKSLLALVLALVTATTGCSPQWINVAVQDLPVLTQMALNIATLVSTLASGKQASTADVAVIQNISAQASRDLNLLQSLYNDYKANPSSATLQK
Ga0193751_107915913300019888SoilLGHLLALVLAITIAATGCSAQWISLALQDLPVLTQMALNVATLVSTLASGQQASAADVAVIQNVSAQAGRDLSLLQSLYSEYKANPNATTLQKIQNVISDLNQNMPALLRSAHIGNPVLPARITAAVNLIPTTVNSFAALISRPRRRLQLQLRSEPVRLGLLARKP
Ga0193735_106418723300020006SoilMNIRSRLKSLLALVLALLSATTGCSPPWINVAVQDLPVLTQMALNIATLVSTLASGKQASTADVAVIKNISAQASRDLNLLQSLYNEYKANPSSATLQKLQNVISDLNQNLPALLQAAHISNPTLSARITAAVNLILSTVNSVASLMPQSPAANSRKIH
Ga0193749_107415513300020010SoilMNLRSKLKSLLALVLALVSATTGCTSQWVNVAVQDLPLLTQMALNIATLVSTLAAGKQASTGDVAVIQNISAQASRDLNLLQSLYNEYKASPNNTTLQKIQNAISGFNQNLPALLQAAHISNPTLSARVSAAVNLIISTVNSVASLMP
Ga0210400_1102109323300021170SoilMNSKSKVFLALVLAIAIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQASTADIAVIQNISAQASRDLNLLQTLYGEYKANPRATTLQKIQNAIADLNQN
Ga0210387_1083824813300021405SoilMNPHSKSKILLALVLAISLAATGCSAQWVNLALQDLPVLTSMALNIATLVSTLASGQQASAADTAVIQNISAQASRDLNLLQSLYSEYKAAPSPTNLQKIQNVISDLNQNLPALLESAHIGNPVLSARITAAVNLILTTVNSFAALMPQASGSPA
Ga0210402_1081012413300021478SoilMNSKSKVFLALVLAIAIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQASTADIAVIQNISAQASRDLNLLQTLYGEYKANPRATTLQKIQNAIADLNQNLPTLLQSAHISNPTLSARIAAAV
Ga0212123_10002312473300022557Iron-Sulfur Acid SpringMNPNSKSKTLLALVLAITIAATGCSAQWISLALQDLPVLTQKALNVATLVSTLPSGQQASAADVAVIQNVSAQASRDLSLLQSLYSEYKANPNATTLQKIGTDLRLSKIDR
Ga0207692_1012044013300025898Corn, Switchgrass And Miscanthus RhizosphereMNSKSKSLLALVLTIAIATTGCSTQWINIALQDLPVLTQMALNIATLAATLASGNQANTADVAVIQNISAQASRDLNLLQTLYNEYRANPS
Ga0207692_1023902913300025898Corn, Switchgrass And Miscanthus RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHIS
Ga0207685_1007544113300025905Corn, Switchgrass And Miscanthus RhizosphereMNSKSKSLLALVLAIAIATTGCSTQWINIALQDMPVLTQMALNIATLAATLASGNQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSASTRQKIQNVI
Ga0207695_1035415023300025913Corn RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISN
Ga0207663_1119273213300025916Corn, Switchgrass And Miscanthus RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANSGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISNSTLA
Ga0207694_1070784213300025924Corn RhizosphereMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQANSGDVAVIQNISAQASRDLNLLQSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISNSTLAAR
Ga0207700_1191330923300025928Corn, Switchgrass And Miscanthus RhizosphereMTHHSNAHSKSKSVLALVLVISIATTGCSPNWINIALQDLPVLTQMALNIATLASTFSQQQNTADLAVIQNISAQASRDLNLLLTLYHEFKASPNTATLAKI
Ga0207664_1004460453300025929Agricultural SoilMTHHSNAHSKSKSVLALVLVISIATTGCSPNWINIALQDLPVLTQMALNIATLASTFSQQQNTADLAVIQNISAQASRDLNLLLTLYHEYKASPNTATFAKIQSAID
Ga0207664_1079259613300025929Agricultural SoilMKTNSKSKVFLALVLAIAIAATGCSAQWINIALQDLPVLTQMALNIATLVSALASGQQASTADIAVIQNISAQASRDLNLLQTLYGEYKANPSATTLQKIQNAIADLNQNLPTLLQSAHISNPTLSARITAAVNLILTTVNSFAALMPQSSAAPARKAP
Ga0207664_1179451413300025929Agricultural SoilMLSRSKSLLVLVLAITLAATGCSAQWINIALQDLPVLTQMALNIATLVSALAPGQQANPGDVAVIQNISAQASRDLNLLLSLYAEYKANPSATTLQKIQNVISDLDQNLPALLESAHISNSTLAARITAAINLILTTVNNFAALMPQTAPATSQRL
Ga0207702_1073043023300026078Corn RhizosphereMNSKSKSLLALVLTIAIATTGCSTQWINIALQDLPVLTQMALNIATLAATLASGNQANTADVAVIQNISAQASRDLNLLQTLYNEYKANPSVSTRQKIQNVISDLNQNLPALLEAAHISSATLTARVTAAVNLILTT
Ga0209238_103318233300026301Grasslands SoilMNSKSKPFLALVLSILIATAGCSAQWINTALQDLPVLTQMALNIATLVSTLAAGQQASTGDIAVIQNISAQASRDLNLLQTLYSEYKAGPSATTLKKIQNAISDLNQNLPAMLQSAHISNATVSTRIAAAVNLILTTVNSFAALMPQSS
Ga0209580_1030080523300027842Surface SoilMNSKSKALIALVLAITIAATGCSAQWINIALQDLPVLTQMALNIAPLATALATGKQASTGDVAVIQNISVQASRDLNLLQTLYGEYKANSNSTTLQKIQNVISDLNQNLPTLLESAHISNAALTARVTAAVNLILATVNSFASLMPQTAPATAQRAPAKV
Ga0209180_1009924323300027846Vadose Zone SoilDPVCRQRLTQRLTTHDQRLLFANGETMNLHSKSKIVLALVLAITIAATGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVASDLNQNLPALLESAHIGNPVLSARITAAVNLILTTATVLRR
Ga0209180_1010462723300027846Vadose Zone SoilMNPHSKSKTFLALVLAITIAATGCSSQWINIALQDLPVLTQIAPNIAVVVSTLASGKQASAADTAVIQNISAQASRDLNLLQSLYSEYKANPNGTTLQKIQNVISDLNQNLPALLESAHIGNPVLSTRITAAVNLILTTVNSFAALIPQTALSTSQKT
Ga0209701_1048999113300027862Vadose Zone SoilMNPNSKSKPLLALVLAITIAATGCSAQWVNLALQDLPVLTQMALNIATLVSTLASGNQASAADTAVIQNISAQASRDLNLLQSLYSEYKRNPNATALQKIQGVASDLNQNLPALLESAH
Ga0307312_1042108023300028828SoilMNILVRVKSLLALVLALLTATTGCSPQWINVAVQDLPVLTQMALNISTLVSTLASGKQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSSATLQKIQNVISDVNQNLPALLQAAHISNPTLSARITAAVNLILSTVNSVASLMPQNSVVT
Ga0310813_1183142223300031716SoilMHFKFKSLLPLVLAFSITLTACSSQWITIALQDLPVLTQMALNIATLAGTFSRQQNTADLAVIQNISAQASRDLNLLLTLYNEYKANPNAATLSKIQTGITGINQHLPALLESA
Ga0307469_1104439223300031720Hardwood Forest SoilMNSKSKSLLALVLAIAIAATGCSSQWINIALQDMPVLTQMALNIATLAATLASGNQASTADVAVIQNISAQASRDLNLLQTLYNQYKANPSASTRQKIQNVISDLNQNLPALLEAAHISSATLTARVTA
Ga0307475_1103543413300031754Hardwood Forest SoilMKINSKSKSLLALVLAITIAATGCSAQWLNVALQDLPVLTQMALNIATLVSTLATGQQASTADVAIIQNISAQASRDLNLLQTLYSEYKATPSAATLQKIQAVI
Ga0307479_1192979913300031962Hardwood Forest SoilMNSKSKSFLALVLAVTIAATGCTAQWINVALQDLPVLTQMALNIATLVSAFASGQQASTADVAVIQNISAQASRDLNLLQLLYNEYKAAPSATTLQKIQNVISDLDQNLPNLLQSAHISNPTVSARITAAVNLILTTVNSFAAL
Ga0308176_1296665213300031996SoilVNFKSKSLLALVLAISITTAGCSAQWINIALQDLPVLTQMALNIATLVSAFASGKQANPGDVAVIQNISAQASRDLNLLQSLYAEYKASPSATTLQKLQSAISAMI
Ga0307472_10036754713300032205Hardwood Forest SoilMKTHSKSKCLLALVLAIAIATTGCSSQWINIALQDMPALTQMALNIATLAATLASGNQASTADVAVIQNISAQASRDLNLLQTLYNEYKANPSASTRQKIQNVISDLNQNLPALLEAAHISSATLTARVTAAVNLILTTVNSFASLIPQSTIPQLTAATSQK
Ga0335078_1263678513300032805SoilMNFKAKSVLALVLAITIAATGCSAQWITVALQDLPVLTQMALNIATLVSALASGQQASTADVAVIQNISAQASRDLNLLQTLYNDYKANPSTTTLQNIENAIADLNQNLPALLQSAHISNPTLSARIAAAVNLIL
Ga0335075_1074954223300032896SoilMNFKAKSVLALVLAITIAATGCSAQWITVALQDLPVLTQMALNIATLVSALASGQQASTADVAVIQNISAQASRDLNLLQTLYNDYKANPSTTTLQNIENAIADLNQNLPALLESAHISNPTPSARIAAAVNLILTTV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.