NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F061219

Metagenome / Metatranscriptome Family F061219

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F061219
Family Type Metagenome / Metatranscriptome
Number of Sequences 132
Average Sequence Length 119 residues
Representative Sequence MGRVKLIVGLAVLALAIIAGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEGFRNAVIGAAKKHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVTLPGFSFTLHFHPSSAK
Number of Associated Samples 77
Number of Associated Scaffolds 132

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 18.18 %
% of genes near scaffold ends (potentially truncated) 15.91 %
% of genes from short scaffolds (< 2000 bps) 59.85 %
Associated GOLD sequencing projects 65
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.242 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(53.030 % of family members)
Environment Ontology (ENVO) Unclassified
(50.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(54.545 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 39.46%    β-sheet: 18.37%    Coil/Unstructured: 42.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.110.6.4: Histidine kinase family 1 (HK1) sensor domainsd4xmra14xmr0.58408
f.5.1.0: automated matchesd5azpa_5azp0.5822
a.24.9.1: alpha-catenin/vinculind1dova_1dov0.57661
e.7.1.1: Inositol monophosphatase/fructose-1,6-bisphosphatase-liked1nuwa_1nuw0.57017
e.7.1.0: automated matchesd3uksa_3uks0.5693


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 132 Family Scaffolds
PF08669GCV_T_C 19.70
PF00753Lactamase_B 4.55
PF13620CarboxypepD_reg 3.79
PF13226DUF4034 3.03
PF12704MacB_PCD 2.27
PF11146DUF2905 1.52
PF03713DUF305 1.52
PF01058Oxidored_q6 1.52
PF00583Acetyltransf_1 0.76
PF00903Glyoxalase 0.76
PF01593Amino_oxidase 0.76
PF03279Lip_A_acyltrans 0.76
PF03544TonB_C 0.76
PF07045DUF1330 0.76
PF11737DUF3300 0.76
PF02371Transposase_20 0.76
PF07238PilZ 0.76
PF14559TPR_19 0.76
PF00266Aminotran_5 0.76
PF00392GntR 0.76
PF00722Glyco_hydro_16 0.76
PF00067p450 0.76
PF12849PBP_like_2 0.76
PF13560HTH_31 0.76
PF13751DDE_Tnp_1_6 0.76
PF02800Gp_dh_C 0.76
PF13424TPR_12 0.76
PF02575YbaB_DNA_bd 0.76
PF00933Glyco_hydro_3 0.76
PF12697Abhydrolase_6 0.76
PF00743FMO-like 0.76
PF09411PagL 0.76
PF00561Abhydrolase_1 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 132 Family Scaffolds
COG0377NADH:ubiquinone oxidoreductase 20 kD subunit (chain B) or related Fe-S oxidoreductaseEnergy production and conversion [C] 1.52
COG1740Ni,Fe-hydrogenase I small subunitEnergy production and conversion [C] 1.52
COG1941Coenzyme F420-reducing hydrogenase, gamma subunitEnergy production and conversion [C] 1.52
COG3260Ni,Fe-hydrogenase III small subunitEnergy production and conversion [C] 1.52
COG3544Uncharacterized conserved protein, DUF305 familyFunction unknown [S] 1.52
COG0057Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenaseCarbohydrate transport and metabolism [G] 0.76
COG0718DNA-binding nucleoid-associated protein YbaB/EfbCTranscription [K] 0.76
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.76
COG1472Periplasmic beta-glucosidase and related glycosidasesCarbohydrate transport and metabolism [G] 0.76
COG1560Palmitoleoyl-ACP: Kdo2-lipid-IV acyltransferase (lipid A biosynthesis)Lipid transport and metabolism [I] 0.76
COG2072Predicted flavoprotein CzcO associated with the cation diffusion facilitator CzcDInorganic ion transport and metabolism [P] 0.76
COG2124Cytochrome P450Defense mechanisms [V] 0.76
COG2273Beta-glucanase, GH16 familyCarbohydrate transport and metabolism [G] 0.76
COG3547TransposaseMobilome: prophages, transposons [X] 0.76
COG4261Predicted acyltransferase, LPLAT superfamilyGeneral function prediction only [R] 0.76
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 0.76


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.24 %
UnclassifiedrootN/A0.76 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002907|JGI25613J43889_10043272All Organisms → cellular organisms → Bacteria → Acidobacteria1274Open in IMG/M
3300002914|JGI25617J43924_10120499All Organisms → cellular organisms → Bacteria924Open in IMG/M
3300004631|Ga0058899_11838181All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300005174|Ga0066680_10189761All Organisms → cellular organisms → Bacteria → Acidobacteria1297Open in IMG/M
3300005176|Ga0066679_10245579All Organisms → cellular organisms → Bacteria1152Open in IMG/M
3300005555|Ga0066692_10009706All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4489Open in IMG/M
3300005555|Ga0066692_10518829All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300005555|Ga0066692_11019173All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300005557|Ga0066704_10141268All Organisms → cellular organisms → Bacteria → Acidobacteria1610Open in IMG/M
3300005559|Ga0066700_10612469All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300006173|Ga0070716_100001687All Organisms → cellular organisms → Bacteria9972Open in IMG/M
3300006797|Ga0066659_10955155All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300007258|Ga0099793_10002082All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6608Open in IMG/M
3300007258|Ga0099793_10240553All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300007258|Ga0099793_10360673All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300007265|Ga0099794_10264979All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300009038|Ga0099829_10001935All Organisms → cellular organisms → Bacteria → Acidobacteria11733Open in IMG/M
3300009038|Ga0099829_10206571All Organisms → cellular organisms → Bacteria → Acidobacteria1591Open in IMG/M
3300009038|Ga0099829_10237138All Organisms → cellular organisms → Bacteria → Acidobacteria1484Open in IMG/M
3300009038|Ga0099829_10247190All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfuromonadales1454Open in IMG/M
3300009038|Ga0099829_10883618All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300009088|Ga0099830_10003367All Organisms → cellular organisms → Bacteria → Acidobacteria8752Open in IMG/M
3300009088|Ga0099830_10172477All Organisms → cellular organisms → Bacteria1677Open in IMG/M
3300009088|Ga0099830_10224352All Organisms → cellular organisms → Bacteria → Acidobacteria1479Open in IMG/M
3300009088|Ga0099830_10302754All Organisms → cellular organisms → Bacteria → Acidobacteria1276Open in IMG/M
3300009090|Ga0099827_10059377All Organisms → cellular organisms → Bacteria → Acidobacteria2910Open in IMG/M
3300011269|Ga0137392_10026521All Organisms → cellular organisms → Bacteria → Acidobacteria4148Open in IMG/M
3300011269|Ga0137392_10081527All Organisms → cellular organisms → Bacteria → Acidobacteria2512Open in IMG/M
3300011269|Ga0137392_10277912All Organisms → cellular organisms → Bacteria → Acidobacteria1382Open in IMG/M
3300011269|Ga0137392_10439348All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300011270|Ga0137391_10001062All Organisms → cellular organisms → Bacteria → Acidobacteria19795Open in IMG/M
3300011270|Ga0137391_10024751All Organisms → cellular organisms → Bacteria → Acidobacteria4990Open in IMG/M
3300011271|Ga0137393_10073055All Organisms → cellular organisms → Bacteria → Acidobacteria2741Open in IMG/M
3300011271|Ga0137393_10221008All Organisms → cellular organisms → Bacteria → Acidobacteria1602Open in IMG/M
3300011271|Ga0137393_10301153All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1365Open in IMG/M
3300012096|Ga0137389_10003572All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae9707Open in IMG/M
3300012096|Ga0137389_10335231All Organisms → cellular organisms → Bacteria → Acidobacteria1285Open in IMG/M
3300012096|Ga0137389_10448188All Organisms → cellular organisms → Bacteria1105Open in IMG/M
3300012189|Ga0137388_10531169All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300012202|Ga0137363_10010138All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6007Open in IMG/M
3300012202|Ga0137363_10327128All Organisms → cellular organisms → Bacteria → Acidobacteria1264Open in IMG/M
3300012202|Ga0137363_11054192All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300012203|Ga0137399_10010349All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5574Open in IMG/M
3300012203|Ga0137399_10027765All Organisms → cellular organisms → Bacteria3844Open in IMG/M
3300012203|Ga0137399_11243177All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300012205|Ga0137362_10181226All Organisms → cellular organisms → Bacteria → Acidobacteria1810Open in IMG/M
3300012205|Ga0137362_10439956All Organisms → cellular organisms → Bacteria → Acidobacteria1129Open in IMG/M
3300012206|Ga0137380_11511482All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300012207|Ga0137381_10345643All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300012351|Ga0137386_10137467All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1744Open in IMG/M
3300012351|Ga0137386_10164782All Organisms → cellular organisms → Bacteria1588Open in IMG/M
3300012361|Ga0137360_10012611All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5432Open in IMG/M
3300012361|Ga0137360_10139609All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium → unclassified Acidobacterium → Acidobacterium sp.1908Open in IMG/M
3300012362|Ga0137361_10000402All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia23934Open in IMG/M
3300012362|Ga0137361_10000894All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae17696Open in IMG/M
3300012363|Ga0137390_10039568All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4541Open in IMG/M
3300012582|Ga0137358_10054063All Organisms → cellular organisms → Bacteria2679Open in IMG/M
3300012683|Ga0137398_10035365All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2867Open in IMG/M
3300012685|Ga0137397_10098003All Organisms → cellular organisms → Bacteria → Acidobacteria2147Open in IMG/M
3300012918|Ga0137396_10062324All Organisms → cellular organisms → Bacteria → Acidobacteria2588Open in IMG/M
3300012918|Ga0137396_10387331All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300012923|Ga0137359_11219588All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300012924|Ga0137413_10567429All Organisms → cellular organisms → Bacteria845Open in IMG/M
3300012927|Ga0137416_11788302All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300012929|Ga0137404_10119148All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2155Open in IMG/M
3300015054|Ga0137420_1468475All Organisms → cellular organisms → Bacteria → Acidobacteria1747Open in IMG/M
3300015241|Ga0137418_10420082All Organisms → cellular organisms → Bacteria1087Open in IMG/M
3300015242|Ga0137412_10578513All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300017823|Ga0187818_10032244All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2249Open in IMG/M
3300017933|Ga0187801_10013087All Organisms → cellular organisms → Bacteria2742Open in IMG/M
3300018088|Ga0187771_10689856All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300019789|Ga0137408_1100871All Organisms → cellular organisms → Bacteria → Acidobacteria844Open in IMG/M
3300020170|Ga0179594_10018239All Organisms → cellular organisms → Bacteria → Acidobacteria2090Open in IMG/M
3300020199|Ga0179592_10012272All Organisms → cellular organisms → Bacteria3723Open in IMG/M
3300020199|Ga0179592_10102938All Organisms → cellular organisms → Bacteria1311Open in IMG/M
3300020199|Ga0179592_10414661All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300020579|Ga0210407_10000474All Organisms → cellular organisms → Bacteria → Acidobacteria45408Open in IMG/M
3300020579|Ga0210407_10239092All Organisms → cellular organisms → Bacteria1414Open in IMG/M
3300020579|Ga0210407_10628670All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300020580|Ga0210403_10127646All Organisms → cellular organisms → Bacteria → Acidobacteria2073Open in IMG/M
3300020581|Ga0210399_10072397All Organisms → cellular organisms → Bacteria2791Open in IMG/M
3300020581|Ga0210399_10076687All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2711Open in IMG/M
3300020581|Ga0210399_10920250All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300021046|Ga0215015_10058783All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300021046|Ga0215015_10059360All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300021046|Ga0215015_10060069All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300021168|Ga0210406_10080122All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2794Open in IMG/M
3300021168|Ga0210406_10915832All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300021170|Ga0210400_10000050All Organisms → cellular organisms → Bacteria196437Open in IMG/M
3300021171|Ga0210405_10001744All Organisms → cellular organisms → Bacteria23933Open in IMG/M
3300021171|Ga0210405_10015052All Organisms → cellular organisms → Bacteria6392Open in IMG/M
3300021171|Ga0210405_10377514All Organisms → cellular organisms → Bacteria1117Open in IMG/M
3300021559|Ga0210409_10232440All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp.1672Open in IMG/M
3300021559|Ga0210409_10844933All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300025939|Ga0207665_10000103All Organisms → cellular organisms → Bacteria → Acidobacteria55197Open in IMG/M
3300026320|Ga0209131_1038551All Organisms → cellular organisms → Bacteria → Acidobacteria2757Open in IMG/M
3300026324|Ga0209470_1195623All Organisms → cellular organisms → Bacteria860Open in IMG/M
3300026334|Ga0209377_1054083All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1782Open in IMG/M
3300026360|Ga0257173_1039727All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300026377|Ga0257171_1001315All Organisms → cellular organisms → Bacteria3390Open in IMG/M
3300026482|Ga0257172_1077196All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300026497|Ga0257164_1021183All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300026551|Ga0209648_10017420All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6270Open in IMG/M
3300026551|Ga0209648_10771951All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300026555|Ga0179593_1143974All Organisms → cellular organisms → Bacteria → Acidobacteria3257Open in IMG/M
3300026999|Ga0207949_1008279All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium939Open in IMG/M
3300027643|Ga0209076_1004353All Organisms → cellular organisms → Bacteria → Acidobacteria3281Open in IMG/M
3300027643|Ga0209076_1153888All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300027671|Ga0209588_1221611All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300027846|Ga0209180_10148213All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300027862|Ga0209701_10005567All Organisms → cellular organisms → Bacteria8258Open in IMG/M
3300027862|Ga0209701_10026815All Organisms → cellular organisms → Bacteria3748Open in IMG/M
3300027882|Ga0209590_10071370All Organisms → cellular organisms → Bacteria → Acidobacteria2001Open in IMG/M
3300027903|Ga0209488_10357133All Organisms → cellular organisms → Bacteria → Acidobacteria1086Open in IMG/M
3300031753|Ga0307477_10000250All Organisms → cellular organisms → Bacteria71079Open in IMG/M
3300031753|Ga0307477_10103847All Organisms → cellular organisms → Bacteria → Acidobacteria1976Open in IMG/M
3300031753|Ga0307477_10164220All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300031753|Ga0307477_10302020All Organisms → cellular organisms → Bacteria1105Open in IMG/M
3300031753|Ga0307477_10369795All Organisms → cellular organisms → Bacteria984Open in IMG/M
3300031754|Ga0307475_10003435All Organisms → cellular organisms → Bacteria → Acidobacteria9626Open in IMG/M
3300031754|Ga0307475_10128015All Organisms → cellular organisms → Bacteria → Acidobacteria2004Open in IMG/M
3300031823|Ga0307478_10787838Not Available796Open in IMG/M
3300031962|Ga0307479_10000165All Organisms → cellular organisms → Bacteria61579Open in IMG/M
3300031962|Ga0307479_10001030All Organisms → cellular organisms → Bacteria → Acidobacteria25267Open in IMG/M
3300031962|Ga0307479_10034699All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4821Open in IMG/M
3300031962|Ga0307479_10466420All Organisms → cellular organisms → Bacteria → Acidobacteria1247Open in IMG/M
3300031962|Ga0307479_11208608All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300032180|Ga0307471_100092649All Organisms → cellular organisms → Bacteria → Acidobacteria2709Open in IMG/M
3300032180|Ga0307471_100631382All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → unclassified Terriglobales → Acidobacteriales bacterium 13_2_20CM_55_81232Open in IMG/M
3300032180|Ga0307471_101918766All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300032180|Ga0307471_103624796All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300032205|Ga0307472_100083038All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium → Acidobacterium ailaaui2139Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil53.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil14.39%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil13.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.27%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.52%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.52%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.52%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.76%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026999Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF044 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25613J43889_1004327223300002907Grasslands SoilMGKVKLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR*
JGI25617J43924_1012049913300002914Grasslands SoilMRKVKLILGLAVLALAIIASWQIASCELANLELHEDLRDLAAQTGAYIGLXSFNSDEDFRNAVIRAAKKHEIQLEPEQVTVQRTGTAQVPIIYLAADYKV
Ga0058899_1183818123300004631Forest SoilMGKVKLILGLAVLALAIIAGWRITSCVLANLELHGDLVDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTAIAPIIYLMADYKVRVTLLGCSFTLHFNPSSAK*
Ga0066680_1018976123300005174SoilMRNAKLILGLAVLALAVITGWQIAWCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0066679_1024557933300005176SoilGWRIASCELANIEFRGELRDLAAQAGAKIGLNSFSTDEELRDAVIREAKKYQIQLEPEQVIVERTGTPPAQIIYLVADYQARVTLPGFSFTLHFHPSSAK*
Ga0066692_1000970663300005555SoilMGRVKLILGLAVLALAIIAGWQIALCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTLERTGTGPTQIIYLAADYKTRVALPGFSFTLHFHPSSAK*
Ga0066692_1051882923300005555SoilGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR*
Ga0066692_1101917313300005555SoilMRNAKLILGLAVLALAVIAGWQIAWCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0066704_1014126833300005557SoilMGKVRLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR*
Ga0066700_1061246923300005559SoilMMGKVKLILGLAVLGLAIIAGWQIASCELANLELHEDLRDLAAQTGAYIGVNPFNTDDDFRNAIIRAAKKYEIRLEPEQVTVGRTGTPPAQIIYLAADYKARVALPGCSFTLHFHPSSAR
Ga0070716_10000168783300006173Corn, Switchgrass And Miscanthus RhizosphereMRKVKLILGIAVLALAIIAGWQIASCELANSEFRGELRDLAAQAGARIGLNSFSTDEELRDAVIRKAKTHEIQIEPEQVTVERTGSGPEQIIHLAADYKMRVTLPGFSFSLHFHSSSAK*
Ga0066659_1095515523300006797SoilMGKVKLILGLAVLGLAIIAGWQIASCELANLELHEDLRDLAAQTGAYIGVNPFNTDDDFRNAIIRAAKKYEIRLEPEQVTVGRTGTPPAQIIYLAADYKARVALPGCSFTLHFHPSSAR*
Ga0099793_1000208263300007258Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNLELHEEIRDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK*
Ga0099793_1024055323300007258Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERTGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK*
Ga0099793_1036067313300007258Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTNTATAPIIVLVADYKVRVTLL
Ga0099794_1026497923300007265Vadose Zone SoilMGKVRLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK*
Ga0099829_10001935113300009038Vadose Zone SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDLRDLAAQGGARIGLGNFSTDEDLRDAVIREAKRHEIQLGPEQVTVQRTGTAPAQIIYLAADYKVRVMLPGCSFTLHFNPSSAR*
Ga0099829_1020657133300009038Vadose Zone SoilMRNLKLILGLAVLALAVIAGWQIASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0099829_1023713823300009038Vadose Zone SoilVLNRREVYSSGKPNYPDVTIGKLKLILGLAALALAINAGWQIASCELANLELREDLRDLASQAGSRIGLVHFNTDEDFRKAVIHHAERHDMRLEPAQVIVQRTGTGPATTGIIYFAADYKARVTLLGFSFNLHFRPSSAR*
Ga0099829_1024719023300009038Vadose Zone SoilVNLILGLAVLALAINAGWQIGACEVTNLELREDLRDIAAQTGSRIGLNSFSTDEELRAAVIRAAKEYDLQIEPEQVTVQSTGAGAKVVTYLAVDYKARVKLIGFSFTLHFNPSSVR*
Ga0099829_1088361823300009038Vadose Zone SoilMGRVKLIVGLAVLALAIIAGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEGFRNAVIGAAKKHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVTLPGFSFTLHFHPSSAK*
Ga0099830_1000336793300009088Vadose Zone SoilMGKGKLILGLAVLALAIIAGWQVVSCELANLGLREDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEAEQVTVQRTGTVPNQSIYLEVDYKARVKLPGFSFALHFHPTSAK*
Ga0099830_1017247723300009088Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELRVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTSTATAPIIVLVADYKVRVTLLGCSFTLHFNPSSAK*
Ga0099830_1022435213300009088Vadose Zone SoilMGRVKLIAGLAVLALAIITGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEGFRNAIIGAAKKHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVTLPGFSFTLHFHPSSAK*
Ga0099830_1030275413300009088Vadose Zone SoilMGKGKLILGLAVLALAIIAGWQVVSCELANLGLREDMHDLAAQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEAEQVTVQRTGAVPNQSIYLAVDYKARVKLPGFSFALHFHPTSAK*
Ga0099827_1005937733300009090Vadose Zone SoilMRNVKLILGLAVLVLAVIAGWQIASCELANLQLREELRDIAAQGGARIWLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0137392_1002652113300011269Vadose Zone SoilMGKGKLILGLAVLALAIIAGWQIASCELANLTLRGDLRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKSHDIQLEPEQVTVQRTGTVPAQSIYLAVDYKARVVMPGFSFTV
Ga0137392_1008152733300011269Vadose Zone SoilMGRVKLIVGLAVLALAIIAGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEGFRNAIIGAAKKHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVTLPGFSFTLHFHPSSAK*
Ga0137392_1027791213300011269Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQTASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRSAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0137392_1043934823300011269Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTNTATAPIIVLVADYKVRVTLLGCSFTLHFNPSSAR*
Ga0137391_10001062113300011270Vadose Zone SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDPRDLAAQGGARIGLGNFSTDEDLRDAVIREAKRHEIQLGPEQVTVQRTGTAPAQIIYLAADYKVRVMLPGCSFTLHFNPSSAR*
Ga0137391_1002475143300011270Vadose Zone SoilMRKVKLILGLAVLALAIIASWQIASCELANLELHEDLRDLAAQTGAYIGLFSFNSDEDFRNAVIRAAKKHEIQLEPEQVTVQRTGTAQVPIIYLAADYKVRVTLPGCSFTLHFHPSSAK*
Ga0137393_1007305533300011271Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQIASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERKGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0137393_1022100823300011271Vadose Zone SoilMGKGKLILGLAVLALAIIAGWQIASCELANLTLRGDLRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKSHDIQLEPEQVTVQRTGTVPAQSIYLAVDYKARVAMPGFSFTVHFHPTSAK*
Ga0137393_1030115333300011271Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTSTATAPIIVLVADYKVRVTLLGCSFTLHFHPSSA
Ga0137389_1000357273300012096Vadose Zone SoilMGKVKLILGLAVLGLAIIAGWQIASCELANLELHEDLRDLAAQTGAHIGLNPFNTDEDFRNAIIRAAKKYEIQLEPEQVTVERTGTPPAQTICLAADYKARVALPGCSLTLHFHPSSAK*
Ga0137389_1033523123300012096Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQIASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0137389_1044818823300012096Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTAIAPIIYLVADYKVRVTLLGCSLTLHFNPSSAK*
Ga0137388_1053116913300012189Vadose Zone SoilMGRVKLIVVLAVLALSIIAGWQIASCALANLELHEDLRDLAAQGGARIGLVSFRTDEDLRDAVMRAAKGHEIQLEPEQVTVQRTGTAGAPVIYLAADYKVRVTLPGCSFTLRFNPSSAK*
Ga0137363_1001013833300012202Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIALCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTVERTGTGPTQIIYLAADYKTRVTLPGFSFTLHFHPSSAK*
Ga0137363_1032712823300012202Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQTASCELANLQLREELRDIAAQCGARIGLLSFNTDEELRNAVIREAKQHEIQLVPEQITVERKGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR*
Ga0137363_1105419213300012202Vadose Zone SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDLRDLAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERTGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK*
Ga0137399_1001034943300012203Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNFELHEEIRDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK*
Ga0137399_1002776533300012203Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK*
Ga0137399_1124317713300012203Vadose Zone SoilGWQMASCELANLALHEDMRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKAHEIQLEPEQVTVQRTGSAPAQIIYLAVDYRARVRLPGFSFTLHFHPTSAK*
Ga0137362_1018122623300012205Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQIASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKQHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFQPSSAR*
Ga0137362_1043995623300012205Vadose Zone SoilMGRVKLILGLAVLALGIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPAQVTVERTGTGPTQIIYLAADYKTRVTLPGFSFTLHFHPSSAK*
Ga0137380_1151148213300012206Vadose Zone SoilMGKVKLILGLAVLAVAITAGWQIASCQLANIELRDDLRDLAVQTGAHIGLVPFKTDEDFRNAVIRDAKKYGIQLEPQQVTVQRTGTTQVPIIYLAADYKVRVTLPGYSFTLHFHPSSVK*
Ga0137381_1034564323300012207Vadose Zone SoilMGKVKLILGLAVLAVAITAGWQIASCQLANIELRDDLRDLAVQTGAHIGLVPFKTDEDFRNAVIRDAKKYGIQLAPEQVTVQRTGAAQVPIIYLAADYKMPVTLAGGNSLTR*
Ga0137386_1013746723300012351Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIALCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTLERTGTGPTQIIYLAADYKTRVVLPGFSFTLHFHPSSAK*
Ga0137386_1016478233300012351Vadose Zone SoilMGKVKLILGLAVLAVAITAGWQIASCQLANVELRDDLQDLAVQTGAHIGLVPFKTDEDFRNAVIRDAKKYGIQLEPQQVTVQRTGTTQVPIIYLAADYKVRVTMPGYSFTLHFHPSSVK*
Ga0137360_1001261133300012361Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTLERTGTGPTQIIYLAADYKTRVALPGFSFTLHFHPSSAK*
Ga0137360_1013960913300012361Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQIASCELANLQFREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIPLAPEQITVERTGTPPAEIIYLSADYRARVTLPGFSF
Ga0137361_10000402123300012362Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPAQVTVERTGTGPTQIIYLAADYKTRVTLPGFSFTLHFHPSSAK*
Ga0137361_10000894113300012362Vadose Zone SoilLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR*
Ga0137390_1003956863300012363Vadose Zone SoilMRNAKLILGLAVLALAVIAGWQTASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRSAVIREAKKHEIQLVPEQITVERKGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSTR*
Ga0137358_1005406323300012582Vadose Zone SoilMGLAFTKAVNPVKSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKVPGFSFTLHFHPSSAK*
Ga0137398_1003536553300012683Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTVERTGTGPTQIIYLAADYKTRVTLPGFSFTLHFHPSSAK*
Ga0137397_1009800323300012685Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLASAIMTGWQIASCELTNIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK*
Ga0137396_1006232433300012918Vadose Zone SoilMFGLAVLALAIIAGWQMASCELANLALHEDMRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKAHEIQLEPEQVTVQRTGSAPAQIIYLAVDYRARVRLPGFSFTLHFHPTSAK*
Ga0137396_1038733123300012918Vadose Zone SoilMGKVKLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPDQVTVQRTSTATAPIIVLVADYKVRVTLLGCSFTLHFNPSSAK*
Ga0137359_1121958823300012923Vadose Zone SoilCVVVMGRVKLILGLAVLALAIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVIGKAKTHKIQLEPAQVTVERTGTGPRQIIYLAADYKTRVTLPGFSFTLHFHPSSAK*
Ga0137413_1056742923300012924Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKVPGFSFTLHFHPSSAK*
Ga0137416_1178830213300012927Vadose Zone SoilIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHGIQLEPEQVTVQRTSTATAPIIVLVADYKVRVTLLGCSFTLHFNPSSAK*
Ga0137404_1011914823300012929Vadose Zone SoilMGRVKLILGLAVLALDIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPAQVTVERTGTGPTQIIYLAADYKSRVTLPGFSFTLHFHPSSAK*
Ga0137420_146847513300015054Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNLELHEEIRDLAGQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK*
Ga0137418_1042008213300015241Vadose Zone SoilVLHLSAAIASAWLLYRRGLPFVLGLAVLALAIIAGWQMASCELANLALHEDMRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKAHEIQLEPEQVTVQRTGSAPAQIIYLAVDYRARVRLPGFSFTLHFHPTSAK*
Ga0137412_1057851313300015242Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELVNIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK*
Ga0187818_1003224413300017823Freshwater SedimentMGSTFTNTINPVQSIVEMGKVKFILALAILALAIIAGWQIASCELDNLGFHEDLRDLAAQGGARIGLLSFSSDEELRDDVVRKAKKRGIQVEPEQVTVERTGTLPAQTIYLAVEYKTRVKLPGCSFTLHFHASSAK
Ga0187801_1001308713300017933Freshwater SedimentMGKLKFILGLAVLALAIIVGWQIASCELANYEFHEDLRDLAAQGGARIGLLSFSTDEELRNAVVREAKKRGIQVEPEQVTVERTGTLPAQTIYLAVEYKTRVKLPGCSFTLHFHASSAK
Ga0187771_1068985613300018088Tropical PeatlandMAKVKFILGLAILALAIIAGWQIASCELANLEFHENLRDLAAQGGARIGWFSFSTDEELRDAVIREAKKHDIQIEPEQVTVERTGTPPSQTICLVADYKARVKLPGCSFTLHFHPSSAK
Ga0137408_110087113300019789Vadose Zone SoilMGKVRLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR
Ga0179594_1001823923300020170Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNLELHEEIRDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK
Ga0179592_1001227243300020199Vadose Zone SoilMGKVKLMFGLAVLALAIIAGWQMASCELANLALHEDMRDLAAQAGAYIGLVSFNTDEDFRDAVIRAAKAHEIQLEPEQVTVQRTGSAPAQIIYLAVDYRARVRLPGFSFTLHFHPTSAK
Ga0179592_1010293813300020199Vadose Zone SoilMGLAFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHEELRDIAAQGGAKIGLLSFRTDEELRDAVIHEAKRHDIQLESTQVTVERAGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK
Ga0179592_1041466123300020199Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNLELHEEIRDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYK
Ga0210407_10000474133300020579SoilMGKVKLILGLAVLALAIVTGWQIASCELANLAFREEIRDIAAQGGARIGLLSFNTDEELRDAVIREAKKHEIQLEPGQVTVERTGTPPAQIIYLSVDYKARVTLLPGWSFTLHFRPSSAR
Ga0210407_1023909223300020579SoilMGKGKLILGLAVLALAIIAGWQIVSCELANLELRGDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAKGHEIELEAEQVTVQRTGTAPNQSIYLAVDYKARVKVPGFSLALHFHPTSAR
Ga0210407_1062867013300020579SoilMGKVKLILGLAVLAVAIIAGWQIASCELADYELREEMRDLSTQTGAHIGLLSFKTDEEFRDSVIRAAKRHEIQLEPDQVTVERTGTPQAPIIFLAADYKARVVLLAFSFTLRFHPSSAK
Ga0210403_1012764633300020580SoilMGKVKLILGLAVLALAIITGWQIASCELANLAFREDIRDIASQGGARIGLLSFNTDEELRDAVIREAKQHEIQLEPGQVTVERTGTPPAQIIYLSVDYKACVTLLPGWSFTLHFRPSSAR
Ga0210399_1007239723300020581SoilMGKVKLILGLAVLAVAIIAGWQIASCELADYELREEMRDLSTQTGAHIGLLSFKTDEEFRDSVIRAAKRHEIQLEPEQVTVERTGTPQAPIIFLAADYKARVVLLAFSFTLRFHPSSAK
Ga0210399_1007668723300020581SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELRGDLRDLAAQAGAYIGLVSFNSDEDFRNAVIRAAKGHEIELEAEQVTVQRTGTAPNQSIYLAVDYKARVRLPGFSLALHFHPTSAK
Ga0210399_1092025013300020581SoilMGKGKLIVGLAVLAMVVIAGWQIASCELANLALREDLRDIASQAGAYIGLVSFNSDEDFRNAVIRAAQSHDIQLEPEQVTVQRTGTVPAQSIYIAVDYKARVKLPGFSFALHFHPKSAR
Ga0215015_1005878313300021046SoilMNVMGKVKLILGLAVLALAIIAGWQIASCELANLELHEDLHDLAAQAGVRIGLVSSSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTVTAPVIYLVADYKVRVTLLGCSFTLHFHPSSAK
Ga0215015_1005936023300021046SoilVKLILGLAVLALGVIAGWQIASCELANLELQEELHDFAAQTGAHIGLNPFNTDEDFRNAIIRAAKKYQIQLEPEQVTVERTGTPPKQIIVLAVDYKARVALLGFSFGLHFQASSGK
Ga0215015_1006006913300021046SoilMGKVKLILGLAVLALAITAGWQIASCVLADLELHVDLRDLAAQVGARIGLVSFSTDDELRDAVIRKAKTHEIQLEPEQVTVQRTGTAIAPVIYLVADYNCLLYTSDAADDMQC
Ga0210406_1008012213300021168SoilMGLVFTKAVNPVQSIVGMEKVKFIFGLVVLALAIMTGWQIASCELANIEFHAELRDIAAQIGAKIGLLSFSTDEELRDAVIHEAKRHDIQLESTQVTVQRTGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK
Ga0210406_1091583213300021168SoilMGKVKLILGLAVLAVAIIAGWQIASCELADYELREEMRDLSTQTGAHIGLLSFKTDEEFRDSVIRAAKRHEIQLEPEQVIVERTGTPQAPIIFLAADYKARVVLLAFSFTLRFHPSSAK
Ga0210400_1000005023300021170SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELRGDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEPEQVTVQRTGTALNQSIYLAVDYKARVKVPGFSLAFHFHPTSAR
Ga0210405_1000174473300021171SoilMGKGKLILGLAVLALAIIAGWQIVSCELANLELRGDLRDIASQAGAYIGLVSFNSDEDFRNAVIRAAKGHEIELEAEQVTVQRTGTAPNQSIYLAVDYKARVKVPGFSLALHFHPTSAR
Ga0210405_1001505243300021171SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELRGDLRDIASQAGAYIGLVSFNSDEDFRNAVIRAAKGHEIELEAEQVTVQRTGTAPNQSIYLAVDYKARVALPGFSFTLHFHPTSAK
Ga0210405_1037751413300021171SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELSGDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEPEQVTVQRTGTALNQSIYLAVDYKARVRLPGFSLALHFHPTSAK
Ga0210409_1023244013300021559SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELSGDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEPEQVTVQRTGTALNQSIYLAVDYKARVKVPGFSLAFHFHPTSAR
Ga0210409_1084493313300021559SoilMGTVKLIVGLAVLALAIITGWQIASCELANLGLHEELHDLAAQTGAYIGLNSFNTDEDFRNAIIGAAKKHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVTLLGFSFTLHFHPSSAK
Ga0207665_10000103313300025939Corn, Switchgrass And Miscanthus RhizosphereMGKVKLILGIAVLALAIIAGWQIASCELANSEFRGELRDLAAQAGARIGLNSFSTDEELRDAVIRKAKTHEIQIEPEQVTVERTGSGPEQIIHLAADYKMRVTLPGFSFSLHFHSSSAK
Ga0209131_103855143300026320Grasslands SoilMGKVKLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGSPPKQIIFLAADYKARVTLPGFPFTLHFHPSSAR
Ga0209470_119562323300026324SoilMGRVKLILGLAVLALAIIAGWQIALCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTLERTGTGPTQIIYLAADYKTRVALPGFSFTLHFHPSSAK
Ga0209377_105408333300026334SoilRVKLILGLAVLALAIIAGWQIALCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTLERTGTGPTQIIYLAADYKTRVALPGFSFTLHFHPSSAK
Ga0257173_103972713300026360SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDLRDLAAQGGARIGLGNFSTDEDLRDAVIREAKRHEIQLGPEQVTVQRTGTAPAQIIYLAADYKVRVMLPGCSFTLH
Ga0257171_100131523300026377SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDLRDLAAQGGARIGLGNFSTDEDLRDAVIREAKRHEIQLGPEQVTVQRTGTAPAQIIYLAADYKVRVMLPGCSFTLHFNPSSAR
Ga0257172_107719613300026482SoilQIASCELSNLELHEEIRDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVQRTNTATAPIIVLVADYKARVTLLGCSFTLHFNPSSAK
Ga0257164_102118313300026497SoilMGKVKLILGLAVLALAIIAGWQVASFELANLELHEDLRDLAAQGGARIGLGNFSTDEDFRDAVIREAKRHEIQLGPEQVTVQRTGTAPAQIIYLAADYKVRVMLPGCSFTLHFNPSSAR
Ga0209648_1001742043300026551Grasslands SoilMRKVKLILGLAVLALAIIASWQIASCELANLELHEDLRDLAAQTGAYIGLFSFNSDEDFRNAVIRAAKKHEIQLEPEQVTVQRTGTAQVPIIYLAADYKVRVTLPGCSFTLHFHPSSAK
Ga0209648_1077195113300026551Grasslands SoilLILGLAVLALAITAGWQIASCVLANLELHVDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTSTATAPIIVLVADYKVRVTLLGCSFTLHFNPSSAK
Ga0179593_114397453300026555Vadose Zone SoilLVSRIAGCRHGKSETHPRLAVLGLVIIAGWQIASCELSNLELHEEIRDLAGQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK
Ga0207949_100827923300026999Forest SoilMGRVKLILGLAVFALAIIAGWQIASCELANMALHEDLRDLAAQAGAYTGLVSFNTDEDFRNAVIRAAKGHKIQLEPAQVTVQRMGSAPAQIIYLAVDYKARVGLAGFSLALHFHPTSAK
Ga0209076_100435323300027643Vadose Zone SoilMGKVKLILGLAVLGLVIIAGWQIASCELSNLELHEEIRDLAGQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK
Ga0209076_115388823300027643Vadose Zone SoilAGWQMASCELANLALHEDMRDLAAQAGAYIGLVSFNTDEDFRNAVIRAAKAHEIQLEPEQVTVQRTGSAPAQIIYLAVDYRARVRLPGFSFTLHFHPSSAK
Ga0209588_122161113300027671Vadose Zone SoilMGKVRLILGLAVLGLVIIGGWQIASCELANLELHEELHDLAAQTGAYIGLNPFNTDEDFRNAIIRAAKRHEIQLEPEQVTVERTGTPPKQIIYLAADYKARVALPGWSFTLHFHPSSAK
Ga0209180_1014821313300027846Vadose Zone SoilVVKIGKVNLILGLAVLALAINAGWQIGACEVTNLELREDLRDIAAQTGSRIGLNSFSTDEELRAAVIRAAKEYDLQIEPEQVTVQSTGAGAKVVTYLAVDYKARVKLIGFSFTLHFNPSSVR
Ga0209701_1000556733300027862Vadose Zone SoilMGRVKLIAGLAVLALAIITGWQIASCELANLGLREDMHDLAAQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEAEQVTVQRTGAVPNQSIYLAVDYKARVKLPGFSFALHFHPTSAK
Ga0209701_1002681523300027862Vadose Zone SoilMGKGKLILGLAVLALAIIAGWQVVSCELANLGLREDLRDIASQAGAYIGLVSFNSDEDFRKAVIRAAQSHDIQLEAEQVTVQRTGTVPNQSIYLEVDYKARVKLPGFSFALHFHPTSAK
Ga0209590_1007137033300027882Vadose Zone SoilMRNVKLILGLAVLVLAVIAGWQIASCELANLQLREELRDIAAQGGARIGLLSFNTDEELRNAVIREAKKHEIQLVPEQITVERTGTPPAEIIYLSADYRARVTLPGFSFTLHFHPSSAR
Ga0209488_1035713313300027903Vadose Zone SoilMGRVKLILGLAVLALAIIAGWQIASCELANLEFHGELVDLAAQGGARIGLLSFNTDEELRDAVISKAKTHKIQLEPEQVTVERTGTGPTQIIYLAADYKTRVTLPGFSFTLHFHPSSAK
Ga0307477_10000250483300031753Hardwood Forest SoilMGKVKLILGLAVLALAIIAGWRITSCVLANLELHGDLVDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTAIAPFIYLVADYKVRVTLLGCSFTLHFNPSSAK
Ga0307477_1010384723300031753Hardwood Forest SoilMGKVKLVFGLAVLGLVIIAGWQIASCELANAELTETLRDLSSQTGAHIGLGSFKTDEEFRDAVIREAKTHEIQLQPEQVTVQRTGTAQAPIIFLAADYKARVTLPGFSFTLHFHPSSAK
Ga0307477_1016422033300031753Hardwood Forest SoilCQSRTMNEMGKVKLILGLAILALAIIAGWQIASCELANLELHEDLHDLAAQAGVRIGLVSSSTDDELRDAVIRKAKSHEIQLEPEQVTVQHTGTATAPVTYLMADYKVRVTLLGFSFTLHFHPSSAK
Ga0307477_1030202013300031753Hardwood Forest SoilMGKGKLIVGLAVLAMVVIAGWQIASCELANLALREDLRDIASQAGAYIGLVSFNSDEGFRNAVIRAAQSHDIQLEPEQVTVQRTGTVPAQSIYIAVDYKARVKLPGFSFALHFHPKSAR
Ga0307477_1036979523300031753Hardwood Forest SoilMGKVKLVFGLAVLALVIIAGWQIASCELANAELTEALRDLSTQTSAHIGLGSFKTDEEFRDAVIREAKTHEIQLQPEQVTVQRTGTAQAPIIFLAADYKARVTLPGFSFTLHFHPSSAK
Ga0307475_1000343543300031754Hardwood Forest SoilMGKVKLVFGLAVLALVIIAGWQIASRELANAELTEALRDLSTQTSAHIGLGSFKTDDEFRDAVIREAKAHEIQLQPEQVTVQRTGTAQAPIIFLAADYKARVTLPGFSFTIHFHPSSAK
Ga0307475_1012801513300031754Hardwood Forest SoilMGRAKLILGLAVLALAVIAGWQIASCELANLELREELRDIAAQGGARIGLLSFNTDEELRSAVIREAKKHEIQLAPEQITVERTGTPPAQTIYLSADYRARVALPGCSFTLHFHPSSAR
Ga0307478_1078783813300031823Hardwood Forest SoilKAMGKGKLILGLAVLALAVIAGWQIVSCELANLELSGDLRDIASQAGAYIGLVSFNSDEDFRNAVIRAAKGHDIELEAEQVTVQRTGTAPNQSIYLAVDYKARVRLPGFSLALHFHPTSA
Ga0307479_10000165203300031962Hardwood Forest SoilMNEMGKVKLILGLAVLALAIIAGWQIASCELANLELHEDLHDLAAQAGVRIGLVSSSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTVTAPVIYLVADYKVRVALLGCSFTLHFNPSSAK
Ga0307479_10001030123300031962Hardwood Forest SoilMGKVKLILGLAVLALAIIAGWRITSCVLANLELHGDLVDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTAIAPIIYLVADYKVRVTLLGCSFTLHFNPSSAK
Ga0307479_1003469913300031962Hardwood Forest SoilMGKGKLILGLAVLALAVIAGWQIVSCELANLELSGDLRDIASQAGAYIGLVSFNSDEDFRNAVIRAAKGHDIELEAEQVTVQRTGTAPNQSIYLAVDYKARVRLPGFSLALHFHPTSAK
Ga0307479_1046642023300031962Hardwood Forest SoilMGRAKLILGLAVLASAVIAGWQIASCELANLELREELRDIAAQGGARIGLLSFNTDEELRSAVIREAKKHEIQLAPEQITVERTGTPPAQTIYLSADYRARVALPGCSFTLHFHPSSAR
Ga0307479_1120860813300031962Hardwood Forest SoilKLILGLAVLALAIIAGWQIASCKLANLELHEDLRDLAAQAGVRIGLVSFSTDEELRDAVIRDAKKYEIQLEPEQVTVQRTGTATAPIIYLMADYKVRVTLLGCSFTLHFHPSSAK
Ga0307471_10009264933300032180Hardwood Forest SoilMGRVKLILGLAVLILLIIAGWQIGSCELANLEFNGDLRDLAAQTGARIGLLSFSTDEEIRSAVIREARKYEIQLEPEQVKVERTGTPPAQIITLAVDYRARVTLPGFSFTLHFHPSSAR
Ga0307471_10063138223300032180Hardwood Forest SoilMVKVKLILGLAVLAVAIIAGWQITSYVLANLELHEDLRDLAAQAGARIGLVSFSTDDELRDAVIRKAKSHEIQLEPEQVTVQRTGTARAPIIYLAADYKARVTLPGCSFTLHFNPSSAK
Ga0307471_10191876613300032180Hardwood Forest SoilVIIAGWQIASCELANAELTEALRDLSTQTSAHIGLGSFKTDDEFRDAVIREAKTHEIQLQPEQVTVQRTGTAQAPIIFLAADYKARVTLPGFSFTLHFHPSSAK
Ga0307471_10362479613300032180Hardwood Forest SoilMGLAFTKAVNPLQSIVGMEKVKFIFGLAVLALAIVTCWQIASCELANIEFHEELRDIAAQGGAKIGLLSFSTDEELRDAVIHEAKRHDIQLESTQVTVERTGTPPAQIIYLAADYKSRVKLPGFSFTLHFHPSSAK
Ga0307472_10008303833300032205Hardwood Forest SoilMGKMKLILGLAVLALAIITGWQIASCELANLAFREEIRDIAAQGGARIGLLSFNTDEELRDAVIREAKKHEIQLEPGQVTVERTGTPPAQIIYLSVDYKARVTLLPGWSLTLHFRPSSAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.