NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F035461

Metagenome / Metatranscriptome Family F035461

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F035461
Family Type Metagenome / Metatranscriptome
Number of Sequences 172
Average Sequence Length 99 residues
Representative Sequence MAKILDCGKNKITEPVPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKLQCPTEECPKMIGPQVANEKTTELVTVKLQANLYLSVVKRSFDIVIFCQ
Number of Associated Samples 112
Number of Associated Scaffolds 172

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.63 %
% of genes near scaffold ends (potentially truncated) 29.07 %
% of genes from short scaffolds (< 2000 bps) 87.79 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (68.023 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.605 % of family members)
Environment Ontology (ENVO) Unclassified
(27.326 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.605 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 18.60%    β-sheet: 39.53%    Coil/Unstructured: 41.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.121.6.1: Papovaviridae-like VPd5y9ea_5y9e0.54371
d.230.5.1: YbjQ-liked1ybba_1ybb0.5423
b.121.5.2: Parvoviridae-like VPd4g0ra_4g0r0.53772
b.121.5.2: Parvoviridae-like VPd1s58a_1s580.53036
d.241.2.1: Trigger factor ribosome-binding domaind1p9ya_1p9y0.53017


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 172 Family Scaffolds
PF04519Bactofilin 1.16
PF07676PD40 1.16
PF00085Thioredoxin 1.16
PF13370Fer4_13 0.58
PF05231MASE1 0.58
PF14026DUF4242 0.58
PF0563523S_rRNA_IVP 0.58
PF02566OsmC 0.58
PF13432TPR_16 0.58
PF00923TAL_FSA 0.58
PF16400DUF5008 0.58
PF10114PocR 0.58
PF13576Pentapeptide_3 0.58
PF11158DUF2938 0.58
PF13365Trypsin_2 0.58
PF13520AA_permease_2 0.58
PF00144Beta-lactamase 0.58

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 172 Family Scaffolds
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 1.16
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 0.58
COG0642Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.58
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.58
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.58
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 0.58
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 0.58
COG2367Beta-lactamase class ADefense mechanisms [V] 0.58
COG3447Integral membrane sensor domain MASE1Signal transduction mechanisms [T] 0.58
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.58


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms68.02 %
UnclassifiedrootN/A31.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2067725000|GPWRP_F5MPXY302G288TNot Available519Open in IMG/M
2088090014|GPIPI_16498178All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1377Open in IMG/M
2088090014|GPIPI_16903726All Organisms → cellular organisms → Bacteria1216Open in IMG/M
2088090014|GPIPI_16966338All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1793Open in IMG/M
2088090014|GPIPI_16999762All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2014Open in IMG/M
2088090014|GPIPI_17344719All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1139Open in IMG/M
2088090015|GPICI_8868757All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1253Open in IMG/M
2124908045|KansclcFeb2_ConsensusfromContig1246715Not Available667Open in IMG/M
2124908045|KansclcFeb2_ConsensusfromContig180657Not Available597Open in IMG/M
2162886007|SwRhRL2b_contig_3247326All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium969Open in IMG/M
2166559005|cont_contig77733Not Available696Open in IMG/M
2170459005|F1BAP7Q02G2WQINot Available529Open in IMG/M
2170459005|F1BAP7Q02G4UPJNot Available500Open in IMG/M
2189573005|GZGK9D401DW8V5Not Available503Open in IMG/M
2199352025|deepsgr__Contig_151970All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium563Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c0633612All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300000363|ICChiseqgaiiFebDRAFT_11098768All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101225901Not Available763Open in IMG/M
3300000559|F14TC_100473526All Organisms → cellular organisms → Bacteria1345Open in IMG/M
3300000559|F14TC_101001848All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300000559|F14TC_103094825Not Available971Open in IMG/M
3300000890|JGI11643J12802_10126698Not Available707Open in IMG/M
3300000890|JGI11643J12802_10852282All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium888Open in IMG/M
3300000890|JGI11643J12802_12337477All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium896Open in IMG/M
3300000893|AP72_2010_repI_A001DRAFT_1019908All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1092Open in IMG/M
3300000893|AP72_2010_repI_A001DRAFT_1061503All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia590Open in IMG/M
3300000953|JGI11615J12901_10754210All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300000953|JGI11615J12901_10836304All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300000955|JGI1027J12803_100361323All Organisms → cellular organisms → Bacteria1640Open in IMG/M
3300000955|JGI1027J12803_102765211All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium568Open in IMG/M
3300000956|JGI10216J12902_100358393Not Available831Open in IMG/M
3300001431|F14TB_100552159All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1387Open in IMG/M
3300004114|Ga0062593_100712814All Organisms → cellular organisms → Bacteria981Open in IMG/M
3300004114|Ga0062593_102751245All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium561Open in IMG/M
3300004156|Ga0062589_100635357Not Available934Open in IMG/M
3300004156|Ga0062589_101155368Not Available737Open in IMG/M
3300004156|Ga0062589_101220713All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300004156|Ga0062589_101841648All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300004156|Ga0062589_102097122All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300004157|Ga0062590_100428513Not Available1097Open in IMG/M
3300004157|Ga0062590_101240034All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium730Open in IMG/M
3300004157|Ga0062590_101971043Not Available605Open in IMG/M
3300004463|Ga0063356_101935922All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium890Open in IMG/M
3300004479|Ga0062595_102261926All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300004479|Ga0062595_102634199Not Available505Open in IMG/M
3300004480|Ga0062592_100959414Not Available777Open in IMG/M
3300005175|Ga0066673_10530545Not Available691Open in IMG/M
3300005186|Ga0066676_10534107Not Available795Open in IMG/M
3300005187|Ga0066675_10631908All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300005289|Ga0065704_10100514All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2242Open in IMG/M
3300005289|Ga0065704_10201011All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1146Open in IMG/M
3300005289|Ga0065704_10357859All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium801Open in IMG/M
3300005289|Ga0065704_10525698All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300005295|Ga0065707_10977596Not Available545Open in IMG/M
3300005332|Ga0066388_100895253All Organisms → cellular organisms → Bacteria1466Open in IMG/M
3300005332|Ga0066388_101212047Not Available1291Open in IMG/M
3300005332|Ga0066388_101387562All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300005332|Ga0066388_101392367All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1217Open in IMG/M
3300005332|Ga0066388_104814553Not Available686Open in IMG/M
3300005332|Ga0066388_106754779All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300005332|Ga0066388_107566322All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005363|Ga0008090_14511143Not Available618Open in IMG/M
3300005446|Ga0066686_10609063Not Available742Open in IMG/M
3300005447|Ga0066689_10700953Not Available633Open in IMG/M
3300005451|Ga0066681_10030597All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2838Open in IMG/M
3300005451|Ga0066681_10162953Not Available1317Open in IMG/M
3300005554|Ga0066661_10094114All Organisms → cellular organisms → Bacteria1782Open in IMG/M
3300005764|Ga0066903_100071092All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4453Open in IMG/M
3300005764|Ga0066903_104046950Not Available786Open in IMG/M
3300005764|Ga0066903_104620642Not Available733Open in IMG/M
3300005764|Ga0066903_104805689All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium718Open in IMG/M
3300006169|Ga0082029_1692101All Organisms → cellular organisms → Bacteria13964Open in IMG/M
3300006796|Ga0066665_10886891Not Available692Open in IMG/M
3300006854|Ga0075425_101948339Not Available658Open in IMG/M
3300006904|Ga0075424_101299094All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300007255|Ga0099791_10036168All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2182Open in IMG/M
3300009012|Ga0066710_100298597All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2358Open in IMG/M
3300009137|Ga0066709_100615817All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1549Open in IMG/M
3300009162|Ga0075423_10996636All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium890Open in IMG/M
3300009162|Ga0075423_11975020All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300009162|Ga0075423_12418745Not Available573Open in IMG/M
3300009792|Ga0126374_10421771All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium939Open in IMG/M
3300010040|Ga0126308_10130833All Organisms → cellular organisms → Bacteria1568Open in IMG/M
3300010040|Ga0126308_10184911All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1333Open in IMG/M
3300010043|Ga0126380_11598052Not Available582Open in IMG/M
3300010046|Ga0126384_11659784All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium603Open in IMG/M
3300010047|Ga0126382_10483626All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium990Open in IMG/M
3300010048|Ga0126373_10879999All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → unclassified Candidatus Udaeobacter → Candidatus Udaeobacter sp.959Open in IMG/M
3300010359|Ga0126376_10425714All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → unclassified Candidatus Udaeobacter → Candidatus Udaeobacter sp.1204Open in IMG/M
3300010359|Ga0126376_11006106Not Available834Open in IMG/M
3300010360|Ga0126372_12953053All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia527Open in IMG/M
3300010361|Ga0126378_10267243All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1808Open in IMG/M
3300010361|Ga0126378_13127093Not Available527Open in IMG/M
3300010366|Ga0126379_12149115Not Available660Open in IMG/M
3300010376|Ga0126381_101397474All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1012Open in IMG/M
3300010376|Ga0126381_102319691Not Available771Open in IMG/M
3300010376|Ga0126381_102532498Not Available735Open in IMG/M
3300010376|Ga0126381_103289248Not Available638Open in IMG/M
3300010376|Ga0126381_103958547All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300010398|Ga0126383_10011489All Organisms → cellular organisms → Bacteria6460Open in IMG/M
3300010398|Ga0126383_10465425Not Available1316Open in IMG/M
3300010398|Ga0126383_13390134Not Available520Open in IMG/M
3300012199|Ga0137383_10249672Not Available1300Open in IMG/M
3300012207|Ga0137381_10024621All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium4801Open in IMG/M
3300012208|Ga0137376_10773907Not Available827Open in IMG/M
3300012285|Ga0137370_10300070All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium959Open in IMG/M
3300012285|Ga0137370_10539854All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium716Open in IMG/M
3300012356|Ga0137371_11383268Not Available517Open in IMG/M
3300012359|Ga0137385_10751220Not Available812Open in IMG/M
3300012582|Ga0137358_10040673Not Available3079Open in IMG/M
3300012896|Ga0157303_10155648All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300012929|Ga0137404_11747294All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium578Open in IMG/M
3300012930|Ga0137407_10625302All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1011Open in IMG/M
3300012948|Ga0126375_10869271All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300012948|Ga0126375_10989028Not Available684Open in IMG/M
3300012971|Ga0126369_11958254All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium674Open in IMG/M
3300014745|Ga0157377_11608525Not Available521Open in IMG/M
3300016270|Ga0182036_11113546Not Available654Open in IMG/M
3300016294|Ga0182041_10678727Not Available912Open in IMG/M
3300016387|Ga0182040_10449145All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300016422|Ga0182039_11512425Not Available611Open in IMG/M
3300016445|Ga0182038_11321593All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300017656|Ga0134112_10216082All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300018431|Ga0066655_10035456All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2469Open in IMG/M
3300018433|Ga0066667_11826448All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300018482|Ga0066669_11115877Not Available712Open in IMG/M
3300019789|Ga0137408_1114330All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1719Open in IMG/M
3300019789|Ga0137408_1183451All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2652Open in IMG/M
3300019789|Ga0137408_1297838All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae4604Open in IMG/M
3300019873|Ga0193700_1062758All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300020170|Ga0179594_10000383All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium10806Open in IMG/M
3300021078|Ga0210381_10048581All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1264Open in IMG/M
3300021478|Ga0210402_10909790All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium806Open in IMG/M
3300022534|Ga0224452_1234756All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300024288|Ga0179589_10464584All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium585Open in IMG/M
3300026306|Ga0209468_1026781All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2030Open in IMG/M
3300026324|Ga0209470_1263327All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300026538|Ga0209056_10527478Not Available607Open in IMG/M
3300026548|Ga0209161_10397219All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300026551|Ga0209648_10045257All Organisms → cellular organisms → Bacteria3793Open in IMG/M
3300026551|Ga0209648_10621418All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium593Open in IMG/M
3300026791|Ga0208072_103492All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300028720|Ga0307317_10079153All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1077Open in IMG/M
3300028819|Ga0307296_10105062All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1515Open in IMG/M
3300030844|Ga0075377_11689149All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300030916|Ga0075386_12087028All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300031231|Ga0170824_101216016All Organisms → cellular organisms → Bacteria1179Open in IMG/M
3300031231|Ga0170824_108239849All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300031231|Ga0170824_114598237All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium681Open in IMG/M
3300031231|Ga0170824_116392768All Organisms → cellular organisms → Bacteria967Open in IMG/M
3300031231|Ga0170824_116725711All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae → Luteibacter973Open in IMG/M
3300031231|Ga0170824_123416558All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium665Open in IMG/M
3300031446|Ga0170820_16339809All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium882Open in IMG/M
3300031469|Ga0170819_14676964All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1094Open in IMG/M
3300031469|Ga0170819_16045376All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium557Open in IMG/M
3300031474|Ga0170818_100655010All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1006Open in IMG/M
3300031474|Ga0170818_103616096All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300031543|Ga0318516_10728956All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300031573|Ga0310915_10687193Not Available723Open in IMG/M
3300031720|Ga0307469_10911348Not Available815Open in IMG/M
3300031744|Ga0306918_10948315All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300031879|Ga0306919_10251485All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1330Open in IMG/M
3300031890|Ga0306925_10056202All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4113Open in IMG/M
3300031954|Ga0306926_10146468All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2922Open in IMG/M
3300032076|Ga0306924_10302123All Organisms → cellular organisms → Bacteria1837Open in IMG/M
3300032076|Ga0306924_11924374Not Available612Open in IMG/M
3300032180|Ga0307471_100146283All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2263Open in IMG/M
3300032261|Ga0306920_100783370All Organisms → cellular organisms → Bacteria1401Open in IMG/M
3300033412|Ga0310810_10048976All Organisms → cellular organisms → Bacteria5150Open in IMG/M
3300033412|Ga0310810_10186840All Organisms → cellular organisms → Bacteria2360Open in IMG/M
3300033412|Ga0310810_11181774All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium623Open in IMG/M
3300033475|Ga0310811_11283174All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium587Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.60%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil12.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil7.56%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil7.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.07%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere2.91%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.91%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.16%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil1.16%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.16%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.58%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.58%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.58%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.58%
Termite NestEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Termite Nest0.58%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.58%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.58%
SimulatedEngineered → Modeled → Simulated Communities (Sequence Read Mixture) → Unclassified → Unclassified → Simulated0.58%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil0.58%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2067725000Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soilEnvironmentalOpen in IMG/M
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2088090015Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
2124908045Soil microbial communities from Great Prairies - Kansas assembly 1 01_01_2011EnvironmentalOpen in IMG/M
2162886007Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
2166559005Simulated microbial communities from Lyon, FranceEngineeredOpen in IMG/M
2170459005Grass soil microbial communities from Rothamsted Park, UK - July 2009 direct MP BIO1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2189573005Grass soil microbial communities from Rothamsted Park, UK - FG3 (Nitrogen)EnvironmentalOpen in IMG/M
2199352025Soil microbial communities from Rothamsted, UK, for project Deep Soil - DEEP SOILEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000893Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A001EnvironmentalOpen in IMG/M
3300000953Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006169Termite nest microbial communities from Madurai, IndiaEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019873Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s1EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026791Grasslands soil microbial communities from Kansas, USA that are Nitrogen fertilized - NN591 (SPAdes)EnvironmentalOpen in IMG/M
3300028720Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_357EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300030844Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPWRP_039785602067725000SoilPVPVYGFGLGSTKTKAKNPAMKMAHGFAIAAAAARTARLKCRTKEFSKIMEPVVANDKTKELVTVKLQNNLYLSVVQHRFDIVIVCK
GPIPI_031468402088090014SoilMAKILHRGKNKIAEPVPVYGFGLGPTKIKAKNPAMKMAHGFAIAAAAARAAEFKYPTEECSKMTGPLVVNDKTTELVTVKLKNNLYLSVVQSRFDIVILCQ
GPIPI_025878102088090014SoilMAKVLQCGKNKIGESEPVHGFGMGPTKTKAKSVAMDMAHGFANAVAAARAVGLQCPPEECPKMIGPQIANEKTTELVTVKLQNNLYLSVVRRSFDIVIFCQ
GPIPI_033900302088090014SoilMTKILRCGKNKVPEPLPVHGFGLGPNKAHVKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCR
GPIPI_016165002088090014SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKSPAMKMAHGFAIAAAAARVAELKCPTEEYSKIIGPLVANDKTTELVTVKLQNSLYLSVVRRSFDIVIVCQ
GPIPI_014766602088090014SoilMAKILDCGKNKIAEPAPAHGFGMGGTKAEAKQAAMDMAHGFAKLIATAHAAEFKCXXXXXXXXXGPQVAHEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ
GPICI_021110002088090015SoilVGXRREKVAILLVCGKNKIPAPVPVHGFGIGRTKTNAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDRNLYLSVVRRTFDLKISCQ
KansclcFeb2_112025002124908045SoilVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ
KansclcFeb2_105092902124908045SoilMAKILHRGKNKIAEPMPVYGFGLGPTKTKAKSPAMKMAHGFAIAVAAARAAEFKCPTEEFSKMMGPLVANDKTTELVTVKLQNNLYLSVVQSRFDIVILCQ
SwRhRL2b_0609.000037802162886007Switchgrass RhizosphereVAILLVCGENKIPAPVPVHGFGIGPNKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDRNLYLSVVRRTFDLKVSCQ
cont_0733.000049602166559005SimulatedMAILPVCGKNKIPAPVPVHGFGIGRTKANAKSVAMDLAHAFANAVAADRTKKWECPTDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
E41_109930502170459005Grass SoilVHGFGVGPNKAKAKSVAMDMAHGFANVVATTRALGLQCPTEECPKMIAPQVANEKTTELLTVKLQNNLYLSVVRRTFDIVIFCQ
E41_057803602170459005Grass SoilMAILPVCGKNKIPAPVPVHGFGIGRTKANAKSVAMDLAHAFANAVAADRTKKWECPTDCPKKIGPQVANEKTTELLTVKLDKNL
FG3_100450002189573005Grass SoilMAVLPVCGKNKIPAPVPVHGFGIGRTKANAKSVAIDLAHAFANAVAADRTKKWECPTDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCR
deepsgr_024997402199352025SoilMTKILRCGKNKVPEPVPVHGFGLGPNKAHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKL
ICChiseqgaiiDRAFT_063361213300000033SoilMVKILHCGKNKIAESAPVYGFGLGPNKAXAKSAAMNMAHGFAXXXAXARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH*
ICChiseqgaiiFebDRAFT_1109876813300000363SoilPERPNRIAPANRSAFPMAKILHCGKNKIAEPERVYGFGLGPNKARAKSAAMNMAHGFAIGVAEARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH*
INPhiseqgaiiFebDRAFT_10122590123300000364SoilMAKLLHCGKNKIGGAVPVHGFGLGPTKAEAKSVAMNMAHGFANAVAATRVLGLQCPIAECPKMIGPQLANEKTTELLTVKLQNTLYLSVVRRTFDIVIYCQ*
F14TC_10047352613300000559SoilVAKILQRGKNKITEPVPVYGFGLGATKTKAKGPATKMAHGFAIAVAGARAAELNARQRNFSKIMAPRVANDKTKELLTVKLQNNLYLSVVQHRFDIVIVCK*
F14TC_10100184813300000559SoilQTMAKILHRGKNTIAEPVPVYGFGLGATKTKAKDPAMKMAHGFAMAVAGARAAEFKCPTKEFSKMMGPRVANDKATELVTVKLQNNLYLSVVQHRFDIVIICK*
F14TC_10309482513300000559SoilMAKILHPGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAERAAKVKCPTKEFSKVKGPLVANDKTTELVTVKLQNNLYLSVVRRRFDIVIVCQ*
JGI11643J12802_1012669813300000890SoilVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAAARAAELQCPKDECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
JGI11643J12802_1085228223300000890SoilMAKILHHGKNRIAEPQPVYGFGLGSTKTKAKSPAMEMAHGFALAVAAARAAEFKYPTEECSKMTGPLVVNDKTTELVTVKLKNNLYLSVVQSRFDIVILCQ*
JGI11643J12802_1233747713300000890SoilFGIGRTKTNAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDRNLYLSVVRRTFDLKVSCQ*
AP72_2010_repI_A001DRAFT_101990823300000893Forest SoilMLAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
AP72_2010_repI_A001DRAFT_106150323300000893Forest SoilPAPVPVHGFGMGQSKAKAKTVAMGLAHVFANAVAAERTKKWECPKDCPKKIGPQIANEKTTELLTVKLDKNLYXSVVRRTFDIKVYCQ*
JGI11615J12901_1075421013300000953SoilMAILLDCGGNKIPAPVPVHGFGMGPTKAKAKSVAMDMAHAFANAVATERTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
JGI11615J12901_1083630413300000953SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARVAKLKCPTEEFSKMMGPLVANDKTTELVTVKLQNNLYLSVVQSRFDIVILCQ*
JGI1027J12803_10036132333300000955SoilMAKILHCGKNKIAEPERVYGFGLGPNKARAKSAAMNMAHGFAIGVAEARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH*
JGI1027J12803_10276521123300000955SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKGPAMKMAHGFAIAVAGARAAEFKCPTEEFSKMMGPRVANDKTTELVTVKLQN
JGI10216J12902_10035839313300000956SoilMAILLDCGGNKIPAPVPVHGFGMGPTKAKAKSVAMDMAHAFANAVATERTKKWGCPKGCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
F14TB_10055215923300001431SoilMAKILHRGKNKIAKPLPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAERAAKVKCPTKEFSKVMGPLVANDKITELVTVKLQNNLYLSVVRRRFDIVIVCQ*
Ga0062593_10071281433300004114SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKSPAMKMAHGFAIAAAAARVAELKCPTEEYSKIIGPLVANDKTTELVTVKLQNSLYLSVVRRSFDIVIVCQ*
Ga0062593_10275124513300004114SoilMAKVLQCGKNKIAEAEPVHGFGMGRTKAKAKSVAMDMAHGFANAVTAARAVGLQCPAEECPKMIGPQIANEKTTELVTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0062589_10063535723300004156SoilVYGFGLGSTKTKAKNPAMKMAHGFAIAAAAARTARLKCRTKEFSKIMEPVVANDKTKELVTVKLQNNLYLSVVQHRFDIVIVCK*
Ga0062589_10115536823300004156SoilMAKILHRGKNKITAPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAGARAAKLKCPTKEFSKMMGPLVTNDKTTELVTVKLQNNLYLSVVRRRFDIVIVCQ*
Ga0062589_10122071323300004156SoilMAKVLQCGKNKIAEAEPVHGFGMGRTKAKAKAKSVAMDMAHGFANAVTAARAVGLQCPAEECPKMIGPQIANEKTTELVTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0062589_10184164823300004156SoilMTKVLQCGKNKILEPVPVHGFGVGPIKAKAKSTAMDMAHGFANLVAARRAAELKCPTDEGCPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0062589_10209712213300004156SoilMAKVLQCGKNTIAESVAHGFGLGPNKTHAKNVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFNIVIFCQ*
Ga0062590_10042851323300004157SoilVAKILHRGKNKITEPVPVYGFGLGSTKTKAKNPAMKMAHGFAIAAAAARTARLKCRTKEFSKIMEPVVANDKTKELVTVKLQNNLYLSVVQHRFDIVIVCK*
Ga0062590_10124003413300004157SoilMAKVLQCGKNKIAEAEPVHGFGMGRTKAKAKSVAMDMAHGFANAVTAARAVGLQCPAEECPKMIGPQIANEKTTELVTVKLQNNLYLSVVRRSFDIV
Ga0062590_10197104313300004157SoilMAILLVCGKNKRPAPVPVHGFGIGRTKTNARSAAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVAHEKTTQLLTVKLDENLYLSVVRRTFDLKVSCQ*
Ga0063356_10193592213300004463Arabidopsis Thaliana RhizosphereMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKGPAMKMAHGFAIAVAGARAAEFKCPTEEFSKMMGPRVANDKTTELVTVKLQNNLYLSVVQRRFDIVIVCQ*
Ga0062595_10226192613300004479SoilPMAKILHGGKNKIAEPERVYGFGLGPNKAQAKSAAMNMAHGFAIGVAEARAAGFKSPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH*
Ga0062595_10263419913300004479SoilMAKVLQCGKNTIAESVAHGFGLGPNKTHAKNVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFNIVI
Ga0062592_10095941423300004480SoilRQTMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARTARLKCRTKEFSKIMEPVVANDKTKELVTVKLQNNLYLSVVQHRFDIVIVCK*
Ga0066673_1053054513300005175SoilMAKILDCGKNRIAEPAPAHGFGMGGTKAEAKHAAIDMAHGFAKLIASAHTAEFKCPREECPKMIGPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0066676_1053410723300005186SoilVPLLRDSRRAKWVITGTMTKILRCGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELFTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0066675_1063190813300005187SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
Ga0065704_1010051423300005289Switchgrass RhizosphereVAILLVCGENKIPAPVPVHGFGIGRTKTNAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDRNLYLSVVRRTFDLKVSCQ*
Ga0065704_1020101113300005289Switchgrass RhizosphereMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAATCAAKFKCPTEEFSKMMGPLVANDKTTELVTVKLQNNLYLSVVRHRFDIVIVCQ*
Ga0065704_1035785913300005289Switchgrass RhizosphereMAKILHCGKNKIGGNVPVHGFGIGLGKAKAKSVAMDMAHGFATAVAATRASALQCPTEECPKMRSPQVANLKTTELLTVKLQNNLYLSVVRRTFDILVFCQ*
Ga0065704_1052569813300005289Switchgrass RhizosphereMAKVLQCGKNKIPEPVPVHGFGVGPTKAKAKSAAMDMAHGFANLVAATRAAELKCPTDEGCPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0065707_1097759613300005295Switchgrass RhizosphereMAKVLQCGKNTIAESVAHGFGLGPNKTHAKNVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLIFQW*
Ga0066388_10089525313300005332Tropical Forest SoilMVAKILHCGKNKITKSVPVHGFGVGPNKTKAKSVAIHMAHGFASGVAAARAAELQCPIKECLKMMCPQVVNEKTTELLTVILQANLYLSVVRISFEILIFCQ*
Ga0066388_10121204723300005332Tropical Forest SoilMTKILRCGKNKVPEPVPVHGFGLGPNKARAKSVATHMAHGFANAVAAARAAEFQCPKEDCPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRGFDIVIFCQ*
Ga0066388_10138756233300005332Tropical Forest SoilMAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFANAVAASRAAELQCPTEKCPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILI
Ga0066388_10139236723300005332Tropical Forest SoilMAILLACGKNKIPAPVPVYGYGIGLNKTKAKSVAMGLAHTFANGVAAARTKKWECPKDCPKKIGPQIANEKTTELLTVKLDKGLYLSVVRSTFDITVSCL*
Ga0066388_10481455313300005332Tropical Forest SoilVPVHGFGIGLTKTKAKSVAMGLAHSFANGVATECTKKWECPKDCTKKIGPQTANEKTTELLTVKLDKGLYLSVVRSTFDITVSCK*
Ga0066388_10675477913300005332Tropical Forest SoilNKIAEPIPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARVAEFKYPTEECSKMTGPLVANDKTTELVTVKLQNNLYLSVVQSRFDVVIVCQ*
Ga0066388_10756632213300005332Tropical Forest SoilMAKVLQCGKNKIPEPVPVHGFGLGPNKTHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCR*
Ga0008090_1451114313300005363Tropical Rainforest SoilKILQCGKNTIAEPVPVHGFGLGPNKAQAKRVAIQMAHGFANAVAASRALELQCPNEKCPKMAHPQIANDKTIELLTVKLQDNLYLSVVRGSFEIVIFCR*
Ga0066686_1060906313300005446SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFNIKVSCQ*
Ga0066689_1070095313300005447SoilMAILLVCGENKIPASVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLY
Ga0066681_1003059753300005451SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
Ga0066681_1016295313300005451SoilMAKILDCGKNKIAEPAPAHGFGMGGTKAEAKHAAIDMAHGFAKLIATAHTAEFKCPREECPKMIGPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0066661_1009411423300005554SoilMAILLVCGENKIPAPVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
Ga0066903_10007109253300005764Tropical Forest SoilVGDYRNDDKDAALWINKVPEPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAAARAAQLQCPKEECPKMRRLQVANEKTTELLAVKLQSNLYLSVVRRSFDILIFCQ*
Ga0066903_10404695023300005764Tropical Forest SoilVITGTMTKKLRCGINKIPEPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAAARAAELQCPKEEYPKMRCPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0066903_10462064213300005764Tropical Forest SoilVPVHGFGVGPNKTKAKSVATHMAHGFASAAAAACAAELQCPKECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRGSFDIVIFCQ*
Ga0066903_10480568933300005764Tropical Forest SoilMAKVLQCGKNKIPEPVPVHGFGLGPNKTHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCR*
Ga0082029_1692101163300006169Termite NestVAKVLQHGKNRITEPRPVYGFGLGATKIKAKVSAIKMARGFAIAVAGARAAQLNARQRNLNARRRNFSEIMAPRIANDKTKELMTVKLQNNFYLSVVQHRFDMVIVCK*
Ga0066665_1088689123300006796SoilLPLCRFLECCSGSRVVSSGIFVALTDNVPLLRDSRRAKWVITGTMTKILRCGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELFTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0075425_10194833923300006854Populus RhizospherePVTGQRRNTMLARILHCGKNKITESVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0075424_10129909413300006904Populus RhizospherePVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0099791_1003616823300007255Vadose Zone SoilMAKILDCGKNKITEPVPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKLQCPTEECPKMIGPQVANEKTTELVTVKLQANLYLSVVKRSFDIVIFCQ*
Ga0066710_10029859713300009012Grasslands SoilMTKILRCGINKVPEPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELFTVKLQSNLYLSVVRRSFDIVIFCQ
Ga0066709_10061581733300009137Grasslands SoilMTKILRCGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELFTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0075423_1099663613300009162Populus RhizosphereIHSRCEIPDRLNRIVPANRSAFPMAKILHCGKNKIAEPERVYGFGLGPNKAQAKSAAMNMAHGFAIGVAEARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH*
Ga0075423_1197502013300009162Populus RhizosphereMARILDCGKNTIAERVPVHGFGTGPTKAKAKSAAMEMAHGFANLAAAARASKSKCPAEKNCPKMIGLQVANEKTTELLTVKLQNNLY
Ga0075423_1241874523300009162Populus RhizospherePVPVYGFGLGSTKTKSKNPAMKMAHGFAIAAAAARTARLKCRTKEFSKIIGPVVANDKAKELVTVKLQNNLYLSVVQHRFDIVIVCR*
Ga0126374_1042177123300009792Tropical Forest SoilVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRISFDILIFCQ*
Ga0126308_1013083323300010040Serpentine SoilVAKILQRGKNRIAEPVPVYGFGLGVTKTKAKGPATKMAHGFAIAVAGARAAKLNARQRKFSKMMAPQVANDKTKELLTVKLQNNLYLSVVQHRFDIVIICK*
Ga0126308_1018491123300010040Serpentine SoilMAKILHRGKNKIGEPVPVYGFGLGSTKTKAKAPAIKMAHGFAIAVAGARAAEFRCPTEEFSKMMGPRVANDKTTELVTVKLQNNLYLSVVQRRFDIVIVCQ*
Ga0126380_1159805213300010043Tropical Forest SoilVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0126384_1165978413300010046Tropical Forest SoilMAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIGMAHGFANTVAAARAAELQCPAEECPKMIRPQVVNEKTTELLTVILQANLYLSVVR
Ga0126382_1048362613300010047Tropical Forest SoilVVKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRISFDILIFCQ*
Ga0126373_1087999923300010048Tropical Forest SoilVPVHGFGVGPNKTKAKGVAIHMAHGFANAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0126376_1042571433300010359Tropical Forest SoilVTGQRRNTTVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0126376_1100610633300010359Tropical Forest SoilMVKILHCRKNKIAAAVPVYGFGLGPNKAKAKSAARNMAHGFAIAVAVARAAALKCPTEKCSKMIDPLVANEQTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0126372_1295305313300010360Tropical Forest SoilMAKILHRGKNKIAEPIPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARVAEFKYPTEECSKMTGPLVANDKTTELVTVKLQNNLYLSVV
Ga0126378_1026724323300010361Tropical Forest SoilVGDYRNDDKDTALWINKVPEPVPVHGFGLGPNKAKAKSVATHMAANAVAAARAAQLQCPKEECPKMRRLQAANEKTTELLAVKLQSNLYLSVVRRSFDILIFCQ*
Ga0126378_1312709313300010361Tropical Forest SoilVPKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAATRAAELQCPAEECPKMIRPQVVNEKTTELLTVILQANLYSFSGENQLRHSNFLSVKPLT*
Ga0126379_1214911523300010366Tropical Forest SoilPVYGFGLGPNKAKAKSAARNMAHGFAIAVAVARAAALKCPTEKCSKMIDPLVANEQTTELLTVKLQNNLYLSVRSFDIVIFCQ*
Ga0126381_10139747423300010376Tropical Forest SoilVPKILHCGKNKITESVPVHGFGVGPNKTKAKGVAIHMAHGFANAVAAARAAELQCPIEECPKMIRPQVVNEKTTELLTVILQAN
Ga0126381_10231969133300010376Tropical Forest SoilMVKILHCGKNKIAAAVPVYGFGLGPNKAKAKSAARNMAHGFAIAVAAARAAELKCPTEECSKMIGPLVATEATTELVTVKLQNNLYLSVRSFDIVIFCQ*
Ga0126381_10253249833300010376Tropical Forest SoilMAKILQCGKNTITEPLPVHGFGLGPNKARAKRVAIQMAHGFANAVATARALELQCPNAKCPYMTHPQIANEKTTELLTVKLQDNLYLSVVRGSFDIIIFCR*
Ga0126381_10328924813300010376Tropical Forest SoilMTGQRRHTTVAKILHCGKNKITESDPVHGFGVGPNKTKAKSVAIHMAHGFANAVAATRAAELQCPAEECPKMIRPQVVNEKTTELLTVILQANLY
Ga0126381_10395854723300010376Tropical Forest SoilMTKILHCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCAKECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVICCQ*
Ga0126383_1001148933300010398Tropical Forest SoilVGDYRNDDKDAALWINKVPEPVPVHGFGLGPNKAKAKSVATHMAANAVAAARAAQLQCPKEECPKMRRLQAANEKTTELLAVKLQSNLYLSVVRRSFDILIFCQ*
Ga0126383_1046542523300010398Tropical Forest SoilMAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIDMAHGFASAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRISFDILIFCQ*
Ga0126383_1339013423300010398Tropical Forest SoilMTKILHCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCPKECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137383_1024967233300012199Vadose Zone SoilMTKILRCGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFAIAVATARATELQCPKEECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137381_1002462133300012207Vadose Zone SoilMTKILRCGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137376_1077390723300012208Vadose Zone SoilMAKILDCGKNKIAEPAPAHGFGMGGIKAEAKHAAIDMAHGFAKLIATAHTAEFKCPREECPKMIGPQVANEKTTELLTVKLENNLYLSVVRRSFDIVIFCQ*
Ga0137370_1030007023300012285Vadose Zone SoilMTKILRCGINKVPEPVPGFGLGPNKAKAKSVATHMAHGFAIAVATARATELQCPKEECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137370_1053985423300012285Vadose Zone SoilMAILLVCGENKIPAPVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTVDIKVSCQ*
Ga0137371_1138326813300012356Vadose Zone SoilGINKVREPVPVHGFGLGPNKAKAKSVATHMAHGFANAVAGARAAELQCPKEECPKMRRPQVANEKTTELFTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137385_1075122013300012359Vadose Zone SoilMTKILRCGINRVPEPVPVHGFGLGPNKAKAKSVATHMAHGFAIAVATARATELQCPKEECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ*
Ga0137358_1004067323300012582Vadose Zone SoilMTKILRCGINKVPEPVPVHGFGLGPNKAKAKSKATHMAHGFATAVAAARAAELQCPKEECPKMRRPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0157303_1015564823300012896SoilMTKVLQCGKNKILEPVPVHGFGVGPIKAKAKSTAMDMAHGFANLVAARRAAELKCPTDEGCPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDIV
Ga0137404_1174729423300012929Vadose Zone SoilCGINKVPEPVPVHGFGLGPNKAKAKSKATHMAHGFATAVAAARAAELQCPKEECPKMRRPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ*
Ga0137407_1062530213300012930Vadose Zone SoilMAILLVCGENKIPASVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ*
Ga0126375_1086927113300012948Tropical Forest SoilVGDYRNDDKDTALWINKVPEPVAVHGFGLGPNKAKAKSVATHMAANAVAAARAAQLQCPKEECPKMRRLQAANEKTTELLAVKLQSNLYLSVVRRSFDIVICCQ*
Ga0126375_1098902813300012948Tropical Forest SoilSVPVHGFGVGPNKTKAKSVAIDMAHGFANAVAASRAAELHCPTEECPKMIRPQVINEKTTELLTVILQANLYLSVVRTSFDILIFCQ*
Ga0126369_1195825413300012971Tropical Forest SoilMTGQRRHTTVAKILHCGKNKITESDPVHGFGVGPNKTKAKSVAIHMAHGFANAVAATRAAELQCPAEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRISFDILIFCQ*
Ga0157377_1160852513300014745Miscanthus RhizosphereSRRMFAVTRPLTIADETNMAKILHCGKNKIGGDVPVHGFGIGLGKVKAKSMAMDMAHGFATAVAATRASGLECPTEECPKMRGPQVANLKTTELLTVKLQNNLYLSVVRRTFDILVFCQ*
Ga0182036_1111354613300016270SoilCQRYCTAAKTRLSSRPPVHGFGMGSNKAKAKKAAMDMAHGFANLVAAAHAAEFQCPTNEGCPKMTRPQVASEKTTELLTVELQKDLYLSVVRRSFDIVIFCQ
Ga0182041_1067872723300016294SoilMLAKILRCGKNKIAESVPVHGFGVGPNKTKAKSVAIHMAHGFASAVAAARAAELQCPTEECPKMIRPQVLNEKTTELLTVILQANLYLSVVR
Ga0182040_1044914533300016387SoilMTKILRCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCPRECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIAIFCQ
Ga0182039_1151242523300016422SoilMTKILRCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCPRESPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIAIFCQ
Ga0182038_1132159323300016445SoilEVLLTKGHPPGGPSRIGVQKWVITGTMTKILRCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCPRECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ
Ga0134112_1021608223300017656Grasslands SoilMAILLVCGENKIPAPVPVHGFGIGPTKANAKSVAMDRAHAFANAVAADRTKKWECPKDCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFNIKVSCQ
Ga0066655_1003545623300018431Grasslands SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0066667_1182644813300018433Grasslands SoilMAILLVCGENKIPAPVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0066669_1111587723300018482Grasslands SoilMAKILDCGKNKIAEPAPAHGFGMGGTKAEAKHAAIDMAHGFAKLIATAHTAEFKCPREECPKMIGPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ
Ga0137408_111433023300019789Vadose Zone SoilMAILLVCGENKIPASVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0137408_118345113300019789Vadose Zone SoilMAILLVCGENKIPASVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKNCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0137408_129783823300019789Vadose Zone SoilVPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAERAAKSQCPTGEECPKMIGPQVANQKTTELVTVKLQNNVYLSVVRRSFDIVVFCQ
Ga0193700_106275823300019873SoilAEPEPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKSQCPTGEECPKMIGPQVANEKTTELVTVKLQNNLYLSVVRRSFDIVVFCQ
Ga0179594_10000383113300020170Vadose Zone SoilMAKILDCGKNKITEPVPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKLQCPTEECPKMIGPQVANEKTTELVTVKLQANLYLSVVKRSFDIVIFCQ
Ga0210381_1004858123300021078Groundwater SedimentVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ
Ga0210402_1090979023300021478SoilSLHLWFWTCSRPAVRGNTTSGRSRTNQTMAKMLHCGENKIGGAVPVHGFGMGPNKAKAKSVAKGMAHGFANAVAATRTLGLQCPTEECPKMIGPQVAHEKTTELLTVKLQNNLYLSVVRRTFDIVIFCR
Ga0224452_123475613300022534Groundwater SedimentSLHPSFRLATAGSCQVKRRVTDQRRNTTVAKILHCGKNKITESVPVHGFGVGPNKTKAKSVAIHMAHGFANAVAAARAAELQCPTEECPKMIRPQVVNEKTTELLTVILQANLYLSVVRASFDILIFCQ
Ga0179589_1046458423300024288Vadose Zone SoilMTKILRCGINKVPEPVPVHGFGLGPNKAKAKSKATHMAHGFATAVAAARAAELQCPKEECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ
Ga0209468_102678123300026306SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVAKEKTTELLTVKLDKNLYLSVVRRTFNIKVSCQ
Ga0209470_126332713300026324SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0209056_1052747813300026538SoilVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLS
Ga0209161_1039721913300026548SoilMAKILDCGKNKIAEPAPAHGFGMGGTKAEAKHAAIDMAHGFAKLIATAHTAEFKCPREECPKMIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0209648_1004525753300026551Grasslands SoilMAKILACGKNKIAEPVPVHGFGVGANKAEAKSAAMDMAHAFANLVAATRAAKSKCPKEECPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDTVIFCQ
Ga0209648_1062141813300026551Grasslands SoilMAKMLQCGKNTISEPEPVHGFGVGPTKAKAKGVAMDMAHGFASVVAATRATELKCPTDEGCPKMIGLQVANEKTTELVTVKLQNNLYLSVVRRSFDIVIFCQ
Ga0208072_10349223300026791SoilMAILLLCGENKIPAPVPVHGFGIGPTKANAKSVAMDLAHAFANAVAADRTKKWECPKDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRIFDIKVSCQ
Ga0307317_1007915323300028720SoilRRKQKTAKILHCGKNKIAEPEPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKSQCPTGEECPKMIGPQVANEKTTELVTVKLQNNLYLSVVRRSFDIVVFCQ
Ga0307296_1010506213300028819SoilVPVHGFGMGPTKAKAKSVAMDMAHGFANAVAAARAAKSQCPTGEECPKMIGPQVANQKTTELVTVKLQNNLYLSVVRRSFDIVVFCQ
Ga0075377_1168914913300030844SoilMAILLVCGENKIPAPVPVHGFGIGRTKANAKSVAMDLAHAFANAVAADRTKKWECPTDCPKKIGPQVANEKTTELLTVKLEMQLYLSVVRRTFDIKVSCQ
Ga0075386_1208702813300030916SoilMTKILRCGKNKVPEPVPVHGFGLGPNKAHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCR
Ga0170824_10121601623300031231Forest SoilMAKMLHCGKNKIGGAVPVHGFGVGPNKAKAKSVAMDMAHGFANVVATTRALGLQCPTEECPKMMCPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFLPVKPLTQRCDSRLRCD
Ga0170824_10823984923300031231Forest SoilMTKILRCGKNKVPEPVPVHGFGLGPNKAHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKKGHPQVANEKTAELLTVKLQNNLYLSVVRRSFDIVIFCR
Ga0170824_11459823723300031231Forest SoilMAILPVCGKNKIPAPVPVHGFGIGRAKANAKSVAMDLAHAFANAVAADRTKKWECPTDCPKKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0170824_11639276823300031231Forest SoilMAKILNCGKNKITEPVPVHGFGIGPSKAKAKSVAMDVAHAFANSVAAARAAKLQCPTEECPKMIDPQVANEKTIELVTVKLQDSLYLSVVRRTFDIVVFCQ
Ga0170824_11672571113300031231Forest SoilMAKVLQCGKNKIPEPVPVHGFGVGPTKAKAKSAAMDMAHGFANLVAATRAAELKCPTDEGCPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ
Ga0170824_12341655823300031231Forest SoilMPKILDCGKNKIAEPVPVHGFGLGPNKAHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQDNLYLSVVRRSFDIVIFC
Ga0170820_1633980923300031446Forest SoilMAKVLQCGKNKIPEPVPVHGFGVGPTKAKAKSAAMDMAHGFANLVAATRAVKLKCPTDEGCPKMIGLQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCQ
Ga0170819_1467696413300031469Forest SoilMTKILDCGKNKIAEPVPVHGFGLGPNKAHAKSVAIHMAHGFANAVAAARAVELQCPKEKCPKMRHPQVANEKTTELLTVKLQNNLYLSVVRRSFDIVIFCR
Ga0170819_1604537623300031469Forest SoilMAILLVCGKNKTPAPLPVHGFGIGSTKANAKSVAKDLAHAFANAVAAGRAKKWKCPKDCPKKVGPQVANEKT
Ga0170818_10065501013300031474Forest SoilMAKMLHCGKNKIGGAVPLHGFGVGPNKAKAKSVAMDMAHGFANVVATTRALGLQCPTEECPKMMCPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFLPVKPLTQRCDSRLRCD
Ga0170818_10361609633300031474Forest SoilMAILPVCGKNKIPAPVPVHGFGIGRTKANAKSVAMDLAHTFANAVAADRTKKWECPKDCPTKIGPQVANEKTTELLTVKLDKNLYLSVVRRTFDIKVSCQ
Ga0318516_1072895623300031543SoilMTKILRCGINKVPEPVPVHGFGVGPNKTKAKSVATHMAHGFASAVAAACAAELQCPRECPKMRRPQVANEKTTELLTVKLQSNLYLSVVRRSFDIVIFCQ
Ga0310915_1068719323300031573SoilMLAKILHCGKNKIAESVPVHGFGVGPNKTKAKSVAIHMAHGFASAVAAARAAELQCPTEECPKMIRPQVLNEKTTELLTVILQANLYLSVVRTSFDIVIFCQ
Ga0307469_1091134813300031720Hardwood Forest SoilVAKILHCGKNKITESVPVHGFGVGRNKTKAKSVAIDMAHGFANAVAAARAAELQCPTEECPKMIHPQVVNEKTTELLTVILQADLYLSVVRTSFDILIFCQ
Ga0306918_1094831523300031744SoilMARVLQCGKNKIAEPVPVHGFGVGSNKVKAKSAAMDMAHGFANLVAATRAAELRCPKEECPKMIGLQVANEKTTELLTVKLQDDLY
Ga0306919_1025148523300031879SoilRSAFPMAKILHCGKNKIAEPERVYGFGLGPNKAQAKSAAMNMAHGFAIGVAEARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH
Ga0306925_1005620293300031890SoilMARVLQCGKNKIAEPVPVHGFGVGSNKVKAKSAAMDMAHGFANLVAATRAAELRCPKEECPKMIGLQVANEKTTELLTVKLQDDLYLSVMRRSFDIVIFCR
Ga0306926_1014646813300031954SoilMAKILHCGKNKIAEPERVYGFGLGPNKAQAKSAAMNMAHGFAIGVAEARAAGFKCPTEKCSKMMRPLVGSEKTTELMTVKLQNHLYLSVVQRSFDIVIVCH
Ga0306924_1030212323300032076SoilMARVLQCGKNKIAEPVPVHGFGVGSIKVKAKSAAMDMAHGFANLVAATRAAELRCPKEECPKMIGLQVANEKTTELLTVKLQDDLYLSVMRRSFDIVIFCR
Ga0306924_1192437413300032076SoilVHGFGMGSNKAKAKKAAMDMAHGFANLVAAAHAAEFQCPTNEGCPKMTRPQVASEKTTELLTVELQKDLYLSVVRRSFDIVIFCQ
Ga0307471_10014628333300032180Hardwood Forest SoilVAKILHCGKNKITESVPVHGFGVGRNKTKAKSVAIDMAHGFANAVAAARAAELQCPTEECPKMIHPQVVNEKTTELLTVILQANLYLSVVRTSFDILIFCQ
Ga0306920_10078337023300032261SoilMLAKILHCGKNKIAESVPVHGFGVGPNKTKAKSVAIHMAHAVAAARAAELQCPTEECPKMIRPQVLNEKTTELLTVILQANLYLSVVRTSFDIVIFCQ
Ga0310810_1004897623300033412SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARVAKVKCPTEEFSKMTGPLVANDKTTELVTVKLKDNLYLSVVQSRFDIVILCQ
Ga0310810_1018684013300033412SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKGPAMKMAHGFAIAVAGARAAEFKCPTEEFSKMMGPRVANDKTTELVTVKLQNNLYLSVVQRRFDIVIVCQ
Ga0310810_1118177413300033412SoilPGRQTMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARAAKVKCPTEEFSKMMGPLVANDKTTELVTVKLQNNLYLSVVQRRFDIVIVCQ
Ga0310811_1128317413300033475SoilMAKILHRGKNKIAEPVPVYGFGLGPTKTKAKNPAMKMAHGFAIAAAAARAAEFKCPTEEFSKMMGPLVANDKTTELVTVKLQNNLYLSVVQRRFDIVIVCQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.