NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F023652

Metagenome / Metatranscriptome Family F023652

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F023652
Family Type Metagenome / Metatranscriptome
Number of Sequences 209
Average Sequence Length 70 residues
Representative Sequence MSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Number of Associated Samples 118
Number of Associated Scaffolds 209

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 36.07 %
% of genes near scaffold ends (potentially truncated) 22.01 %
% of genes from short scaffolds (< 2000 bps) 82.30 %
Associated GOLD sequencing projects 112
AlphaFold2 3D model prediction Yes
3D model pTM-score0.67

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.421 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(23.923 % of family members)
Environment Ontology (ENVO) Unclassified
(52.632 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(78.469 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.60%    β-sheet: 0.00%    Coil/Unstructured: 40.40%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.67
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 209 Family Scaffolds
PF07080DUF1348 2.87
PF00072Response_reg 1.91
PF05598DUF772 1.44
PF01699Na_Ca_ex 0.96
PF13628DUF4142 0.96
PF03972MmgE_PrpD 0.48
PF02687FtsX 0.48
PF00239Resolvase 0.48
PF13561adh_short_C2 0.48
PF08734GYD 0.48
PF00135COesterase 0.48
PF00149Metallophos 0.48
PF14833NAD_binding_11 0.48
PF13604AAA_30 0.48
PF08402TOBE_2 0.48
PF13751DDE_Tnp_1_6 0.48
PF00005ABC_tran 0.48
PF04632FUSC 0.48

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 209 Family Scaffolds
COG3558Uncharacterized conserved protein, nuclear transport factor 2 (NTF2) superfamilyFunction unknown [S] 2.87
COG0387Cation (Ca2+/Na+/K+)/H+ antiporter ChaAInorganic ion transport and metabolism [P] 0.96
COG0530Ca2+/Na+ antiporterInorganic ion transport and metabolism [P] 0.96
COG1289Uncharacterized membrane protein YccCFunction unknown [S] 0.48
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.48
COG20792-methylcitrate dehydratase PrpDCarbohydrate transport and metabolism [G] 0.48
COG2272Carboxylesterase type BLipid transport and metabolism [I] 0.48
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.48
COG4274Uncharacterized conserved protein, contains GYD domainFunction unknown [S] 0.48


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.42 %
All OrganismsrootAll Organisms31.58 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001867|JGI12627J18819_10310114Not Available636Open in IMG/M
3300002245|JGIcombinedJ26739_100073698All Organisms → cellular organisms → Bacteria → Proteobacteria3154Open in IMG/M
3300003505|JGIcombinedJ51221_10060018Not Available1464Open in IMG/M
3300003505|JGIcombinedJ51221_10133998All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria997Open in IMG/M
3300004080|Ga0062385_10283223All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium941Open in IMG/M
3300004091|Ga0062387_100360369All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium964Open in IMG/M
3300004091|Ga0062387_101054942All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. WSM4349627Open in IMG/M
3300004092|Ga0062389_100577078All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Undibacterium → unclassified Undibacterium → Undibacterium sp. KW11285Open in IMG/M
3300004092|Ga0062389_101323238All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium906Open in IMG/M
3300004092|Ga0062389_103027461Not Available629Open in IMG/M
3300004633|Ga0066395_10559398Not Available666Open in IMG/M
3300005167|Ga0066672_10785245Not Available601Open in IMG/M
3300005184|Ga0066671_10595656Not Available716Open in IMG/M
3300005434|Ga0070709_10335902Not Available1113Open in IMG/M
3300005435|Ga0070714_101133416Not Available763Open in IMG/M
3300005436|Ga0070713_100035414All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4018Open in IMG/M
3300005436|Ga0070713_101434440Not Available670Open in IMG/M
3300005437|Ga0070710_10125251All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1560Open in IMG/M
3300005437|Ga0070710_11311831Not Available538Open in IMG/M
3300005439|Ga0070711_102001221Not Available510Open in IMG/M
3300005454|Ga0066687_10352453All Organisms → cellular organisms → Bacteria843Open in IMG/M
3300005557|Ga0066704_10687636All Organisms → cellular organisms → Bacteria → Proteobacteria646Open in IMG/M
3300005561|Ga0066699_10305873All Organisms → cellular organisms → Bacteria1132Open in IMG/M
3300005568|Ga0066703_10463420Not Available758Open in IMG/M
3300005569|Ga0066705_10491891All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300005587|Ga0066654_10273753Not Available896Open in IMG/M
3300005602|Ga0070762_10529405Not Available775Open in IMG/M
3300005602|Ga0070762_10532104Not Available774Open in IMG/M
3300005610|Ga0070763_10019766All Organisms → cellular organisms → Bacteria2952Open in IMG/M
3300005610|Ga0070763_10272227All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae924Open in IMG/M
3300005610|Ga0070763_10481614All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium707Open in IMG/M
3300005610|Ga0070763_10640676Not Available619Open in IMG/M
3300005764|Ga0066903_100781229Not Available1703Open in IMG/M
3300005764|Ga0066903_101747140All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1186Open in IMG/M
3300005764|Ga0066903_103402642All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Rhodopila → Rhodopila globiformis858Open in IMG/M
3300005764|Ga0066903_107522606Not Available562Open in IMG/M
3300005921|Ga0070766_10299209Not Available1033Open in IMG/M
3300005921|Ga0070766_10685994Not Available692Open in IMG/M
3300006046|Ga0066652_101980099Not Available520Open in IMG/M
3300006176|Ga0070765_100288823Not Available1513Open in IMG/M
3300006176|Ga0070765_100711933All Organisms → cellular organisms → Bacteria → Proteobacteria948Open in IMG/M
3300006176|Ga0070765_101059381Not Available766Open in IMG/M
3300006176|Ga0070765_101901217All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300006755|Ga0079222_12320074Not Available536Open in IMG/M
3300006800|Ga0066660_11074473All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300006893|Ga0073928_10149262All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1888Open in IMG/M
3300006954|Ga0079219_10636138Not Available792Open in IMG/M
3300006954|Ga0079219_10841516All Organisms → cellular organisms → Bacteria730Open in IMG/M
3300009143|Ga0099792_10581443All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria712Open in IMG/M
3300009143|Ga0099792_10659723Not Available673Open in IMG/M
3300009143|Ga0099792_11182734Not Available518Open in IMG/M
3300010046|Ga0126384_10946741Not Available781Open in IMG/M
3300010048|Ga0126373_10455346All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → sulfur-oxidizing symbionts → Candidatus Thiodiazotropha → Candidatus Thiodiazotropha endolucinida1315Open in IMG/M
3300010048|Ga0126373_10506396Not Available1250Open in IMG/M
3300010048|Ga0126373_11092749Not Available863Open in IMG/M
3300010048|Ga0126373_11431387Not Available757Open in IMG/M
3300010048|Ga0126373_12641104Not Available560Open in IMG/M
3300010048|Ga0126373_12714370Not Available553Open in IMG/M
3300010159|Ga0099796_10321871All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Undibacterium → unclassified Undibacterium → Undibacterium sp. KW1660Open in IMG/M
3300010335|Ga0134063_10486702Not Available616Open in IMG/M
3300010358|Ga0126370_10862822Not Available814Open in IMG/M
3300010359|Ga0126376_10742682Not Available950Open in IMG/M
3300010360|Ga0126372_10539143Not Available1105Open in IMG/M
3300010360|Ga0126372_11134321Not Available802Open in IMG/M
3300010360|Ga0126372_12120031Not Available610Open in IMG/M
3300010361|Ga0126378_10842993Not Available1025Open in IMG/M
3300010361|Ga0126378_11409826Not Available789Open in IMG/M
3300010361|Ga0126378_12846633Not Available552Open in IMG/M
3300010366|Ga0126379_10479236All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Rhodopila → Rhodopila globiformis1312Open in IMG/M
3300010366|Ga0126379_10728639Not Available1087Open in IMG/M
3300010376|Ga0126381_100575782All Organisms → cellular organisms → Bacteria → Proteobacteria1599Open in IMG/M
3300010376|Ga0126381_101653666Not Available925Open in IMG/M
3300010376|Ga0126381_104478917Not Available540Open in IMG/M
3300010398|Ga0126383_10451366All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1335Open in IMG/M
3300010398|Ga0126383_10805730All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Rhodopila → Rhodopila globiformis1022Open in IMG/M
3300010398|Ga0126383_10879078All Organisms → cellular organisms → Bacteria982Open in IMG/M
3300011120|Ga0150983_13678241Not Available697Open in IMG/M
3300011120|Ga0150983_14432064Not Available500Open in IMG/M
3300011120|Ga0150983_14697608Not Available749Open in IMG/M
3300012056|Ga0153925_1010990Not Available1039Open in IMG/M
3300012203|Ga0137399_10928445All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria733Open in IMG/M
3300012582|Ga0137358_10479123Not Available839Open in IMG/M
3300012683|Ga0137398_10552880Not Available794Open in IMG/M
3300012977|Ga0134087_10400930All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Undibacterium → unclassified Undibacterium → Undibacterium sp. KW1668Open in IMG/M
3300015357|Ga0134072_10213536Not Available675Open in IMG/M
3300016294|Ga0182041_10685190Not Available907Open in IMG/M
3300016319|Ga0182033_11847716All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium549Open in IMG/M
3300016319|Ga0182033_12188723Not Available504Open in IMG/M
3300016404|Ga0182037_10426518All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1098Open in IMG/M
3300016404|Ga0182037_11233408Not Available658Open in IMG/M
3300018468|Ga0066662_11149525Not Available781Open in IMG/M
3300020579|Ga0210407_10908218Not Available675Open in IMG/M
3300020580|Ga0210403_10060629All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium3030Open in IMG/M
3300020580|Ga0210403_10066691All Organisms → cellular organisms → Bacteria → Proteobacteria2889Open in IMG/M
3300020580|Ga0210403_10437190Not Available1066Open in IMG/M
3300020581|Ga0210399_10402902Not Available1141Open in IMG/M
3300020581|Ga0210399_10553225Not Available954Open in IMG/M
3300020581|Ga0210399_11596341Not Available503Open in IMG/M
3300020583|Ga0210401_10219089All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1761Open in IMG/M
3300020583|Ga0210401_10251911All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1625Open in IMG/M
3300020583|Ga0210401_10312384All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae1433Open in IMG/M
3300020583|Ga0210401_10859891Not Available766Open in IMG/M
3300020583|Ga0210401_10947740Not Available719Open in IMG/M
3300021168|Ga0210406_11167583Not Available562Open in IMG/M
3300021170|Ga0210400_10205007All Organisms → cellular organisms → Bacteria → Proteobacteria1604Open in IMG/M
3300021170|Ga0210400_10522582All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria979Open in IMG/M
3300021170|Ga0210400_10634080Not Available880Open in IMG/M
3300021171|Ga0210405_10297992Not Available1274Open in IMG/M
3300021171|Ga0210405_10346308Not Available1173Open in IMG/M
3300021171|Ga0210405_10358988Not Available1149Open in IMG/M
3300021171|Ga0210405_10902300Not Available671Open in IMG/M
3300021171|Ga0210405_10985071Not Available636Open in IMG/M
3300021178|Ga0210408_10449833All Organisms → cellular organisms → Bacteria → Proteobacteria1025Open in IMG/M
3300021384|Ga0213876_10690683Not Available543Open in IMG/M
3300021402|Ga0210385_10171983All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1562Open in IMG/M
3300021403|Ga0210397_10403982Not Available1022Open in IMG/M
3300021405|Ga0210387_11102956Not Available692Open in IMG/M
3300021441|Ga0213871_10007018Not Available2414Open in IMG/M
3300021478|Ga0210402_11049936Not Available742Open in IMG/M
3300021479|Ga0210410_10505268All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1079Open in IMG/M
3300021560|Ga0126371_10087791All Organisms → cellular organisms → Bacteria → Proteobacteria3068Open in IMG/M
3300021560|Ga0126371_10322720Not Available1675Open in IMG/M
3300021560|Ga0126371_10420036Not Available1481Open in IMG/M
3300021560|Ga0126371_10808583Not Available1083Open in IMG/M
3300021560|Ga0126371_11090572All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300021560|Ga0126371_11310192All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium858Open in IMG/M
3300021560|Ga0126371_12339425Not Available646Open in IMG/M
3300021560|Ga0126371_12425962Not Available634Open in IMG/M
3300022532|Ga0242655_10029614All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1239Open in IMG/M
3300022557|Ga0212123_10397739Not Available925Open in IMG/M
3300022724|Ga0242665_10281247All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium576Open in IMG/M
3300024288|Ga0179589_10211717All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Undibacterium → unclassified Undibacterium → Undibacterium sp. KW1851Open in IMG/M
3300025915|Ga0207693_10039328All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3724Open in IMG/M
3300025939|Ga0207665_10305821All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1190Open in IMG/M
3300026315|Ga0209686_1203878Not Available538Open in IMG/M
3300026316|Ga0209155_1153828All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300026322|Ga0209687_1260073Not Available540Open in IMG/M
3300026490|Ga0257153_1047286Not Available883Open in IMG/M
3300026494|Ga0257159_1051895Not Available698Open in IMG/M
3300026527|Ga0209059_1272168Not Available570Open in IMG/M
3300026551|Ga0209648_10306061Not Available1133Open in IMG/M
3300026557|Ga0179587_10502294Not Available796Open in IMG/M
3300026984|Ga0208732_1000773Not Available2015Open in IMG/M
3300026984|Ga0208732_1001363Not Available1690Open in IMG/M
3300027047|Ga0208730_1018708Not Available777Open in IMG/M
3300027069|Ga0208859_1006840Not Available1226Open in IMG/M
3300027070|Ga0208365_1032467Not Available690Open in IMG/M
3300027172|Ga0208098_1010979Not Available830Open in IMG/M
3300027173|Ga0208097_1012684Not Available925Open in IMG/M
3300027174|Ga0207948_1050950All Organisms → cellular organisms → Bacteria → Proteobacteria501Open in IMG/M
3300027537|Ga0209419_1004356All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2135Open in IMG/M
3300027701|Ga0209447_10018157All Organisms → cellular organisms → Bacteria1987Open in IMG/M
3300027783|Ga0209448_10097325All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium986Open in IMG/M
3300027855|Ga0209693_10020645All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3176Open in IMG/M
3300027884|Ga0209275_10145018Not Available1254Open in IMG/M
3300027889|Ga0209380_10129383Not Available1470Open in IMG/M
3300027889|Ga0209380_10332303All Organisms → cellular organisms → Bacteria893Open in IMG/M
3300031561|Ga0318528_10472648Not Available673Open in IMG/M
3300031573|Ga0310915_10887683Not Available625Open in IMG/M
3300031573|Ga0310915_11050858Not Available567Open in IMG/M
3300031681|Ga0318572_10625577Not Available641Open in IMG/M
3300031715|Ga0307476_10193122All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus → Candidatus Solibacter usitatus Ellin60761476Open in IMG/M
3300031718|Ga0307474_11226113Not Available593Open in IMG/M
3300031748|Ga0318492_10088313All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD00171509Open in IMG/M
3300031753|Ga0307477_10400651Not Available940Open in IMG/M
3300031754|Ga0307475_10318205Not Available1250Open in IMG/M
3300031754|Ga0307475_10381368Not Available1133Open in IMG/M
3300031754|Ga0307475_10401986All Organisms → cellular organisms → Bacteria1101Open in IMG/M
3300031754|Ga0307475_11099321Not Available622Open in IMG/M
3300031823|Ga0307478_10906528Not Available738Open in IMG/M
3300031880|Ga0318544_10391560Not Available540Open in IMG/M
3300031941|Ga0310912_11518780Not Available504Open in IMG/M
3300031947|Ga0310909_10867709Not Available742Open in IMG/M
3300031954|Ga0306926_12967465Not Available508Open in IMG/M
3300031962|Ga0307479_10591530Not Available1092Open in IMG/M
3300031962|Ga0307479_11480432Not Available636Open in IMG/M
3300032001|Ga0306922_10473622All Organisms → cellular organisms → Bacteria1336Open in IMG/M
3300032059|Ga0318533_10424616Not Available972Open in IMG/M
3300032076|Ga0306924_11719918Not Available657Open in IMG/M
3300032091|Ga0318577_10455725Not Available610Open in IMG/M
3300032180|Ga0307471_101638240All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300032261|Ga0306920_102523371All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria707Open in IMG/M
3300032261|Ga0306920_104241915Not Available516Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil23.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.62%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.66%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil7.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.26%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.78%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil4.31%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.44%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.44%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.48%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.48%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.48%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens0.48%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012056Attine ant fungus gardens microbial communities from New Jersey, USA - TSNJ009 MetaGHost-AssociatedOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021441Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R1Host-AssociatedOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026984Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF048 (SPAdes)EnvironmentalOpen in IMG/M
3300027047Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF042 (SPAdes)EnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027070Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF004 (SPAdes)EnvironmentalOpen in IMG/M
3300027172Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF038 (SPAdes)EnvironmentalOpen in IMG/M
3300027173Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF036 (SPAdes)EnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027537Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027701Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027783Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031748Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f22EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031880Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f25EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032091Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12627J18819_1031011413300001867Forest SoilMPSDEHTLTLQEIAADRDCWSGAHAGEQYPELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWAA
JGIcombinedJ26739_10007369883300002245Forest SoilMSDEHTPTLXNLXADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRASVPMRGEKV*
JGIcombinedJ51221_1006001833300003505Forest SoilMPSDEHTLTLQEIAADRDCWSGAHAGEQYPELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWAAGR*
JGIcombinedJ51221_1013399813300003505Forest SoilMSDEHTPTLQSLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADR
Ga0062385_1024876813300004080Bog Forest SoilMSDEHTLKLHDIPADRDRWSQHAGTHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADHFDRG*
Ga0062385_1028322313300004080Bog Forest SoilMSYEHTSTLQDIASDRDWWSVDAGKQYRELAHWLRGIAAKCGLPYSQRELLDLARRYDSRADRLSRRAAER*
Ga0062387_10036036913300004091Bog Forest SoilMSYEHTSTLQDIASDRDWWSVDAGKQYRELAHWLRGIAAKCGLPYSQRELLDLARRYDSRADRLSRRAAER*S
Ga0062387_10105494213300004091Bog Forest SoilMSNEHTPTLQQIAADRDWWSADAGKHYRELAHTLRGIAAKCRLPYTQKELLDLARRYDTRADRIGRGPIAR*
Ga0062389_10057707813300004092Bog Forest SoilMSDEHTPTLQNIAADRDWWSANAGTHYREVAHWVRGIAAKCRLPYTQKELLELAGRYDARADRIGRGPVVR*
Ga0062389_10132323823300004092Bog Forest SoilMSYEHTSTLQDIASDRDWWSVDAGKQYRELAHWLRGIAAKCGLPYSQRELLDLARRYDSR
Ga0062389_10302746123300004092Bog Forest SoilMSDEHTPTLQQIAADRDRWSADAGKHYRELAHWLRGIAAKCRLPYTQKELLKLAKRYDTRADWIGRGPVAR*
Ga0066395_1055939813300004633Tropical Forest SoilMSDEHTLELQDIATDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADYVDGR*
Ga0066395_1091405423300004633Tropical Forest SoilQRLAADRDRWSQHTGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGER*
Ga0066672_1078524513300005167SoilMSDEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR*
Ga0066671_1059565623300005184SoilMTNEHTPTLQYIAADRDCWSAHAGKHYHELAHSLRGIAAKCRLPYTQRELLDLARRYATRADRIGRGPIAR*
Ga0066388_10264019523300005332Tropical Forest SoilKAGIPGECPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGER*
Ga0070709_1033590233300005434Corn, Switchgrass And Miscanthus RhizosphereMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLNLARRYDARADRLSRGAKDR*
Ga0070714_10113341613300005435Agricultural SoilMSDEHTPTLQDLAVDRNWWSVDAGRHYHELEHWLRGIAAKCRLPYSQRELLNLARRYDARADRLSRGAKDR*
Ga0070713_10003541423300005436Corn, Switchgrass And Miscanthus RhizosphereMSDEHTPTLQGLAVDRDWWSVNAGRHYHELAHWLRGIAAKCRLPYSQRELLALARRYDARADRLSRGAKDR*
Ga0070713_10143444013300005436Corn, Switchgrass And Miscanthus RhizosphereMSDEHTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR*
Ga0070710_1012525133300005437Corn, Switchgrass And Miscanthus RhizosphereMSDEHTPTLQDLAVDRDWWSVNAGRHYHELAHWLRGIAAKCRLPYSQRELLALARRYDARADRLSRGAKDR*
Ga0070710_1131183113300005437Corn, Switchgrass And Miscanthus RhizosphereMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR*
Ga0070711_10200122123300005439Corn, Switchgrass And Miscanthus RhizosphereMSNEHTPTLQNLAADRDWWSADAGRHYQELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSPGAKDR*
Ga0066687_1035245323300005454SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0066704_1068763613300005557SoilMSDEHTPTLQNIAADRVWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYD
Ga0066699_1030587313300005561SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELALWLRGIAANCRLPYSQRELLDLARRYDARADRLSR
Ga0066703_1046342013300005568SoilMSDECTPTLQNLAADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLGRRATDR*
Ga0066705_1049189113300005569SoilMSDEHTPTLQNIAADRDWSSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR*
Ga0066654_1027375343300005587SoilLQNLAADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0070762_1052940523300005602SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR*
Ga0070762_1053210423300005602SoilNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLAQRYDARADRLSRGAKDI*
Ga0070763_1001976623300005610SoilMSDEHTPTLQNIAADRDWWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR*
Ga0070763_1027222723300005610SoilMTDEYTPTLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRANVAVGSTAPVWGRPGSCALRDGRRR*
Ga0070763_1048161413300005610SoilMSDEHTPTLQNLAANRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR*
Ga0070763_1064067613300005610SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRASVPMREKV*
Ga0066903_10049483043300005764Tropical Forest SoilMCMSDEHTLKLHPPKAGIPGECPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLVLASRYERQADHHFGGR*
Ga0066903_10078122933300005764Tropical Forest SoilMSDEYTLTLQQSAADRNWWSAHAGKHYRDLAHRLRGIAASCRLPYTQKELLDLARRYDTTADGIGRDAAD*
Ga0066903_10174714023300005764Tropical Forest SoilMSDDHTPAVPKIAADRDWWSVEAGKHYSELAHWLRGIAARCRLPYTQKELLKLARRYDSSADRLSRRVTER*
Ga0066903_10340264233300005764Tropical Forest SoilMSDEHTPTLQQTAADRDWWSANAEKHYGELAHWLRGIAAKCRLPYSQDELLDLARRYDARADRIGRGAADR*
Ga0066903_10752260613300005764Tropical Forest SoilMTDEHTPTLQDIAADRDWWSQNAGAHYCDLATWLREVAAKCRLPYTQKELPALAQRYEGRADRLSRI
Ga0070766_1029920923300005921SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR*
Ga0070766_1068599413300005921SoilMTDEYTPTLQKTADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRANVAVGSTAAIWGRP
Ga0066652_10198009913300006046SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAADR*
Ga0070765_10028882333300006176SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLAQRYDARADRLSRGAKDI*
Ga0070765_10071193323300006176SoilMSDEHTPTLNNLDADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRASVPMRGEKV*
Ga0070765_10105938113300006176SoilLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR
Ga0070765_10190121713300006176SoilMPSDEHTLTLQEIAADRDCCSAHAGEQYRELAYWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR*
Ga0079222_1232007423300006755Agricultural SoilMTDKHTPTLQDIAADRDWWSADAGKHYRELAHWLRGIAAKCGLPYTQKELLKLAKRYDTIADRIGRRAVVR*
Ga0066660_1107447323300006800SoilMADEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR*
Ga0073928_1014926243300006893Iron-Sulfur Acid SpringMSDEHTPALQNIAADRDWWSADTGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAKYR*
Ga0079219_1063613823300006954Agricultural SoilMMDEHTPTLQQIASDRDWWSADAGKHYRELAHWLRGIAANCRLPYTQKELLDLARRYDTRADRIGRQAVAR*
Ga0079219_1084151623300006954Agricultural SoilMPDEHTPTFQDVAVNRDWSSAHAGKHYRELAHSLRGIAATCRLPYTQKELLDLARRYDIRAHHIDPQAH*
Ga0099792_1058144333300009143Vadose Zone SoilFSDIPECMSDEHTPTLQNVAADRDWWSADTGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0099792_1065972323300009143Vadose Zone SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAARCRLPYSQRELLNLARRYDAR
Ga0099792_1118273423300009143Vadose Zone SoilMSDERTPTLQDIAADRDWWSADAGTHYRELVHWLRGIAAKCRLPYTQKELVNLARRYEIRADNLDRQAR*
Ga0099792_1119662313300009143Vadose Zone SoilMPGRARRRDWWSAGAGKHYRELAHWLRGIAAKSRLPYTQKELLDLARRYDTRAHRIGRGP
Ga0126384_1094674113300010046Tropical Forest SoilMSDEHTLKLQDIAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADYVD
Ga0126384_1154918613300010046Tropical Forest SoilMSDEHTLKLQDIAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGGR*
Ga0126373_1004526363300010048Tropical Forest SoilMSDEHTLKLHPPKAGIPGECPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCWLPNPQRELLGLASRYERQADHHFGER*
Ga0126373_1045534633300010048Tropical Forest SoilMSDEHTPTLQQIAADRDWWSVEAGTHYRELAHWLRGIAARCRLPYTQKELLKLARRYDSSADRLSRRVTER*
Ga0126373_1050639623300010048Tropical Forest SoilMSDEYTPTLQDIAADRDLWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADHCDRG*
Ga0126373_1109274923300010048Tropical Forest SoilMTDEHTPTLQDIAADRDWWSQNAGAHYCDLATWLREVAAKCRLPYTQKELLTLARRYEATADRLDGARSS*
Ga0126373_1143138723300010048Tropical Forest SoilMSDEHTSTSQDIAADRNWWQGDAGEHYRDLAHWLRGIAAKCRLPNPQRELLDLARHYDIKAD
Ga0126373_1171434313300010048Tropical Forest SoilPPKAGIPGECPVQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLVLASRYERQADHHFGGR*
Ga0126373_1264110413300010048Tropical Forest SoilDEHTPTLQDIAADRNRWSADAGKHYRELAHALRGIAAKCQLPHTQKELLKLARRYYNTADRIGRRAAAR*
Ga0126373_1271437013300010048Tropical Forest SoilMSNEPPTLLDIATDKDRWSQHAGKHYRELAHCLRGVAAKCRLPAPHPQRELLGLARCYDRQADYVDGR*
Ga0099796_1032187123300010159Vadose Zone SoilMADEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARAADR
Ga0134063_1048670213300010335Grasslands SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELALWLRGIATECRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0126370_1078551723300010358Tropical Forest SoilMPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCWLPNPQRELLGLASRYERQADHHFGER*
Ga0126370_1086282213300010358Tropical Forest SoilMSDEHTLELQDIATDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLSLASRYDRQADYVDGR*
Ga0126376_1074268233300010359Tropical Forest SoilMSDEYTLTLQQSAADRNWWSAHAGKHYRDLAHGLRGIAASCRLPYTQKELLDLARRYDTTADGIGRDAAD*
Ga0126376_1097473523300010359Tropical Forest SoilMSDEHTLKLQDIAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLVLASRYERQADHHFGGR*
Ga0126372_1028721323300010360Tropical Forest SoilMSDEHTLKLQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLVLASRYERQADHHFGGR*
Ga0126372_1053914313300010360Tropical Forest SoilMSDEHTLELQDIATDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLSLASRYDRQADYVDRR*
Ga0126372_1113432133300010360Tropical Forest SoilMSDEHTPTLQQTAADRDWWSANAGKHYGELAHWLRGIAAKCRLPYSRGELLDLARRYDARADRIGSGAADR*
Ga0126372_1212003113300010360Tropical Forest SoilMSDEYTLTLQQSAADRNWWSAHAGKHYRDLAHRPRGIAASCRLPYSQDELLDLARRYDARADRIGRRAADR*
Ga0126378_1084299323300010361Tropical Forest SoilMSDEHTLELQDIAADRDRWSRHTGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADYVDGR*
Ga0126378_1140982613300010361Tropical Forest SoilMSDEHTSTSQDIAADRNWWQGDAGEHYRDLAHWLRGIAAKCRLPNPQRELLDLARHYDIKANHLSGCGAAP*
Ga0126378_1179869313300010361Tropical Forest SoilMSDEHTLKLQDIAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGER*
Ga0126378_1284663323300010361Tropical Forest SoilMSDEHTPTLQQIAADRGWWSVEAGKHYSELAHRLRGIAARCRLPYTQKELLKLARRYDSRADRLSRLVT*
Ga0126378_1298776023300010361Tropical Forest SoilMSDEHTLKLQDIAVDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADRHFGGR*
Ga0126379_1047923613300010366Tropical Forest SoilMSDEYTLTLQQSAADRNWWSAHAGKHYRDLAHRLRGIAASCRLPYSQGELLDLARRCDARADRIGRGAADR*
Ga0126379_1072863933300010366Tropical Forest SoilVSDKHTPTLQHTAAYRDWWSANAGKHYGELAHWLRGIAAKCRLPYSQDELLDLARRYDARADRIGRGAADR*
Ga0126379_1182368913300010366Tropical Forest SoilMSDEHTLKLQDIAADRDRWSQHAGKHYRELAHWLRGVAAKCWLPNPQRELLGLASRYERQADHHFGER*
Ga0126379_1277965523300010366Tropical Forest SoilMSDEHTLTLRPPEARIPGECPVQDLAADEDRWLRKAGTHYREVAHWLRGVAARCHLPNPQRELLTLARRYERRADRFDTGWR*
Ga0126381_10057578233300010376Tropical Forest SoilMTDENTPTLQDIAADRDWWSQHAGAHYRELAIWLREIAAKCRLPYTQKELLALARRYEARADRLSRTRA*
Ga0126381_10070952323300010376Tropical Forest SoilMSDEHTLKLHPPKAGTPGECPVQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADRHFGGR*
Ga0126381_10129842633300010376Tropical Forest SoilMSDEHTLTLRPPEAGIPGECPVQDLAADEDRWLRKAGTHYREVAHWLRGVAARCHLPNPQRELLTLARRYERRADRFDTGWR*
Ga0126381_10165366623300010376Tropical Forest SoilMSDEHTLELQDIAADQDRWSRHAGKHYRELAHWLPGVAAKCRLPNPQRELLGLASRYDRQADYVDEC*
Ga0126381_10199476123300010376Tropical Forest SoilMSDEHTPKLHPPKAGIPGECPVQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPN
Ga0126381_10314394623300010376Tropical Forest SoilIPGECPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGER*
Ga0126381_10447891713300010376Tropical Forest SoilMSDDHTPAVPKIAADRDWWSVEAGKHYSELAHWLRGIAARCRLPYTQKELLKLAHRYDARADRLNRRVT*
Ga0126383_1045136613300010398Tropical Forest SoilMSDEHTPTLQQIAADRDWWSVEAGTHYRELAHWLRGIAARCRLPYTQKELLKLARRYDSSADRLSRRVT
Ga0126383_1080573033300010398Tropical Forest SoilMSDEYTPTFQQTAVDRDWWSANAGKHYRELAHWLRGIAAKCRLPYSQGELLDLARRYDARADRIGCGAADR*
Ga0126383_1087907813300010398Tropical Forest SoilMSDEHTPKLQQIAADRDWWSDEAGKHYRELAHWLRGIAARCRLPYTQKELLKLARRYDSSADRLSRRVT*
Ga0126383_1222001523300010398Tropical Forest SoilMSDEHTLKLHPPKAGTPGECPVQHLATDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADRHFGGR*
Ga0150983_1367824113300011120Forest SoilRDEHTLTLQEIAADRDLWSAHAGQHYGELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWAAGR*
Ga0150983_1443206423300011120Forest SoilMSDEHTPTLQNLAADRDWWSADAGRHYRELAHWLRGIAAKCRLPYSQRELLDLARRHDARADRLSRRAADR*
Ga0150983_1469760813300011120Forest SoilMTDEHTRTLQKTAADRDWWSADAGKHYRELAHWLGGVAAKCRLPYTQKELLKLAKRYDTRADRFSRRVSDT*
Ga0153925_101099013300012056Attine Ant Fungus GardensMSDEHTPTLQNIAADRDCSSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWAAGR*
Ga0137399_1092844523300012203Vadose Zone SoilMSGEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0137358_1047912333300012582Vadose Zone SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDAKADRLSRRATDR*
Ga0137398_1055288013300012683Vadose Zone SoilMADEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGIAAKCRLPYSQRELLDLAWRYDARGDRLSRRAADR*
Ga0126369_1066995523300012971Tropical Forest SoilMSDEHTLKLHPQKAGIPGECPVQRLAADRDRWSQHAGKHYRELAHWLRGVAAKCWLPNPQRELLGLASRYERQADHHFGER*
Ga0126369_1275878313300012971Tropical Forest SoilMSDEHTLKLQPPKAGIPGECPVQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCPLPNPQRELLSL
Ga0134087_1040093013300012977Grasslands SoilMSDEHTPTLQNLAADRDWWSADAARHYHELALWLRGIAANCRLPYSQRELLDLARRYDARADRLSRRATDR*
Ga0134072_1021353623300015357Grasslands SoilADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDI*
Ga0182041_1068519043300016294SoilMSDEYTPTLQDIAADRDWWSAHAGAHYRELARWLRGIAAKCQLPYSRRELLDLARRYDARAERLNRQAADR
Ga0182033_1184771613300016319SoilTLQNTSADRDWWSAEAGNHYRELADRLRGVAAKCRLPYSRRELLDLARRYDARAGRLTRRADR
Ga0182033_1218872313300016319SoilHTVRNTDEHTPTLQQIATERDWWSENAGEHYRELAHGLRGVATKCRLPYTRKELLSLARRYETRADRLSRRP
Ga0182037_1042651813300016404SoilMPTFRDTAADRDWWQGDAKRHYRELAHWLRGIAAKCRLPNPQQELLDIARHYDIRANDLSRGGTAP
Ga0182037_1110864623300016404SoilMSDKHTLTLHLPEAGIPGECPVQDLAADEGRWLRKAGTHYREVAHWLRGVAARCHLPNPQRELLT
Ga0182037_1123340813300016404SoilNTAADRDWWSAEASNHYRELADRLRGVAAKCRLPYSRRELLDLARRYDARAGRLTRRADR
Ga0066662_1114952513300018468Grasslands SoilMSDEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLAWRYDARGDRLSRRAADR
Ga0210407_1090821813300020579SoilMSDEHTPTLQQIAADRDWWSERAGAHYRELAHWLRGVAAKCRLPATQKEILNLARRYESRANHLGRRGAAR
Ga0210403_1006062923300020580SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR
Ga0210403_1006669133300020580SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAYWLRGIATKCRLPYTQKELPKLARRYDRFGGRGVGR
Ga0210403_1043719013300020580SoilMSDEHTPTLQNIAADRDWWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR
Ga0210399_1040290223300020581SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0210399_1055322523300020581SoilMSDEHTPTLQNITADRDWSSADAGEQYRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0210399_1159634113300020581SoilMTDEYTPTLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRANVAVGSTAAIWGRPGSCALRDGRRR
Ga0210401_1021908913300020583SoilMTDEYTPTLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRADRFSRRVSDT
Ga0210401_1025191113300020583SoilMSNEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGLATKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0210401_1031238423300020583SoilMSDEHTPTLQQIAADRDWWSERAGAHYCELAHWLRGVAAKCRLPATQKEILNLARRYESRASHLGRR
Ga0210401_1085989123300020583SoilMSDEHTPTLQNVAADRDWWSADAGRHYHEIAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0210401_1094774013300020583SoilMPSDEHTLTLQEIAADRDCWSAHAGEQCRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGWGAGR
Ga0210406_1116758313300021168SoilMSDEHTPTLQNIAANRDWWSANAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0210400_1020500713300021170SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELPHWLRGIAAKCRLPYSQRELLDLARRYDARADRPSPGAKDR
Ga0210400_1052258233300021170SoilMSDEHTPTLHNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRASVPMRGEKV
Ga0210400_1063408013300021170SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSPGAKDR
Ga0210405_1029799223300021171SoilMSDEHTPTLQNLAADRDWWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWGAGR
Ga0210405_1034630823300021171SoilMSDEHTPTLQQIAADRDWWSERAGAHYRELAHWLRGVAAKCRLPATQNEILNLARRYESRANHLGRRGAAR
Ga0210405_1035898823300021171SoilMSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR
Ga0210405_1090230023300021171SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELPHWLRGIAAKCRLPYSQRELLDLARRYDARADRPSP
Ga0210405_1098507113300021171SoilMTDEYTPTLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRANVAVGSTAPVWGRPGSCALRDGRRR
Ga0210408_1044983323300021178SoilMPSDEHMLTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPCTQKELLKLARRYDRLGGWGAGR
Ga0213876_1069068313300021384Plant RootsMSDEHTPTLQQLAADRDWWSADAGKHYRELAHWLRGIAAKCRLPYTQKELLDLARRYDSRADRIGHGNTNP
Ga0210385_1017198313300021402SoilMSDEHTPTLQNIAADRDWWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRL
Ga0210397_1040398223300021403SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0210387_1110295623300021405SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAYWLRGIATKCRLPYTQKELLKLARRYDRFGGRGVGR
Ga0213871_1000701833300021441RhizosphereMSDEHTPTLQEIAADRDLWSAEAGKHYRELAHSLRGIAAKSPLPHTQQELLDLARRYDTRADRIGRRAVDR
Ga0210402_1104993623300021478SoilMSNEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0210410_1050526823300021479SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0126371_1008779153300021560Tropical Forest SoilMSDEHTPTLQQIAADRDWWSVEAGTHYRELAHWLRGIAARCRLPYTQKELLKLARRYDSSADRLSRRVTER
Ga0126371_1032272023300021560Tropical Forest SoilMSDEHTLELQDIATDRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLSLASRYDRQADYVDGR
Ga0126371_1042003633300021560Tropical Forest SoilMSDEYTLTLQQSAADRNWWSAHAGKHYRDLAHGLRGIAASCRLPYTQKELLDLARRYDTTADGIGRDAAD
Ga0126371_1060445113300021560Tropical Forest SoilMCMSDEHTLKLQRLAADRDRWSQHTGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGER
Ga0126371_1080858313300021560Tropical Forest SoilMSDEYTPTLQDIAADRDLWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYDRQADHCDRG
Ga0126371_1099980023300021560Tropical Forest SoilMSDEHTPKLHPPKAGIPGECPVQHLAADRDRWSQHAGKHYRELAHWLRGVAAKCRLPNPQRELLGLASRYERQADHHFGGR
Ga0126371_1109057223300021560Tropical Forest SoilMSDEHTPTLQQIAADRNFWSVEAGKHYRELAHRLRGIAARCRLPYTQKELLKLARRYVPL
Ga0126371_1131019213300021560Tropical Forest SoilGYGCMSDAHTLTLQDIAADRNQSTAEAGKHYRELAHALRGIAAKCQLPHTQKELLKLARRYDNTADRIGRRAAAR
Ga0126371_1233942513300021560Tropical Forest SoilHEYTPTLQQLTANRDCWSAGAGKHYRELAHRLRGIAAECRLPYTQKELLRLAGRYDAVADRLGGQVQLP
Ga0126371_1242596223300021560Tropical Forest SoilMSDEHTPTLQQTAADRDWWSANAGKHYGELAHWLRGIAAKCRLPYSQDELLDLARRYDARADRIGRGAADR
Ga0242655_1002961413300022532SoilQLFSSIPCGMTDEYTPTLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRADRFSRRVSDT
Ga0212123_1039773913300022557Iron-Sulfur Acid SpringMSDEHTPALQNIAADRDWWSADAGKHYHQLAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAKYR
Ga0242665_1028124713300022724SoilLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0179589_1021171723300024288Vadose Zone SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR
Ga0207693_1003932833300025915Corn, Switchgrass And Miscanthus RhizosphereMSGEHTLQNLAADRDWWSADTGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0207665_1030582133300025939Corn, Switchgrass And Miscanthus RhizosphereMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0209686_120387823300026315SoilMSDEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYCQRELLDLARRYDARADRLSRRAADR
Ga0209155_115382813300026316SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELALWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRATDR
Ga0209687_126007313300026322SoilLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0257153_104728613300026490SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRATDR
Ga0257159_105189523300026494SoilMSDEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0209059_127216813300026527SoilMSDEHTPTLQNIAADRDWWSGDAGRHYHELAHWLRGLAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0209648_1030606133300026551Grasslands SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARTDRLSRRATDR
Ga0179587_1050229423300026557Vadose Zone SoilVLTGSFLWGMSDERTPTLQDIAADRDWWSADAGTHYRELVHWLRGIAAKCRLPYTQKELVNLARRYEMRADNLDRQAR
Ga0208732_100077313300026984Forest SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRKLLDLARRYDARADRLSRRASVPMRGEKV
Ga0208732_100136313300026984Forest SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR
Ga0208730_101870813300027047Forest SoilMPSDEHTLTLQEIAADRDCWSGAHAGEQYPELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGWAAGR
Ga0208859_100684013300027069Forest SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0208365_103246723300027070Forest SoilMPSDEHTLTLQEIAADRDCWSAHAGEQCRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR
Ga0208098_101097923300027172Forest SoilMPSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPYTQSELLKLARRYDRLGGRGVGR
Ga0208097_101268423300027173Forest SoilMSDEHTPTLQSLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR
Ga0207948_105095013300027174Forest SoilMSDEHTPTLQNIAADRDWWSERAGAHYRELAHWLRGVAAKCRLPATQKEILNLARRYESRANHLGRRG
Ga0209419_100435643300027537Forest SoilMSDEHTPTLQNIAADRDWWSAHAGEQYRELAHWLRGIAAECRLSYTQKELLKLARRYDR
Ga0209447_1001815713300027701Bog Forest SoilMSYEHTSTLQDIASDRDWWSVDAGKQYRELAHWLRGIAAKCGLPYSQRELLDLARRYDSRADRLSRRAAER
Ga0209448_1009732533300027783Bog Forest SoilMGLPLQVRNRDWWSANAGKHYRELAHWLRGIAAKCRLPYSQRELLDLARRYDSRADRLSRRAAER
Ga0209693_1002064513300027855SoilMSDEHTPTLQNLAANRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLNRGAKDR
Ga0209275_1014501813300027884SoilMSDEHTPTLQNLAADRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLAQRYDARADRLSRGAKDI
Ga0209380_1012938323300027889SoilMTDEYTPTLQKTADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRADRFSRRVS
Ga0209380_1033230313300027889SoilDRDWWSADAGRHYHELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0318528_1047264823300031561SoilMSDGQMPTFRDTAADRDWWQGDAKRHYRELAHWLRGIAAKCRLPNPQQELLDIARHYDIRANDLSRGGTAP
Ga0310915_1088768313300031573SoilMSDEYTPTLQNTAADRDWWSAEAGNHYRELADRLRGVAAKCRLPYSRRELLDLARRYDARAGRLTRRADR
Ga0310915_1105085813300031573SoilSVTPCGMLDEHTPMPQDIAADRDWRSRDAGKHYRELAHWLRGIAAKCRFPGTQRELLSLARRYDARADDLSRRAAAS
Ga0318572_1062557723300031681SoilMLDEHTPMPQDIAAGDWRSRDAGKHYRELAHWLRGIAAKCRFPGTQRELLSLARRYDARADDLSRRAAASSSA
Ga0307476_1019312223300031715Hardwood Forest SoilMSDEHTPTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0307474_1122611323300031718Hardwood Forest SoilMSDEHTLTLQEIAADRDCWSAHAGEQCRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0318492_1008831333300031748SoilMTPSRRPTPQNIAADRDWWSRDAGKHYRELARWLRGIPAKCRLPGTQQELLNLARRYDARADHLS
Ga0307477_1040065123300031753Hardwood Forest SoilMPSDEHTLTLQEIAADRNCWSAHSGEQYRELAHWLRGIAAKCRLPYTQKELLKLARRYDRLGGRGVGR
Ga0307475_1031820533300031754Hardwood Forest SoilECMSDEHTPTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAECRLPYTQKELLKLARRYDRLGGRGVGR
Ga0307475_1038136833300031754Hardwood Forest SoilMSDEHTLTLQDLAADRDWWSADAGSHYRELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRGAKDR
Ga0307475_1040198623300031754Hardwood Forest SoilLQKTAADRDWWSADAGKHYRELAHWLRGVAAKCRLPYTQKELLKLAKRYDTRADRLSRRVSDT
Ga0307475_1109932113300031754Hardwood Forest SoilMSDEHTLTLQEIAADRDCWSAHAGEQYRELAHWLRGIAAKCRLPYTQSELLKLARRYDRLGGRGVGR
Ga0307478_1090652823300031823Hardwood Forest SoilMSDEHTLTLQNLAADRDWWSADAGRHYRELAHWLRGIAAKCRLPYSQRELLDLARRYDARADRLSRRAADR
Ga0318544_1039156013300031880SoilCGMLDEHTPMPQDIAAGDWRSRDAGKHYRELAHWLRGIAAKCRFPGTQRELLSLARRYDARADDLSRRAAAS
Ga0310912_1151878023300031941SoilMTYEHTPTLQDIAADRDWWSAHAGAHYRELARWLRGIAAKCQLPYSRRELLDLARRYDARAERPNRQAADR
Ga0310909_1086770913300031947SoilMLDEHTPMPQDIAAGDWRSRDAGKHYRELAHWLRGIAAKCRFPGTQRELLSLARRYDARADDLSRRAAASSSARLDVTGAARI
Ga0306926_1296746513300031954SoilMSDEHTPTLQDIAADRDWWSAHAGAHYRELARWLRGIAAKCQLPYSRRELLDLAQRYDGRAKRLSRRAADR
Ga0307479_1059153013300031962Hardwood Forest SoilMSDEHTPTLQSLVADRDWWSADAGRHYHELAHWLRGIAAKCWLPYSQRELLDLARRYDARADRLSRRATGK
Ga0307479_1148043213300031962Hardwood Forest SoilMTDEYTPTLQKTAADRDWWSADAGKHYRELADWLRGVAAKCRLPYTQKELLKLAKRYDTRANVAVGSTAAVWGRPGSCALRDGRRR
Ga0306922_1025545233300032001SoilMSDKHTLTLHLPEAGIPGECPVQDLAADEGRWLRKAGTHYREVAHWLRGVAARCHLPNPQRELLTLARRYERRADRFDTGWR
Ga0306922_1047362213300032001SoilSCMSDGQMPTFRDTAADRDWWQGDAKRHYRELAHWLRGIAAKCRLPNPQQELLDIARHYDIRANDLSRGGTAP
Ga0318533_1042461633300032059SoilMSDEYTPTLQDIAADRDWWSQDAGRHYRELAHWLRGMASKCRLPYTRKELLDLARRYDARAERLNRQAADR
Ga0306924_1171991813300032076SoilMSDEYTPTLQDIAADRDWWSQDAGRHYRELAHWLRGMASKCRLPYTRKELLDLARRYDTRADRLG
Ga0318577_1045572523300032091SoilMLDEHTPMPQDIAADRDWRSRDAGKHYRELAHWLRGIAAKCRFPGTQRELLSLARRYDARADDL
Ga0307471_10163824033300032180Hardwood Forest SoilMSDEHTPTLQDLAVDRDWWSVNAGRHYHELAHWLRGIAAKCRLPYSQRELLALARRYDARADRLSRRAKDR
Ga0306920_10252337123300032261SoilMSDEHTPTLQQIAADRDWWSADAGAHYRELAHWLRGIAAKCRLPYSRRELLDLAQRYDGRAKRLSRRAADR
Ga0306920_10424191513300032261SoilMSDEHTPTLQDIAADRDWWSAHAGAHYRELARWLRGIAAKCQLPYSRRELLDLARRYDARADRRSRQAADR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.