NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F041291

Metagenome / Metatranscriptome Family F041291

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F041291
Family Type Metagenome / Metatranscriptome
Number of Sequences 160
Average Sequence Length 120 residues
Representative Sequence MTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQDQQR
Number of Associated Samples 124
Number of Associated Scaffolds 160

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 15.00 %
% of genes near scaffold ends (potentially truncated) 88.75 %
% of genes from short scaffolds (< 2000 bps) 96.25 %
Associated GOLD sequencing projects 120
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (75.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(21.250 % of family members)
Environment Ontology (ENVO) Unclassified
(27.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.875 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.78%    β-sheet: 11.11%    Coil/Unstructured: 61.11%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 160 Family Scaffolds
PF01741MscL 2.50
PF13298LigD_N 2.50
PF03631Virul_fac_BrkB 2.50
PF13378MR_MLE_C 1.88
PF13653GDPD_2 1.25
PF11127DUF2892 1.25
PF01266DAO 1.25
PF02464CinA 1.25
PF01814Hemerythrin 0.62
PF03976PPK2 0.62
PF00873ACR_tran 0.62
PF00149Metallophos 0.62
PF00571CBS 0.62
PF14114DUF4286 0.62
PF13610DDE_Tnp_IS240 0.62
PF00903Glyoxalase 0.62
PF07944Glyco_hydro_127 0.62
PF13701DDE_Tnp_1_4 0.62
PF13551HTH_29 0.62
PF13546DDE_5 0.62
PF00210Ferritin 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 160 Family Scaffolds
COG1295Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase)Function unknown [S] 2.50
COG1970Large-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 2.50
COG1546Nicotinamide mononucleotide (NMN) deamidase PncCCoenzyme transport and metabolism [H] 1.25
COG2326Polyphosphate kinase 2, PPK2 familyEnergy production and conversion [C] 0.62
COG3533Beta-L-arabinofuranosidase, GH127 familyCarbohydrate transport and metabolism [G] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A75.00 %
All OrganismsrootAll Organisms25.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2228664021|ICCgaii200_c0702754Not Available942Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100625244Not Available684Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100629634Not Available1031Open in IMG/M
3300000550|F24TB_10333745Not Available2584Open in IMG/M
3300000955|JGI1027J12803_100438722Not Available1147Open in IMG/M
3300000955|JGI1027J12803_100751342All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella1026Open in IMG/M
3300003324|soilH2_10260758Not Available1096Open in IMG/M
3300003911|JGI25405J52794_10086236All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Sinorhizobium/Ensifer group → Sinorhizobium → Sinorhizobium fredii group → Sinorhizobium fredii693Open in IMG/M
3300004633|Ga0066395_10183178All Organisms → cellular organisms → Bacteria1085Open in IMG/M
3300005172|Ga0066683_10906275Not Available505Open in IMG/M
3300005176|Ga0066679_10557420All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis747Open in IMG/M
3300005178|Ga0066688_10900386Not Available546Open in IMG/M
3300005179|Ga0066684_10585941All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella gemina749Open in IMG/M
3300005180|Ga0066685_11067751Not Available530Open in IMG/M
3300005181|Ga0066678_11003098Not Available541Open in IMG/M
3300005438|Ga0070701_10133567All Organisms → cellular organisms → Bacteria1412Open in IMG/M
3300005451|Ga0066681_10226307All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300005459|Ga0068867_100731363All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300005471|Ga0070698_102059337All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella524Open in IMG/M
3300005557|Ga0066704_10475681Not Available825Open in IMG/M
3300006032|Ga0066696_10061240All Organisms → cellular organisms → Bacteria2149Open in IMG/M
3300006058|Ga0075432_10221966Not Available756Open in IMG/M
3300006196|Ga0075422_10591008Not Available513Open in IMG/M
3300006844|Ga0075428_100415333All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300006847|Ga0075431_100281512All Organisms → cellular organisms → Bacteria → Proteobacteria1684Open in IMG/M
3300006853|Ga0075420_100554868Not Available992Open in IMG/M
3300006853|Ga0075420_100964369Not Available733Open in IMG/M
3300006904|Ga0075424_100115971All Organisms → cellular organisms → Bacteria → Proteobacteria2829Open in IMG/M
3300006904|Ga0075424_101545605Not Available704Open in IMG/M
3300007255|Ga0099791_10397885Not Available663Open in IMG/M
3300007258|Ga0099793_10664800All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300009090|Ga0099827_11354030All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300009137|Ga0066709_101322986Not Available1055Open in IMG/M
3300009147|Ga0114129_10191697All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes2775Open in IMG/M
3300009147|Ga0114129_10664645All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1343Open in IMG/M
3300009156|Ga0111538_11318717Not Available911Open in IMG/M
3300009156|Ga0111538_12117066Not Available707Open in IMG/M
3300009177|Ga0105248_10678966Not Available1162Open in IMG/M
3300009792|Ga0126374_11246961Not Available598Open in IMG/M
3300009822|Ga0105066_1108656Not Available616Open in IMG/M
3300010043|Ga0126380_10123068Not Available1606Open in IMG/M
3300010043|Ga0126380_10785472Not Available777Open in IMG/M
3300010047|Ga0126382_10698043Not Available851Open in IMG/M
3300010047|Ga0126382_11833339Not Available571Open in IMG/M
3300010071|Ga0127477_120399Not Available507Open in IMG/M
3300010083|Ga0127478_1064592Not Available550Open in IMG/M
3300010086|Ga0127496_1020130Not Available611Open in IMG/M
3300010103|Ga0127500_1075325All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis729Open in IMG/M
3300010106|Ga0127472_1086271Not Available637Open in IMG/M
3300010107|Ga0127494_1041118Not Available718Open in IMG/M
3300010114|Ga0127460_1016841Not Available677Open in IMG/M
3300010115|Ga0127495_1063808Not Available649Open in IMG/M
3300010119|Ga0127452_1038154All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300010124|Ga0127498_1016994Not Available690Open in IMG/M
3300010124|Ga0127498_1028935Not Available792Open in IMG/M
3300010132|Ga0127455_1132458Not Available580Open in IMG/M
3300010132|Ga0127455_1159537Not Available625Open in IMG/M
3300010133|Ga0127459_1109502Not Available632Open in IMG/M
3300010139|Ga0127464_1003139Not Available573Open in IMG/M
3300010140|Ga0127456_1066732Not Available539Open in IMG/M
3300010145|Ga0126321_1110848Not Available630Open in IMG/M
3300010145|Ga0126321_1437750All Organisms → cellular organisms → Bacteria980Open in IMG/M
3300010145|Ga0126321_1463376Not Available615Open in IMG/M
3300010303|Ga0134082_10535695Not Available514Open in IMG/M
3300010359|Ga0126376_12879555Not Available531Open in IMG/M
3300010362|Ga0126377_10397587Not Available1388Open in IMG/M
3300010403|Ga0134123_11573687All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter704Open in IMG/M
3300012203|Ga0137399_10156428All Organisms → cellular organisms → Bacteria1825Open in IMG/M
3300012205|Ga0137362_11053128Not Available692Open in IMG/M
3300012211|Ga0137377_11206807Not Available686Open in IMG/M
3300012212|Ga0150985_110933266All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300012351|Ga0137386_10623588Not Available776Open in IMG/M
3300012355|Ga0137369_10573381Not Available788Open in IMG/M
3300012359|Ga0137385_10721593Not Available832Open in IMG/M
3300012360|Ga0137375_10193531All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1932Open in IMG/M
3300012361|Ga0137360_10107018All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Salmonella → Salmonella enterica → Salmonella enterica subsp. enterica2151Open in IMG/M
3300012362|Ga0137361_11871676Not Available517Open in IMG/M
3300012376|Ga0134032_1068366Not Available663Open in IMG/M
3300012379|Ga0134058_1092109Not Available693Open in IMG/M
3300012383|Ga0134033_1064270Not Available565Open in IMG/M
3300012384|Ga0134036_1277772All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella gemina642Open in IMG/M
3300012393|Ga0134052_1223048Not Available509Open in IMG/M
3300012396|Ga0134057_1001676Not Available559Open in IMG/M
3300012397|Ga0134056_1252304Not Available535Open in IMG/M
3300012401|Ga0134055_1098169Not Available559Open in IMG/M
3300012402|Ga0134059_1073470Not Available847Open in IMG/M
3300012402|Ga0134059_1409871Not Available981Open in IMG/M
3300012403|Ga0134049_1310697Not Available540Open in IMG/M
3300012406|Ga0134053_1295222Not Available789Open in IMG/M
3300012410|Ga0134060_1491365All Organisms → cellular organisms → Bacteria761Open in IMG/M
3300012469|Ga0150984_103887024Not Available624Open in IMG/M
3300012948|Ga0126375_10123114Not Available1585Open in IMG/M
3300012948|Ga0126375_10186331Not Available1348Open in IMG/M
3300012975|Ga0134110_10423643Not Available594Open in IMG/M
3300013306|Ga0163162_10907889All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella994Open in IMG/M
3300013306|Ga0163162_11132896Not Available887Open in IMG/M
3300014157|Ga0134078_10618296Not Available521Open in IMG/M
3300015356|Ga0134073_10157533Not Available723Open in IMG/M
3300015358|Ga0134089_10340918Not Available630Open in IMG/M
3300017997|Ga0184610_1272095Not Available561Open in IMG/M
3300018056|Ga0184623_10154967All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300018466|Ga0190268_11926824Not Available539Open in IMG/M
3300019228|Ga0180119_1227665Not Available598Open in IMG/M
3300019279|Ga0184642_1104810Not Available504Open in IMG/M
3300019279|Ga0184642_1342093Not Available964Open in IMG/M
3300019279|Ga0184642_1657703Not Available515Open in IMG/M
3300019279|Ga0184642_1692549Not Available786Open in IMG/M
3300020081|Ga0206354_11688671All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter690Open in IMG/M
3300021307|Ga0179585_1116887Not Available548Open in IMG/M
3300022195|Ga0222625_1082988Not Available654Open in IMG/M
3300022195|Ga0222625_1264418Not Available525Open in IMG/M
3300025927|Ga0207687_11136684All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300025933|Ga0207706_11385980Not Available578Open in IMG/M
3300025941|Ga0207711_10618484Not Available1011Open in IMG/M
3300027873|Ga0209814_10332385Not Available663Open in IMG/M
3300027880|Ga0209481_10068413Not Available1675Open in IMG/M
3300027907|Ga0207428_11063013Not Available567Open in IMG/M
3300028587|Ga0247828_11085395Not Available529Open in IMG/M
3300028889|Ga0247827_10767667Not Available635Open in IMG/M
3300030829|Ga0308203_1015590Not Available936Open in IMG/M
3300030830|Ga0308205_1001095All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1863Open in IMG/M
3300030830|Ga0308205_1004342All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1250Open in IMG/M
3300030830|Ga0308205_1054137Not Available540Open in IMG/M
3300030830|Ga0308205_1063995Not Available510Open in IMG/M
3300030902|Ga0308202_1004129All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1717Open in IMG/M
3300030903|Ga0308206_1031118Not Available970Open in IMG/M
3300030903|Ga0308206_1047199Not Available843Open in IMG/M
3300030903|Ga0308206_1169337Not Available537Open in IMG/M
3300030905|Ga0308200_1181408Not Available504Open in IMG/M
3300030993|Ga0308190_1200868Not Available501Open in IMG/M
3300030998|Ga0073996_11758221Not Available600Open in IMG/M
3300031039|Ga0102760_10848453Not Available532Open in IMG/M
3300031054|Ga0102746_10727544Not Available515Open in IMG/M
3300031058|Ga0308189_10016055All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1675Open in IMG/M
3300031058|Ga0308189_10118629Not Available865Open in IMG/M
3300031058|Ga0308189_10252541Not Available667Open in IMG/M
3300031058|Ga0308189_10272019All Organisms → cellular organisms → Bacteria → Proteobacteria649Open in IMG/M
3300031089|Ga0102748_10949331Not Available561Open in IMG/M
3300031091|Ga0308201_10279948All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella palauensis586Open in IMG/M
3300031092|Ga0308204_10006935All Organisms → cellular organisms → Bacteria1889Open in IMG/M
3300031092|Ga0308204_10068467Not Available909Open in IMG/M
3300031092|Ga0308204_10102171All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300031092|Ga0308204_10135944Not Available716Open in IMG/M
3300031093|Ga0308197_10295555Not Available595Open in IMG/M
3300031093|Ga0308197_10372466Not Available550Open in IMG/M
3300031114|Ga0308187_10096438Not Available911Open in IMG/M
3300031114|Ga0308187_10301974Not Available601Open in IMG/M
3300031114|Ga0308187_10395417Not Available545Open in IMG/M
3300031114|Ga0308187_10499811Not Available500Open in IMG/M
3300031847|Ga0310907_10501542Not Available649Open in IMG/M
3300031943|Ga0310885_10394866Not Available735Open in IMG/M
3300032013|Ga0310906_10012953All Organisms → cellular organisms → Bacteria3275Open in IMG/M
3300032075|Ga0310890_10987410Not Available677Open in IMG/M
3300032075|Ga0310890_11731213Not Available519Open in IMG/M
3300032180|Ga0307471_103700939Not Available541Open in IMG/M
3300034663|Ga0314784_122771Not Available564Open in IMG/M
3300034666|Ga0314788_064259Not Available756Open in IMG/M
3300034667|Ga0314792_038692Not Available1004Open in IMG/M
3300034667|Ga0314792_159987Not Available608Open in IMG/M
3300034678|Ga0314803_057300Not Available687Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil21.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil16.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.38%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil7.50%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.62%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment4.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.50%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil1.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.25%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.25%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.25%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.25%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.62%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.62%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.62%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.62%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.62%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.62%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.62%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.62%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010071Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010083Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010086Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010103Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010106Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010107Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010115Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010119Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010124Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010132Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010133Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010139Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010140Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010145Soil microbial communities from Hawaii, USA to study soil gas exchange rates - KP-HI-INT metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012376Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012383Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012384Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012402Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300019228Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020081Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-3 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300021307Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030829Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_357 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030830Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_368 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030902Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_356 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030903Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030905Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030993Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_185 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030998Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-3A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031039Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PI 6C (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031054Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PI 1C (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031089Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PI 2B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031091Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_355 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032013Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034663Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034666Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034667Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034678Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICCgaii200_070275412228664021SoilFVDDTGRGLRRQAQAVFATTRRPFHRQPGLGERLLTHAEQLGTTTGMVLVGCVGLGVGLGYLWACQGXPQQRXRWREKARIYWRPTERRDAHQREITRRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTREEAEAWAQAQGKP
INPhiseqgaiiFebDRAFT_10062524413300000364SoilPRGMPLGVALVGGVGLGAGLVALLEPQGGPQRRAWLREQARAYWRTXDTXHAPXQXLRRXWRIYQGRDTQRVDAHTRQPAQAEAWYAEPAHYESLVLYSEPFVTREEAEAWTQAHNKQ*
INPhiseqgaiiFebDRAFT_10062963423300000364SoilPRGMPLGVALVGGVGLGAGLVALLEPQGGPQRRAWLREQARAYWRTTDTSHAPRQALRRPWRIYQGRDTQRVDAHTRQPAQAEAWYAEPAHYESLVLYSEPFVTREEAEAWTQAHNKQ*
F24TB_1033374553300000550SoilGRQAQAVLATTRRPFRRQPGFGERLLAQAEQLGTTTGMVLVGCVGLGVGLGYLWARQGSSQQRARLREKARTYWRPAEPRHAQQQALSRRWRFYQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKP*
JGI1027J12803_10043872213300000955SoilQAVLATTRRPFQRQPGLGERLLAHAEQRRTTTGMILVGCVGLGVGLGYLLACQGSPQQRARLREKARTYWRPTQTRHAPQQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKP*
JGI1027J12803_10075134213300000955SoilGCVGLGVGLGYLWACQGRPQQRTRWREKARTSWRPTERRHGHQQELARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVLYSEPFVTREEAEAWAQAQGKP*
soilH2_1026075813300003324Sugarcane Root And Bulk SoilAVLATTRRPFQHQAGLGERLLTQAEQLGTTTGMVLMGCVGLGVGLGYLWACQGASPQRARWREKARTYWRPTERRQAHQPALARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVLYSEPFVTREEAEAWAQAQGKP*
JGI25405J52794_1008623613300003911Tabebuia Heterophylla RhizosphereQSGLGERLLTHAEXLGTPTGMVLVGCVGLGVGLGYLWAWQRRPQQRTWWREKARTYWRPTERRQAPQQELARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVLYSEPFVTRDEAEAWAQAQGQP*
Ga0066395_1018317823300004633Tropical Forest SoilHQPGFGERLLTQATQLGMTTGVILVGCVGLGAGLIALLEPQGGPQRRAWLREKARAYWHPTEPQQELRRRWRFYQGRDTQRVDAHTRKPAQAEAWYAEPTGYEGIVLYSEPFVTREEAEAWAKAQASPL*
Ga0066683_1090627513300005172SoilLDETRRTLGQQAQAVLATTRRPFQRQPGLGERLRTQAAQPGMTTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTSHALHHELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAQAQGTQ*
Ga0066679_1055742023300005176SoilALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHPTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0066688_1090038613300005178SoilEPQGGPQRRAWLREHARASWHTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0066684_1058594123300005179SoilLPFRHQPGFGERLRTQAEEMGLPLGLCLLGGVGLGAGLMYLLEPQGGPQRRARLRETVRAYWRTTETKHAPQQERRRAWRFFQGHETHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0066685_1106775113300005180SoilGAGLVALLEPQGGPQRRAWLREYARAYWHTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0066678_1100309813300005181SoilRRTLGRQAQAILATPRIPFRRQPGLRERLRTQAAQPGLTPGIVLVGCVGLGAGLVALLESQGGPPRRAWLRETARAYWHPTKTSHAPQQELRRAWRFYQGRDTHRVDPHTRKSAQAEAWYVEPAHYESPVLYSEPFVTRDEAEAWAKAQDQP*
Ga0070701_1013356713300005438Corn, Switchgrass And Miscanthus RhizosphereLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD*
Ga0066681_1022630713300005451SoilLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYVEPADYEGIVLYSEPFVTRDEAETWAQAQDTQ*
Ga0068867_10073136313300005459Miscanthus RhizosphereYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD*
Ga0070698_10205933713300005471Corn, Switchgrass And Miscanthus RhizosphereLGAGLVYLLAPQGGPQRWAWLREKARAYWHRTEPQQELHRPWRFFQGHDTQRVDAHTRKPAQAQAWYAEPRDYDSNVLYSEPFVTREEAEAWAKAQEQP*
Ga0066704_1047568113300005557SoilQAQAVLATTRRPFQRQPGLGERLRTQAAQRGMTTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTRHAPQHELRRAWRFYQGRETPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0066696_1006124033300006032SoilGRGERLHTQAAPRGMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0075432_1022196613300006058Populus RhizosphereLWARQGSAQKPARLREKARTYWRPAETRSAHQQALSRRWHFFQGHETSRVDAHTRKPAQAEAWYAEPRAYDNPVLYSEPFVTRDEAEAWAKAQDTP*
Ga0075422_1059100823300006196Populus RhizosphereRVYWHPAETRHIPRQELRGGWRFFQGRDTYRVDAHTRKPARAEAWYTEPREYDSIVLYSEPFVTREEAEAWARAQDKP*
Ga0075428_10041533313300006844Populus RhizosphereQAQAVLATTRRPFQRQPGLGERLLAHAEQRATTTGMILVGCIGLGVGLGYLWACQGSTQQRARLREKARTYWRPAETRHAPQQALSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKS*
Ga0075431_10028151213300006847Populus RhizosphereKARAYWHTKDTSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGIVLYSEPFVTRDEAEAWAKAQDQQ*
Ga0075420_10055486813300006853Populus RhizospherePFRRQPGLGERLRAQAEQLGTTTGMVLVGCIGLGVGLGYLLARQGGPQRRARLREQARAYWRPAATRHVPPQDLRRGWRFFQGRDTQRVNAHTRMPARAEAWYAEPREYDSNVLYSEPFVTREEAEAWARDQDKP*
Ga0075420_10096436923300006853Populus RhizospherePGLGERLLAHVEQRGMAIGMILVGCVGLGVGLGYLWSCQGSPQQRARLREKARTYWRPTETRHALHQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRNEAEAWAEAQEKL*
Ga0075424_10011597133300006904Populus RhizosphereATTPRPFRRQPGLGERLRTQAAPRGMTTGVVLVGGVGLGASLVALLEPQGGPQRRAWLREKARAYWHTKDTSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGIVLYSEPFVTRDEAEAWAKAQDQQ*
Ga0075424_10154560523300006904Populus RhizosphereQAQAVLATTRRPFRRPPGLGERLLAHVEQRGMAIGMILVGCVGLGVGLGYLWACQGSPQQRARLREKARTYWRPTETRHALHQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKL*
Ga0099791_1039788523300007255Vadose Zone SoilMQAAPCGMPTGLALVGGVGLGAGLVALLEPHGGPQRRAWLREQVRTYWHPTSTRPTPQPALRRAWRFYQGRETARVDAHTRQPAHAEAWYAEPVGDESLVLYSEPFVTREEAEAWAHAQGKP*
Ga0099793_1066480023300007258Vadose Zone SoilEPHGGPQRRAWLREQVRTYWHPTSTRPTPQPALRRAWRFYQGRETARVEAHTRQPAHAEAWYAEPVGDESLVLYSEPFVTREEAEAWAHAQGKP*
Ga0099827_1135403013300009090Vadose Zone SoilRPQSGRGERRRMQAEERGLPLGLCLLGCVGLGAGLVALLEPQGGPQRRARLRETVRGYWHPAATNHASQPELRRAWHFFQGRDTQRVDAHTRQPARAQAWYAEPAHYESDVLYS*
Ga0066709_10132298613300009137Grasslands SoilALVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHPTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYVEPAEYEGMVLYSEPFVTRDEAETWAKAQGKP*
Ga0114129_1019169743300009147Populus RhizosphereDTGRTLGRQAQAVLATTRRPFRRPPGLGERLLAHAEQRGMAIGMILVGCVGLGVGLGYLWACQGSPQQRARLREKARTYWRPTETRHALHQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKL*
Ga0114129_1066464523300009147Populus RhizosphereQRARLREKARTYWRPAETRHAPQQALSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKS*
Ga0111538_1131871723300009156Populus RhizosphereATTRRPFRRQPGLGERLLAQAEQLGTTTGMVLVGCIGLGVGLGYLLARQGGSQQRARLREQARVYWHPAETRHIPRQELRGGWRFFQGRDTYRVDAHTRKPARAEAWYVEPREYDSNVLYSEPFVTREEAEAWARAQDKP*
Ga0111538_1211706613300009156Populus RhizosphereFQRQPGLGERLLAHAAQRRTTTGMILVGCVGLGVGLGYLWACQGSPQQRARLREKARTYWRPTKTRHAPQQELSRRWRFFQGHDTWRVDAHTRKPAHAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKP*
Ga0105248_1067896613300009177Switchgrass RhizosphereQATVARAHMPFRRQPSLGERLRTQAGERGLSLGLCLLGSVGLGVGLGYLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD*
Ga0126374_1124696123300009792Tropical Forest SoilLAERLLAQAEQLGTTTGMVLVGCVGLGVGLGYLWACQSSPQQRARLREKARTYWRPTEMRHAHQQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYVEPREYDSHVLYSEPFVTRDEAEAWAEAQEKP*
Ga0105066_110865613300009822Groundwater SandLGLCLLGCIGLGAGLVYLLEPQGGPQRRARLREQVRAYWHPTETRHAPQQERRRAWRVFQGHETQRVDAHTRKPAQAEAWYAEPPDYDSNVLYSAPFVTREEAESWARAQEK*
Ga0126380_1012306813300010043Tropical Forest SoilMPTGVVLVGGVGLGAGLVALLDPHGGPQRRAWLREHVRTYWHSTATRHPPQPALRRAWRFYQGRETPRVDAYTRKPAHADAWYAEPARYDSLVLYSEPFVTREEAEAWAHAQDQP*
Ga0126380_1078547223300010043Tropical Forest SoilKRRARLRKQARAYWRPAETRHVPRQDLHRGWRFFQGRDTQRVNAHTRKPARAEAWYAEPREYDSHVLYSEPFVTREEAEAWARAQDKP*
Ga0126382_1069804313300010047Tropical Forest SoilMTTGLALVGGVGLGAGLVALLEPRGGPQRRAWLREHVRTYWHPTATRPTPPPALHQAWRFYQGRETSRVDAHTRQPAHAEAWYAEPVGDESLVLYSEPFVTREEAEAWAHAHGKP*
Ga0126382_1183333923300010047Tropical Forest SoilGCVGLGAGLVYLLEPQGGPQRRARLREKVRAYWRTTETSHAPQQGLRRAWRVFQGHETQRVDAHTRKSAQAQAWYAEPADYDSNVLYSEPFVTRDEAEAWAKAQDQQ*
Ga0127477_12039913300010071Grasslands SoilRRTLGRQAQAVLATTRLPFRHQPGFGERLRTQAEEMGLPLGLCLLGGVGLGAGLMYLLEPQGGPQRRARLRETVRAYWRTTETKHAPQQERRRAWRFFQGHETHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0127478_106459213300010083Grasslands SoilRRTLSRQAQAVLATTRLPFRHQPGFGERLRTQAEEMGLPLGLCLLGGVGLGAGLMYLLEPQGGPQRRARLRETVRAYWRTTETKHAPQQERRRAWRFFQGHETHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0127496_102013013300010086Grasslands SoilVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQERRRPWRFYQGRDTQRVDAHTRKPAQAEAWYVEPADYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0127500_107532513300010103Grasslands SoilALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQDKP*
Ga0127472_108627123300010106Grasslands SoilLDETRRTLVQQAQAVLATTRRPFQRQSGLGERLHTQAAQPGMPTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGMVLYSEPFVTRDEAETWAKAQGKP*
Ga0127494_104111823300010107Grasslands SoilLLAHAEQLGTTTGMVLVGCVGLGVGLGYLWASQGSPQQRVQLREKARTYWRPTETRQAHQQEISRGWRFFQGRDTPRVDAHTRKPARAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAAAQEKP*
Ga0127460_101684113300010114Grasslands SoilAQAVFATTRRPFQCQPGLGERLRTQAEEMGMPLGLCLLGCIGLGAGLVYLLEPQGGPQRRARLREKARAYWHPTETSPGLQQELRRAWRFFQGHETQRVDAHTRKPAQAQAWYAEPKDYDSNVLYSEPFVTREEAETWAKAQEKQ*
Ga0127495_106380813300010115Grasslands SoilVGLGAGLVALLEPQGGPRRRAWLREQARAYWHPTDTSHAPQPELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPAEYEGMVLYSEPFVTRDEAETWAKAQGKP*
Ga0127452_103815413300010119Grasslands SoilERLRTQAEEMGLPLGLCLLGGVGLGAGLMYLLEPQGGPQRRARLRETVRAYWRTTETKHAPQQERRRAWRFFQGHETHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0127498_101699413300010124Grasslands SoilEPQGGPQRRARLREQVRAYWHPTETRHAPQQERRRAWRVFQGQETQRVDAHTRKPAQAEAWYAEPRDYDSNVLYSAPFVTREEAESWARAQEK*
Ga0127498_102893523300010124Grasslands SoilLWASQGSPQQRVQLREKARTYWRPTETRQAHQQEISRGWRFFQGRDTPRVDAHTRKPARAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAAAQEKP*
Ga0127455_113245823300010132Grasslands SoilWHPTETSPGLQQELRRAWRFFQGHETQRVDAHTRKPARAQAWYAEPKDYDSNVLYSEPFVTREEAETWAKAQEKQ*
Ga0127455_115953713300010132Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGIVLYSEPFVTRDEAEAWAQAQDTQRRLWRFYQGRDTQRVDAHTRK
Ga0127459_110950213300010133Grasslands SoilERLRTQAAQPGMTTGIALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0127464_100313913300010139Grasslands SoilTGALLDETRRTLGQQAQAVLATTRRPFQRQPGLGERLHTQAAQPGMTTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTSHALHHELRRAWRFYQGRDTPRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0127456_106673213300010140Grasslands SoilLGTTTGMVLVGCVGLGVGLGYLWASQGSPQQRVQLREKARTYWRPTETRQAHQQEISRGWRFFQGRDTPRVDAHTRKPARAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAAAQEKP*
Ga0126321_111084813300010145SoilLGRQAQAVLATTRRPFRRQPGLGERLLAQAEQLGTTTGMVLVGCIGLGVGLGYLWARQGGSQQRARLREQARVYWHLAETRHIPRQDLRGGWRFFQGCDTQRVDAHTRKPARAEAWYAEPREYDSNVLYSEPFVTREEAEAWAKAQDKP*
Ga0126321_143775033300010145SoilLVALLEPQGGPQRRAWLREQARAYWHPTDTSHAPQQELLRAWCFYQGRDTQRVDAHTRKPAQAEAWYAEPADYEGIVLYSEPFVTRNEAETWAKAQGKP*
Ga0126321_146337613300010145SoilQAQAVFATTRRPFRRQPGRGERLLAQAEQLGTTTGMVLVGCLGLGVGLGYLWACQGRPEQRPRLWEKVRTYWHPLATRHAPQPALSRDWRFFQDRDAQRVDAHTLKPARAQAWYAEPADYESNMLYSAPFATRAEAEAWARTEAQRQCQAHTEV*
Ga0134082_1053569513300010303Grasslands SoilRRTLGQQAQAVLATARRPFQRQLGLGERLRTQAAQPGMTTGIALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHPTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0126376_1287955513300010359Tropical Forest SoilMPTGVVLVGGVGLGAGLVALLEPHGGPQRRAWLREQVRTYWHPTATRHPPQPALRRAWRFYQGRETPRVEAHTRKPAHADAWYAEPAGYDSLVLYSEPFVTREEAEAWAHAQDQQRGVWRFYQGRETSRVDAHTQK
Ga0126377_1039758713300010362Tropical Forest SoilMPTGVVLVGGVGLGAGLVALLEPHGGPQRRAWLREQVRTYWHPTATRHPPQPALRRAWRFYQGRETPRVEAHTRKPAHADAWYAEPAGYDSLVLYSEPFVTREEAEAWAHAQDQP*
Ga0134123_1157368723300010403Terrestrial SoilSLGERLRTQAGERGLSLGLCLLGSVGLGVGLGYLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTREPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD*
Ga0137399_1015642823300012203Vadose Zone SoilMQAAPCGMPTGLALVGGVGLGAGLVALLEPHGGPQRRAWLREQVRTYWHPTSTRPTPQPALRRAWRFYQGRETARVEAHTRQPAHAEAWYAEPVGDESLVLYSEPFVTREEAEAWAHAQGKP*
Ga0137362_1105312813300012205Vadose Zone SoilRQAQAVFATTRRPFQRQPGLGERLRTQAEEMGMPLGLCLLGCIGLGAGLVYLLEPQGGPQRRAWLRETARAYWRPTETRHTPQQEQRRPWRFYQGRDTQRVDAHTRKPAQAEAWYVEPAGYESLVLYSEPFVTRDEAETWAKAQDQP*
Ga0137377_1120680713300012211Vadose Zone SoilMTTGVVLVGGVGLGAGLVALLEPQGGPRRRAWLREKARAYWHPTDTRHAPQPELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYVEPAEYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0150985_11093326623300012212Avena Fatua RhizosphereGLPLGLCLLGGIGLGAGLVYLLEPQSGPQRRARLREKVRAYWRTTETKHAPQQELRRAWRFFQGHATHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0137386_1062358813300012351Vadose Zone SoilLDETRRTLGRPAQAVRATTRRPFQRQPGLGARRRTQAAPRGMTIGVALVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHTTDTRHAPQHELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYVEPAEYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0137369_1057338113300012355Vadose Zone SoilMGMPLGLCLLGCIGLGAGLVYLLEPQGGPQRRARLREQVRAYWHPTETRHAPQQERRRAWRVFQGQETQRVDAHTRKPAQAEAWYAEPPDYDSNVLYSAPFVTREEAESWARAQEK*
Ga0137385_1072159323300012359Vadose Zone SoilQAAPRGMTTGVALVGGIGLGASLVALLEPQGGPRRRAWLREKARAYWHPTDTRHAPQPELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPADYESVVLYSEPFVTRDEAETWAKAQGKP*
Ga0137375_1019353133300012360Vadose Zone SoilEQVRAYWHPTETRHAPQQERRRAWRVFQGQETQRVDAHTRKPAQAEAWYAEPPDYDSNVLYSAPFVTREEAESWARAQEK*
Ga0137360_1010701833300012361Vadose Zone SoilHARAYWHPTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYVEPAEYEGVVLYSEPFVTRDEAETWATAQGKP*
Ga0137361_1187167613300012362Vadose Zone SoilRTQAEEMGMPLGLCLLGCIGLGAGLVYLLEPQGGPQRRARLREQVRAYWHPTETRHAPQQERRRAWRFFQGHETQRVDAHTRKPAQAEAWYAEPRDYDSNVLYSAPFVTREEAESWARAQEK*
Ga0134032_106836613300012376Grasslands SoilMTTGIALVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTSHALRHELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWATAQGKP*
Ga0134058_109210923300012379Grasslands SoilVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQHELRRAWRFYQGRETPRVDAHTRKPAQAEAWYVEPADYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0134033_106427013300012383Grasslands SoilMTTGIALVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHTTDTRHAPQHELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAEAWATAQDKQRRP
Ga0134036_127777223300012384Grasslands SoilTVRAYWRTTETKHAPQQERRRAWRFFQGHETHRVDAHTRKPAQAQAWYAEPADYDSNVLYSEPFVTRDEAETWAKAQDQQ*
Ga0134052_122304813300012393Grasslands SoilMTTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHTTDTRHAPQHELGRAWRFYQGRDTPRVDSHTRKPAQAEAWYAEPAEYDGIVLYSE
Ga0134057_100167623300012396Grasslands SoilLEPQGGPQRRAWLREHARAYWRTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0134056_125230413300012397Grasslands SoilVVDAGRQTGALLDETRRTLGQQAQAVLATTRRPFQRQPGLGERLRTQAAQPGMTTGIALVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHTMDTSHAPQHELRRAWRFYQGRETPRVDAHTRKPAQAEAWYVEPAEYEGIVLYSEPFVT
Ga0134055_109816913300012401Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLRETARAYWHPTKTSPTPQQEWRRAWRLYQGRDTQRVDPHTRKPAQAEAWYVEPAHYESPVLYSEPFVTRDEAEAWAKAQDQ
Ga0134059_107347013300012402Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSPGLQQELRRAWRFFQGHETQRVDAHTRKPARAQAWYAEPKDYDSNVLYSEPFVTREEAETWAKAQEKQ*
Ga0134059_140987113300012402Grasslands SoilPGMTTGVALVGGVGLGAGLVALLEPQSGPQRRAWLREHARASWHTMDTSHAPQHELRRAWRFYQGRETPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0134049_131069713300012403Grasslands SoilMITGIALVGGVGLGAGLVALLEPQGGPQRRAWLREKACAYWHPTETSHTPQQERRRAWRFYQGRDTQRVDAHTRKPAQAEAWYVEPADYEGIVLYSEPFVTRDEAETWAQAQDTQSKPWR
Ga0134053_129522213300012406Grasslands SoilQAQAVLAKPRLPFRHQPGLGERLGTQAEEMGLPLGLSLLGCVGLGAGLVYLLEPQGGPQRRARLREKARAYWHPTETSPGLQQELRRAWRFFQGHETQRVDAHTRKPAQAQAWYAEPKDYDSNVLYSEPFVTREEAETWAKAQEKQ*
Ga0134060_149136523300012410Grasslands SoilEQLRTQAAPLGMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREHARAYWHTTDTRHAPQHELGRAWRFYQGRDTPRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0150984_10388702413300012469Avena Fatua RhizosphereLTQAEQLGTTSGMVLMGCVGLGVGLGYLWACQGASQQRARWREKARTYWRPTERRQAHQRELARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPRAYESPVLYSEPFVTREEAEAWAQVQGKP*
Ga0126375_1012311413300012948Tropical Forest SoilMTTGLALVGGVGLGAGLVALLEPRGGPQRRAWLREHVRTYWHPTATRPTPPPALHQAWRFYQGRETSLVDAHTRQPAHAEAWYAEPVGDESLVLYSEPFVTREEAEAWAHAHGKP*
Ga0126375_1018633123300012948Tropical Forest SoilMPGRPVRGQPGRGTRQPTQVAPRGLPTGVVLVGGGGLGAGLVALLEPHGGPQRRAWLREQVRTYWHPTATRHPPQPALRRAWRFYQGRETPRVEAHTRKPAHADAWYAEPAGYDSLVLYSEPFVTREEAEAWAHAQDQP*
Ga0134110_1042364313300012975Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREHARASWHTTDTRHTPQHELRRAWRFYQGRDTPRVDAHTRKPAQAEAWYVEPDGDWGTFPSRQPSVPRDDAETWPAPQHEQRRAWRFYQGRDTQR
Ga0163162_1090788923300013306Switchgrass RhizosphereGLGYLWACQGRPQQRTRWREKARTSWRPTERRHGHQQELARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVLSSEPFVTREEAEAWAQAQGKP*
Ga0163162_1113289623300013306Switchgrass RhizosphereGLGERLLAHAEQRRTTTGMILVGCVGLGVGLGYLWAGQGSPQPRARLREKARTYWRPTKTRHAPQQELSRRWRFFQGHDTWRVDAHTRKPAQAKAWYAEPRTYDSPVLYSEPFVTRDEAEAWAEAQEKP*
Ga0134078_1061829613300014157Grasslands SoilTRRPFRRQPGRGEQLRTQAAPLGMTTGVVLVGGVGLGAGLVALLAPQGGPQRRAWLREKACAYWHPTETSHTPQQERRRPWRFYQGRDTQRVDAHTRKPAQAEAWYVEPAEYEGIVLYSEPFVTRDEAETWAKAAG*
Ga0134073_1015753313300015356Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQELRRAWRFYQGRDTQRVDAHTRKPAQAEAWYAEPAEYDGIVLYSEPFVTRDEAETWAKAQDQQR
Ga0134089_1034091813300015358Grasslands SoilMTTGVVLVGGVGLGAGLVALLEPQGGPQRRAWLREKARAYWHPTETSHAPQQERRRPWRFYQGRDTQRVDAHTRKPAQAEAWYVEPADYEGIVLYSEPFVTRDEAETWAKAQGKP*
Ga0184610_127209513300017997Groundwater SedimentVWLLEPQGGPQRRARLREKVRAYWHTTKTRHAPQQERRRAWRIFQGQDTQQVDAHTRKPAQAEAWYAEPPDYDSNVLYSAPFVTREEAETWAKAQDK
Ga0184623_1015496723300018056Groundwater SedimentGCVGLGAGLVWLLEPQGGPQRRARLREQVRAYWHTTKTRHAPQQERRRAWRIFQGQDTQRVDAHTRKPAQAEAWYAEPPDYDSNVLYSTPFVTREEAETWAKAQDK
Ga0190268_1192682423300018466SoilWACQGLPQQRARWREKARTSWRPTERRDAHQRELTRRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVLYSEPFVTREEAEAWAQAHGKP
Ga0180119_122766513300019228Groundwater SedimentQAEEWGLPLGLGLLGCVGLGVGLGAFLEAQGGPQRRAWLCARLRAYWQPTATSHAPQQDIHKTWHFFQGHDTQRVDAHTRKPARAEAWYAEPKDYESPVLYSAPFVTRDEAETWAQAQDK
Ga0184642_110481013300019279Groundwater SedimentALLDDTTRALDRQAQAFLAKTRVPFRRQPGLGERLLMQAEQRGMSTGVVLLGCVGLGAGLIYLLEPQGGPQRRARLREKVRTYWHLTATSHAPQQELRRAWRFFQGHDTQRVDAHTRRPAQAEAWYAEPANYDSNVLYSEPFVTREEAEAWAKAQEKLKMPWRFFQG
Ga0184642_134209313300019279Groundwater SedimentMKMLWLRRQTNDLLDDTTRTLDRQARAFLAKTRVPFRRQPGLGERLLMQAEQLGMTTGIVLLGCVGLGAGLIYLLEPQGGPQRRARLREKARAYWHPTATSPGSQEALRRAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPANYDSNVLY
Ga0184642_165770313300019279Groundwater SedimentAQAEQLGTTTGMVLVGCVGLGVGLGYLWACQGSPQQRARLREKVRTYWRPTETRYVPQQELSRDWRFFQGHDTQRVDAHTRKPAQAEAWYAEPHEYDSNVLYSEPFVTRDEAEAWAKAQGKP
Ga0184642_169254913300019279Groundwater SedimentTGMVLVGCVGLGVGLGYLWACQGSPQQRTRLREKARIYWHPTETRQAHQQELSRGWRFFQGRDTPRVDAHTRKPARAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAEAQEKP
Ga0206354_1168867123300020081Corn, Switchgrass And Miscanthus RhizosphereMPFRRQPSLGERLRTQAGERGLSLGLCLLGSVGLGVGLGYLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD
Ga0179585_111688713300021307Vadose Zone SoilMKMLWLRRQADDLLDDTARELNRQTQAFLAKTRVPFRRQPGLGERLLMQAEQLGMTTGVVLLGCVGLGAGLIYLLEPQGGPQRRARLREKVRTYWHPTATSPGSQEALRRAWRFFQGHDTQRVDAYTRKPAQASAWYAEPAHYDSN
Ga0222625_108298813300022195Groundwater SedimentPQRRARLREKVRTYWHPTATSHAPQQELRRAWRFFQGHDTQRVDAHTRRPAQAEAWYAEPANYDSNVLYSEPFVTRAEAETWARAQEKK
Ga0222625_126441823300022195Groundwater SedimentVGCVGVGAGLIYLLEPQGGPQRRARLREKVRAYWHPTAPQQELRRAWRFFQGHDTHRVDAHTRKPAQAEAWYAEPANYESNVLYSEPFVTRQEAEAWVKTQEKN
Ga0207687_1113668413300025927Miscanthus RhizosphereGYLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD
Ga0207706_1138598013300025933Corn RhizosphereLERVREPACLLMGGVGLGVGLGYLWACQGASQPRARWREQARTYWRPTARRQVHQPALARRWHFFQGRDTYRVDAHIRKPAQAEAWYAEPQAYESPVLYSEPFVTREEAEAWAQAQGQP
Ga0207711_1061848433300025941Switchgrass RhizosphereRGLSLGLCLLGSVGLGVGLGYLLEPHGGPQRRARLYEKARAYWHPTATSDVSQQEWRKAWRFFQGHDTQRVDAHTRKPAQAEAWYAEPKNYESIVLYSEPFVTREDAEAWAKAQD
Ga0209814_1033238513300027873Populus RhizosphereLGVGLGYLLACQSSPQLRGRLREKARTYWRPTETRHAHQQELSRRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPCEYDSHVLYSEPFVTRDEAEAWAEAQEKP
Ga0209481_1006841313300027880Populus RhizosphereGTTTGMVLVGCIGLGVGLGYLLARQGGPQRRARLREQARAYWRPAATRHVPPQDLRRGWRFFQGRDTQRVNAHTRMPARAEAWYAEPREYDSNVLYSEPFVTREEAEAWARDQDKP
Ga0207428_1106301313300027907Populus RhizosphereGERLRTQAAPRGMTTGVVLVGGVGLGASLVALLEPQGGPQRRARLREQARAYWRPAATRHVPPQDLRRGWRFFQGRDTQRVNAHTRMPARAEAWYAEPREYDSNVLYSEPFVTREEAEAWARAQDKP
Ga0247828_1108539513300028587SoilVDDTGRGLRRHAQAVFATTRRPFHRQPGLGERLLTHAEQLGTTTGMVLVGCVGLGVGLGYLWACQGRPQQRTRWREKARTYWRPTESRHAHQQDLARRWHFFQGRDTYRVDAHTRKPAQAEAWYAEPREYESPVFYSEPFVTREEAEAWAQAQGKP
Ga0247827_1076766713300028889SoilPGFGERLLAQAEQWGTTTGLVLVGCVGLGVGLGYLWARQGSAQKPARLREKARTYWRPAETRSAHQQALSRRWHFFQGHETSRVDAHTRKPAQAEAWYAEPRAYDNPVLYSEPFVTRDEAEAWAKAQDTP
Ga0308203_101559023300030829SoilERRRRQAEERSLPLGLCLLGCAGLGAGLVYLLEPQGGPQRRARLRETMGGYWHTAATTHTPEQELRRAWHFYQGRDLPRVDAHTRQPARAQAWYAEPADYESDVLYSAPFATRAEAEAWASAEAQRQSQEHTEI
Ga0308205_100109523300030830SoilVGLGYLWASQGSSKQHAQLRQKARTYWRPTETRHVPQQELSGDWRFFQGHDTQRVDAHTRKPAQADAWYAEPCEYDNKVLYSEPFVTRDQAEAWARTEEQRQSQEHTKV
Ga0308205_100434233300030830SoilGAGLVYLLEPQGGPQRRARLREQVRAYWHPTESRHAPDQERRRAWRFFQGQDTQRVDAHTRKPAQAEAWYAEPPNYDSNVLYSAPFVTREEAESWARAQEE
Ga0308205_105413723300030830SoilPGLGERLLAQAEQRGTSTGLVLMGCVGLGVGLGYLWACQGSPQQHARLREKARTYWHPTETRPVPQQELSQGWRFFQGHDTQRVDAHTRKPAQAEAWYAEPREYDSHVLYSEPFVTRDEAEAWAKTQGKP
Ga0308205_106399523300030830SoilRQAQAVFATTRRPFQRQPGLGERLLAQAEQLGTTTGMVLIGCVGLGVGLGYLWACQGNPQQRARLREKARIYWHPTETRHAPQQELSQGWRFFQGHDTQRVDAHTRKPAQAEAWYAEPREYDSNVLYSEPFVTRDEAEAWAKAQSKS
Ga0308202_100412933300030902SoilGSPPKRVQLREKARTYWRPTETRRAHQQELSRGWRFFQGRDTPRVDASTRKPARAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAEGQGTP
Ga0308206_103111813300030903SoilQAQAVFATTRKPFRRQPGLGERLLAQAEQRGTTTGMVLVGCIGLGVGLGYLWACQRSPQQRTRLRDKARTYWRPTETRHAPQQEFSRRWRIFQGHDTYRVDAHTRKPAQADAWYAEPCEYDNKVLYSEPFVTRDQAEAWARTEEQRQSQEHTEV
Ga0308206_104719913300030903SoilQRARLREHARTYWRSAKTRHAPQQEIARRWRFFQGHDTWRVDAHTRKPAQAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAEAQGKP
Ga0308206_116933713300030903SoilRGMSTGVALVGGVGLGAGLVALLEPQGGPQRRAWLREKVRAYWHPTDTSHAPQHTRRWAWRFYQGRDTPRVDAHTRKPAQADAWYAEPADYEHLVLYSEPFVTRDEAEAWAQAQDTQ
Ga0308200_118140813300030905SoilLGRQAQAVLATTRRPFRRQPGLGERLLAHAEQLGTTTGMVLVGCVGLGVGLGYLWAHQGSPEQRVQLREKARTYWRPTETRQAHQRELSRGWRFFQGRDTPRVDAHTRKPAQAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAQAQGKP
Ga0308190_120086813300030993SoilTGRTLGRQAQAVFATTRRPFRRQPGLGERLLAQAEQRGTSTGLVLVGCVGLGVGLGYLWACQGSPQQHARLREKARTYWHPTETRPVPQQELSQGWRFFQGHDTQRVDAHTRKPAQAEAWYAEPREYDSHVLYSEPFVTRDEAEAWVKAQEKP
Ga0073996_1175822113300030998SoilMTTGGVLLGCVGLGAGLLYLLEPQGGSQRRARLRKKVQAYWYPVATSSTPQLTQPRDWRFFQGRDTQRVEAHTREPAQAEAWYAEPGNYESPVLYSEPFVTRAEAEAWAKGQRKR
Ga0102760_1084845323300031039SoilGLGYLWAYQGSSRSRTQLREKARVYWHPTRTRHVPQQELSRDWRFFQGHETQRVDAHTRKPAQAEAWYAEPREYDSNVLYSEPFVTRDEAEAWAKAQGKL
Ga0102746_1072754413300031054SoilAEQLGTSTGMVLVGCVSLGVGLGYLLARQGGPRLLEQARVYWRPAETRHAPQQDLRRGWRFFQGRETQRVNAHTRKPAQAEAWYAEPRAYDSHVLYSEPFVTREEAEVWARDQDKL
Ga0308189_1001605523300031058SoilLGCAGLGAGLVYLLEPQGGPQRRARLRETMRGYWHTAATTHTPEQELRRAWHFYQGRDLPRVDAHTRQPARAQAWYAEPADYESDVLYSAPFATRAEAEAWASAEAQRQSQEHPEI
Ga0308189_1011862913300031058SoilLGAGLVYLLEPQGGPQRRARLREQVRAYWHPTETRHAPEQERRRAWRFFQGHETQRVDAHTRKPAQAEAWYAEPPDYDSNVLYSTPFVTREEAESWARAQEK
Ga0308189_1025254113300031058SoilVLVGCVGLGVGLGYLWACQGSPQQRARLREKVRTYWRPTETRYVPQQEFSRDWRFFQGHDTQRVDAHTRKPAQAEAWYAEPHEYDSNVLYSEPFVTRDEAEAWAKAQGKP
Ga0308189_1027201933300031058SoilARTYWRPTETRQAHQRELARGWRFFQGRDSPRVDAHTRKPAQAEAWYAEPREYDSPVLYSEPFVTRDEAEVWAEAQGKP
Ga0102748_1094933113300031089SoilRTLGRQAQAVLVTPRRPFRRQPGFGERLLAQAEQLGTPTGMVLVGCVSLGVGLGYLLARQGGPRLLEQARVYWRPAETRHAPQQDLRRGWRFFQGRETQRVNAHTRKPAQAEAWYAEPRAYDSHVLYSEPFVTREEAEVWARDQDKL
Ga0308201_1027994823300031091SoilHQGSPEQRVQLREKARTYWRPTETRQAHQRELSRGWRFFQGRDTPRVDAHTRKPAQAEAWYAEPREYDSPVLYSEPFVTRDEAEVWAEAQGKP
Ga0308204_1000693513300031092SoilSGMSQGVENGCARAEQLGTTTGIVLVGCVGLGVGLGYLWASQGSSKQHAQLRQKARTYWRPTETRHVPQQKLSGDWRFFQGHDTQRVDAHTRKPAQADAWYAEPCEYDNKVLYSEPFVTRDQAEAWARTEEQRQSQEHTEV
Ga0308204_1006846713300031092SoilGMVLVGCVGLGVGLGYLWACQGRPQQRAQLREKARTYWRPKEPRHGPQQKLSRDWRFFQGHDTQRVDAHTRKPAQAEAWYAEPREYDSNVLYSEPFVTRDEAEAWAEAQGKP
Ga0308204_1010217113300031092SoilTSRPFRRQPWLGGRLLAHAEQRGTTTGLVLVGCIGLGVGLGCLWAYQGSSQQRARLREKARTYWRPTATYYRPHQELSRGWRFFQGYDTQRLDAHTRKPAQAEAWYAEPREYDSTVLYSEPFVTRDEAEAWAKAQEKP
Ga0308204_1013594423300031092SoilGSPQQRVQLREKARTYWRPTETRQAHQQELSRGWRFFQGRDTPRVDAHTRKPAQAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAAAQEKP
Ga0308197_1029555513300031093SoilGPQRRARLREKAHAYWHPTAPSHALQQELQRAWRFFQGYDTQRVDAHTRQPAQAEAWYAEPANYDSNILYSEPFATREEAEAWAKAQEKKEGVIH
Ga0308197_1037246613300031093SoilAVFATTRKPFRRQPGLGERLLAHAEQLGTTTGMVLVGCVGLGVGLGYLWACQRSPQQRARLREKVRTYWHPRETRPAPQQELSRDWRFFQGRDTQRVDAHTRKPAQAEAWYAEPRAYDGNVLYSEPFVTRDEAEAWAAAQEKP
Ga0308187_1009643813300031114SoilERLLAHAEQLGTTTGMVLVGCVGLGVGLGYLWASQGSPQQRVQLREKARTYWRPTETRQAHQQEISRGWRFFQGRDTPRVDAHTRKPAQAEAWYAEPRAYDSPVLYSEPFVTRDEAEAWAAAQEKP
Ga0308187_1030197413300031114SoilRALDLQAQAFLARTRVPFRRQPGLGERLLMQAEQLGMTTGVVLLGCVGLGAGLIYLLEPQGGPQRRARLREKVRTYWHPTATSHAPQQELRRAWRFFQGHDTQRVDAHTRRPAQAEAWYAEPANYDSNVLYSEPFVTREEAEAWAKAQEKK
Ga0308187_1039541713300031114SoilRVPFRRQPGLGERLLTQAEQLGMTTGIVLVGCVGLGAGLVWLLEPQGGPQRRARLREKVRAYWHTTKTRHAPQQERRRAWRIFQGQDTQQVDAHTRKPAQAEAWYAEPPDYDSNVLYSAPFVTREEAETWAKAQDK
Ga0308187_1049981113300031114SoilARLREKVRTYWRPKVTRHGPQQELSRDWRFFQGQDTQRVDAHTRKPAQAEAWYAEPREYDSNVLYSEPFVTRDEAEAWARAEAQRQRQEHT
Ga0310907_1050154213300031847SoilMPFRHQPGPGERLRAQVEEWGLPLGLGLLGCVGLGVGLGALLEPQGGPQRRAWLCARLRAYWQPTATSHASEQESHKAWRFVQGHDTQRVDAHTRKPARAEAWYAEPKDYESPVLYSEPFVTRDEAETWAQAQDKP
Ga0310885_1039486623300031943SoilMILVGCVGLGVGLGYLLACQGSPQQRARLREKARTYWRPTETRHAPQQELSRRWRFFQGHDTWRVDAHTQKPAHAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKP
Ga0310906_1001295313300032013SoilTRRPFQRQAGLGKRLLTQVEQRGTTTGMVLLSCVGLGVGLGYLWACQGRPQQGASWREKARAYWRPTERRPAHQQELAQRWHFFQGRDTYRVDAHTRKPAQAEAWYVEPREYESPVLYSEPFVTREEAEAWAQAQGKP
Ga0310890_1098741023300032075SoilRTYWRPAETRSAHQQALSRRWHFFQGHETSRVDAHTRKPAQAEAWYAEPRAYDNPVLYSEPFVTRDEAEAWAKAQDTP
Ga0310890_1173121313300032075SoilVGLGALLEPQGGPQRRAWLCARLRAYWQPTATSHASEQESHKAWRFVQGHDTQRVDAHTRKPARAEAWYAEPKDYESPVLYSEPFVTRDEAETWAQAQDKP
Ga0307471_10370093923300032180Hardwood Forest SoilLGYLWAGQGSPQQRARLREKARTYWRPTKTRHAPQQELSRRWRFFQGHNTWRVDAHTRKPAHAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQEKP
Ga0314784_122771_295_5643300034663SoilAQRRARLSAHLRAYWQPAATNHVSQRNLHRSWRFFQGHDTQRVDAHTRKPARAEAWYAEPKDYESPVLYSEPFVTRDEAETWAKAQGKP
Ga0314788_064259_406_7413300034666SoilMVLLSCIGLGVGLGYLWACQGRPQQGASWREKARAYWRPTERRPAHQQELAQGWHFFQGRDTYRVDAHTRKPAQAEAWYVEPREYESPVLYSEPFVTREEAEAWAQAQGKP
Ga0314792_038692_656_9613300034667SoilVGLGYLWARQGSAQKPARLREKARTYWRPAETLSAHQQALSRRWHFFQGHETSRVDAHTRKPAQAEAWYAEPRAYDNPVLYSEPFVTRDEAEAWAKAQDTP
Ga0314792_159987_328_6063300034667SoilQGSAQKPARLREKARTYWRPAQTRSAHQQALSRRWHFFQGHDTSRVDAHTRKPAQAEAWYAEPRAYDSQVLYSEPFVTRDEAEAWAEAQDKP
Ga0314803_057300_434_6853300034678SoilWREKARAYWRPTERRPAHQQELAQGWHFFQGRDTYRVDAHTRKPAQAEAWYVEPREYESPVLYSEPFVTREQAEAWAQAQGKP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.