NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F028762

Metagenome / Metatranscriptome Family F028762

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F028762
Family Type Metagenome / Metatranscriptome
Number of Sequences 190
Average Sequence Length 82 residues
Representative Sequence MSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Number of Associated Samples 135
Number of Associated Scaffolds 190

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 56.32 %
% of genes near scaffold ends (potentially truncated) 45.26 %
% of genes from short scaffolds (< 2000 bps) 76.32 %
Associated GOLD sequencing projects 115
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (69.474 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.316 % of family members)
Environment Ontology (ENVO) Unclassified
(54.737 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.789 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.24%    β-sheet: 4.76%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 190 Family Scaffolds
PF00248Aldo_ket_red 48.95
PF01625PMSR 32.63
PF00583Acetyltransf_1 2.11
PF14520HHH_5 1.58
PF13349DUF4097 1.58
PF08423Rad51 1.58
PF08241Methyltransf_11 0.53
PF00291PALP 0.53
PF07452CHRD 0.53
PF00296Bac_luciferase 0.53
PF03417AAT 0.53
PF13345Obsolete Pfam Family 0.53
PF01042Ribonuc_L-PSP 0.53
PF12840HTH_20 0.53
PF01738DLH 0.53

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 190 Family Scaffolds
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 32.63
COG0468RecA/RadA recombinaseReplication, recombination and repair [L] 1.58
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 0.53
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.53
COG4927Predicted choloylglycine hydrolaseGeneral function prediction only [R] 0.53


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms84.74 %
UnclassifiedrootN/A15.26 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1002921All Organisms → cellular organisms → Archaea2888Open in IMG/M
3300002558|JGI25385J37094_10002983All Organisms → cellular organisms → Bacteria5746Open in IMG/M
3300002558|JGI25385J37094_10089133All Organisms → cellular organisms → Archaea938Open in IMG/M
3300002558|JGI25385J37094_10108743All Organisms → cellular organisms → Archaea808Open in IMG/M
3300002558|JGI25385J37094_10124002All Organisms → cellular organisms → Archaea729Open in IMG/M
3300002560|JGI25383J37093_10001854All Organisms → cellular organisms → Bacteria6009Open in IMG/M
3300002560|JGI25383J37093_10008028All Organisms → cellular organisms → Bacteria3420Open in IMG/M
3300002560|JGI25383J37093_10018978All Organisms → cellular organisms → Bacteria2299Open in IMG/M
3300002560|JGI25383J37093_10032459All Organisms → cellular organisms → Archaea1741Open in IMG/M
3300002561|JGI25384J37096_10001504All Organisms → cellular organisms → Bacteria7707Open in IMG/M
3300002561|JGI25384J37096_10074910All Organisms → cellular organisms → Archaea1234Open in IMG/M
3300002561|JGI25384J37096_10100928All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300002562|JGI25382J37095_10005250All Organisms → cellular organisms → Archaea4577Open in IMG/M
3300002562|JGI25382J37095_10232985All Organisms → cellular organisms → Archaea556Open in IMG/M
3300002562|JGI25382J37095_10276181All Organisms → cellular organisms → Archaea505Open in IMG/M
3300002908|JGI25382J43887_10192988Not Available988Open in IMG/M
3300002908|JGI25382J43887_10389194All Organisms → cellular organisms → Archaea589Open in IMG/M
3300002911|JGI25390J43892_10002326All Organisms → cellular organisms → Archaea4073Open in IMG/M
3300002911|JGI25390J43892_10050281All Organisms → cellular organisms → Archaea985Open in IMG/M
3300002912|JGI25386J43895_10139599All Organisms → cellular organisms → Archaea602Open in IMG/M
3300002916|JGI25389J43894_1000250All Organisms → cellular organisms → Bacteria8407Open in IMG/M
3300004139|Ga0058897_11044692All Organisms → cellular organisms → Archaea1037Open in IMG/M
3300005166|Ga0066674_10106290All Organisms → cellular organisms → Bacteria1306Open in IMG/M
3300005171|Ga0066677_10773827Not Available532Open in IMG/M
3300005172|Ga0066683_10232750All Organisms → cellular organisms → Bacteria1141Open in IMG/M
3300005172|Ga0066683_10454764All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300005175|Ga0066673_10000053All Organisms → cellular organisms → Archaea22048Open in IMG/M
3300005178|Ga0066688_10607678All Organisms → cellular organisms → Archaea702Open in IMG/M
3300005178|Ga0066688_10734789All Organisms → cellular organisms → Archaea623Open in IMG/M
3300005181|Ga0066678_10749508All Organisms → cellular organisms → Archaea648Open in IMG/M
3300005186|Ga0066676_10004536All Organisms → cellular organisms → Bacteria6317Open in IMG/M
3300005447|Ga0066689_10641877All Organisms → cellular organisms → Archaea667Open in IMG/M
3300005454|Ga0066687_10345712All Organisms → cellular organisms → Archaea851Open in IMG/M
3300005468|Ga0070707_100000030All Organisms → cellular organisms → Archaea122924Open in IMG/M
3300005468|Ga0070707_100113719All Organisms → cellular organisms → Archaea2626Open in IMG/M
3300005536|Ga0070697_100003364All Organisms → cellular organisms → Archaea12291Open in IMG/M
3300005540|Ga0066697_10145858All Organisms → cellular organisms → Archaea1396Open in IMG/M
3300005540|Ga0066697_10399441All Organisms → cellular organisms → Archaea798Open in IMG/M
3300005542|Ga0070732_10015509All Organisms → cellular organisms → Archaea4246Open in IMG/M
3300005552|Ga0066701_10579865All Organisms → cellular organisms → Archaea685Open in IMG/M
3300005553|Ga0066695_10460595All Organisms → cellular organisms → Archaea783Open in IMG/M
3300005554|Ga0066661_10732373All Organisms → cellular organisms → Archaea579Open in IMG/M
3300005556|Ga0066707_10900300All Organisms → cellular organisms → Archaea542Open in IMG/M
3300005559|Ga0066700_10108739All Organisms → cellular organisms → Archaea1828Open in IMG/M
3300005559|Ga0066700_10139545All Organisms → cellular organisms → Archaea1631Open in IMG/M
3300005561|Ga0066699_10684283All Organisms → cellular organisms → Archaea732Open in IMG/M
3300005568|Ga0066703_10082279All Organisms → cellular organisms → Archaea1864Open in IMG/M
3300005569|Ga0066705_10262481All Organisms → cellular organisms → Archaea1095Open in IMG/M
3300005575|Ga0066702_10141973Not Available1414Open in IMG/M
3300005576|Ga0066708_10380568All Organisms → cellular organisms → Archaea906Open in IMG/M
3300005576|Ga0066708_10542133All Organisms → cellular organisms → Archaea749Open in IMG/M
3300005587|Ga0066654_10002578All Organisms → cellular organisms → Bacteria5903Open in IMG/M
3300006046|Ga0066652_100738067All Organisms → cellular organisms → Archaea939Open in IMG/M
3300006755|Ga0079222_11115226All Organisms → cellular organisms → Archaea695Open in IMG/M
3300006794|Ga0066658_10463352Not Available690Open in IMG/M
3300006794|Ga0066658_10685170All Organisms → cellular organisms → Archaea565Open in IMG/M
3300006794|Ga0066658_10775958Not Available537Open in IMG/M
3300006804|Ga0079221_11351125All Organisms → cellular organisms → Archaea564Open in IMG/M
3300006806|Ga0079220_11302550All Organisms → cellular organisms → Archaea609Open in IMG/M
3300006806|Ga0079220_11519643All Organisms → cellular organisms → Archaea575Open in IMG/M
3300006903|Ga0075426_11389459All Organisms → cellular organisms → Archaea533Open in IMG/M
3300007255|Ga0099791_10337308All Organisms → cellular organisms → Archaea721Open in IMG/M
3300007255|Ga0099791_10609698All Organisms → cellular organisms → Archaea534Open in IMG/M
3300007258|Ga0099793_10072268All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1561Open in IMG/M
3300007258|Ga0099793_10199551All Organisms → cellular organisms → Archaea959Open in IMG/M
3300009038|Ga0099829_10002126All Organisms → cellular organisms → Archaea11280Open in IMG/M
3300009038|Ga0099829_10189836All Organisms → cellular organisms → Archaea1658Open in IMG/M
3300009038|Ga0099829_11494039All Organisms → cellular organisms → Archaea558Open in IMG/M
3300009088|Ga0099830_10212661All Organisms → cellular organisms → Archaea1517Open in IMG/M
3300009088|Ga0099830_10919644All Organisms → cellular organisms → Archaea723Open in IMG/M
3300009090|Ga0099827_10973580All Organisms → cellular organisms → Archaea736Open in IMG/M
3300010078|Ga0127487_102914All Organisms → cellular organisms → Bacteria1467Open in IMG/M
3300010133|Ga0127459_1033774Not Available1402Open in IMG/M
3300010301|Ga0134070_10425973All Organisms → cellular organisms → Archaea527Open in IMG/M
3300010303|Ga0134082_10432404All Organisms → cellular organisms → Archaea567Open in IMG/M
3300010304|Ga0134088_10105402All Organisms → cellular organisms → Bacteria1328Open in IMG/M
3300010304|Ga0134088_10532905Not Available580Open in IMG/M
3300010326|Ga0134065_10053020Not Available1254Open in IMG/M
3300010333|Ga0134080_10644885All Organisms → cellular organisms → Archaea520Open in IMG/M
3300010335|Ga0134063_10148273All Organisms → cellular organisms → Archaea1087Open in IMG/M
3300010336|Ga0134071_10470686Not Available646Open in IMG/M
3300010336|Ga0134071_10509350All Organisms → cellular organisms → Archaea622Open in IMG/M
3300010337|Ga0134062_10047453All Organisms → cellular organisms → Archaea1739Open in IMG/M
3300010360|Ga0126372_12772843All Organisms → cellular organisms → Archaea542Open in IMG/M
3300010905|Ga0138112_1077354All Organisms → cellular organisms → Archaea893Open in IMG/M
3300012189|Ga0137388_10349397All Organisms → cellular organisms → Archaea1363Open in IMG/M
3300012198|Ga0137364_10602796All Organisms → cellular organisms → Archaea828Open in IMG/M
3300012199|Ga0137383_10163971All Organisms → cellular organisms → Bacteria1628Open in IMG/M
3300012200|Ga0137382_10351739All Organisms → cellular organisms → Archaea1033Open in IMG/M
3300012200|Ga0137382_10952712Not Available617Open in IMG/M
3300012201|Ga0137365_10285486Not Available1224Open in IMG/M
3300012201|Ga0137365_10556022All Organisms → cellular organisms → Archaea842Open in IMG/M
3300012201|Ga0137365_10595557Not Available810Open in IMG/M
3300012206|Ga0137380_10037902All Organisms → cellular organisms → Bacteria4445Open in IMG/M
3300012206|Ga0137380_10130366All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2296Open in IMG/M
3300012207|Ga0137381_10774106All Organisms → cellular organisms → Archaea833Open in IMG/M
3300012207|Ga0137381_11162492All Organisms → cellular organisms → Archaea663Open in IMG/M
3300012207|Ga0137381_11469875All Organisms → cellular organisms → Archaea573Open in IMG/M
3300012208|Ga0137376_10760046All Organisms → cellular organisms → Archaea835Open in IMG/M
3300012210|Ga0137378_10028911Not Available4899Open in IMG/M
3300012211|Ga0137377_10181596Not Available2020Open in IMG/M
3300012285|Ga0137370_10051713All Organisms → cellular organisms → Archaea2205Open in IMG/M
3300012349|Ga0137387_10001929All Organisms → cellular organisms → Archaea10438Open in IMG/M
3300012349|Ga0137387_10551282All Organisms → cellular organisms → Archaea836Open in IMG/M
3300012350|Ga0137372_10415828All Organisms → cellular organisms → Archaea1015Open in IMG/M
3300012351|Ga0137386_10317918Not Available1120Open in IMG/M
3300012351|Ga0137386_11020080All Organisms → cellular organisms → Archaea588Open in IMG/M
3300012356|Ga0137371_11266038All Organisms → cellular organisms → Archaea547Open in IMG/M
3300012357|Ga0137384_10284377Not Available1377Open in IMG/M
3300012357|Ga0137384_10800813Not Available762Open in IMG/M
3300012361|Ga0137360_10228926All Organisms → cellular organisms → Archaea1518Open in IMG/M
3300012362|Ga0137361_11711172All Organisms → cellular organisms → Archaea548Open in IMG/M
3300012391|Ga0134035_1076056Not Available1542Open in IMG/M
3300012399|Ga0134061_1042387All Organisms → cellular organisms → Archaea730Open in IMG/M
3300012400|Ga0134048_1031816Not Available1526Open in IMG/M
3300012917|Ga0137395_10490326All Organisms → cellular organisms → Archaea884Open in IMG/M
3300012922|Ga0137394_10293726All Organisms → cellular organisms → Bacteria1389Open in IMG/M
3300012927|Ga0137416_11414895All Organisms → cellular organisms → Archaea630Open in IMG/M
3300012944|Ga0137410_10255512All Organisms → cellular organisms → Archaea1375Open in IMG/M
3300012972|Ga0134077_10053061All Organisms → cellular organisms → Archaea1499Open in IMG/M
3300012975|Ga0134110_10088413All Organisms → cellular organisms → Archaea1244Open in IMG/M
3300012976|Ga0134076_10397783All Organisms → cellular organisms → Archaea613Open in IMG/M
3300014154|Ga0134075_10174930Not Available921Open in IMG/M
3300015245|Ga0137409_10093412All Organisms → cellular organisms → Archaea2797Open in IMG/M
3300015358|Ga0134089_10432705All Organisms → cellular organisms → Archaea567Open in IMG/M
3300017656|Ga0134112_10326428All Organisms → cellular organisms → Archaea621Open in IMG/M
3300017657|Ga0134074_1363080All Organisms → cellular organisms → Archaea536Open in IMG/M
3300017659|Ga0134083_10082887All Organisms → cellular organisms → Bacteria1247Open in IMG/M
3300018431|Ga0066655_10122883All Organisms → cellular organisms → Bacteria1495Open in IMG/M
3300018431|Ga0066655_10535008All Organisms → cellular organisms → Archaea781Open in IMG/M
3300018433|Ga0066667_10136766All Organisms → cellular organisms → Archaea1704Open in IMG/M
3300018433|Ga0066667_10172112All Organisms → cellular organisms → Bacteria1557Open in IMG/M
3300018433|Ga0066667_10201306All Organisms → cellular organisms → Archaea1463Open in IMG/M
3300018468|Ga0066662_10050457All Organisms → cellular organisms → Archaea2647Open in IMG/M
3300018468|Ga0066662_10188235All Organisms → cellular organisms → Archaea1615Open in IMG/M
3300018468|Ga0066662_10202316All Organisms → cellular organisms → Archaea1572Open in IMG/M
3300021046|Ga0215015_10141728All Organisms → cellular organisms → Archaea507Open in IMG/M
3300021046|Ga0215015_10600831All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300021046|Ga0215015_10607443Not Available850Open in IMG/M
3300021476|Ga0187846_10116768Not Available1141Open in IMG/M
3300021559|Ga0210409_10342125All Organisms → Viruses → Predicted Viral1345Open in IMG/M
3300022531|Ga0242660_1019129All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300022724|Ga0242665_10061320Not Available1028Open in IMG/M
3300024330|Ga0137417_1192030All Organisms → cellular organisms → Archaea1791Open in IMG/M
3300025922|Ga0207646_10000001All Organisms → cellular organisms → Archaea1242027Open in IMG/M
3300025922|Ga0207646_10025736All Organisms → cellular organisms → Archaea5381Open in IMG/M
3300025922|Ga0207646_10153091All Organisms → cellular organisms → Bacteria2080Open in IMG/M
3300026277|Ga0209350_1000524All Organisms → cellular organisms → Archaea20745Open in IMG/M
3300026295|Ga0209234_1034433All Organisms → cellular organisms → Archaea1927Open in IMG/M
3300026297|Ga0209237_1087554Not Available1405Open in IMG/M
3300026297|Ga0209237_1110130All Organisms → cellular organisms → Archaea1186Open in IMG/M
3300026298|Ga0209236_1051014Not Available2063Open in IMG/M
3300026298|Ga0209236_1102239All Organisms → cellular organisms → Bacteria1288Open in IMG/M
3300026306|Ga0209468_1053060All Organisms → cellular organisms → Archaea1372Open in IMG/M
3300026313|Ga0209761_1035189All Organisms → cellular organisms → Archaea2957Open in IMG/M
3300026313|Ga0209761_1057649All Organisms → cellular organisms → Archaea2133Open in IMG/M
3300026313|Ga0209761_1081542All Organisms → cellular organisms → Archaea1675Open in IMG/M
3300026315|Ga0209686_1001197All Organisms → cellular organisms → Archaea12987Open in IMG/M
3300026317|Ga0209154_1298446All Organisms → cellular organisms → Archaea531Open in IMG/M
3300026324|Ga0209470_1001506All Organisms → cellular organisms → Archaea17254Open in IMG/M
3300026325|Ga0209152_10006386All Organisms → cellular organisms → Archaea4250Open in IMG/M
3300026326|Ga0209801_1031658All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2500Open in IMG/M
3300026327|Ga0209266_1154427All Organisms → cellular organisms → Archaea922Open in IMG/M
3300026328|Ga0209802_1187510All Organisms → cellular organisms → Archaea816Open in IMG/M
3300026329|Ga0209375_1037416All Organisms → cellular organisms → Archaea2549Open in IMG/M
3300026333|Ga0209158_1337309All Organisms → cellular organisms → Archaea523Open in IMG/M
3300026354|Ga0257180_1028502Not Available751Open in IMG/M
3300026528|Ga0209378_1058949All Organisms → cellular organisms → Bacteria1818Open in IMG/M
3300026532|Ga0209160_1001083All Organisms → cellular organisms → Archaea22742Open in IMG/M
3300026538|Ga0209056_10475801All Organisms → cellular organisms → Archaea676Open in IMG/M
3300026548|Ga0209161_10041080All Organisms → cellular organisms → Archaea3101Open in IMG/M
3300027548|Ga0209523_1064169All Organisms → cellular organisms → Archaea755Open in IMG/M
3300027643|Ga0209076_1000212All Organisms → cellular organisms → Archaea8765Open in IMG/M
3300027643|Ga0209076_1044455All Organisms → cellular organisms → Archaea1253Open in IMG/M
3300027643|Ga0209076_1169909Not Available606Open in IMG/M
3300027748|Ga0209689_1035914All Organisms → cellular organisms → Archaea2898Open in IMG/M
3300027765|Ga0209073_10365909All Organisms → cellular organisms → Archaea585Open in IMG/M
3300027842|Ga0209580_10003102All Organisms → cellular organisms → Bacteria7125Open in IMG/M
3300027846|Ga0209180_10016186All Organisms → cellular organisms → Archaea3904Open in IMG/M
3300027875|Ga0209283_10047630Not Available2722Open in IMG/M
3300028536|Ga0137415_10866997All Organisms → cellular organisms → Archaea713Open in IMG/M
3300028536|Ga0137415_11049245All Organisms → cellular organisms → Archaea627Open in IMG/M
3300028792|Ga0307504_10033037All Organisms → cellular organisms → Archaea1384Open in IMG/M
3300031720|Ga0307469_10244525All Organisms → cellular organisms → Bacteria1438Open in IMG/M
3300031820|Ga0307473_10062152Not Available1831Open in IMG/M
3300031820|Ga0307473_11066364All Organisms → cellular organisms → Archaea594Open in IMG/M
3300031962|Ga0307479_10053136All Organisms → cellular organisms → Archaea3899Open in IMG/M
3300032180|Ga0307471_100703566All Organisms → cellular organisms → Archaea1175Open in IMG/M
3300032180|Ga0307471_103305331All Organisms → cellular organisms → Archaea571Open in IMG/M
3300032205|Ga0307472_102555159All Organisms → cellular organisms → Archaea520Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil22.11%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil20.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.63%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.68%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.16%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.11%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.58%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.05%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.05%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.53%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.53%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.53%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010078Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010133Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010905Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012400Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_100292133300002557Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
JGI25385J37094_1000298363300002558Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
JGI25385J37094_1008913323300002558Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
JGI25385J37094_1010874313300002558Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
JGI25385J37094_1012400213300002558Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
JGI25383J37093_1000185443300002560Grasslands SoilMFDIAKLGAMVRIMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGLGLARTPRECPSCHKQLVLWVPGP*
JGI25383J37093_1000802833300002560Grasslands SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMETAAALVEQEPLFSKGYWRCAKGCRPFGIGLARTPRECPYCHSQLVAWVPGP*
JGI25383J37093_1001897833300002560Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSNGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
JGI25383J37093_1003245913300002560Grasslands SoilRWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
JGI25384J37096_1000150463300002561Grasslands SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGIGLARTPRECPYCHSQLVAWVPGP*
JGI25384J37096_1007491013300002561Grasslands SoilMSETLDRQERWKQQXEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
JGI25384J37096_1010092823300002561Grasslands SoilMSETLERQERWKQQKDEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP*
JGI25382J37095_1000525063300002562Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSXHKQLILWVPGP*
JGI25382J37095_1023298513300002562Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILWVPGP*
JGI25382J37095_1027618113300002562Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
JGI25382J43887_1019298813300002908Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPS
JGI25382J43887_1038919413300002908Grasslands SoilQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
JGI25390J43892_1000232643300002911Grasslands SoilMVRIMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGLGLARTPRECPSCHKQLVLWVPGP*
JGI25390J43892_1005028123300002911Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPXGFGLARTPRECPSCHKQLILWVPGP*
JGI25386J43895_1013959913300002912Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
JGI25389J43894_100025013300002916Grasslands SoilTLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0058897_1104469223300004139Forest SoilMSETLEKQQRWKEQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP*
Ga0066674_1010629023300005166SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPRP*
Ga0066677_1077382713300005171SoilKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIIWVPGP*
Ga0066683_1023275023300005172SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066683_1045476423300005172SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCHSQLVAWVPGP*
Ga0066673_10000053133300005175SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAVVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066688_1060767823300005178SoilRWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIIWVPGP*
Ga0066688_1073478913300005178SoilMSETLDRQERWKQQKEEADRQRMGGPYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066678_1074950823300005181SoilSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066676_1000453653300005186SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPVGFGLARTPRECPSCHKQLILWVPGP*
Ga0066689_1064187713300005447SoilRWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066687_1034571213300005454SoilIMSETLDRQERWKQQKEEADRQRMGGPYIQGFIGTAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0070707_100000030743300005468Corn, Switchgrass And Miscanthus RhizosphereMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0070707_10011371923300005468Corn, Switchgrass And Miscanthus RhizosphereMSETLEKQQRWKDQKDEADRQRMGGAYIQGFIETAAALIGEEPLFSKGYWKCAGGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0070697_10000336473300005536Corn, Switchgrass And Miscanthus RhizosphereMSETLERQERWKQQKEEADRQRIGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066697_1014585813300005540SoilSRIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066697_1039944113300005540SoilIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0070732_1001550943300005542Surface SoilMSENLDRQQRWKQQREDADRQRIGGAYIQGFMETASALVEEEPLFSKGYWTCAKGCKPFGMGLARTPRECPHCHSQLVAWVPGP*
Ga0066701_1057986523300005552SoilTLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066695_1046059523300005553SoilEWSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066661_1073237323300005554SoilMSETLDRQDRWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066707_1090030013300005556SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFDKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066700_1010873933300005559SoilLSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066700_1013954513300005559SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQ
Ga0066699_1068428323300005561SoilIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIIWVPGP*
Ga0066703_1008227933300005568SoilKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066705_1026248123300005569SoilSRIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066702_1014197313300005575SoilQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066708_1038056823300005576SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILW
Ga0066708_1054213323300005576SoilRQERWKQQKEEADRQRMGGAYIQGFIETAAAVVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0066654_1000257823300005587SoilMSEGLDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0066652_10073806713300006046SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQ
Ga0079222_1111522613300006755Agricultural SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPGCHKQLILWVPGP*
Ga0066658_1046335213300006794SoilQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIIWVPGP*
Ga0066658_1068517023300006794SoilERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0066658_1077595823300006794SoilADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0079221_1135112513300006804Agricultural SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLIL
Ga0079220_1130255013300006806Agricultural SoilIMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0079220_1151964313300006806Agricultural SoilMSENLDRQQRWKQQREDADRQRICRAYMQGFMETASALVEEEPLFSRGYWTCAKGCKPFGMGLARTPRECPYCHSQLVAWVPGP*
Ga0075426_1138945913300006903Populus RhizosphereMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILW
Ga0099791_1033730813300007255Vadose Zone SoilKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0099791_1060969813300007255Vadose Zone SoilTLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0099793_1007226823300007258Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0099793_1019955123300007258Vadose Zone SoilIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0099829_10002126133300009038Vadose Zone SoilMSESLEKQQRWREQKDEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILWVPGP*
Ga0099829_1018983643300009038Vadose Zone SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPR
Ga0099829_1149403913300009038Vadose Zone SoilMSESLDRQENWKQRKEEADRQRMGGAYIQGFMETAATLVVEEPLFNKGYWKCAGGCKPFGLGLARTPRECPSCHKQLIEWVPGP*
Ga0099830_1021266113300009088Vadose Zone SoilERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0099830_1091964423300009088Vadose Zone SoilMSENLDRQENYKQRKEEADRQRMGGAYIQGFMETAATLVVEEPLFNKGYWKCAGGCKPFGLGLARTPRECPSCHKQLIEWVPGP*
Ga0099827_1097358013300009090Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPVGFGLAKTPRECPSCHKQLILWVPGP*
Ga0127487_10291433300010078Grasslands SoilQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0127459_103377423300010133Grasslands SoilDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0134070_1042597313300010301Grasslands SoilMSEGQDRQQRWKQQREDADRQRIGGPYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0134082_1043240423300010303Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKPPRDCPSCHKQLILWVPGP*
Ga0134088_1010540223300010304Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAAVVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0134088_1053290513300010304Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPREC
Ga0134065_1005302013300010326Grasslands SoilEDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0134080_1064488513300010333Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYVQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP*
Ga0134063_1014827323300010335Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAGAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP*
Ga0134071_1047068623300010336Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGC
Ga0134071_1050935023300010336Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0134062_1004745333300010337Grasslands SoilSRIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAVVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0126372_1277284313300010360Tropical Forest SoilMSDALEKQQRWKQQREDADRQRMGGAYIQGFMETAAALVEEEPLFSKGYWTCAKGCRPFGQGLARTPRECPSCHSQLVAWVPGP*
Ga0138112_107735413300010905Grasslands SoilMSEGPDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLV
Ga0137388_1034939713300012189Vadose Zone SoilIMSETLEGQERWRQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILWVPGP*
Ga0137364_1060279613300012198Vadose Zone SoilLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137383_1016397123300012199Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCATGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137382_1035173913300012200Vadose Zone SoilEEWSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137382_1095271223300012200Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSC
Ga0137365_1028548633300012201Vadose Zone SoilMSESLDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCKPFGMGLARTPRECPYCHSQLVAWVPGP*
Ga0137365_1055602213300012201Vadose Zone SoilETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137365_1059555713300012201Vadose Zone SoilEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137380_1003790263300012206Vadose Zone SoilMSDALERQERWKQQRQDADRQRMGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCKPFGMGLARTPRECPYCHSLLVAWVPGP*
Ga0137380_1013036633300012206Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGYGLARTPRECPGCHKQLI
Ga0137381_1077410613300012207Vadose Zone SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRDCPYCHSQLVAWV
Ga0137381_1116249213300012207Vadose Zone SoilWSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137381_1146987513300012207Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGYGLARTPRECPGCHKQLILWVPGP*
Ga0137376_1076004613300012208Vadose Zone SoilTLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPRP*
Ga0137378_1002891163300012210Vadose Zone SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMESAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCHSQLVAWVPGP*
Ga0137377_1018159633300012211Vadose Zone SoilDRVADLMSETLDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0137370_1005171313300012285Vadose Zone SoilRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIRWVPGP*
Ga0137387_10001929103300012349Vadose Zone SoilMSETLERQERWKQQKEEADRQRLGGGYIQGFIETTAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137387_1055128213300012349Vadose Zone SoilEELSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137372_1041582813300012350Vadose Zone SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLASTPRECPSCHKQLILWVPGP*
Ga0137386_1031791813300012351Vadose Zone SoilKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137386_1102008023300012351Vadose Zone SoilERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGYGLARTPRECPGCHKQLILWVPGP*
Ga0137371_1126603813300012356Vadose Zone SoilDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0137384_1028437713300012357Vadose Zone SoilQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137384_1080081313300012357Vadose Zone SoilQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCQPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137360_1022892623300012361Vadose Zone SoilMSEILEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0137361_1171117213300012362Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASDCKPFGFGLAKTPRE
Ga0134035_107605623300012391Grasslands SoilLDRVADLMSEGPDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0134061_104238713300012399Grasslands SoilDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPHCRSQLVAWVPGP*
Ga0134048_103181613300012400Grasslands SoilCHSLDRVADLMSEGPDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP*
Ga0137395_1049032623300012917Vadose Zone SoilMSESLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWV
Ga0137394_1029372623300012922Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHNQLILWVPGP*
Ga0137416_1141489513300012927Vadose Zone SoilETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0137410_1025551213300012944Vadose Zone SoilMSESLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHRQLILWVPGP*
Ga0134077_1005306113300012972Grasslands SoilMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0134110_1008841313300012975Grasslands SoilMSETLDRQERWKQQKDEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP*
Ga0134076_1039778313300012976Grasslands SoilRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP*
Ga0134075_1017493023300014154Grasslands SoilMSESLDRQERWKQQKGEADRQRMGGAYIQGFIETAAAIVDDEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP*
Ga0137409_1009341223300015245Vadose Zone SoilMSESLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHNQLILWVPGP*
Ga0134089_1043270523300015358Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDDEPLFSKGYWKCATGCKPFGFGLAKTPRECPNCHKQLILWVPGP*
Ga0134112_1032642823300017656Grasslands SoilKQQKGEADRQRMGGAYIQGFIETAAAIVDDEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0134074_136308023300017657Grasslands SoilWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPRP
Ga0134083_1008288723300017659Grasslands SoilMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGLGLARTPRECPSCHKQLVLWVPGP
Ga0066655_1012288323300018431Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0066655_1053500813300018431Grasslands SoilIMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRDCPSCHKQLILWVPGP
Ga0066667_1013676613300018433Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAVVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0066667_1017211223300018433Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0066667_1020130613300018433Grasslands SoilMSETLDRQERWKQQKDEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPREC
Ga0066662_1005045733300018468Grasslands SoilMFDIAKLGAMVRIMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGLGLARTPRECPSCHKQLVLWVPGP
Ga0066662_1018823513300018468Grasslands SoilEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0066662_1020231633300018468Grasslands SoilSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0215015_1014172813300021046SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPGCHKQLILWVPGP
Ga0215015_1060083123300021046SoilMSETLEKQQRWKEQKEEADRQRMGGAYIQGFIETAAALVGEEPLFNKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0215015_1060744313300021046SoilMSETLEKQQRWKEQKDEADRQRMGGAYIQGFIETAAALVGDEPLFSKGSWKCASVCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0187846_1011676823300021476BiofilmMSDSFEKEQRWKEQRQDADRQRMGGAYIQGFIETAAALVHEEPLFSKGYWRCDKGCKPFGLGLARTPRECPHCHSQLVAWVPGP
Ga0210409_1034212523300021559SoilMSETLEKQQRWKEQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0242660_101912923300022531SoilMSETLEKQQRWKEQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP
Ga0242665_1006132033300022724SoilMSETLEKQQRWKEQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCH
Ga0137417_119203023300024330Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0207646_100000017703300025922Corn, Switchgrass And Miscanthus RhizosphereMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0207646_1002573633300025922Corn, Switchgrass And Miscanthus RhizosphereMSETLEKQQRWKDQKDEADRQRMGGAYIQGFIETAAALIGEEPLFSKGYWKCAGGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0207646_1015309123300025922Corn, Switchgrass And Miscanthus RhizosphereMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0209350_100052483300026277Grasslands SoilMSETLERQERWKQQKDEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPNCHKQLILWVPGP
Ga0209234_103443313300026295Grasslands SoilTLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209237_108755433300026297Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209237_111013013300026297Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCATGCKPFGFGLARTPRECPGCHKQLRFKVEIVVFKPAPVEWD
Ga0209236_105101423300026298Grasslands SoilMSETLDRQQRWKQQRDDADRQRIGGAYIQGFMETAAALVEQEPLFSKGYWRCAKGCRPFGIGLARTPRECPYCHSQLVAWVPGP
Ga0209236_110223923300026298Grasslands SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCATGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0209468_105306023300026306SoilMSEGPDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP
Ga0209761_103518933300026313Grasslands SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSNGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209761_105764913300026313Grasslands SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0209761_108154213300026313Grasslands SoilSRIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0209686_1001197153300026315SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLIIWVPGP
Ga0209154_129844613300026317SoilQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209470_100150633300026324SoilMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPVGFGLARTPRECPSCHKQLILWVPGP
Ga0209152_1000638613300026325SoilIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209801_103165813300026326SoilMFDIAKLGAMVRIMSETLEKQERWKQQKEEADRQRIGGAYIQGFMETAVALVGEEPLFSKGYWKCASGCKPFGLGLARTPRECP
Ga0209266_115442723300026327SoilMSEGLDRQQRWKQQREDADRQRIGGAYIQGFMETAAALVEEEPLFSKGYWRCAKGCRPFGMGLARTPRECPYCRSQLVAWVPGP
Ga0209802_118751013300026328SoilSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209375_103741623300026329SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPRP
Ga0209158_133730913300026333SoilLERQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0257180_102850223300026354SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSC
Ga0209378_105894923300026528SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETGAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209160_1001083103300026532SoilMSETLDRQERWKQQKEEADCQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209056_1047580123300026538SoilTEEEWSRIMSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209161_1004108033300026548SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPGCHKQLILWVPGP
Ga0209523_106416913300027548Forest SoilMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPACHKQLILWVPGP
Ga0209076_100021223300027643Vadose Zone SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0209076_104445513300027643Vadose Zone SoilVIMSETLDRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0209076_116990913300027643Vadose Zone SoilKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0209689_103591433300027748SoilLSETLEKQERWKQQKEEADRQRMGGAYIQGFMETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0209073_1036590923300027765Agricultural SoilRQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0209580_1000310233300027842Surface SoilMSENLDRQQRWKQQREDADRQRIGGAYIQGFMETASALVEEEPLFSKGYWTCAKGCKPFGMGLARTPRECPHCHSQLVAWVPGP
Ga0209180_1001618623300027846Vadose Zone SoilMSESLEKQQRWREQKDEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILWVPGP
Ga0209283_1004763033300027875Vadose Zone SoilMSENLDRQENYKQRKEEADRQRMGGAYIQGFMETAATLVVEEPLFNKGYWKCAGGCKPFGLGLARTPRECPSCHKQLIEWVPGP
Ga0137415_1086699713300028536Vadose Zone SoilMSETLEKQQRWKEQKDEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPNCHKQLILWVPGP
Ga0137415_1104924513300028536Vadose Zone SoilMSETVEKQERWKQQKEEADRQRMGGAYIQGFIETAAALVGEEPLFSKGYWKCASGCKPFGFGLARTPRECPSCHKQLILWVPGP
Ga0307504_1003303713300028792SoilMSETLDRQERWKQQKEEADRQRMGGAYIQGFMETAAAIVDEEPLFSKGYWKCATGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0307469_1024452523300031720Hardwood Forest SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSRGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0307473_1006215233300031820Hardwood Forest SoilMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPGCHKQLILWVPGP
Ga0307473_1106636413300031820Hardwood Forest SoilWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0307479_1005313643300031962Hardwood Forest SoilMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFAFGLAKTPRECPSCHKQLILWVPGP
Ga0307471_10070356633300032180Hardwood Forest SoilMSETLERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVGEEPLFSKGYWKCASGCKPFGFGLAKTPRECPSCHKQLILWVPGP
Ga0307471_10330533123300032180Hardwood Forest SoilMSEALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPGCHKQLILGSLDHNGPLQFKVEIVVF
Ga0307472_10255515923300032205Hardwood Forest SoilALERQERWKQQKEEADRQRMGGAYIQGFIETAAAIVDEEPLFSKGYWKCASGCKPFGFGLAKTPRECPGCHKQLILWVPGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.