NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F079096

Metagenome Family F079096

Go to section:
Overview Alignments Structure & Topology Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079096
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 200 residues
Representative Sequence SLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDSGNARHAAADSSRGARGEALAAPCALDSTLWAEGLPSAYGALGRLAAEQRTLVIALGLDVGGPRTYSLGQEFCDAAWRSGLAVLTRAGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATGLRDRVLKQRRDLYFAFRLAQPPADWFTLGL
Number of Associated Samples 91
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 3.74 %
% of genes near scaffold ends (potentially truncated) 92.24 %
% of genes from short scaffolds (< 2000 bps) 92.24 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (92.241 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(30.172 % of family members)
Environment Ontology (ENVO) Unclassified
(54.310 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(61.207 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.50%    β-sheet: 7.66%    Coil/Unstructured: 46.85%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms92.24 %
UnclassifiedrootN/A7.76 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002562|JGI25382J37095_10212569All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Thermomonosporaceae → Actinomadura → Actinomadura atramentaria584Open in IMG/M
3300002914|JGI25617J43924_10286966All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300005166|Ga0066674_10460305All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300005171|Ga0066677_10429419All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300005174|Ga0066680_10642838All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300005177|Ga0066690_10637252All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300005406|Ga0070703_10532087All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300005445|Ga0070708_101972486All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300005447|Ga0066689_10668777All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300005536|Ga0070697_101426015All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300005540|Ga0066697_10428180All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300005546|Ga0070696_101294678All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Sedimentitalea → Sedimentitalea nanhaiensis618Open in IMG/M
3300005552|Ga0066701_10470224All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Sedimentitalea → Sedimentitalea nanhaiensis780Open in IMG/M
3300005552|Ga0066701_10805497All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300005554|Ga0066661_10827584All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Sedimentitalea → Sedimentitalea nanhaiensis541Open in IMG/M
3300005555|Ga0066692_10817731All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300005558|Ga0066698_10735551All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Sedimentitalea → Sedimentitalea nanhaiensis646Open in IMG/M
3300005566|Ga0066693_10504804All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Nocardiopsaceae → Marinactinospora → Marinactinospora thermotolerans500Open in IMG/M
3300005576|Ga0066708_10674613All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300005598|Ga0066706_11162011All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005598|Ga0066706_11461521All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005878|Ga0075297_1035439All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300005886|Ga0075286_1059512All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300006173|Ga0070716_100954611All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300006173|Ga0070716_101813125All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300006175|Ga0070712_101101274All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300006755|Ga0079222_12425862All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300006755|Ga0079222_12688548All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300006791|Ga0066653_10545160All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300006791|Ga0066653_10786736All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Sedimentitalea → Sedimentitalea nanhaiensis500Open in IMG/M
3300006797|Ga0066659_11256253All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300006800|Ga0066660_11013948All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300006800|Ga0066660_11044937All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300009012|Ga0066710_104629934All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300009089|Ga0099828_11659889All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300009089|Ga0099828_11705806All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300009137|Ga0066709_103039956All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300010301|Ga0134070_10322936All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300010303|Ga0134082_10273685All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300010322|Ga0134084_10460101All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300010326|Ga0134065_10242159All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300010326|Ga0134065_10343134All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300010335|Ga0134063_10322238All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300010335|Ga0134063_10645911All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300010336|Ga0134071_10400727All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300012200|Ga0137382_10700314All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300012200|Ga0137382_11043932All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300012200|Ga0137382_11214020All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300012205|Ga0137362_11416854All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300012206|Ga0137380_10926011All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300012206|Ga0137380_11014338All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300012206|Ga0137380_11733629All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300012207|Ga0137381_11786541All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300012208|Ga0137376_11487984All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300012285|Ga0137370_10420647All Organisms → cellular organisms → Bacteria811Open in IMG/M
3300012285|Ga0137370_10521352All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300012349|Ga0137387_11190339All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300012349|Ga0137387_11311691All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300012353|Ga0137367_10692830All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300012353|Ga0137367_10968396All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300012359|Ga0137385_11109863All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300012359|Ga0137385_11500392All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300012922|Ga0137394_11394407All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300012925|Ga0137419_11689996All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300012977|Ga0134087_10699953All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300012977|Ga0134087_10799786All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300014150|Ga0134081_10319749All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300014154|Ga0134075_10464292All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300014166|Ga0134079_10423046All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300015359|Ga0134085_10383658All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300015372|Ga0132256_102935074All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300017654|Ga0134069_1374622All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300017657|Ga0134074_1208583All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300017657|Ga0134074_1376539All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300018433|Ga0066667_11315467All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300018468|Ga0066662_12107024All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300018468|Ga0066662_12521701All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300018482|Ga0066669_12065477All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300019789|Ga0137408_1161353All Organisms → cellular organisms → Bacteria712Open in IMG/M
3300020170|Ga0179594_10401706All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300024317|Ga0247660_1049979All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300025922|Ga0207646_11618410All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300026298|Ga0209236_1303613All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300026310|Ga0209239_1308539All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300026314|Ga0209268_1197001All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300026315|Ga0209686_1191349All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300026316|Ga0209155_1172369All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300026317|Ga0209154_1307051All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300026323|Ga0209472_1319081All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300026331|Ga0209267_1233102All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300026332|Ga0209803_1245627All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300026332|Ga0209803_1354335All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300026334|Ga0209377_1184154All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300026527|Ga0209059_1274591All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300026536|Ga0209058_1215974All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300026540|Ga0209376_1269176All Organisms → cellular organisms → Bacteria712Open in IMG/M
3300027273|Ga0209886_1033833All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300027787|Ga0209074_10410046All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300027875|Ga0209283_10970790All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300027882|Ga0209590_10766443All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300028878|Ga0307278_10520013All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300031740|Ga0307468_102169821All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300031754|Ga0307475_11512290All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300031820|Ga0307473_11251621All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300031962|Ga0307479_11987750All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300032180|Ga0307471_104171744All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300034817|Ga0373948_0121453All Organisms → cellular organisms → Bacteria632Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.17%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil21.55%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil15.52%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.62%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.59%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.59%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.72%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.86%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300024317Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK01EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027324Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J37095_1021256913300002562Grasslands SoilDAVRRAWSDAVNRLQPTSVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRVLRQRRDLYFAFRLPQLPADWFTLGLLRGFALPDRPIL
JGI25617J43924_1028696613300002914Grasslands SoilRLQPTSVAWIPMLNLADGRWTPGDSSRGPRGEPVAAPCAFDSTLWAGTLAPAYAALARLAAEQRGLVIALGLDLAAWRTPGLEFCDAAWRRGLARILRSGPFDSLPYAERYPALRDAGLLALYYRALEDEVAERAAVLRDRALKQRRDIYFAFRLPQLPADWFTLGLLHGFALADRPILVFTSEVRTR
Ga0066674_1046030513300005166SoilPLFATRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPR
Ga0066672_1040828613300005167SoilLQQAPIPVELVPPPLVPPVGVDTVALPLVPDSKLDRATSVPEWLRQQGMRVLWAPLLPARDGRHAVRPAASLDSLVTLLDVGGFNLLGGDADPAGADSVHARWEERDAVRRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIALGLDIDGPHGYSMGQEFCDAAWQRGIAALGDRARLDSLPAGARYAALRDAGLLSRYYRALEDEVTARATALRDRILKERRDVYFAFRLPQPPADWFTLGLMRGFGLPD
Ga0066672_1081021013300005167SoilSRGPRGEALPAPCALDTTLWAAGFAPAYAALGRLAAEQRTLVIALGLDLGDPLRDARARGRSYGMGQDFCDGAWHRALVRLGRRGVLDSLPYAARYSTLREAGLLPLYYRALQDDVAERASTLRVRVLRQRRDLYFAFRLPQAPGDWFTLGLLRGFALPDRPLLLLTPEAKTREVLALLRARGMNGVHAVQLAPA
Ga0066677_1042941913300005171SoilLPAHDGHHVVRPGASLDSLVTLLDVGGFNLLGGDADPAGADSLHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGTGARLDSLPPGARYGALREAGLLSRYYRALEDEIAARAAALRDRILKQRRDVYFAFRLPQPPADWFTLGLLRGFGLPDRPLLLFTPELRT
Ga0066680_1053601413300005174SoilLARATSAPEWLRQQGMRVLWMPLFATRDGRRAVRSSASLDSLVALLDAGGFNLFAGDAGPESADSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYRNVRHAAGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCNAAWRRGLAGLVRAGTAGGGWDSLPFAARYPALRDAGLLPRYYRTLEDEVAARAAVLRDRVLKQRRDLYFAF
Ga0066680_1064283813300005174SoilLFVTRDGRRAMRSGAALDSLVVLLDVGGFNLVAGDAGPESMDSLHVRWEERDAVRRAWGDAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCAFDSLLWAEGLPSAYGALGRLAAEQRTLVIALGLDLGGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLY
Ga0066690_1063725213300005177SoilVGGFNLLGGDADPAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHVRRNPADSSRGARGEGLAAPCALDSVLWAEGLSGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSRARYAALRDAGLLSRYYHALEDEVAGRAAALRDRILKQRRDVYFAFRLPQAPADWFALGLLRGFALPDRPLLLFTPELRTRQLLALYRS
Ga0066675_1056177913300005187SoilALPLVPDSKLDRATSVPEWLRQQGMRVLWAPLLPARDGRHAVRPAASLDSLVTLLDVGGFNLLGGDADPAGADSVHARWEERDAVRRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIGLGLDIDGPHGYSMGQEFCDAAWRRGIAALGDRARLDSLPAGARYAALRDAGLLSRYYRAIEDEVTARAAALRDRILKERRDVYFAFRLPQPPADWFTLGLMRGFGLPDRPLLLFTPELRTR
Ga0070703_1053208713300005406Corn, Switchgrass And Miscanthus RhizosphereGAARHAVTDSSRGARGEALPMPCALDSTLWTDGLPSAYGALGRLATDERTLVIAIGLDLAGPHAYSMGQDFCDLAWRRGLAGLSRVGQGDARLDSLPYPARYAALRDAGLLPRYFRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPTDWFTLGLLRGFDLPDRPILLFTSEVR
Ga0070708_10197248613300005445Corn, Switchgrass And Miscanthus RhizosphereENTDSLHTRWVEREAVRRAWADAAKRLQPTSVAWIPVLDLGAARHTVTDSSRGARGEALPMPCALDSTLWTDGLPSAYGALGRLATDQRTLVIAIGLDLAGPHGYSMGQDFCDLAWRRGLAGLSRVGQGGPHLDSLPYPARYAALRDAGLLPRYYRALEDEVTARATVLRDRVLKQRRD
Ga0066689_1066877713300005447SoilARSTASLDSLVALLDAGGFNLLAGDAGPESTDSLHTYWEERLAVRRVWADAVKRLQPTSVAWIPVLDLDKARHAAADSSRGARGEPLAAPCALDSMLWTDGLATAYAALGRLAAEQRTLVIALGLDVGERSYSMGQEFCDAAWRRGLIALTLTGAAGRHLDSLPYAARYPTLRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDVYFAFRLAQP
Ga0066682_1064100513300005450SoilVRRAWSDAVKRLGPTSVAWMPAFDDAAARLPPGDSSRGPRGEALPAPCALDTTLWAAGFAPAYAALGRLAAEQRTLVIALGLDLGDPLRDARARGRSYGMGQDFCDGAWHRALVRLGRRGVLDSLPYAARYSTLREAGLLPLYYRALQDDVAERASTLRDRVLRQRRDLYFAFRLPQAPGDWFTLGLLRGFALPDRPLLLLTPEAKTREVLALLRARGMN
Ga0070697_10142601513300005536Corn, Switchgrass And Miscanthus RhizosphereDSLVVLLDAGGFNLLAGDAGPESMDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCALDSILWAEGLPSAYGALGRLAADQRTLVIAVGLDLGGLRSYSMGQEFCDAAWRRGLAVLIRGGAAGGGWDSLPHAARYPALRDAGLLPRYYRALEDEVAARAAGLRDRVLKQRRDLYFAF
Ga0066697_1042818013300005540SoilGRRVVRSGASLDSLVALLDAGGFNLLAGDAGPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEGLPAPCALDSTLWADGLPSAASALGRLAAEQRTLVIALGLDVAGPHSYSMGQEFCDAAWRRGLAGLIRAGAAGGGWDSLPFAARYPTLRDAGLLPRYYRALEDEVAARAMVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFELPDRPLLLFTPEIWTRDLLA
Ga0070696_10129467813300005546Corn, Switchgrass And Miscanthus RhizosphereVKRLQPTSVAWIPLLDYANVRHAAPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPVLRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRDLLAFYRARG
Ga0066701_1047022413300005552SoilPLFATRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRD
Ga0066701_1080549713300005552SoilNRLQPTSVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRVLRQRRDLYFAFRLPQLPADWFTLGLLRGFALPDRPILVFT
Ga0066661_1082758413300005554SoilAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGF
Ga0066692_1081773113300005555SoilVHARWEERDAVRRAWSDAANRLQPTSVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRVLRQRRDLYFAFRLPQLPADWFTLGL
Ga0066698_1073555113300005558SoilVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRL
Ga0066693_1050480413300005566SoilTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIGLGLDIDGPHGYSMGQEFCDAAWRRGIAALGDRARLDSLPAGARYAALRDAGLLSRYYRALEDEVTARAAALRDRILKERRDVYFAFRLPQPP
Ga0066708_1067461313300005576SoilVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHARRNPADSSRGARGEGLAAPCALDSALWAEGLSGAYAALGRLAAEQRTLVIALGLDIEGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSRTRYAALRDAGLLSRYYRALEDEVAGRAAALRDRILKERRDLYFAFRLPQAPADWFALGLLRGFALPDRPLLLFTPELRTRQLLALYRS
Ga0066706_1116201113300005598SoilAVRSSASLDSLVALLDAGGFNLFAGDAGPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYRNARHATGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCDAAWRRGLAGLVRAGTAGGGWDSLPFGARYPALRDAGLLPRYYRSLEDEVAARA
Ga0066706_1146152113300005598SoilADSVHARWEERDAVRRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIGLGLDIDGPHGYSMGQEFCDAAWRRGTAALGDRARLDSLPAGARYAALRDAGLLSRYYRAIEDEVTARAAALRDRILKER
Ga0075297_103543913300005878Rice Paddy SoilVKRLQPTSVAWIPVLDYDNARHATADSSRGVRGEGLAAPCALDSTLWADGLASAYGALGRLAAEQRTLVIALGLEVDPSRAYSMGQEFCDAAWRRGVAGLPRAGTARSWDSLPYPARYPALRDAGLLPLYYRALEDEVAARATTLRDRVLKQRRDLYFAFQLAQPPADWFTLGLLRGFGLPDRPTLVFTPEV
Ga0075286_105951213300005886Rice Paddy SoilVLDYDNARHATADSSRGVRGEGLAAPCALDSTLWADGLASAYGALGRLAAEQRTLVIALGLEVDPSRAYSMGQEFCDAAWRRGVAGLPRAGTARSWDSLPYPARYPALRDAGLLPLYYRALEDEVAARATTLRDRVLKQRRDLYFAFQLAQPPADWFTLGLLRGFGLPDRPTLVFTPEVRTR
Ga0070716_10095461113300006173Corn, Switchgrass And Miscanthus RhizosphereASLDSMVSLLDAGGFNLLAGDAGPESTDSVHTRWDERDAVKRAWADAAKRLQPTSVAWIPVLDLVNARHAPADSSRGARGEALPVPCALDSTLWADGLASAYAALGRLAADERTLVIAIGLDLPASPHGYSMGQDFCDAAWRRGLAGLPRGAPGDQRLDTLPYAARYPALRDAGLLPRYYRALEDEIAARAAVLRDRVLKQRRDLYFAFRLAQPPTDWFALGLLR
Ga0070716_10181312513300006173Corn, Switchgrass And Miscanthus RhizosphereEERDAVRRAWAEAVKRLQPTSVAWIPLLDYRNVRHAPADSSRGARGEALPVPCALDSTLWAEGLPSAYGALGRLAVEQRTLVIALGLDVGGPRSYTMGQEFCDAAWRRGLTGLSRAGPAGGGWDSLPFAARYPALRDAGLLPRYYRVLEDEIAARATVLRDRVLKQRR
Ga0070712_10110127413300006175Corn, Switchgrass And Miscanthus RhizosphereGDAAPENTDSLHTRWVEREAVRRAWADAAKRLQPTSVAWIPVLDLGAARHTVTDSSRGARGEALPMPCALDSTLWTDGLPSAYGALGRLATDQRTLVIAIGLDLAGPHGYSMGQDFCDLAWRRGLAGLSRVGQGGPHLDSLPYPARYAALRDAGLLPRYYRALEDEVTARATVLRDRVLKQRRDLYFAFRLAQPPTDWFTLGLLRGFELPDRPILLFTPEVRARELVAL
Ga0079222_1242586213300006755Agricultural SoilMPCALDSTLWTDGLPSAYGALGRLATDERTLVIAIGLDLAGPHAYSMGQDFCDLAWRRGLAGLSRVGQGDARLDSLPYPARYAALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQTLTEGVTFGLLRGFERPDRPIRLCTLEVHSLELVVLNLADGLHLAHA
Ga0079222_1268854813300006755Agricultural SoilLAAPCALDSTLWADGLASAFAALARLAADQRTLVIALGLDLGASHSYSMGQEFCDLAWRRGVAGVSHGGEARAWDSLPYAARYPTLRDAGLLARYYRALEDEVAARATVLRDRVLRQRRDVYFAFHLEQPPADWFTIGLLRGFGLPDRPTLLFTPEPQARDLLALYR
Ga0066653_1054516013300006791SoilWTPLFATRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRY
Ga0066653_1078673613300006791SoilASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFDALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARY
Ga0066659_1125625313300006797SoilGASLDSLVTLLDVGGFNLLGGDADPTGADSLHARWEEHDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGTGARLDSLPPGARYGALREAGLLSRYYRALEDEIAARAAALRDRILKQRRDVYFAFR
Ga0066660_1101394813300006800SoilGDADPAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHVRRNPADSSRGARGEGLAAPCALDSVLWAEGLSGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSRARYAALRDAGLLSRYYHALEDEVAGRAAALRDRILKQRRDVYFAFRLPQAPADWFALGLLRGFALPDRPLLLFTPELRTRQ
Ga0066660_1104493713300006800SoilPAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHARRNPADSSRGARGEGLAAPCALDSALWAEGLSGAYAALGRLAAEQRTLVIALGLDIEGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSGTRYAALRDAGLLSRYYRALEGEVAGRAAALRDRILKQRRDLYFAFRLPQAPADWFALGLLRGFALPDRPLLLFTPELRTRQ
Ga0066710_10462993413300009012Grasslands SoilLWAEGLPSAYGALGRLAAEQRTLVIALGLDVGGPRSYSMGQEFCDAAWRRGLAVLIRGGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFTPEVWTRDLLALYRARGFNLVHAVALV
Ga0099828_1165988913300009089Vadose Zone SoilRAVRSGAALDSLVVLLDVGGFNLVAGDAGPESMDSLHVRWEERDAVRRAWGDAVKRLQPTSVAWIPLLDYGNARHAPADSSRGARGEALAAPCAFDSLLWAEGLPSAYGALGRLAAEQRTLVIALGLDLGGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYAARYPALRDAGLLPRYYR
Ga0099828_1170580613300009089Vadose Zone SoilESMDSLHVRWEERDAVRRAWGDAVKRLQPTSVAWIPLLDYGNARHTAADSSRGARGEALAAPCALDSLLWAEGLPSAYGALGRLAAEQRTLVIALGLDLAGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYGARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAF
Ga0066709_10303995613300009137Grasslands SoilMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVRRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVETARSYSMGQEFCDAAWRRGLAALIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEV
Ga0134070_1032293613300010301Grasslands SoilAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAF
Ga0134082_1027368513300010303Grasslands SoilHHVVRPGASLDSLVTLLDVGGFNLLGGDADPAGADSLHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGTGARLDSLPPGARYAALREAGLLSRYYRALEDEIAARAAALRDRILKERRDVYFAFRLPQPPADWFTLGLLRGFGLPDR
Ga0134084_1040028513300010322Grasslands SoilVAGDADPEGTDSVHARWEERDAVRRAWSDAVNRLQPTSVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRV
Ga0134084_1046010113300010322Grasslands SoilPLLDYRNVRHAAGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCNAAWRRGLAGLVRAGTAGGGWDSLPFAARYTALRDAGLLPRYYRTLEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGL
Ga0134065_1024215913300010326Grasslands SoilSLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLL
Ga0134065_1034313413300010326Grasslands SoilLAVRRVWADAVKRLQPTSVAWIPVLDLDKARHAAADSSRGARGEPLAAPCALDSMLWTDGLATAYAALGRLAAEQRTLVIALGLDVGERSYSMGQEYCDAAWRRGLIALTLTGAAGRHLDSLPYAARYPTLRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDVYFAFRLAQPPADWFSLGLLRGFGLPDRP
Ga0134063_1032223813300010335Grasslands SoilNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRDLLAFYRARGLNIVHAVALV
Ga0134063_1064591113300010335Grasslands SoilVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRVLRQRRDLYFAFRLPQLPADWFTLGLLRGFALPDRPILVFTP
Ga0134071_1040072713300010336Grasslands SoilMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGADSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRGDLYFAFRLAQPPADWFTFGLLRGFGLP
Ga0137382_1070031413300012200Vadose Zone SoilMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGF
Ga0137382_1104393213300012200Vadose Zone SoilSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDSGNARHAAADSSRGARGEALAAPCALDSTLWAEGLPSAYGALGRLAAEQRTLVIALGLDVGGPRTYSLGQEFCDAAWRSGLAVLTRAGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATGLRDRVLKQRRDLYFAFRLAQPPADWFTLGL
Ga0137382_1121402013300012200Vadose Zone SoilASLDSLVTLLDVGGFNLLGGDADPAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHARRNPADSSRGARGEGLAAPCALDSALWAEGLSGAYAALGRVAAEQRTLVIALGLDIEGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSRTRYAALRDAGLLSRYYRAL
Ga0137362_1141685413300012205Vadose Zone SoilDYRNVRHATGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCDAAWRRGLAGLVRAGTAGGGWDSLPFGARYPALRDAGLLPRYYRSLEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFTPEVWTRDLLALYRARGLN
Ga0137380_1092601113300012206Vadose Zone SoilTRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRGDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTP
Ga0137380_1101433813300012206Vadose Zone SoilESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVETARSYSMGQEFCDAAWRRGLAALIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRDLLAFYRARGLNIVHA
Ga0137380_1173362913300012206Vadose Zone SoilRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANVRHAGPDSSRGARGEGLAAPCALDSILWGEGLPSAFGALARLAAEQRTLVIALGLDVGAARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQR
Ga0137381_1178654113300012207Vadose Zone SoilDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVETARSYSMGQEFCDAAWRRGLAALIRTGAAGGGWDSLPYTARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLY
Ga0137376_1148798413300012208Vadose Zone SoilMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYVNARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAG
Ga0137370_1042064713300012285Vadose Zone SoilARATSTPEWLRQQGMRVLWTPLFATRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFDALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDPAWRRGLAVLIRTGAAGGGWDSLPYTARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLL
Ga0137370_1052135213300012285Vadose Zone SoilAGGFNLFAGDAGPESADSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEGLPAPCALDSTLWADGLPSAYGALGRLAAEQRTLVIALGLDVAGPHSYSMGQEFCDAAWRRGLAGLIRAGAAGGGWDSLPFAARYPALRDAGLLPRYYRALEDEVAARAMVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFELPDRPLLLFTPEIWTRDLLALQRARGL
Ga0137387_1119033913300012349Vadose Zone SoilADSLHARWEERDAVRRAWADAVNRLQPTSVAWIPAFDAAGARWASGDSSRGPRGEAFATPCAFDSTLWNGSLASAYAALGRLAADQRNLVIALGLDLWDAHPRAAEFCDEAWRRTLARIRRGTVLDSLPPVARYPALRDAGLLPAYYRALEDEMAAHAAVLRDRVLKQRSDLYFAFRLR
Ga0137387_1131169113300012349Vadose Zone SoilLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAG
Ga0137367_1069283013300012353Vadose Zone SoilLLAGDARPESMDSLHVRWDERDAVRRAWAEAVKRLQPTSVAWVPLLEFGEARHGAADSSRGVRGEALPAPCALDSTLWADGLPSAYGALGRLAADQRTLVIAVGVEFGSARDYSMGQEFCDAAWRRGLVGLSRGGGEARAWEALPYQARYPTLRDAGLLPLYYRALEDEVAARATVLRDRVLKQRRDLYFAFQLPQPPADWLTLGLLRGFGLPDRPILLLTPELRTRELLALYRAR
Ga0137367_1096839613300012353Vadose Zone SoilTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCALDSTLWTEGLPAAYSALGRLAAEQRTLVIALGLDVGGPRSYSMGKEFCDAAWRRGLAVLIRAGAAGGGWDSLSYPARYPTLRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFTPEVWTRDLLA
Ga0137385_1110986313300012359Vadose Zone SoilERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYAALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLPQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRDLLAFYRA
Ga0137385_1150039213300012359Vadose Zone SoilNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVETARSYSMGQEFCDAAWRRGLAALIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRLAQPPADWFTFGLLRGFGLPDRPLLLFTPELWTRD
Ga0137394_1139440713300012922Vadose Zone SoilDSLVVLLDVGGFNLVAGDAAPESMDSLHVRWEERDAVRRAWGDAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCAFDSLLWAEGLPSAYGALGRLATEQRTLVIALGLDLGGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAAR
Ga0137419_1168999613300012925Vadose Zone SoilFNLLAGDAGPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLEYGNARHAALDSSRGARGEALAAPCALDSILWAEGLPSAYGALGRLAAEQRTLVIALGLDIGGPRSYSMGQEFCDPAWRRGLAVLIRAGAAGGGWDSLPYAARYPALRNAGLLPRYYRALEDEVAARATAL
Ga0134087_1069995313300012977Grasslands SoilADAVKRLQPTSVAWIPLLDYRNARHATGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCDAAWRRGLAGLVRAGTAGGSWDSLPFAARYPALRDAGLLARYYRTLEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGL
Ga0134087_1079978613300012977Grasslands SoilVRRVWADAVKRLQPTSVAWIPVLDLDKARHAAADSSRGARGEPLAAPCALDSMLWTDGLATAYAALGRLAAEQRTLVIALGLDVGERSYSMGQEFCDAAWRRGLIALTLTGAAGRHLDSLPYAARYPTLRDAGLLPRYYRALEDEVA
Ga0134081_1031974913300014150Grasslands SoilADSSRGARGEALAAPCALDSTLWAEGLPSAYGALGRLAAEQRTLIIALGLDVGALRSYSMGQEFCDAAWRRGLAVLIRSGAAGGGWDSLPHAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFAPEVWTRDLLALYRARGRNIVH
Ga0134075_1046429213300014154Grasslands SoilLLDVGGFNLLGGDADPAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRNLADSSRGARGEGLAAPCALDSASWAEGLSGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIASFGNRAELDSLPSGARYATLRDAGLLSRYYRALEEEVAARAAALRDRILKQ
Ga0134079_1042304613300014166Grasslands SoilDVGGFNLLGGDADPAGADSLHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGTGARLDSLPPGARYAALREAGLLSRYYRALEDEIAARAAALRDRILKQRRDVYSAFRLPQPPADWFTLGL
Ga0134085_1038365813300015359Grasslands SoilSLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFDALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRRDLYFAFRL
Ga0132256_10293507413300015372Arabidopsis RhizosphereGGFNLLAGDARPESMDSLHVRWDERDWVRRAWADAVKRLQPTSVAWIPLLDYGAARRNPADSSRGARGEGLAAACVLDSTLWAEGLPSAFAALGKLAGEQKTLVIALGVGLDGARDYSMGQEFCDPAWRRGLAGLALGNGGARAWDTLPYAARYPSLRDAGLLPLYYRALEDEVAARAAVLRDRVLRQRR
Ga0134069_137462213300017654Grasslands SoilDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDRVLRQRGDLYYAFRLAQPPADWFTFGLLRGFGLPDRPL
Ga0134074_120858313300017657Grasslands SoilTPLFATRDGRRAVRSGASLDSLVVLLDAGAFNLLAGDASPESMDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAVPCALDSTLWAEGLPSAYGALGRLAAEQRTLVIALGLDVGGPRTYSMGQEFCDAAWRSGLAVLTRAGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATGLRDRVLKQRRDLYFAFRLAQPPA
Ga0134074_137653913300017657Grasslands SoilRDAVRRAWAAAVGRLEPTSVAWIPVLDYDHARRNPADSSRGARGEGLAAPCALDSALWAEGLSGAYAALGRLAAEQRTLVIALGLDIEGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSGTRYAALRDAGLLSRYYRALEGEVAGRAAALRDRILKQRRDLYFAFRLPQAPA
Ga0066667_1131546713300018433Grasslands SoilDADPAGADSLHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAAFGTGARLDSLPPGARYAALREAGLLSRYYRALEDEIAARAAVLRDRILKERRDVYFAFRLPQPPADWFTLGLLRGFGLPDRPLL
Ga0066662_1210702413300018468Grasslands SoilRDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVRRLQPTSVAWIPLLDYANARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVETARSYSMGQEFCDAAWRRGLAALIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEV
Ga0066662_1252170113300018468Grasslands SoilVVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANARHAGLDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEV
Ga0066669_1206547713300018482Grasslands SoilTSVAWIPVLDYGHARRNPADSSRGTRGEGLAAPCALDSALWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGLRGYSMGQEFCDAAWRRGIVGLGNGATLDSLPYRARYAALRDAGLLPRYYRALEDEVTARAAALRDRILKQRRDVYFAFRLPQPPADWFTLGLLRGFALPDRPLLL
Ga0137408_116135313300019789Vadose Zone SoilSIRSSYCLLDAGGFNLLAGDAGPESMDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCALDSILWAEGLPSAYGALGRLAADQRTLVIALGLDVGGLRSYSMGQEFCDAAWRRGLAVLIRGGAAGGGWDSLPHAARYPALRDAGLLPRYYRALEDEVAARAAGLRDRVLKQRRDLYFAFRLSQPPADWFTLGLLRGFGLPDRPLLLFT
Ga0179594_1040170613300020170Vadose Zone SoilSLVVLLDVGGFNLVAGDAGPESMDSLHVRWEERDAVRRAWGDAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCAFDSLLWAEGLPAAYGVLGRLAAEQRTLVIALGLDLEGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGAGWDSLPYAARYPALRDAGLLPR
Ga0247660_104997913300024317SoilTDSLHTRWDEREAVRRAWADAAKRLQPTSVAWIPVLDLGAARHAVTDSSRGARGEALPMPCALDSTLWTDGLPSAYGALGRLATDERTLVIAIGLDLAGPHAYSMGQDFCDLAWRRGLAGLSRVGQGDARLDSLPYPARYAALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPTDWFTLGLLRGFELPDRPILLFTPEVRARELVALYHADGLI
Ga0209342_1045151813300025326SoilWDELLAVRGAWSDAVKALQPTSVAWIPVLDYGNRLPPRADSSRGARGEPIPAPCALDSAFWGEGLAPGYGALMRLAAEQRTLVIALGLDLGRGYGMGQEFCDAAWRQALTRLTRRGELDSLPYPERYPTLRETGLLALYYRALEDLVAERAATLRDRALRQRRDLYFAFRLPLGSRSASCAASSYRTGRSSC
Ga0207646_1161841013300025922Corn, Switchgrass And Miscanthus RhizosphereVLDYAHARRNPADSSRGARGEGLAAPCALDSALWAEGISGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGLGDAARLDSLPSGARYAALRDAGLLSRYYRALEDEVATRAAAVRDRILKQRRDVYFAFRLPQPPADWFTLGLLRGFGLPDRPLLLFTPELRTRQLLSLY
Ga0209236_130361313300026298Grasslands SoilYWEERLAVRRAWADAVKRLQPTSVAWIPVHDLDKARHATADSSRGARGEALAAPCALDSILWAEGLPSAYGALGRLAAEQRTLVIALGLDIGGPRSYSMGQEFCDAAWRRGLAGLIRAGAAGGGWDSLAYAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRD
Ga0209239_130853913300026310Grasslands SoilTYWEERLAVRRVWADAVKRLQPTSVAWIPVLDLDKARHAAADSSRGARGEPLAAPCALDSMLWTDGLATAYAALGRLAAEQRTLVIALGLDVGERSYSMGQEFCDAAWRRGLIALTLTGAAGRHLDSLPYAARYPTLRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDVYFAFRLAQPP
Ga0209268_113775913300026314SoilVGGFNLVAGDADPEGTDSVHARWEERDAVRRAWSDAVNRLQPTSVAWIPAFDPSDARWASGDSSRGPRGEAVPAPCAFDSTLWAGAFAPAYTALGRLAAEQRTLVIALGLDLRGARGGGGGLEFCDSAWRRGLARLARSGPPLDALPPGARYPALRDAGLLPLYYRALVDEVAERAAALRDRVLRQRRDLYFAFRLPQLP
Ga0209268_119700113300026314SoilEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCDAAWRRGLAGLVRAGTAGGGWDSLPFGARYPALRDAGLLPRYYRSLEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFTPELWTRDLL
Ga0209686_119134913300026315SoilAGADSVHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYDHVRRNPADSSRGARGEGLAAPCALDSVLWAEGLSGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGNRAELDSLPSRARYAALRDAGLLSRYYHALEDEVAGRAAALRDRILKQRRDVYFAFRLPPAP
Ga0209155_117236913300026316SoilPLLPSRDGRHVVRPGASLDSLVTLLDVGGFNLLGGDADPAGADSVHARWEERDAVRRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIGLGLDIDGPHGYSMGQEFCDAAWRRGIAALGDRARLDSLPAGARYAALRDAGLLSRYYRALEDEVTARAAALRDRILKERRDVYFAFRLPQPPADWFTLGLMRG
Ga0209154_130705113300026317SoilRHAVRPAASLDSLVTLLDVGAFNLLGGDADPAGADSVHARWEERDAVRRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLPVPCALDSALWAEGLAGAYAALGRLAAEQRTLVIALGLDIDGPHGYSMGQEFCDAAWQRGIAALGDRARLDSLPAGARYAALRD
Ga0209472_131908113300026323SoilLGGDADPAGADSLHARWEERDAVRRAWAAAVGRLEPTSVAWIPVLDYGHARRTPADSSRGTRGEGLAAPCALDSVLWAEGLSGAYGALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFGTGARLDSLPPGARYAALREAGLLSRYYRALEDEIA
Ga0209267_123310213300026331SoilLWMPLFATRDGRRAVRSSASLDSLVALLDAGGFNLFAGDAGPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYRNARHATGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCDAAWRRGLAGLVRAGTAGGGWDSLPFGARYPALRDAGLLPRYYRSLEDEVAARATVLRDRVLKQRRD
Ga0209803_124562713300026332SoilARSTASLDSLVALLDAGGFNLLAGDAGPESTDSLHTYWEERLAVRRVWADAVKRLQPTSVAWIPVLDLDKARHAAADSSRGARGEPLAAPCALDSMLWTDGLATAYAALGRLAAEQRTLVIALGLDVGERSYSMGQEFCDAAWRRGLIALTLTGAAGRHLDSLPYAARYPTLRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRD
Ga0209803_135433513300026332SoilWEERDVVRRAWADAVKRLQPTSVAWIPVLDYDKARHGAADSSRGPRGEPLAAPCALDSTLWAEGLGPAYAALGRLAAEQRTLVIALGLDIGGARGYSMGQEFCDEAWHRGVAGLSRGEGGTRSLDSLPNASRYSALRDAGLLPRYYRALEDEIAARAAVLRDRVLKQRR
Ga0209377_118415413300026334SoilGGFNLFAGDAGPESADSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYRNVRHAAGDSSRGARGEGLPAPCALDSTLWTEGLPSAFGALGRLAAEQHTLVIALGLDLGGLRSYSMGQEFCNAAWRRGLAGLVRAGTAGGGWDSLPFAARYPALRDAGLLPRYYRTLEDEVAARAAVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFGLPDRPLLLFTPEPWTRDLLALQRAR
Ga0209059_127459113300026527SoilVLLDAGGFNLLAGDASPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAGPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATTLRDR
Ga0209058_121597413300026536SoilGMRVLWTPLFATRDGRRAMRSGASLDSLVVLLDAGGFNLLASDAGPESTDSLHVRWEERDAVRRAWVDAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCALDSTLWAEGLPSAYGALGRLAAEQRTLIIALGLDVGALRSYSMGQEFCDAAWRRGLAVLIRSGAAGGGWDSLPHAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPADWFTLGLLRGFG
Ga0209376_126917613300026540SoilESTDSLHTRWDEREAVRRAWADAAKRLQPTSVAWIPVLDLGNARHAAADSSRGARGEALPEPCALDSTLWSDGLTSAYGALARLAADQRTLVIAIGLDLGGPRGYSMGQDFCDAAWRRGLAGLTRGIQGGERLDTLPFPARYPALRDAGLLPRYYRALEDEVAARAAVLRDRVLKQRRDLYFAFRLTQPPADWFTLGLLRGFALPDRPILLFTSEVRTRELLALYHAHGLNLAHAVA
Ga0209886_103383313300027273Groundwater SandLLAGDADPASADSLRARWDERLAVRRAWSDAVKALQPTSVAWIPVLDYGNAHPPRADSSRGARGEPIPAPCALDSAFWGEGLAPAYGALARLAAEQRTLVIALGLDLGRGYGMGQEFCDAAWRRALTRLTRRGELDSLPHPERYSTLRDTGLLSIYFRALEDLVAERAAVLRDRALRQRRDLLFAFRLAQPPADWFTLGVLRGFGLPDRPLLLLTPEVKMRELLTGLRARGVSAVHAVELSPAFLRSRDLAGLKRLVFEE
Ga0209845_107082113300027324Groundwater SandRLAVRRAWSDAVKALQPTSVAWIPVLDYGNAHPPRADSSRGARGEPIPAPCALDSAFWGEGLAPAYGALARLAAEQRTLVIALGLDLGRGYGMGQEFCDAAWLQALTRLTRRGELDSLPYPERYPTLREAGLLSLYYRALEDLVAERAAMLRDRALRQRRDLNFAFRLPQAPADWFTLGI
Ga0209074_1041004613300027787Agricultural SoilRGARGEALPVPCALDSTLWADGLASAYAALGRLAADERTLVIAIGLDLPASPHGYSMGQDFCDAAWRRGLAGLPRGAPGDQRLDTLPYAARYPALRDAGLLPRYYRALEDEIAARAAVLRDRVLKQRRDLYFAFRLAQPPTDWFTLGLLRGFGLPDRPILLFTPEIRTREFLALYHADGLNLAHAVALA
Ga0209283_1097079013300027875Vadose Zone SoilDGRRVMRSGASLDSLVVLLDAGGFNLLAGDASPESTDSLHVHWEERDAVRRAWADAVKRLQPTSVAWIPLLDYANVRHAAPDSSRGARGEGLAAPCALDSTLWGEGLPSAFGALARLAAEQRTLVIALGLDVGTARSYSMGQEFCDAAWRRGLAVLIRTGAAGGGWDSL
Ga0209590_1076644313300027882Vadose Zone SoilDAVRRAWADAVKRLQPTSVAWIPLLDYGNARHAAADSSRGARGEALAAPCALDSILWAEGLPSAYGALGRLAADQRTLVIALGLDVGGLRSYSMGQEFCDAAWRRGLAVLIRGGAAGGGWDSLPHAARYSALRDAGLLPRYYRALEDEVAARAAGLRDRVLKQRRDLYFAFRLSQPPADWFTLGLLRGFGLPDRPLLLFTPEVWT
Ga0307278_1052001313300028878SoilSVHTRWDERDAVRRVWADAVKHLQPTSVAWIPVLDYGNARHATADSSRGARGEALPAPCALDSTLWADGLASAYGALGRLAADQRTLVIALGLDIGAPRSYSMGQEFCDPAWRRGLAGLSRAGGGGALDALPYTARYPALRDAGLLPLYYRALEDEVAARAAVLRDRVLKQR
Ga0307468_10216982113300031740Hardwood Forest SoilNLLAGDAGPESTDSLHVRWEERDAVRRAWADAVKRLQPTSVAWIPLLDYGNARRAALDSSRGARGEGLAAPCALDSILWAEGLPSAYGALGRLAAEQRTLVIALGLDVGGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAVRATVL
Ga0307475_1151229013300031754Hardwood Forest SoilAWIPVLDFAHARRNPADSSRGARGEGLPAPCALDSALWAEGISGAYAALGRLAADQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGFSDAARLDSLPSGVRYAALRDAGLLSRYYRALEDEVATRAAAVRDRLLKQRRDVYFAFRLSQPPADWFALGLLRGFGLPDR
Ga0307473_1125162113300031820Hardwood Forest SoilVDSSRGARGEALAAPCAFDSLLWADGLPSAYSALGRLAADQRTLVIALGLDLGGPRSYSMGQEFCDAAWRRGLAVLMRGGAAGGGWDSLPYAARYPALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQTPSDWFTLGLLRGFGLPDRPLLLFTPEVWTRDVLALHRARGFN
Ga0307479_1198775013300031962Hardwood Forest SoilRAWTAAVGRLEPTSVAWIPVLDYAHARRNPADSSRGARGEGLAAPCALDSALWAEGISGAYAALGRLAAEQRTLVIALGLDIDGPRGYSMGQEFCDAAWRRGIAGLGDAARLDSLPPGARYAALRDGGLLSRYYRALEDEVATRAAAVRDRILKQRRDVYFAFRLPQPPADWFTLG
Ga0307471_10417174413300032180Hardwood Forest SoilWADAVKRLQPTSVAWIPVLDFEKARHGAADSSRGPRGEPLAAPCALDSTLWTEGLASAYGALGRLAAEQRTLVIALGLDIGGGRSYSMGQEFCDAAWRRGVAGMSRGEGGTRSLDSLPDASRYSALRDAGLLPRYYRALEDEIAARAAVLRDRVLKQRRDLYFAFRLEQ
Ga0373948_0121453_2_6313300034817Rhizosphere SoilFNLLAGDAAPENTDSLHTRWDEREAVRRAWADAAKRLQPTSVAWIPVLDLGAARHAVTDSSRGARGEALPMPCALDSTLWTDGLPSAYGALGRLATDERTLVIAIGLDLAGPHAYSMGQDFCDPAWRRGLAGLSRVGQGDPHLDSLPYPARYAALRDAGLLPRYYRALEDEVAARATVLRDRVLKQRRDLYFAFRLAQPPTDWFTLGLLR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.