NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F042235

Metagenome Family F042235

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F042235
Family Type Metagenome
Number of Sequences 158
Average Sequence Length 128 residues
Representative Sequence MLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Number of Associated Samples 107
Number of Associated Scaffolds 158

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 50.97 %
% of genes near scaffold ends (potentially truncated) 38.61 %
% of genes from short scaffolds (< 2000 bps) 63.92 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (69.620 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.709 % of family members)
Environment Ontology (ENVO) Unclassified
(53.797 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.494 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 62.66%    β-sheet: 0.00%    Coil/Unstructured: 37.34%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 158 Family Scaffolds
PF00857Isochorismatase 26.58
PF01638HxlR 3.80
PF04075F420H2_quin_red 3.16
PF13243SQHop_cyclase_C 2.53
PF12697Abhydrolase_6 2.53
PF04208MtrA 2.53
PF07366SnoaL 1.90
PF05378Hydant_A_N 1.27
PF00892EamA 1.27
PF09360zf-CDGSH 1.27
PF04014MazE_antitoxin 1.27
PF13424TPR_12 0.63
PF04185Phosphoesterase 0.63
PF00561Abhydrolase_1 0.63
PF01436NHL 0.63
PF13412HTH_24 0.63
PF02830V4R 0.63
PF00801PKD 0.63
PF03352Adenine_glyco 0.63

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 158 Family Scaffolds
COG1335Nicotinamidase-related amidaseCoenzyme transport and metabolism [H] 26.58
COG1535Isochorismate hydrolaseSecondary metabolites biosynthesis, transport and catabolism [Q] 26.58
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 3.80
COG0145N-methylhydantoinase A/oxoprolinase/acetone carboxylase, beta subunitAmino acid transport and metabolism [E] 2.53
COG4063Tetrahydromethanopterin S-methyltransferase, subunit ACoenzyme transport and metabolism [H] 2.53
COG1719Predicted hydrocarbon binding protein, contains 4VR domainGeneral function prediction only [R] 0.63
COG28183-methyladenine DNA glycosylase TagReplication, recombination and repair [L] 0.63
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.63


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms83.54 %
UnclassifiedrootN/A16.46 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10008376All Organisms → cellular organisms → Archaea3670Open in IMG/M
3300002558|JGI25385J37094_10063929All Organisms → cellular organisms → Archaea1196Open in IMG/M
3300002558|JGI25385J37094_10147712All Organisms → cellular organisms → Archaea634Open in IMG/M
3300002561|JGI25384J37096_10024353All Organisms → cellular organisms → Bacteria2359Open in IMG/M
3300002561|JGI25384J37096_10037379All Organisms → cellular organisms → Archaea1888Open in IMG/M
3300002561|JGI25384J37096_10092232All Organisms → cellular organisms → Archaea1075Open in IMG/M
3300002561|JGI25384J37096_10235981All Organisms → cellular organisms → Archaea534Open in IMG/M
3300002908|JGI25382J43887_10009143All Organisms → cellular organisms → Bacteria4965Open in IMG/M
3300002908|JGI25382J43887_10105871All Organisms → cellular organisms → Archaea1482Open in IMG/M
3300002909|JGI25388J43891_1005728All Organisms → cellular organisms → Archaea2475Open in IMG/M
3300002911|JGI25390J43892_10023036All Organisms → cellular organisms → Bacteria → Proteobacteria1500Open in IMG/M
3300002912|JGI25386J43895_10007215All Organisms → cellular organisms → Bacteria3103Open in IMG/M
3300002912|JGI25386J43895_10039422All Organisms → cellular organisms → Archaea1380Open in IMG/M
3300002912|JGI25386J43895_10043997All Organisms → cellular organisms → Archaea1298Open in IMG/M
3300005167|Ga0066672_10396844Not Available902Open in IMG/M
3300005167|Ga0066672_10639986All Organisms → cellular organisms → Archaea687Open in IMG/M
3300005172|Ga0066683_10028546All Organisms → cellular organisms → Bacteria3196Open in IMG/M
3300005172|Ga0066683_10247079All Organisms → cellular organisms → Archaea1106Open in IMG/M
3300005174|Ga0066680_10147000All Organisms → cellular organisms → Archaea1472Open in IMG/M
3300005176|Ga0066679_10017684All Organisms → cellular organisms → Archaea3711Open in IMG/M
3300005176|Ga0066679_10149905All Organisms → cellular organisms → Archaea1459Open in IMG/M
3300005177|Ga0066690_10145320All Organisms → cellular organisms → Archaea1555Open in IMG/M
3300005180|Ga0066685_10007568All Organisms → cellular organisms → Bacteria5838Open in IMG/M
3300005180|Ga0066685_10766233Not Available657Open in IMG/M
3300005181|Ga0066678_10100774All Organisms → cellular organisms → Archaea1748Open in IMG/M
3300005181|Ga0066678_10151914All Organisms → cellular organisms → Archaea1449Open in IMG/M
3300005446|Ga0066686_10664854Not Available706Open in IMG/M
3300005447|Ga0066689_10596564All Organisms → cellular organisms → Archaea697Open in IMG/M
3300005468|Ga0070707_100085165All Organisms → cellular organisms → Bacteria3055Open in IMG/M
3300005540|Ga0066697_10764311All Organisms → cellular organisms → Archaea526Open in IMG/M
3300005552|Ga0066701_10055305All Organisms → cellular organisms → Archaea2195Open in IMG/M
3300005554|Ga0066661_10371954All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon874Open in IMG/M
3300005555|Ga0066692_10027420All Organisms → cellular organisms → Bacteria2977Open in IMG/M
3300005557|Ga0066704_10202188All Organisms → cellular organisms → Archaea1343Open in IMG/M
3300005557|Ga0066704_10329293All Organisms → cellular organisms → Bacteria1027Open in IMG/M
3300005559|Ga0066700_10667524All Organisms → cellular organisms → Archaea717Open in IMG/M
3300005559|Ga0066700_10923392Not Available578Open in IMG/M
3300005559|Ga0066700_10985092All Organisms → cellular organisms → Archaea555Open in IMG/M
3300005568|Ga0066703_10388929All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon836Open in IMG/M
3300005586|Ga0066691_10035610All Organisms → cellular organisms → Bacteria2600Open in IMG/M
3300005586|Ga0066691_10159377All Organisms → cellular organisms → Archaea1301Open in IMG/M
3300005598|Ga0066706_10240724All Organisms → Viruses → Predicted Viral1405Open in IMG/M
3300006032|Ga0066696_10161188All Organisms → cellular organisms → Archaea1406Open in IMG/M
3300006034|Ga0066656_10335546Not Available977Open in IMG/M
3300006797|Ga0066659_10080338All Organisms → cellular organisms → Archaea2162Open in IMG/M
3300006797|Ga0066659_11464075Not Available571Open in IMG/M
3300006800|Ga0066660_10002326All Organisms → cellular organisms → Bacteria8643Open in IMG/M
3300006806|Ga0079220_10993930All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Natrialbales → Natrialbaceae → Natrinema → Natrinema pallidum665Open in IMG/M
3300006806|Ga0079220_11555018Not Available571Open in IMG/M
3300007255|Ga0099791_10341798Not Available716Open in IMG/M
3300007258|Ga0099793_10109009Not Available1286Open in IMG/M
3300007258|Ga0099793_10692482Not Available514Open in IMG/M
3300007265|Ga0099794_10154413Not Available1166Open in IMG/M
3300009038|Ga0099829_10005327All Organisms → cellular organisms → Archaea7810Open in IMG/M
3300009088|Ga0099830_10165347All Organisms → cellular organisms → Archaea1711Open in IMG/M
3300009088|Ga0099830_10444589All Organisms → cellular organisms → Archaea1052Open in IMG/M
3300009089|Ga0099828_10381141Not Available1271Open in IMG/M
3300009090|Ga0099827_10540654All Organisms → cellular organisms → Archaea1003Open in IMG/M
3300009137|Ga0066709_100577555All Organisms → Viruses → Predicted Viral1597Open in IMG/M
3300009137|Ga0066709_103049640All Organisms → cellular organisms → Archaea614Open in IMG/M
3300010304|Ga0134088_10230177All Organisms → cellular organisms → Archaea890Open in IMG/M
3300010304|Ga0134088_10429356All Organisms → cellular organisms → Archaea646Open in IMG/M
3300011269|Ga0137392_10747092All Organisms → cellular organisms → Archaea809Open in IMG/M
3300011270|Ga0137391_10261438Not Available1498Open in IMG/M
3300012096|Ga0137389_10968129All Organisms → cellular organisms → Archaea730Open in IMG/M
3300012189|Ga0137388_11421396All Organisms → cellular organisms → Archaea632Open in IMG/M
3300012189|Ga0137388_11667954All Organisms → cellular organisms → Archaea572Open in IMG/M
3300012198|Ga0137364_10039483All Organisms → cellular organisms → Archaea3077Open in IMG/M
3300012198|Ga0137364_10678777All Organisms → cellular organisms → Archaea777Open in IMG/M
3300012199|Ga0137383_10047025All Organisms → cellular organisms → Archaea3075Open in IMG/M
3300012199|Ga0137383_10241014All Organisms → cellular organisms → Archaea1325Open in IMG/M
3300012199|Ga0137383_10584707All Organisms → cellular organisms → Archaea817Open in IMG/M
3300012199|Ga0137383_10749343All Organisms → cellular organisms → Archaea713Open in IMG/M
3300012200|Ga0137382_10033578All Organisms → cellular organisms → Bacteria3089Open in IMG/M
3300012201|Ga0137365_10051245All Organisms → cellular organisms → Archaea3125Open in IMG/M
3300012202|Ga0137363_10316696All Organisms → cellular organisms → Archaea1284Open in IMG/M
3300012203|Ga0137399_10045700All Organisms → cellular organisms → Archaea3144Open in IMG/M
3300012205|Ga0137362_10446090All Organisms → cellular organisms → Archaea1120Open in IMG/M
3300012207|Ga0137381_10017128All Organisms → cellular organisms → Bacteria5674Open in IMG/M
3300012208|Ga0137376_10745946All Organisms → cellular organisms → Archaea844Open in IMG/M
3300012209|Ga0137379_10026499All Organisms → cellular organisms → Archaea5574Open in IMG/M
3300012209|Ga0137379_10778404All Organisms → cellular organisms → Archaea862Open in IMG/M
3300012210|Ga0137378_10008351All Organisms → cellular organisms → Archaea9004Open in IMG/M
3300012210|Ga0137378_10075324All Organisms → cellular organisms → Archaea3069Open in IMG/M
3300012210|Ga0137378_10882460All Organisms → cellular organisms → Archaea807Open in IMG/M
3300012211|Ga0137377_10806451All Organisms → cellular organisms → Archaea871Open in IMG/M
3300012285|Ga0137370_10366646All Organisms → cellular organisms → Archaea869Open in IMG/M
3300012354|Ga0137366_10162913All Organisms → cellular organisms → Archaea1675Open in IMG/M
3300012356|Ga0137371_10776854All Organisms → cellular organisms → Archaea731Open in IMG/M
3300012356|Ga0137371_11229181All Organisms → cellular organisms → Archaea558Open in IMG/M
3300012357|Ga0137384_10017245All Organisms → cellular organisms → Bacteria5873Open in IMG/M
3300012359|Ga0137385_10003359All Organisms → cellular organisms → Archaea13908Open in IMG/M
3300012361|Ga0137360_10086246Not Available2368Open in IMG/M
3300012361|Ga0137360_10110162All Organisms → cellular organisms → Archaea2124Open in IMG/M
3300012361|Ga0137360_10525431All Organisms → cellular organisms → Archaea1009Open in IMG/M
3300012361|Ga0137360_11070629All Organisms → cellular organisms → Archaea696Open in IMG/M
3300012361|Ga0137360_11506601Not Available577Open in IMG/M
3300012362|Ga0137361_10095286All Organisms → cellular organisms → Archaea2577Open in IMG/M
3300012362|Ga0137361_10203752All Organisms → cellular organisms → Archaea1790Open in IMG/M
3300012918|Ga0137396_10008168All Organisms → cellular organisms → Archaea6282Open in IMG/M
3300012918|Ga0137396_10105938All Organisms → cellular organisms → Archaea2013Open in IMG/M
3300012927|Ga0137416_11195675Not Available684Open in IMG/M
3300012944|Ga0137410_10156106Not Available1742Open in IMG/M
3300012972|Ga0134077_10319427All Organisms → cellular organisms → Archaea656Open in IMG/M
3300015245|Ga0137409_10071309All Organisms → cellular organisms → Bacteria3258Open in IMG/M
3300017656|Ga0134112_10407150All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon563Open in IMG/M
3300017659|Ga0134083_10163488All Organisms → cellular organisms → Archaea906Open in IMG/M
3300018431|Ga0066655_10227214Not Available1175Open in IMG/M
3300018468|Ga0066662_10031803All Organisms → cellular organisms → Archaea3141Open in IMG/M
3300018468|Ga0066662_10414924All Organisms → cellular organisms → Archaea1190Open in IMG/M
3300018468|Ga0066662_11110059All Organisms → cellular organisms → Archaea791Open in IMG/M
3300021046|Ga0215015_10214429Not Available1344Open in IMG/M
3300021088|Ga0210404_10000803All Organisms → cellular organisms → Archaea14639Open in IMG/M
3300021559|Ga0210409_10022276All Organisms → cellular organisms → Archaea6144Open in IMG/M
3300025922|Ga0207646_10068555All Organisms → cellular organisms → Archaea3168Open in IMG/M
3300026296|Ga0209235_1005907All Organisms → cellular organisms → Bacteria6859Open in IMG/M
3300026296|Ga0209235_1013236All Organisms → cellular organisms → Archaea4538Open in IMG/M
3300026296|Ga0209235_1083792Not Available1416Open in IMG/M
3300026297|Ga0209237_1083811All Organisms → cellular organisms → Archaea1450Open in IMG/M
3300026298|Ga0209236_1007238All Organisms → cellular organisms → Archaea6678Open in IMG/M
3300026298|Ga0209236_1019606All Organisms → cellular organisms → Bacteria3802Open in IMG/M
3300026301|Ga0209238_1115030All Organisms → cellular organisms → Archaea894Open in IMG/M
3300026309|Ga0209055_1100863All Organisms → cellular organisms → Archaea1152Open in IMG/M
3300026310|Ga0209239_1027090All Organisms → cellular organisms → Archaea2760Open in IMG/M
3300026313|Ga0209761_1017777All Organisms → cellular organisms → Archaea4507Open in IMG/M
3300026313|Ga0209761_1050355All Organisms → cellular organisms → Archaea2336Open in IMG/M
3300026313|Ga0209761_1101119All Organisms → cellular organisms → Archaea1441Open in IMG/M
3300026315|Ga0209686_1001085All Organisms → cellular organisms → Archaea13622Open in IMG/M
3300026315|Ga0209686_1019409All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2717Open in IMG/M
3300026317|Ga0209154_1018446All Organisms → cellular organisms → Archaea3294Open in IMG/M
3300026318|Ga0209471_1128769All Organisms → cellular organisms → Archaea1070Open in IMG/M
3300026325|Ga0209152_10019942All Organisms → cellular organisms → Bacteria2289Open in IMG/M
3300026326|Ga0209801_1028097All Organisms → cellular organisms → Archaea2685Open in IMG/M
3300026326|Ga0209801_1043093All Organisms → cellular organisms → Archaea2082Open in IMG/M
3300026327|Ga0209266_1090668All Organisms → cellular organisms → Archaea1362Open in IMG/M
3300026328|Ga0209802_1003567All Organisms → cellular organisms → Archaea10403Open in IMG/M
3300026328|Ga0209802_1198503All Organisms → cellular organisms → Archaea775Open in IMG/M
3300026332|Ga0209803_1248357All Organisms → cellular organisms → Archaea615Open in IMG/M
3300026354|Ga0257180_1049612All Organisms → cellular organisms → Archaea595Open in IMG/M
3300026532|Ga0209160_1002484All Organisms → cellular organisms → Archaea15344Open in IMG/M
3300026532|Ga0209160_1098746All Organisms → cellular organisms → Bacteria → Proteobacteria1485Open in IMG/M
3300026538|Ga0209056_10025687All Organisms → cellular organisms → Archaea5661Open in IMG/M
3300026540|Ga0209376_1058040All Organisms → cellular organisms → Archaea2176Open in IMG/M
3300026551|Ga0209648_10696215All Organisms → cellular organisms → Archaea555Open in IMG/M
3300027671|Ga0209588_1001841All Organisms → cellular organisms → Archaea5652Open in IMG/M
3300027748|Ga0209689_1070565All Organisms → cellular organisms → Archaea1880Open in IMG/M
3300027862|Ga0209701_10119346All Organisms → cellular organisms → Archaea1634Open in IMG/M
3300027875|Ga0209283_10829466All Organisms → cellular organisms → Archaea566Open in IMG/M
3300027882|Ga0209590_10170911Not Available1360Open in IMG/M
3300028536|Ga0137415_10261368Not Available1538Open in IMG/M
3300028536|Ga0137415_11131943All Organisms → cellular organisms → Archaea596Open in IMG/M
3300031820|Ga0307473_10766330All Organisms → cellular organisms → Archaea685Open in IMG/M
3300031962|Ga0307479_10237109All Organisms → Viruses → Predicted Viral1799Open in IMG/M
3300032180|Ga0307471_100895792Not Available1054Open in IMG/M
3300032205|Ga0307472_100567538All Organisms → cellular organisms → Archaea994Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.75%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil20.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.16%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.90%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.27%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.27%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.63%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.63%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1000837673300002558Grasslands SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPT
JGI25385J37094_1006392923300002558Grasslands SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK*
JGI25385J37094_1014771213300002558Grasslands SoilMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR
JGI25384J37096_1002435313300002561Grasslands SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSLRN*
JGI25384J37096_1003737913300002561Grasslands SoilLKPARIVSNFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQP
JGI25384J37096_1009223223300002561Grasslands SoilLKPARIVSNFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA*
JGI25384J37096_1023598113300002561Grasslands SoilLKPARIIANLVGLMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSST
JGI25382J43887_1000914343300002908Grasslands SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSLRN*
JGI25382J43887_1010587143300002908Grasslands SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGIAAIXDYIIQIILGFIALGTSSGVVVVVHLTNAFVLGVLVTYLISFADSADKASTHILSQTQAVR
JGI25388J43891_100572813300002909Grasslands SoilMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR*
JGI25390J43892_1002303623300002911Grasslands SoilLKTTRIIANLVGLMLFVQVILGGSALLLGVPLIYHLVWGAGTFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKASAHVLSQLKPAST*
JGI25386J43895_1000721563300002912Grasslands SoilAVFGFIDVLYHIIWGALTFVVLIFATVFAASELGSKSTLFRTGIASIAVYVVQIILGLIALGPNSGAVIVVHLTNAFVLGVIVTYLISFADSAEKASTSLHPRSPTASPGSTRALSRRK*
JGI25386J43895_1003942233300002912Grasslands SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNGGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
JGI25386J43895_1004399713300002912Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNGGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPT
Ga0066672_1039684423300005167SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFQKGMAAIADYVLQIILGFIALGANSGVVVVVHLTNAFVLGFSYLISFADSADKASTHILSQTQTVR*
Ga0066672_1063998613300005167SoilLKPARIIANLVGLMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKTSTSLHPQPPTRSPGTMRALSRGK*
Ga0066683_1002854623300005172SoilMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKASTSLRPQPPTGSPGTMRALSRGK*
Ga0066683_1024707923300005172SoilLKPTRIITNLVGLMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR*
Ga0066680_1014700023300005174SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0066679_1001768433300005176SoilMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPTGSPGTMRALSRGK*
Ga0066679_1014990523300005176SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGIAAIADYVVQTILGFIALGTNSGVVVVIHLTNAFVLGVLVTYLISFADSADKASTTYYHKLKPYAEGPGYLLSNMLL*
Ga0066690_1014532023300005177SoilMLFVQVILGGSAVFGYIDVLYHLVWGIVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKTSTSLHPQPPTRSPGTMRALSRGE*
Ga0066685_1000756883300005180SoilQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR*
Ga0066685_1076623323300005180SoilMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTGMAAIADYVLQIILGFIALGANSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQT
Ga0066678_1010077423300005181SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTRSPGTMRALSRGE*
Ga0066678_1015191423300005181SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0066686_1066485413300005446SoilMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSA
Ga0066689_1059656413300005447SoilILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNGGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0070707_10008516543300005468Corn, Switchgrass And Miscanthus RhizosphereMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGIVAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSADKASTSLHPQPSTVSPGAMRALPPRS*
Ga0066697_1076431113300005540SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSA
Ga0066701_1005530553300005552SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0066661_1037195413300005554SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPAGSPGTMRALSRGK*
Ga0066692_1002742023300005555SoilLKPARIIANLVGLMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK*
Ga0066704_1020218813300005557SoilLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0066704_1032929313300005557SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTIRALSRRK*
Ga0066700_1066752413300005559SoilFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA*
Ga0066700_1092339233300005559SoilGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTRMAAIADYVLQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQTVR*
Ga0066700_1098509213300005559SoilGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASTHILSQTQTVR*
Ga0066703_1038892923300005568SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIMIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA*
Ga0066691_1003561023300005586SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA*
Ga0066691_1015937713300005586SoilFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0066706_1024072443300005598SoilSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVVVGSSNKAFVLGVLVTYLISFADSADKASTHILSQTQTVR*
Ga0066696_1016118843300006032SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTGMAAIADYVLQIILGFIALGTNSGVVVVIHLTNAFVLGVLVTYLISFADSADKASTHILSQTQTVR
Ga0066656_1033554633300006034SoilMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTGMAAIADYALQIILGFIALGANGGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQTVR*
Ga0066659_1008033813300006797SoilMLFVQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASSHILSQTQTVR*
Ga0066659_1146407513300006797SoilLKTTRIIANLVGLMLFVQVILGGSALLLGVPLIYHLVWGAGTFIVLIVATFYAATELGSKSTLFRTGIAAIADYVVQTILGFIALGTNSGVVVVIHLTNAFVLGVLVTYLISFA
Ga0066660_1000232633300006800SoilLKPARIIANLVGLMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPAGSPGTMRALSRGK*
Ga0079220_1099393013300006806Agricultural SoilMLFVQVILGGSAVFGLADVRIHLVWGIVTFGVLIIATVYAARELGSKSTLFRTGVAAIADYVVQIVLGVTALATNNNAVVVVVHLTNAFVLGVLVTYLISFADSAERASASLNPQARTPAPGTMRALSRDF*
Ga0079220_1155501813300006806Agricultural SoilLKPTRIIANIVGVMLFIQVILGGSALLLGVPIIYHLVWGAATFIVLIAATFYAATELGSKSTLFRTGVAAIADYIVQIILGFVALGLNGYVVVVHLTNAFVLGVLVTYLISFADGADKASPQVLSQLKPAST*
Ga0099791_1034179813300007255Vadose Zone SoilMLFVQVILGGRGILQGVPIIYQLVWGAATFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKTSAHVLSQLKPVST*
Ga0099793_1010900933300007258Vadose Zone SoilMLFVQVILGGSAPLLGVPIIYHLVWGALTFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKASAHVLSQLK
Ga0099793_1069248213300007258Vadose Zone SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGMAAIADYVVQIILGFIALGTNSGVVVVHLTNGLCSAFSSLILSASPIAPTRPPPTYYHKLKP
Ga0099794_1015441323300007265Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIVDYVVQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPTVAPGTMRAVSRDA*
Ga0066710_10385550513300009012Grasslands SoilDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTRMAAIADYVLHIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQTVR
Ga0099829_1000532713300009038Vadose Zone SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIIVHLTNAFVLGVLVTYLISFADSAEKASAHVLSQTQKVR*
Ga0099830_1016534733300009088Vadose Zone SoilMLFVQVILGGSAAFGYIDVQYHLAWGVATFGVLIIATVYAVMELGSKSTLFRTGIAAIADYVVQIVLGFIALGTNSGVVIVVHLTNAFVLGVLVTYLINFADSAEKASAHVLSQTQTVR*
Ga0099830_1044458923300009088Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPTVAPGTMLSVSRDA*
Ga0099828_1038114123300009089Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPTVAPGTMLSVSRDA*
Ga0099827_1054065423300009090Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADAAEKASTALNPKGPTVAPGTMLSVSRDA*
Ga0066709_10057755543300009137Grasslands SoilMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTRMAAIADYVLQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQTVR*
Ga0066709_10304964023300009137Grasslands SoilMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHI
Ga0134088_1023017723300010304Grasslands SoilMLFVQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR*
Ga0134088_1042935613300010304Grasslands SoilITERCGFSRVLKPARIIANLVGLMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK*
Ga0126378_1324634613300010361Tropical Forest SoilATLLNFPVDPHLYWGLITFIVLVVATFYAAKELGAKSALFRTGVAAIVDFLIQAGLGLTALFTGSNVVVLVHLTNAFVLGVLVTYVISFADSADKAAVVASSAAKPM*
Ga0137392_1074709213300011269Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTALNPKGPTVAPGTIQAASRDA*
Ga0137391_1026143823300011270Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTALNPKGPTVAPGTMLSVSPDA*
Ga0137389_1096812923300012096Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFLLGVLVTSLISFADSAEKVSTPLNPKGPTVAPGTMQAVSR
Ga0137388_1142139613300012189Vadose Zone SoilMMLFVQVILGGLATVLGYPVIYHIIWGGLTFVVLIIATVYAVRELGSKSTLFRTGIVANADYVVQIILGFVALGLNNDAVVVVHLTNAFVLGVLVTYLISFADSAEKASASLHPHTPTISPGTM*
Ga0137388_1166795413300012189Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTALNPKGPTVAP
Ga0137364_1003948353300012198Vadose Zone SoilRLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVITVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRRGQTPDSFTGYPA*
Ga0137364_1067877713300012198Vadose Zone SoilRLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRAGIAAIADYVVQIILGFIALGTSSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRTLSHRN*
Ga0137383_1004702513300012199Vadose Zone SoilNFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVITVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRRGQTPDSFTGYPA*
Ga0137383_1024101413300012199Vadose Zone SoilVGMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGMAAIADYVVQIILGFIALGTNSGVIVVVHLTNAFVLGVLVTYLISFADSADKAATHILSQTQTVR*
Ga0137383_1058470713300012199Vadose Zone SoilNFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIAAVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGSNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASASSHPQSPTVSPRTMPALSRRK*
Ga0137383_1074934323300012199Vadose Zone SoilILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137382_1003357833300012200Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVITVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRRGQTPDSFTGYPA*
Ga0137365_1005124523300012201Vadose Zone SoilMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRAGIAAIADYVVQIILGFIALGTSSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRTLSHRN*
Ga0137363_1031669613300012202Vadose Zone SoilMVANLVGFMLFVQVILGGSAVFGFVDVIYHIVWRALTFVVLIVATVYAARELGSKSTLFRTGIAATADYVVQIVLGFVALGLNNDAVVVVHLTNAFVLGVLVTYLISFADSAEKTSASLHPHTPISPGTM*
Ga0137399_1004570043300012203Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNVFVLGVLVTYLISFADSAEKASTSLNPKGPSVAPGTMRAFSPDA*
Ga0137362_1044609033300012205Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSGLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALS
Ga0137381_1001712873300012207Vadose Zone SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGT
Ga0137376_1074594623300012208Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137379_1002649913300012209Vadose Zone SoilPTRIITNLVGMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVAMAYAASELGSKSTLFRTGMAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASTHILSQTQTVH*
Ga0137379_1077840413300012209Vadose Zone SoilRLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIAAVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGSNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASASSHPQSPTVSPRTMPALSRRK*
Ga0137378_1000835143300012210Vadose Zone SoilMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGMAAIADYVVQIILGFIALGTNSGVIVVVHLTNAFVLGVLVTYLISFADSADKAATHILSQTQTVR
Ga0137378_1007532413300012210Vadose Zone SoilVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVITVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRRGQTPDSFTGYPA*
Ga0137378_1088246033300012210Vadose Zone SoilLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRAGIAAIADYVVQIILGFIALGSNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASASSHPQSPTVSPRTMPALSRRK*
Ga0137378_1166193813300012210Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTY
Ga0137377_1080645133300012211Vadose Zone SoilAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137370_1036664613300012285Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLNSQPPTASPGTMRALSHRN*
Ga0137366_1016291323300012354Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRAGIAAIADYVVQIILGFIALGTSSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRTLSHRN*
Ga0137371_1077685413300012356Vadose Zone SoilILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRAGIAAIADYVVQIILGFIALGTSSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRTLSHRN*
Ga0137371_1122918113300012356Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADGAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137384_1001724523300012357Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRTLSHRN*
Ga0137385_10003359113300012359Vadose Zone SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSSIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK*
Ga0137360_1008624643300012361Vadose Zone SoilMVANLVGFMLFVQVILGGSAVFGFVDVIYHIVWRALTFVVLIVATVYAARELGSKSTLFRTGIAATADYVVQIVLGFVALGLNNDAVVVIHLTNAFVLGVLVTYLISFADSAEKTSASLHPHTPISPGTM*
Ga0137360_1011016243300012361Vadose Zone SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNGFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137360_1052543113300012361Vadose Zone SoilMMLFVQVVLGGLATVLGYPVIYHIIWGGVTFVVLIIATVYGVRELGSKSTLFRTGIAAIADYVVQIILGFVALGLNNDAVVVVHLTNAFVLGVLVTYLISFADSAEKATASLHPQSPITPGTM*
Ga0137360_1107062923300012361Vadose Zone SoilSRVLKPARIIANLVGLMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTIRALSRRK*
Ga0137360_1150660113300012361Vadose Zone SoilMLFVQIILGGSAPLLGVPIIYHLVWGAATFIVLIVATFYAAAELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKASAHVLSQLKPAST*
Ga0137361_1009528643300012362Vadose Zone SoilMLFIQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLMSFADSAEKASTSLHSQPPTASPGTMRALSHRN*
Ga0137361_1020375223300012362Vadose Zone SoilMLFVQVILGGSAPLLGVPIIYHLVWGAATFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKASAHVLSQLKPAST*
Ga0137396_1000816873300012918Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWRVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSA
Ga0137396_1010593853300012918Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFMALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPSVAPGTMRAFSPDA*
Ga0137416_1119567513300012927Vadose Zone SoilVFGYVDVRYHLVWGVLAFGVLIIGTVYAASELGRKSTLFRTGIAAIADYVVQIILGFMALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPTVAPGTMLSVSRDA*
Ga0137410_1015610643300012944Vadose Zone SoilMLFVQVILGGSAPLLGVPIIYHLVWGAATFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVLVHLTNAFVLGVLVTYLISFADNADKASAHVLPQLKPAST*
Ga0134077_1031942713300012972Grasslands SoilGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK*
Ga0137409_1007130943300015245Vadose Zone SoilMLFVQVILGGSALLLGVPIIYHLVWGAATFVVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVLVHLTNAFVLGVLVTYLISFADNADKASAHVLPQLKPAST*
Ga0134112_1040715023300017656Grasslands SoilGLMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK
Ga0134083_1016348813300017659Grasslands SoilISVIGAFRNVLKPTRIITNLVGLMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR
Ga0066655_1022721423300018431Grasslands SoilVNVVGLMLFVPVILRGSAVFGYIDVLYHLAWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVIVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK
Ga0066662_1003180353300018468Grasslands SoilMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK
Ga0066662_1041492413300018468Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Ga0066662_1111005913300018468Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLATYLISFADSAGKASTSLHQKHT
Ga0215015_1021442913300021046SoilVRYHLVWGVLAFGVLIIATVYAASELGSKSTLFRTGIAAIGDYVVQIILGFIALGTNSGVIIVVHLTNAFVLGVLVTYLISFADSAEKTSTSLNPKGPTISPGTLQTVSGDA
Ga0210404_10000803103300021088SoilLKPARIVANIVGLMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVIHLTNAFVLGVLVTYLISFADSAEKTSTALNPQGPTLTPGTNRTVSRD
Ga0210409_1002227653300021559SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVIHLTNAFVLGVLVTYLISFADSAEKTSTALNPQGPTLNSGTNRTVSRDS
Ga0207646_1006855553300025922Corn, Switchgrass And Miscanthus RhizosphereMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGIVAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSADKASTSLHPQPSTVSPGAMRALPPRS
Ga0209235_100590783300026296Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA
Ga0209235_101323643300026296Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAERASTSLHSQPPTASPGTMRALSHRN
Ga0209235_108379233300026296Grasslands SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGVAAIADYVVQIILGFVALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASAHMLSQT
Ga0209237_108381113300026297Grasslands SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNGGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Ga0209236_100723863300026298Grasslands SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSLRN
Ga0209236_101960653300026298Grasslands SoilLKPTRIITNLVGLMLFIQVVLGGSAAFGYIDVRYHLVWGVVTFGVLIVATAYAANELGSKSVLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKTSSHILSQTQTVR
Ga0209238_111503013300026301Grasslands SoilMLFVQVILGGSAVFGYIAVRYHLVWGVVTFGVHIVATAYAASELGTKSTLFRTGMAAIADYVLQIILGFIALGTNSGVVVVIHLTNAFVLGVLVTYLISFADSADKASTYILSRTQTVC
Ga0209055_110086323300026309SoilMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPAGSPGTMRALSRGK
Ga0209239_102709023300026310Grasslands SoilLKTTRIIANLVGLMLFVQVILGGSALLLGVPLIYHLVWGAGTFIVLIVATFYAATELGSKSTLFRTGVAAIADYIVQIILGVVALGLNGYVVVVHLTNAFVLGVLVTYLISFADSADKASAHVLSQLKPAST
Ga0209761_101777713300026313Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA
Ga0209761_105035513300026313Grasslands SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSLRN
Ga0209761_110111923300026313Grasslands SoilMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTIRALSRRK
Ga0209686_100108533300026315SoilLKPARIIANLVGLMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPTGSPGTMRALSRGK
Ga0209686_101940913300026315SoilAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTRSPGTMRALSRGE
Ga0209154_101844633300026317SoilMLFFQVILGGSAVFGYINVQYHLEWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLANAFVLGVLVTYLITFADSAEKASTSLHPQPPTGSPGTMRALSRGK
Ga0209471_112876913300026318SoilRIITNLVGMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATPYAASELGSKSTLFRTGIAAIADYVVQTILGFIALGTNSGVVVVIHLTNAFVLGVLVTYLISFADSADKASTTYYHKLKPYAEGPGYLLSNMLL
Ga0209152_1001994223300026325SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKTSTSLHPQPPTRSPGTMRALSRGK
Ga0209801_102809713300026326SoilMLFVQVILGGSAVFGYIDVLYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTRSPGTMRALSRGE
Ga0209801_104309333300026326SoilLKPARLVANFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Ga0209266_109066823300026327SoilLKPARIIANLVGLMLFVQVILGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIVVHLTNAFVLGVLVTYLISFADSAEKASTSLRPQPPTGSPGTMRALSRGK
Ga0209802_1003567123300026328SoilLKPARIIANLVGLMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTMRALSRRK
Ga0209802_119850323300026328SoilMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSALFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Ga0209803_124835713300026332SoilILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARQLGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVIIVHLTNAFVLGVLVTYLISFADSAEKASTSLHSQPPTASPGTMRALSHRN
Ga0257180_104961213300026354SoilLKPARIVANLVGLMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFLLGVLVTYLISFADSAEKVSTPLNPKGPTVAPGTMQA
Ga0209160_1002484123300026532SoilLKPARIVSNFVGLMLFVQVILGGSAVFGYIDVQYHLVWGVVTFGVLIIATVYAARELGSKSTLFRTGLAAIADYVVQIVLGFIALGTNSGVVIIIHLTNAFVLGVLVTYLISFADSAEKASTSLHPQPPTGSPGTMRALSRREHIPDSFTGYPA
Ga0209160_109874623300026532SoilLKPARIIANLVGLMLFVQVVLGGSAVFGYINVQYHLVWGVVTFGVLIIATVYAAMELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGIVIVIHLTNAFVLGVLVTYLISFADSAEKASASLYPQSSTASPGTIRALSRRK
Ga0209056_1002568713300026538SoilNLVGMMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSTLFRTGIAAIADYVVQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASTTHILSQTQTVR
Ga0209376_105804013300026540SoilMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTGMAAIADYVLQIILGFIALGANSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHIL
Ga0209648_1069621513300026551Grasslands SoilLKPARIVANLVGLMLFVQVIQGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPNGPTVAPGTMLSVSRDA
Ga0209588_100184123300027671Vadose Zone SoilVGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPTVAPGTMRAVSRDA
Ga0209689_107056543300027748SoilMLFVQVILGGSAVFGYIDVRYHLVWGVVTFGVLIVATAYAASELGSKSALFRTRMAAIADYVLQIILGFIALGTNSGVVVVVHLTNAFVLGVLVTYLISFADSADKASIHILSQTQTVR
Ga0209701_1011934633300027862Vadose Zone SoilMLFVQVILGGSAAFGYIDVQYHLAWGVATFGVLIIATVYAVMELGSKSTLFRTGIAAIADYVVQIVLGFIALGTNSGVVIVVHLTNAFVLGVLVTYLINFADSAEKASAHVLSQTQTVR
Ga0209283_1082946613300027875Vadose Zone SoilQVILGGLATVLGYPVIYHIIWGGLTFVVLIIATVYAVRELGSKSTLFRTGIVANADYVVQIILGFVALGLNNDAVVVVHLTNAFVLGVLVTYLISFADSAEKASASLHPHTPTISPGTM
Ga0209590_1017091123300027882Vadose Zone SoilLKPTRIVANLVGLMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVIQIILGFIALGTNSGAIIVVHLTNAFVLGVLVTYLISFADAAEKASTALNPKGPTVAPGTMLSVSRDA
Ga0137415_1026136823300028536Vadose Zone SoilLKSARIVANLVALMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAASELGRKSTLFRTGIAAIADYVVQIILGFMALGTNSGAIIVVHLTNAFVLGVLVTYLISFADSAEKASTSLNPKGPSVAPGTMRAFSPDA
Ga0137415_1113194313300028536Vadose Zone SoilMLFVQVILGGSAVFGYVDVRYHLVWGVLAFGVLIIATVYAANELGRKSTLFRTGIAAIADYVVQIILGLIALGTNSGAIIVVHLTNAFVLGVLVTYLISFA
Ga0307473_1076633013300031820Hardwood Forest SoilQPTRIVTNLVGMMLFVQVILGGSAVFGLADVRYHLVWGIATFGVLIVATVYAARELGSKSTLFRTGIAAIVDYVVQIILGVTALATNNNSIVVVVHLTNAFVLGVLVTYLISFADSAEKASASLNPQARTPAPGTMRALSRDS
Ga0307479_1023710943300031962Hardwood Forest SoilFGLADVWIHLVWGIVTFGVLIIATVYAARELGSKSTLFRTGVAAIADYVVQIVLGVTALATNNNAVVVVVHLTNAFVLGVLVTYLISFADSAEKASASLNPQARTPAPGTMRALSRDS
Ga0307471_10089579213300032180Hardwood Forest SoilLGGLATVLGFDVIYHIIWGGLTFIVLIVATLFAARKMGSKSTLFRTGIAAIADYVVQIILGFVAFGLNDDAVVVVHLTNAFVLAVLVTYLISFADSAEKVSVAMRPQTPSMSPGTM
Ga0307472_10056753823300032205Hardwood Forest SoilMLFVQVILGGSAVFGLTDVRYHLVWGIATFGVLIVATVYAARELGSKSTLFRTGIAAIVDYVVQIILGVTALATNNDPIVVVVHLTNAFVLGVLVTYLISFADSA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.