NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F066205

Metagenome Family F066205

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066205
Family Type Metagenome
Number of Sequences 127
Average Sequence Length 222 residues
Representative Sequence TASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Number of Associated Samples 111
Number of Associated Scaffolds 127

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 11.81 %
% of genes near scaffold ends (potentially truncated) 39.37 %
% of genes from short scaffolds (< 2000 bps) 48.82 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.84

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(17.323 % of family members)
Environment Ontology (ENVO) Unclassified
(28.346 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.457 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.73%    β-sheet: 17.86%    Coil/Unstructured: 42.41%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.84
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.23.10.1: Photosystem II reaction centre subunit H, transmembrane regiond1rzhh21rzh0.78848
a.137.7.1: Proteinase A inhibitor IA3d1dpjb_1dpj0.78177
f.23.25.1: PetM subunit of the cytochrome b6f complexd1q90m_1q900.77462
f.23.25.1: PetM subunit of the cytochrome b6f complexd4ogqf_4ogq0.75516
f.23.19.1: Subunit XII of photosystem I reaction centre, PsaMd6k61m_6k610.73245


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 127 Family Scaffolds
PF04773FecR 25.20
PF02075RuvC 22.83
PF13722CstA_5TM 7.09
PF00398RrnaAD 3.15
PF01875Memo 3.15
PF00365PFK 1.57
PF08323Glyco_transf_5 0.79
PF00501AMP-binding 0.79
PF00171Aldedh 0.79
PF00924MS_channel 0.79
PF02452PemK_toxin 0.79
PF13193AMP-binding_C 0.79
PF07676PD40 0.79
PF11799IMS_C 0.79
PF07228SpoIIE 0.79
PF12867DinB_2 0.79
PF07638Sigma70_ECF 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 127 Family Scaffolds
COG0817Holliday junction resolvasome RuvABC endonuclease subunit RuvCReplication, recombination and repair [L] 22.83
COG003016S rRNA A1518 and A1519 N6-dimethyltransferase RsmA/KsgA/DIM1 (may also have DNA glycosylase/AP lyase activity)Translation, ribosomal structure and biogenesis [J] 3.15
COG1355Predicted class III extradiol dioxygenase, MEMO1 familyGeneral function prediction only [R] 3.15
COG02056-phosphofructokinaseCarbohydrate transport and metabolism [G] 1.57
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 0.79
COG0297Glycogen synthaseCarbohydrate transport and metabolism [G] 0.79
COG0438Glycosyltransferase involved in cell wall bisynthesisCell wall/membrane/envelope biogenesis [M] 0.79
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.79
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 0.79
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.79
COG2337mRNA-degrading endonuclease MazF, toxin component of the MazEF toxin-antitoxin moduleDefense mechanisms [V] 0.79
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.79
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 0.79


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2228664022|INPgaii200_c0457930All Organisms → cellular organisms → Bacteria → Acidobacteria554Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101677465All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6360Open in IMG/M
3300000789|JGI1027J11758_12682957All Organisms → cellular organisms → Bacteria → Acidobacteria1366Open in IMG/M
3300000955|JGI1027J12803_101428668All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis820Open in IMG/M
3300001646|JGI20276J16322_100898All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis550Open in IMG/M
3300001867|JGI12627J18819_10000460All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis13408Open in IMG/M
3300005174|Ga0066680_10139203All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1512Open in IMG/M
3300005177|Ga0066690_10013937All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter4246Open in IMG/M
3300005178|Ga0066688_10048338All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2449Open in IMG/M
3300005332|Ga0066388_100646225All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium1673Open in IMG/M
3300005434|Ga0070709_10003903All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8042Open in IMG/M
3300005435|Ga0070714_100123696All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2304Open in IMG/M
3300005436|Ga0070713_100280430All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1529Open in IMG/M
3300005436|Ga0070713_101353201All Organisms → cellular organisms → Bacteria → Acidobacteria690Open in IMG/M
3300005439|Ga0070711_100233521All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1435Open in IMG/M
3300005439|Ga0070711_100459494All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1043Open in IMG/M
3300005454|Ga0066687_10026805All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2495Open in IMG/M
3300005526|Ga0073909_10001141All Organisms → cellular organisms → Bacteria8266Open in IMG/M
3300005529|Ga0070741_10004273All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter33604Open in IMG/M
3300005542|Ga0070732_10067555All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2080Open in IMG/M
3300005542|Ga0070732_10254190All Organisms → cellular organisms → Bacteria → Acidobacteria1052Open in IMG/M
3300005557|Ga0066704_10047402All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2702Open in IMG/M
3300005561|Ga0066699_10566270All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter815Open in IMG/M
3300005610|Ga0070763_10733778All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter580Open in IMG/M
3300005921|Ga0070766_10010846All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter4675Open in IMG/M
3300006050|Ga0075028_100182034All Organisms → cellular organisms → Bacteria → Acidobacteria1126Open in IMG/M
3300006059|Ga0075017_100581656All Organisms → cellular organisms → Bacteria → Acidobacteria855Open in IMG/M
3300006163|Ga0070715_10026604All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2303Open in IMG/M
3300006163|Ga0070715_10070234All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1563Open in IMG/M
3300006172|Ga0075018_10048044All Organisms → cellular organisms → Bacteria → Acidobacteria1768Open in IMG/M
3300006173|Ga0070716_100525317All Organisms → cellular organisms → Bacteria → Acidobacteria878Open in IMG/M
3300006174|Ga0075014_100001411All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium6577Open in IMG/M
3300006354|Ga0075021_10018118All Organisms → cellular organisms → Bacteria3910Open in IMG/M
3300006797|Ga0066659_10041488All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2843Open in IMG/M
3300006797|Ga0066659_10612190All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium TAA 166885Open in IMG/M
3300006800|Ga0066660_10071802All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2351Open in IMG/M
3300009792|Ga0126374_10012111All Organisms → cellular organisms → Bacteria3451Open in IMG/M
3300010048|Ga0126373_10028629All Organisms → cellular organisms → Bacteria4795Open in IMG/M
3300010358|Ga0126370_11909059All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium578Open in IMG/M
3300010360|Ga0126372_10003195All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7443Open in IMG/M
3300010361|Ga0126378_10506895All Organisms → cellular organisms → Bacteria → Acidobacteria1322Open in IMG/M
3300012200|Ga0137382_10040441All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2858Open in IMG/M
3300012202|Ga0137363_10000190All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae32955Open in IMG/M
3300012207|Ga0137381_10250948All Organisms → cellular organisms → Bacteria → Acidobacteria1538Open in IMG/M
3300012208|Ga0137376_10190725All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1772Open in IMG/M
3300012517|Ga0157354_1005007All Organisms → cellular organisms → Bacteria → Acidobacteria1143Open in IMG/M
3300012519|Ga0157352_1006417All Organisms → cellular organisms → Bacteria → Acidobacteria1121Open in IMG/M
3300012582|Ga0137358_10072035All Organisms → cellular organisms → Bacteria → Acidobacteria2320Open in IMG/M
3300012927|Ga0137416_10000704All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae15970Open in IMG/M
3300012944|Ga0137410_10288934All Organisms → cellular organisms → Bacteria → Acidobacteria1296Open in IMG/M
3300012958|Ga0164299_10968822All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium624Open in IMG/M
3300012961|Ga0164302_10438928All Organisms → cellular organisms → Bacteria → Acidobacteria903Open in IMG/M
3300012971|Ga0126369_11048323All Organisms → cellular organisms → Bacteria → Acidobacteria904Open in IMG/M
3300012984|Ga0164309_10749844All Organisms → cellular organisms → Bacteria → Acidobacteria780Open in IMG/M
3300012989|Ga0164305_10791869All Organisms → cellular organisms → Bacteria → Acidobacteria785Open in IMG/M
3300015371|Ga0132258_11441210All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1739Open in IMG/M
3300015373|Ga0132257_100000197All Organisms → cellular organisms → Bacteria41815Open in IMG/M
3300016387|Ga0182040_10101023All Organisms → cellular organisms → Bacteria → Acidobacteria1957Open in IMG/M
3300017927|Ga0187824_10003825All Organisms → cellular organisms → Bacteria4107Open in IMG/M
3300017930|Ga0187825_10004428All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4645Open in IMG/M
3300017936|Ga0187821_10024016All Organisms → cellular organisms → Bacteria → Acidobacteria2136Open in IMG/M
3300017936|Ga0187821_10078280All Organisms → cellular organisms → Bacteria → Acidobacteria1203Open in IMG/M
3300017993|Ga0187823_10012355All Organisms → cellular organisms → Bacteria → Acidobacteria2054Open in IMG/M
3300017994|Ga0187822_10007081All Organisms → cellular organisms → Bacteria → Acidobacteria2578Open in IMG/M
3300018032|Ga0187788_10006143All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3666Open in IMG/M
3300019877|Ga0193722_1033242All Organisms → cellular organisms → Bacteria → Acidobacteria1332Open in IMG/M
3300020580|Ga0210403_11123273All Organisms → cellular organisms → Bacteria → Acidobacteria609Open in IMG/M
3300020581|Ga0210399_10049577All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3372Open in IMG/M
3300020582|Ga0210395_10048986All Organisms → cellular organisms → Bacteria → Acidobacteria3093Open in IMG/M
3300020583|Ga0210401_10041892All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4334Open in IMG/M
3300021178|Ga0210408_10486587All Organisms → cellular organisms → Bacteria → Acidobacteria981Open in IMG/M
3300021406|Ga0210386_10028372All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4387Open in IMG/M
3300021420|Ga0210394_10059972All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3311Open in IMG/M
3300021432|Ga0210384_10004648All Organisms → cellular organisms → Bacteria15228Open in IMG/M
3300021432|Ga0210384_10475419All Organisms → cellular organisms → Bacteria → Acidobacteria1125Open in IMG/M
3300021479|Ga0210410_10011366All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7664Open in IMG/M
3300024330|Ga0137417_1249177All Organisms → cellular organisms → Bacteria → Acidobacteria2248Open in IMG/M
3300025906|Ga0207699_10184536All Organisms → cellular organisms → Bacteria → Acidobacteria1404Open in IMG/M
3300025928|Ga0207700_10095521All Organisms → cellular organisms → Bacteria → Acidobacteria2358Open in IMG/M
3300025928|Ga0207700_10999655All Organisms → cellular organisms → Bacteria → Acidobacteria749Open in IMG/M
3300025929|Ga0207664_10026515All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4382Open in IMG/M
3300026309|Ga0209055_1076685All Organisms → cellular organisms → Bacteria → Acidobacteria1386Open in IMG/M
3300026317|Ga0209154_1070609All Organisms → cellular organisms → Bacteria → Acidobacteria1519Open in IMG/M
3300026318|Ga0209471_1095328All Organisms → cellular organisms → Bacteria → Acidobacteria1294Open in IMG/M
3300026322|Ga0209687_1054765All Organisms → cellular organisms → Bacteria → Acidobacteria1293Open in IMG/M
3300026335|Ga0209804_1040069All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2335Open in IMG/M
3300026527|Ga0209059_1005141All Organisms → cellular organisms → Bacteria5896Open in IMG/M
3300026529|Ga0209806_1006076All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6798Open in IMG/M
3300026552|Ga0209577_10008638All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae9341Open in IMG/M
3300027047|Ga0208730_1014296All Organisms → cellular organisms → Bacteria → Acidobacteria873Open in IMG/M
3300027071|Ga0209214_1003737All Organisms → cellular organisms → Bacteria → Acidobacteria1601Open in IMG/M
3300027576|Ga0209003_1003233All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2074Open in IMG/M
3300027706|Ga0209581_1000004All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1127213Open in IMG/M
3300027842|Ga0209580_10044109All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2063Open in IMG/M
3300027889|Ga0209380_10018895All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3903Open in IMG/M
3300031545|Ga0318541_10006555All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4986Open in IMG/M
3300031546|Ga0318538_10220834All Organisms → cellular organisms → Bacteria → Acidobacteria1014Open in IMG/M
3300031564|Ga0318573_10098800All Organisms → cellular organisms → Bacteria → Acidobacteria1495Open in IMG/M
3300031640|Ga0318555_10261030All Organisms → cellular organisms → Bacteria → Acidobacteria936Open in IMG/M
3300031680|Ga0318574_10129605All Organisms → cellular organisms → Bacteria → Acidobacteria1421Open in IMG/M
3300031681|Ga0318572_10043797All Organisms → cellular organisms → Bacteria → Acidobacteria2387Open in IMG/M
3300031718|Ga0307474_10014670All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5663Open in IMG/M
3300031718|Ga0307474_10073571All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2539Open in IMG/M
3300031718|Ga0307474_10101090All Organisms → cellular organisms → Bacteria → Acidobacteria2155Open in IMG/M
3300031720|Ga0307469_10030092All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3135Open in IMG/M
3300031720|Ga0307469_10130063All Organisms → cellular organisms → Bacteria → Acidobacteria1841Open in IMG/M
3300031740|Ga0307468_101151129All Organisms → cellular organisms → Bacteria → Acidobacteria695Open in IMG/M
3300031744|Ga0306918_10213925All Organisms → cellular organisms → Bacteria → Acidobacteria1458Open in IMG/M
3300031753|Ga0307477_10093026All Organisms → cellular organisms → Bacteria → Acidobacteria2094Open in IMG/M
3300031753|Ga0307477_10115304All Organisms → cellular organisms → Bacteria → Acidobacteria1871Open in IMG/M
3300031754|Ga0307475_10014123All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5430Open in IMG/M
3300031754|Ga0307475_10066624All Organisms → cellular organisms → Bacteria → Acidobacteria2743Open in IMG/M
3300031754|Ga0307475_11246945All Organisms → cellular organisms → Bacteria → Acidobacteria577Open in IMG/M
3300031792|Ga0318529_10037705All Organisms → cellular organisms → Bacteria → Acidobacteria2037Open in IMG/M
3300031796|Ga0318576_10031507All Organisms → cellular organisms → Bacteria → Acidobacteria2198Open in IMG/M
3300031797|Ga0318550_10286135All Organisms → cellular organisms → Bacteria → Acidobacteria800Open in IMG/M
3300031820|Ga0307473_10627454All Organisms → cellular organisms → Bacteria → Acidobacteria745Open in IMG/M
3300031823|Ga0307478_10001791All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae17955Open in IMG/M
3300031962|Ga0307479_10000548All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae33802Open in IMG/M
3300032044|Ga0318558_10051436All Organisms → cellular organisms → Bacteria → Acidobacteria1824Open in IMG/M
3300032063|Ga0318504_10028174All Organisms → cellular organisms → Bacteria → Acidobacteria2205Open in IMG/M
3300032180|Ga0307471_101246290All Organisms → cellular organisms → Bacteria → Acidobacteria907Open in IMG/M
3300032205|Ga0307472_100012935All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4268Open in IMG/M
3300032205|Ga0307472_100400099All Organisms → cellular organisms → Bacteria → Acidobacteria1148Open in IMG/M
3300032205|Ga0307472_101033955All Organisms → cellular organisms → Bacteria → Acidobacteria772Open in IMG/M
3300032829|Ga0335070_11382423All Organisms → cellular organisms → Bacteria → Acidobacteria643Open in IMG/M
3300033289|Ga0310914_10049372All Organisms → cellular organisms → Bacteria → Acidobacteria3461Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil17.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.54%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil14.17%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.66%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.30%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment4.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.72%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil4.72%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.36%
Unplanted SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Unplanted Soil1.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.57%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.57%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.57%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.79%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.79%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.79%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001646Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF042EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012517Unplanted soil (control) microbial communities from North Carolina - M.Soil.6.yng.070610EnvironmentalOpen in IMG/M
3300012519Unplanted soil (control) microbial communities from North Carolina - M.Soil.7.yng.070610EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027047Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF042 (SPAdes)EnvironmentalOpen in IMG/M
3300027071Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027576Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031546Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f23EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031792Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f23EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032044Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f20EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_045793012228664022SoilPDHYVVSVSRDCKAGYDSTGKLTPQSDPGAPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGGRKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAP
INPhiseqgaiiFebDRAFT_10167746543300000364SoilMRPSASSRKIVETMTNTVFRCLVVGALFGIPSVAFAQQSAASPATITFSLDFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGAPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGARKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAPGGY*
JGI1027J11758_1268295723300000789SoilMRPSASSRKIVETMTNTVFRCLVVGALFGIPSVAFAQQSAASPATITFSLDFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGAPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGXRKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAPGGY*
JGI1027J12803_10142866813300000955SoilMRPSASSRKIVETMTNTVFRCLVVGALFGIPSVAFAQQSAASPATITFSLDFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGAPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGGRKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAPGGY*
JGI20276J16322_10089813300001646Forest SoilLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNR
JGI12627J18819_1000046053300001867Forest SoilMRGGFAQEAKAPAASVRFSLDFPQSIPDHYEIAVSSEGQASYDSTGKLTPESEPGDPFHLEFSISPASTHRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLSYKDATRNTRAEYNYTPIPAVQEITALFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAHEKDLEEIQAVAPILQRIVDDKSVLNVTRARAQRLLAGSSVRAAP*
Ga0066680_1013920323300005174SoilMNSVFVSATRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0066690_1001393743300005177SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0066688_1004833823300005178SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKILTYKDATRNNRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0066388_10064622513300005332Tropical Forest SoilRYLVFSVEVLSVLFFSLLVPPELGSAPAPVATVGFSLDFPQSIPDHYVVTVSSDGRASYDSTGKLTIDADPGDAFHMDFTVSPGTRERIFDLAAKAKYFEGKVDSGKRNIASTGEKVLSYKDEQRHTRAAYNYSPNAAVQELTTLFQNLSTTLEFGRRLDYYYHYQKLALDEELKRMEQMLKEKNLEEVQAIAPLLQKILADSSVINVTRGRAQRLLNGSGVTASR*
Ga0070709_1000390333300005434Corn, Switchgrass And Miscanthus RhizosphereMNVFSRLTLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS*
Ga0070714_10012369613300005435Agricultural SoilMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATGSVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTIAPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEEELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0070713_10028043023300005436Corn, Switchgrass And Miscanthus RhizosphereMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVD
Ga0070713_10135320113300005436Corn, Switchgrass And Miscanthus RhizosphereVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS*
Ga0070711_10023352113300005439Corn, Switchgrass And Miscanthus RhizosphereMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0070711_10045949423300005439Corn, Switchgrass And Miscanthus RhizospherePQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0066687_1002680533300005454SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0073909_1000114163300005526Surface SoilMNVFSRLSLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0070741_1000427363300005529Surface SoilMAAASEAPARISFSLDFPNSIPDHYVIAISSDGRASYDSTGKLTPDSEPGDPFHADFVLSSAACKRVFDLAAKAKYFEGKVDSGKSNLASTGAKVLTYTSGDRHNRAEYNYSPLPAVQQITAYFQNLSTTLEFGRRLDFYYRHQKLALEDELKRMEELARDKSLEEVQAVAPVLRQIAADKSVLNVSRARAQRLLLASGSELPH*
Ga0070732_1006755533300005542Surface SoilMTPTPVPRQASAIKAVLFALILSTSLAAFPQEAKTAPAVVGFSLDFPQSSPDHYEFSIASDGRASYDSTGKLTPQSDAGDPFHTDFTISAVNLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLDYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAH*
Ga0070732_1025419013300005542Surface SoilMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPIPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGASW*
Ga0066704_1004740213300005557SoilMNSVFVSATRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0066699_1056627013300005561SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPALQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0070763_1073377813300005610SoilQASAIKTVLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSIASDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVTP
Ga0070766_1001084643300005921SoilMTPSPVPRQASAIKTVLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSIASDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVTPILEKIVADRSVLNVSRARAQRLLTAAGAAGAR*
Ga0075028_10018203413300006050WatershedsMRREAAVPTLLDIDRMAPQSLRTTMIAALWRLARAALLFSYALAGILMIAATAFAQESKPTATVSFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTPDSDPADPFHLDFTISAANRDKIFDLAAKAKYFEGKVDSGKKNLASTGAKVLTYKDAQRNTQAAYNYSPIPAVQELTALFQNISTTLEFGRQLDYYHHYQKLALDEQLKRMEQMANEKNLDEMQAVAPILQQILMDKSVINVTRARAQRLLASSGATASR*
Ga0075017_10058165623300006059WatershedsMRREAAVPTLLDIDRMAPQSLRTTMIAALWRLARGVLLFSYALAGLLMIAATAFAQESKPTATVSFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTPDSDPADPFHLDFTISAANRDKIFDLAAKAKYFEGKVDSGKRNLASTGAKVLTYKDAQRNTQAAYNYSPIPAVQELTALFQNISTTLEFGRQLDYYHHYQKLALDEQLKRMEQMANEKNLDEMQAVAPILQQILMDKSVINVTRARARVTLMTDL
Ga0070715_1002660413300006163Corn, Switchgrass And Miscanthus RhizosphereQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS*
Ga0070715_1007023423300006163Corn, Switchgrass And Miscanthus RhizosphereMTRISRPATWTTAIVLLLAILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0075018_1004804423300006172WatershedsMRREAAVPTLLDIDRMAPQSLRTTMIAALWRLARGVLLFSYALAGLLMIAATAFAQESKPTATVSFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTPDSDPADPFHLDFTISAANRDKIFDLAAKAKYFEGKVDSGKRNLASTGAKVLTYKDAQRNTQAAYNYSPIPAVQELTALFQNISTTLEFGRQLDYYHHYQKLALDEQLKRMEQMANEKNLDEMQAVAPILQQILMDKSVINVTRARAQRLLASSGATASR*
Ga0070716_10052531723300006173Corn, Switchgrass And Miscanthus RhizosphereEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPVVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0075014_10000141163300006174WatershedsSKTAVATVTFSLDFPQSIPDHYVLTVSSDGHAGYDSTGKLTPESEPTDPFHLDFTVSPSTREKVFDLAAKAKYFEGKIDSGKRNIASTGAKVLSYRDSQRSAQSAYNYSPLPPVQELTALFQNISTTLEFGRRLDYYHHYQKLALEEELKRMEEMVREKSLDEVQAVAPILQQIVADQSVMNVTRARAQRLLNGTGAVASR*
Ga0075021_1001811853300006354WatershedsMRREAAVPTLLDIDRMAPQSLRTTMIAALWRLARAALLFSYALAGILMIAATAFAQESKPTATVSFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTPDSDPADPFHLDFTISAANRDKIFDLAAKAKYFEGKVDSGKKNLASTGAKVLTYKDAQRNTQAAYNYSPIPAVQELTALFQNISTTLEFGRELDYYHHYQKLALDEQLKRMEQMANEKNLDEMQAVAPILQQILMDKSVINVTRARAQRLLASSGATASR*
Ga0066659_1004148823300006797SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0066659_1061219013300006797SoilMFLFRSLRSCLFRALVLLLSGLFCLTWLASAQTSSQTDATVRFSLDFPQSIPDHYAVTVSSNGHATYDSTGKLTVDSDPGDPFHLDFTVSAGTRERIFNLAGKAKYFEGKVDSGKRNIASTGAKVLSYKDGARNTKATYNYSPVTAVQELTALFQNLSTTLEFGRRLDYYYRYQKLALEEELKRMEQMVKDKNLEEVQAVAPILQRIMADQSVINVTRARAQRLLNSSGVAANR*
Ga0066660_1007180243300006800SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0126374_1001211123300009792Tropical Forest SoilLLLIHHLFWIDVPPSFDLLALESLWSSMILRPGRYLAFSVAVLSVLFLTLLAPPELGSAPAAVATVGFSLDFPQSIPDHYVVTVSSDGRASYDSTGKLTIDADPGDAFHMDFTISPGTRERIFDLAARAKYFEGKVDSGKRNIASTGEKVLSYKDEQRNTRAAYNYSPNPAVQELTTLFQNLSTTLEFGRRLDYYYHYQKLALDEELKRMEQMLKEKNLEEVQAIAPVLQKILADSSVINVTRGRAQRLLNGSGVTASR*
Ga0126373_1002862933300010048Tropical Forest SoilMTRIPRPDTWTAALVLLLVILSSFRLACAQEAKAPTASVGFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS*
Ga0126370_1190905913300010358Tropical Forest SoilDFPQSIPDHYALKVSSDGHAEYNSTGKLTPDSEPPDPFHLEFTISPATLEKIFDLAAKAKYFEGKVDSGKRNIASTGNKVLAYHDAQRNTQAEYNYSPNPAVQELTSIFQNISTTLEFGRRLDYYHRYQKLALEAELKTMEEMVRSKSLEELQAVSPILEQIATDQSVINISRSRAQRLLALGGTPIASH*
Ga0126372_1000319533300010360Tropical Forest SoilLLLIHHLFWIDVPPSFDLLALESLWSSMILRPGRYLAFSVAVLSVLFLTLLAPPELGSAPAAVATVGFSLDFPQSIPDHYVVTVSSDGRASYDSTGKLTIDADPGDAFHMDFTISPGTRERIFDLAARAKYFEGKVDSGKRNIASTGEKVLSYKDEQRNTRAAYNYSPNPAVQELTTLFQNLSTTLEFGRRLDYYYHYQKLALDEELKRMEQMLKEKNLEEVQAIAPVLQKILADSSVINVTRGRAQRLLNGSGVTATR*
Ga0126378_1050689523300010361Tropical Forest SoilMIRTSRRSVTRKQVCSLIAPLFLAVMSCAQEAKPATVTFSLDFPQSIPDHYALSVSSDGHATYDSTGKLTPDSEPPDPFHLDFTMSAANRARVFDLAARAKYFDGKVDSGKTNLANTGAKVLTYQDGQRNTKAAYNYSPVPAVQELTTLFQNISTTLEFSRRLDYYHHYQKLALEEELKQMEQMVKEKNLDELLAVAPILKTILADQSVMNVTRARAQRLLYLMGSHP*
Ga0137382_1004044133300012200Vadose Zone SoilLTWLAWAQSASPSVATINFSLDFPQSIPDHYAVTVSSDGRATYDSTGKLTVDSDPGDPFHLDFTISPGTRERIFNLAGKAKYFEGKVDSGKRNIASTGAKVLSYKDGARNTKATYNYSPVPAVQELTALFQNLSTTLECGRRLDYYYRYQKLALEEELKRMEQMVKEKNLEEVQAVAPILQRIMVDQSVINVTRARAQRLLNGSGVAASR*
Ga0137363_10000190213300012202Vadose Zone SoilMISTPLGWSVASTRVLLCLLVAIAVSPLKASAQDSNPDVATVTFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTAESEPTDPFHIDFTLSPAAREKAFDLAAKAKYFQGKVDSGKRNIASTGAKVLSYRDRQRSTQASYNYSLLPAVQELTALFQNISTTLEFGRRLDYYHHYQKLALEEELKRMEEMVRERSLDELQAVAPILQRIVADQTVINVTRARAQRLLAAGGATASR*
Ga0137381_1025094813300012207Vadose Zone SoilMNSVRLAILALFLSLSAALLAQQPANPAPTITFSLDFPQSIPDHYVLTVSSDRHASYESTGKLTPQADPGDPFRHEFTMSAETCKRIFDLAAKAKYFEGQVDSGNKKLASTGVKILTYADGARKTEATYNYSPNPAVQQITTIFQSISSTLEFGHRLEFYHHHQKLALEEELKRMEEMAHAKSLEEVQALAPILQSIIADHSVLNVTRARAQRLLNGANVAAANGA*
Ga0137376_1019072513300012208Vadose Zone SoilLTWLAWAQSASPSVATISFSLDFPQSIPDHYAVTVSSDGRATYDSTGKLTVDSDPGDPFHLDFTISPGTRERIFNLAGKAKYFEGKVDSGKRNIASTGAKVLSYKDGARNTKATYNYSPVPAVQELTALFQNLSTTLECGRRLDYYYRYQKLALEEELKRMEQMVKEKNLEEVQAVAPILQRIMVDQSVINVTRARAQRLLNGSGVAASR*
Ga0157354_100500723300012517Unplanted SoilMNVFSRLSLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEASYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0157352_100641713300012519Unplanted SoilMNVFSRLSLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFRDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEASYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0137358_1007203543300012582Vadose Zone SoilMISTPLGWSVASTRVLLCLLVAIAVSPLKASAQDSNPDVATVTFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTAESEPTDPFHIDFTLSPAAREKAFDLAAKAKYFQGKVDSGKRNIASTGAKVLSYRDRQRSTQASYNYSLLPAVQELTALFQNISTTLEFGRRLDYYHHYQKLALEEELKRMEEMVRERSLDELQAVAPILQRIVADQTVIN
Ga0137416_1000070493300012927Vadose Zone SoilMISTPLGWSVASTRVLLCLLVAIAVSPLKASAQDSNPDVATVTFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTAESEPTDPFHIDFTLSPAAREKAFDLAAKAKYFQGKVDSGKRNIASTGAKVLSYRDRQRSTQASYNYSLLPAVQELTALFQNISTTLEFGRRLDYYHHYQKLALEEELKRMEEMVRERSLDELQAVAPILQRIVADQTVINVTRARAQRLLAAGAATASR*
Ga0137410_1028893413300012944Vadose Zone SoilMNVFSRLTLIAQILVVTASLAQQSPAPATISFSLDFPQSIPDHYVITVSSDRHSTYESTGKLTPQSDPGDPFLAEFTVSPETCKKIFDLATKAKYFQAQIDSGNKKLASTGVKVLKYSEGSRKTEATYNYSPIPAVQQITAVFQNISTTLEYGRRLDFYYHHQKLALEDELKHMEEAAKSNNLGEIQAVAPILQSIVADHSVLNVTRARAQRLLAVANVTPVAGS*
Ga0164299_1096882213300012958SoilSSDRHATYESTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0164302_1043892813300012961SoilSRLSLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKITPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0126369_1104832313300012971Tropical Forest SoilIDRSVMTSIPLSAMRRATLVLAFILCEMAPCFAQEAKPPAASVSFSLDFPQSIPDHYEIAVSSDRQASYDSTGKLTPESEPGDPFHLEFTISPSSTHRIFDLAAKAKYFDGKVDSGKRNLASTGAKVLSYKDATRNTRAEYNYSPIPAVQEITVLFQNMSSTLEFGRRLDYYHHHQKLALDEELKRMEEMARDKNLEEIPAVAPILQRIVDDKSVLNVTRARAQRLLVRSATGATQ*
Ga0164309_1074984413300012984SoilMNVLSRLSLVALILADTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0164305_1079186913300012989SoilPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEASYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0132258_1144121023300015371Arabidopsis RhizosphereMNVFSRLSLVALILAVTASLAQQNPPATISFSLDFPQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRQTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILQRIAADHTVLNVTRARAQRLLSTANAARIAGS*
Ga0132257_100000197203300015373Arabidopsis RhizosphereMNVFSRLSLVALILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYESTGKLTPQSDPGDPFRDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAIVPILRRIAADHSVLNVTRARAQRLLSTANAAPIAGS*
Ga0182040_1010102323300016387SoilRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNQDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0187824_1000382523300017927Freshwater SedimentMNLKLRPVVLAILAAVLLSVLLGGLESAASDPPSSISFSLDFPQSIPDHYQITVASDGRASYDSTGKLTPDSEPGDPYHLEFTLSPASCQRLFDLAAKAKYFEGKVDSGRRNLASTGAKVLTYSDGDRHSRAEYNYSPLPAVQEITSIFQGMSTTLEFGRRLDFFHHHQKLALEAELKRMEEMAKDKSLQEIQAIAPLLQQIATDQTVLNVSRARAQRLLLASGTAN
Ga0187825_1000442863300017930Freshwater SedimentMNLKLRPVVLAILAAVLLSVLLGGLESAASDPPSSISFSLDFPQSIPDHYQITVASDGRASYDSTGKLTPDSEPGDPYHLEFTLSPASCHRLFDLAAKAKYFEGKVDSGRRNLASTGAKVLTYSDGDRHSRAEYNYSPLPAVQEITSIFQGMSTTLEFGRRLDFFHHHQKLALEAELKRMEEMAKDKSLQEIQAIAPLLQQIATDQTVLNVSRARAQRLLLASGTAN
Ga0187821_1002401623300017936Freshwater SedimentMPSVPVRWYGLTFCLSILLCCALAAQPAPGAAGLPASVSFSLDFPNSIPDHYEISVSSDGHASYDSTGKLTPDSEPGDPFHTDFMISAANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAQYNYSPVPAVQEITSIFQGVSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEIQAVAPILQRIAADQSVLNVSRARAQRLLLAGGAGASH
Ga0187821_1007828023300017936Freshwater SedimentMNPLLRPVVLAICTATLFCLLLGSLESAASDPPSSISFSLDFPQSIPDHYQITVASDGHASYDSTGKLTPDSEPGDPYHLDFTLSPASCQHLFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYNNGERHSRAEYNYSPVPAVQEITSIFQGMSTTLEFGRRLDFYHHHQKLALEAELKRMEEMAKEKSLQEIQAIAPLLQQIASDQTVLNVSRARAQRLLLASGPSMPR
Ga0187823_1001235523300017993Freshwater SedimentMNLKLRPVVLAILAAVLLSVLLGGLESAAASDPPSSISFSLDFPQSIPDHYQITVASDGRASYDSTGKLTPDSEPGDPYHLEFTLSPASCQRLFDLAAKAKYFEGKVDSGRRNLASTGAKVLTYSDGDRHSRAEYNYSPLPAVQEITSIFQGMSTTLEFGRRLDFFHHHQKLALEAELKRMEEMAKDKSLQEIQAIAPLLQQIATDQTVLNVSRARAQRLLLASGTAN
Ga0187822_1000708123300017994Freshwater SedimentMNLKLRPVVLAILAAVLLSVLLGGLESAASDPPSSISFSLDFPQSIPDHYQITVASDGRASYDSTGKLTPDSEPGDPYHLEFTLSPASCQRLFDLAAKAKYFEGKVDSGRRNLASTGAKVLTYSDGDRHSRAEYNYSPLPAVQEITSIFQGMSTTLEFGRRLDFFHHHQKLALEAELKRMEEMAKDKSLQEIQAIAPLLQQIATDQTVLNVSRARAQRLLLASGTSN
Ga0187788_1000614333300018032Tropical PeatlandMSSLPVRWSTLAVCFVAFFCNASMLARPVPASEGPASISFSLDFPQSIPDHYVVSVSSDGHATYDSTGKLTPESEVGDPYHADFTLSSVNCKRVFDLAAKAKYFQGKIDSGKSNLASTGAKVLTYNNGGEHHRAEYNYSPVPAVQEITSFFQNLSATLEFGRRLDFYYHHQKLALEAELKRMEEMLNEKSLDEIQAVAPILQQIAADHSVLNVSRARAQRLLQGSGIGTTR
Ga0193722_103324223300019877SoilMTLAFRSVIPVLITLLCFPAWALAQDAQPAVATVTFSLDFPQSIPDHYVFSVSSDGHASYDSTGKLTPNSDPGDPFHHEFTVSAANNKKIFDLAAKAKFFEGQIDSGKRNLASTGKKVLTYTNGARNTQATYNYSPQPAVQELTAIFQNMSSTLEFGRRLEYFHRHQKLALEEELKRMEQMAKDKSLEEVQAVAPILEGIVADHSVLNVSRARAQRLLSASGVGAGH
Ga0210403_1112327313300020580SoilFPQSIPDHYEISVASDGEASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0210399_1004957733300020581SoilMTRISRPATWTAAIVLLLVILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGEASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0210395_1004898623300020582SoilMTRISPPATWTAAIVLLLVILSSFPLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGASW
Ga0210401_1004189243300020583SoilMTPTPVPRQASAIKTVLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSITSDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAAGAR
Ga0210408_1048658713300021178SoilEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPVVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0210386_1002837223300021406SoilMDSVFLRAARTAAIVLVVISGCAMRGGFAQEAKTPGSVRFSLDFPQSIPDHYAIAVSSEGRASYDSTGKLTPESEPGDPFHLDFSISPASTHRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLSYKDATRNTRAEYNYSPIPAVQEITALFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAREKDLEEIQAVAPILQRIVDDKSVLNVTRARAQRLLAGSSVRAAP
Ga0210394_1005997233300021420SoilKTGLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSITSDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVTPILEKIVADRSVLNVSRARAQRLLTAAGAAGAR
Ga0210384_1000464823300021432SoilMTPTPVPRQASAIKTVLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSITSDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRVFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAAGAR
Ga0210384_1047541923300021432SoilMDSVFLRAARTAAIVLVVISGCAMRGGFAQEAKTPAGSVRFSLDFPQSIPDHYEIAVSSEGQASYDSTGKLTPESEPGDPFHLDFSISPASTHRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLSYKDATRNTRAEYNYSPIPAVQEITALFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAREKDLEEIQAVAPILQRIVDDKSVLNVTRARAQRLLAGSSVRVAP
Ga0210410_1001136653300021479SoilMTRISPPATWTAAIVLLLVILSSFPLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0137417_124917733300024330Vadose Zone SoilMISTPLGWSVASTRVLLCLLVAIAVSPLKASAQDSNPDVATVTFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTAESEPTDPFHIDFTLSPAAREKAFDLAAKAKYFQGKVDSGKRNIASTGAKVLSYRDRQRSTQASYNYSLLPAVQELTALFQNISTTLEFGRRLDYYHHYQKLALEEELKRMEEMVRERSLDELQAVAPILQRIVADQTVINVTRARAQRLLAAGAATASR
Ga0207699_1018453623300025906Corn, Switchgrass And Miscanthus RhizosphereMTRISRPAASTAAIVLLLAILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTIAPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEEELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0207700_1009552143300025928Corn, Switchgrass And Miscanthus RhizosphereMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0207700_1099965513300025928Corn, Switchgrass And Miscanthus RhizosphereVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS
Ga0207664_1002651513300025929Agricultural SoilMNSVFVPAIRSAALALLLISFCSIHVGFAQEGKSATGSVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTIAPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEEELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209055_107668523300026309SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEI
Ga0209154_107060923300026317SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209471_109532823300026318SoilMNSVFVSATRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209687_105476513300026322SoilSVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209804_104006923300026335SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209059_100514143300026527SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKILTYKDATRNNRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209806_100607643300026529SoilMNSVFVSAIRSAALALLLISFFSIRVGFAQEGKSATASVSFFLDFPQSIPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209577_10008638103300026552SoilPDHYEISVASDGQASYDSTGKLIPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0208730_101429613300027047Forest SoilDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209214_100373723300027071Forest SoilMTRISRPATWTAAIVLLLAILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAARAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209003_100323323300027576Forest SoilMTRISRPATWTTAIVLLLAILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPVVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0209581_10000043313300027706Surface SoilMAAASEAPARISFSLDFPNSIPDHYVIAISSDGRASYDSTGKLTPDSEPGDPFHADFVLSSAACKRVFDLAAKAKYFEGKVDSGKSNLASTGAKVLTYTSGDRHNRAEYNYSPLPAVQQITAYFQNLSTTLEFGRRLDFYYRHQKLALEDELKRMEELARDKSLEEVQAVAPVLRQIAADKSVLNVSRARAQRLLLASGSELPH
Ga0209580_1004410923300027842Surface SoilMTPTPVPRQASAIKAVLFALILSTSLAAFPQEAKTAPAVVGFSLDFPQSSPDHYEFSIASDGRASYDSTGKLTPQSDAGDPFHTDFTISAVNLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLDYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAH
Ga0209380_1001889513300027889SoilLMTPTPVPRQASAIKTVLFALILSTSLAAFPQEAKTAPAIVGFSLDFPQSSPDHYEFSIASDGHASYDSTGKLTPESDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVTPILEKIVADRSVLNVSRARAQRLLTAAGAAGAR
Ga0318541_1000655523300031545SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0318538_1022083423300031546SoilLTIVRFRMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0318573_1009880013300031564SoilLEAPSTKRRCGLTIVRFRMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0318555_1026103013300031640SoilFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0318574_1012960513300031680SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQS
Ga0318572_1004379723300031681SoilFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0307474_1001467033300031718Hardwood Forest SoilMTPTPVPRQASAIKAVLFALILSTSLAAFPQEPKTAPAVVGFSLDFPQSSPDHYEFSITSDGRASYDSTGKLTPQSDAGDPFHTDFTISAVNLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTSGDRRTYAQYNYSLIPAVQDLTAIFQNMSTTLEFGRRLDYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAR
Ga0307474_1007357123300031718Hardwood Forest SoilMTRISPPATWTAAIVLLLAILSSFRLACAQEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGVS
Ga0307474_1010109023300031718Hardwood Forest SoilMDSVFLRAARTAAIVLVVISGCAMRGGFAQEAKTPGSVRFSLDFPQSIPDHYAIAVSSEGQASYDSTGKLTPQSEPGDPFHLDFSISPASTHRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLSYKDATRNTRAEYNYSPIPAVQEITALFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAREKDLEEIQAVAPILQRIVDDKSVLNVTRARAQRLLAGSSVRAAP
Ga0307469_1003009223300031720Hardwood Forest SoilMRPSASSRKIVETMTNTVFRCLVVGALFGIPSVAFAQQSAASPATITFSLDFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGAPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGARKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAPGG
Ga0307469_1013006323300031720Hardwood Forest SoilRHAAYDSTGKLTPQSDPGDPFHDDFTISADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKILKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS
Ga0307468_10115112913300031740Hardwood Forest SoilMRPSASSRKIVETMTNTVFRCLVVGALFGIPSVAFAQQSAASPATITFSLDFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGDPFHSEFTLSADTCKKVFELAGKAKYFEGQIDSGNKKLASTGVKILCYTEGARKNQATFNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIV
Ga0306918_1021392523300031744SoilPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0307477_1009302613300031753Hardwood Forest SoilTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0307477_1011530423300031753Hardwood Forest SoilAIKTVLIALILSTSLAAFPQEAKTAPAVVGFSLDFPQSSPDHYEFSITSDGRASYDSTGKLTPQSDAGDPFHTDFTISAVNLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTSGDRRTYAQYNYSLIPAVQDLTAIFQNMSTTLEFGRRLDYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAR
Ga0307475_1001412333300031754Hardwood Forest SoilMKFVLVRRPPAQLVFSFVLVSLLALPTFAQEAKPDVAAIGFSLDFPQSIPDRYQISVSSAGNASYDSTGKLTPDAEPGDPFHFDFTISAANLKRIFDLAAKAKFFEGQVDSGKHNLASTGAKVLTYTNGSRHTQAAYNYSPIPAVQEITALFQNMSTTLEFGRRLDYYHHHQKLALDDETKRMEQMANEKSLEEIQAVAPILQRIVSDQSVLNITRARAQRLLNGRAIGAAH
Ga0307475_1006662423300031754Hardwood Forest SoilMTPTPVPSQASAIKTVLIALILSTSLAAFPQEAKTAPAVVGFSLDFPQSSPDHYEFSITSDGHASYDSTGKLTPQSDAGDPFHTDFTISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYTNGDRKTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAR
Ga0307475_1124694513300031754Hardwood Forest SoilLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATRNSRAEYNYSPVPAVQEITAIFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNRPQAGAS
Ga0318529_1003770513300031792SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQ
Ga0318576_1003150713300031796SoilRMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0318550_1028613513300031797SoilSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLASGSGAAR
Ga0307473_1062745413300031820Hardwood Forest SoilMTRISRPATWTTAIVLLLAILSSFRLACAEEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLAYKDATRNSRAEYNYSPVPAVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNLPQAGAS
Ga0307478_10001791143300031823Hardwood Forest SoilMTPTSAPRQASAIKALLFALILSTSLAAFPQEAKTAPAVVGFSLDFPQSSPDHYEFSISSDGRASYDSTGKLTPQSDAGDPFHTDFAISAANLRRIFDLAAKAKYFDGPIDSGKRNLASTGAKVLSYSNGDRNTHAEYNYSLIPAVQDLTAIFQNMSTTLEFGRRLEYYQRHQKLALEDELKRMEEMANEKSLQEIQAVAPILEKIVADRSVLNVSRARAQRLLTAAGAPGAR
Ga0307479_1000054843300031962Hardwood Forest SoilMAVPLSSVAIVDCGRAIRRAFCLSYALTGWLLICATTVAQESKPAATITFSLDFPQSIPDHYVLTVSSDGRAGYDSTGKLTPDSEPTDPFHLDFTISPANREKIFDLAVKAKYFQGKVDSGKKNLASTGAKVLAYKDGERSTRAEYNYSPVPAVQELTALFQNISTTLEIGRQLEYYHRYQKLALDEHLKQMEQMLSEKNLDEVQAVAPILQRILADQSVMNVSRARAQRLLLRSKAMVS
Ga0318558_1005143643300032044SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLL
Ga0318504_1002817413300032063SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIAADQSVLNVSRARAQRLLLAGGT
Ga0307471_10124629013300032180Hardwood Forest SoilATYDSTGKLTPQSDPGDPFHDDFTVSADTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSSANAAPVAGS
Ga0307472_10001293513300032205Hardwood Forest SoilMTRISRPATWTTAIVLLLAILSSFRLACAEEAKAPTASVSFSLDFPQSIPDHYEISVASDGQASYDSTGKLTPDSEPGDPFHLEFTISPASTKRIFDLAAKAKYFDGKIDSGKRNLASTGAKVLTYKDATHNSRAEYNYSPVPVVQEITAVFQNMSSTLEFGRRLDYYHHHQKLALEDELKRMEEMAKEKNLEEIQAVAPILQRIVDDKSVLNITRARAQRLLNLPQAGA
Ga0307472_10040009913300032205Hardwood Forest SoilFPESIPDHYVVSVSRDCKAGYDSTGKLTPQSDPGDPFHSEFTLSADTCKKVFDLAAKAKYFEGQIDSGNKKLASTGAKILSYTEGARKNQATYNYSPNPAVQQITAIFQNISATLEFGRRLDFYHHHQKLALEEELKHMEEMAKSKSLDEIQAVGPILQAIVADHSVLNVTRARAQRLLNGAAVAAPGGY
Ga0307472_10103395513300032205Hardwood Forest SoilLILAVTASLAQQSPPATISFSLDFPQSIPDHYVITVSSDRHATYDSTGKLTPESDPGDSFHDEFTVSPDTCKKIFDLAAKVKYFEGQIDSGNKKLASTGVKVLKYEEGTRKTEATYNYSPIPAVQQITAVFQNISTTLEFGRRLDFFYRHQKLALEDELKHMEESAKSNNLGELQAVVPILQRIAADHSVLNVTRARAQRLLSTANAMTVAGS
Ga0335070_1138242313300032829SoilLLVFSCSLVFCFAIFFCVASSVAQPARVSEPPASISFSLDFPQSIPDHYMISISSDGHATYDSTGKLTPESEAGDPYHSDFTLSSANCKRVFELAAKARYFQGKVDSGRSNIASTGAKVLSYTSGDQHHRAEYNYSPVPAVQEITSFFQNLSATLEFGRRLDFYYHHQKLALEAELKRMEEMANEKSLDELQAVAPLLQQIAADHSVLNVSRAR
Ga0310914_1004937213300033289SoilMPSFPVRWFALAFSVAVLLSSALAQPAHAAEPPASVSFSLDFPNSIPDHYMISVASDGHASYDSTGKLTPDAEPGDPYHTDFMLSSANCKRIFDLAAKAKYFEGKVDSGRSNLASTGVKVLTYTGGDRHTRAEYNYSPVPAVQEITTIFQSMSTTLEFGRRLDFYYHHQKLALEAELKRMEEMAKEKNLDEVQAVAPILQRIA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.