NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F075273

Metagenome / Metatranscriptome Family F075273

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F075273
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 203 residues
Representative Sequence MKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVTAK
Number of Associated Samples 97
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 49.58 %
% of genes near scaffold ends (potentially truncated) 61.34 %
% of genes from short scaffolds (< 2000 bps) 91.60 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (85.714 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(32.773 % of family members)
Environment Ontology (ENVO) Unclassified
(34.454 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.025 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 10.61%    β-sheet: 26.53%    Coil/Unstructured: 62.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF01259SAICAR_synt 2.61
PF00211Guanylate_cyc 2.61
PF04295GD_AH_C 1.74
PF00535Glycos_transf_2 0.87
PF00756Esterase 0.87
PF10590PNP_phzG_C 0.87
PF03544TonB_C 0.87
PF00873ACR_tran 0.87
PF00583Acetyltransf_1 0.87
PF02530Porin_2 0.87
PF13358DDE_3 0.87
PF00155Aminotran_1_2 0.87
PF06339Ectoine_synth 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG0152Phosphoribosylaminoimidazole-succinocarboxamide synthaseNucleotide transport and metabolism [F] 2.61
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 2.61
COG2721Altronate dehydrataseCarbohydrate transport and metabolism [G] 1.74
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.87
COG3637Opacity protein LomR and related surface antigensCell wall/membrane/envelope biogenesis [M] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A85.71 %
All OrganismsrootAll Organisms14.29 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000567|JGI12270J11330_10135740Not Available961Open in IMG/M
3300001356|JGI12269J14319_10083646All Organisms → cellular organisms → Bacteria1670Open in IMG/M
3300004092|Ga0062389_102426173Not Available694Open in IMG/M
3300004100|Ga0058904_1369590Not Available635Open in IMG/M
3300004101|Ga0058896_1004815Not Available814Open in IMG/M
3300004104|Ga0058891_1517108Not Available742Open in IMG/M
3300004119|Ga0058887_1466738Not Available569Open in IMG/M
3300004120|Ga0058901_1555807Not Available861Open in IMG/M
3300004139|Ga0058897_10951824Not Available508Open in IMG/M
3300004139|Ga0058897_10992487Not Available762Open in IMG/M
3300005332|Ga0066388_107106633Not Available563Open in IMG/M
3300005445|Ga0070708_100255440Not Available1647Open in IMG/M
3300005468|Ga0070707_100899302Not Available850Open in IMG/M
3300005471|Ga0070698_101782320Not Available568Open in IMG/M
3300005542|Ga0070732_10918653Not Available535Open in IMG/M
3300005556|Ga0066707_10089200All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1875Open in IMG/M
3300005598|Ga0066706_10711280Not Available798Open in IMG/M
3300006050|Ga0075028_100663314Not Available625Open in IMG/M
3300006162|Ga0075030_100396910Not Available1099Open in IMG/M
3300006172|Ga0075018_10582284Not Available593Open in IMG/M
3300006796|Ga0066665_10255955All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1388Open in IMG/M
3300006861|Ga0063777_1081210Not Available582Open in IMG/M
3300006903|Ga0075426_10220510Not Available1377Open in IMG/M
3300006914|Ga0075436_100080752All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2253Open in IMG/M
3300006914|Ga0075436_101414123Not Available527Open in IMG/M
3300006954|Ga0079219_10935134Not Available707Open in IMG/M
3300009137|Ga0066709_101608117Not Available930Open in IMG/M
3300009523|Ga0116221_1202312Not Available861Open in IMG/M
3300009839|Ga0116223_10563296Not Available660Open in IMG/M
3300010048|Ga0126373_11089368Not Available865Open in IMG/M
3300010048|Ga0126373_11178092Not Available832Open in IMG/M
3300010358|Ga0126370_10313928Not Available1249Open in IMG/M
3300010360|Ga0126372_11025712Not Available838Open in IMG/M
3300010360|Ga0126372_11993966Not Available627Open in IMG/M
3300010361|Ga0126378_10006037All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales9465Open in IMG/M
3300010366|Ga0126379_10267449Not Available1695Open in IMG/M
3300010379|Ga0136449_100003216All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales49036Open in IMG/M
3300010379|Ga0136449_100029708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD008813305Open in IMG/M
3300011066|Ga0138524_1053451Not Available582Open in IMG/M
3300011120|Ga0150983_12004368Not Available1554Open in IMG/M
3300011120|Ga0150983_14200995All Organisms → cellular organisms → Bacteria → Proteobacteria2610Open in IMG/M
3300011271|Ga0137393_10980590Not Available720Open in IMG/M
3300012096|Ga0137389_10653373Not Available904Open in IMG/M
3300012202|Ga0137363_10181357Not Available1677Open in IMG/M
3300012202|Ga0137363_10552599Not Available970Open in IMG/M
3300012207|Ga0137381_11115996Not Available678Open in IMG/M
3300012357|Ga0137384_10288467All Organisms → cellular organisms → Bacteria1366Open in IMG/M
3300012361|Ga0137360_11144151Not Available673Open in IMG/M
3300012923|Ga0137359_11384844Not Available591Open in IMG/M
3300012929|Ga0137404_12042912Not Available535Open in IMG/M
3300017934|Ga0187803_10155935Not Available898Open in IMG/M
3300017944|Ga0187786_10066266Not Available1132Open in IMG/M
3300017955|Ga0187817_10369881All Organisms → cellular organisms → Bacteria914Open in IMG/M
3300018012|Ga0187810_10326175Not Available638Open in IMG/M
3300018060|Ga0187765_10023177All Organisms → cellular organisms → Bacteria → Proteobacteria2981Open in IMG/M
3300020579|Ga0210407_11058871Not Available616Open in IMG/M
3300020579|Ga0210407_11156705Not Available584Open in IMG/M
3300020580|Ga0210403_10440138Not Available1062Open in IMG/M
3300020581|Ga0210399_10060831All Organisms → cellular organisms → Bacteria → Proteobacteria3047Open in IMG/M
3300020583|Ga0210401_10381923Not Available1271Open in IMG/M
3300021151|Ga0179584_1097905Not Available605Open in IMG/M
3300021168|Ga0210406_10488359Not Available975Open in IMG/M
3300021170|Ga0210400_10168690Not Available1769Open in IMG/M
3300021178|Ga0210408_10558898Not Available908Open in IMG/M
3300021432|Ga0210384_10093855Not Available2694Open in IMG/M
3300021432|Ga0210384_10280941All Organisms → cellular organisms → Bacteria → Proteobacteria1499Open in IMG/M
3300021432|Ga0210384_10283885Not Available1490Open in IMG/M
3300021432|Ga0210384_10547180Not Available1041Open in IMG/M
3300021478|Ga0210402_10957518Not Available783Open in IMG/M
3300021479|Ga0210410_10325737Not Available1378Open in IMG/M
3300021479|Ga0210410_11296206Not Available621Open in IMG/M
3300021559|Ga0210409_10070306All Organisms → cellular organisms → Bacteria → Proteobacteria3258Open in IMG/M
3300021559|Ga0210409_11258793Not Available616Open in IMG/M
3300022506|Ga0242648_1025868Not Available785Open in IMG/M
3300022507|Ga0222729_1005106Not Available1212Open in IMG/M
3300022507|Ga0222729_1005106Not Available1212Open in IMG/M
3300022508|Ga0222728_1006311Not Available1425Open in IMG/M
3300022509|Ga0242649_1023140Not Available756Open in IMG/M
3300022523|Ga0242663_1096846Not Available583Open in IMG/M
3300022531|Ga0242660_1006277Not Available1868Open in IMG/M
3300022532|Ga0242655_10013165Not Available1637Open in IMG/M
3300022533|Ga0242662_10037807Not Available1196Open in IMG/M
3300022533|Ga0242662_10037807Not Available1196Open in IMG/M
3300022712|Ga0242653_1004915Not Available1458Open in IMG/M
3300022712|Ga0242653_1004915Not Available1458Open in IMG/M
3300022717|Ga0242661_1007448Not Available1480Open in IMG/M
3300022718|Ga0242675_1023438Not Available887Open in IMG/M
3300022722|Ga0242657_1072778Not Available801Open in IMG/M
3300022722|Ga0242657_1234670Not Available520Open in IMG/M
3300022724|Ga0242665_10089345Not Available896Open in IMG/M
3300022726|Ga0242654_10017534Not Available1703Open in IMG/M
3300025906|Ga0207699_11249095Not Available550Open in IMG/M
3300025916|Ga0207663_10417156Not Available1030Open in IMG/M
3300027641|Ga0208827_1105636Not Available831Open in IMG/M
3300027854|Ga0209517_10194384Not Available1257Open in IMG/M
3300027857|Ga0209166_10374964Not Available741Open in IMG/M
3300027905|Ga0209415_10049176All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5467Open in IMG/M
3300027911|Ga0209698_10323356Not Available1217Open in IMG/M
3300028047|Ga0209526_10337627Not Available1012Open in IMG/M
3300029636|Ga0222749_10652131Not Available576Open in IMG/M
3300030730|Ga0307482_1119855Not Available740Open in IMG/M
3300030743|Ga0265461_13779296Not Available514Open in IMG/M
3300030937|Ga0138302_1719883Not Available506Open in IMG/M
3300031057|Ga0170834_100112762Not Available1316Open in IMG/M
3300031057|Ga0170834_100180828Not Available752Open in IMG/M
3300031122|Ga0170822_13280447Not Available799Open in IMG/M
3300031128|Ga0170823_14496502Not Available788Open in IMG/M
3300031128|Ga0170823_17009266Not Available1012Open in IMG/M
3300031231|Ga0170824_117838821Not Available756Open in IMG/M
3300031231|Ga0170824_120518519All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1787Open in IMG/M
3300031231|Ga0170824_120518519All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1787Open in IMG/M
3300031446|Ga0170820_13301915Not Available547Open in IMG/M
3300031474|Ga0170818_104331894Not Available501Open in IMG/M
3300031720|Ga0307469_12015198Not Available560Open in IMG/M
3300031754|Ga0307475_10537612Not Available938Open in IMG/M
3300031754|Ga0307475_10746141Not Available779Open in IMG/M
3300031823|Ga0307478_10285300Not Available1349Open in IMG/M
3300032009|Ga0318563_10761507Not Available519Open in IMG/M
3300032180|Ga0307471_102280511Not Available683Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil32.77%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil9.24%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.40%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil8.40%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil8.40%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.04%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.20%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.36%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.52%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.52%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.68%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.68%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.84%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.84%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000567Peat soil microbial communities from Weissenstadt, Germany - SII-2010EnvironmentalOpen in IMG/M
3300001356Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004100Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF244 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004101Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF228 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004104Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF218 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004119Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF210 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006861Peat soil microbial communities from Weissenstadt, Germany - Metatranscriptome 3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009523Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaGEnvironmentalOpen in IMG/M
3300009839Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_a_PC metaGEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011066Peat soil microbial communities from Weissenstadt, Germany - Metatranscriptome 3 (Metagenome Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022506Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-26-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022508Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-19-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022509Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022717Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022718Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027641Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300030937Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A4_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032009Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f19EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12270J11330_1013574023300000567Peatlands SoilQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICAAPGQTLTKAQAEDLIKSFKVTSK*
JGI12269J14319_1008364613300001356Peatlands SoilMRSPMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAP
Ga0062389_10242617313300004092Bog Forest SoilNFSNSEPRDEQSQPIACSQSRLQPPAGLDVECGQYPLSNSHCSKQGYVVQSKAVGGPSIFVFAMTKQGGGKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPKTDRDGKCVAFYQPGAPVVNYGQDAAPSHSWLRDYYYRGWICVPPGQTLTRAQAEDLIKSFKVTGK*
Ga0058904_136959013300004100Forest SoilITALASEEIMKGVIVATVLALVFAPVPASVMLSARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSSKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPITSFGGPAQPPHGYLREYYMRGWICVPPGQTLTK
Ga0058896_100481513300004101Forest SoilFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK*
Ga0058891_151710813300004104Forest SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKAQAADLIKSFKVTSK*
Ga0058887_146673813300004119Forest SoilARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPKTDKDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPPGQTLTKDQAVDLIKSFK
Ga0058901_155580713300004120Forest SoilMKGVIVATVLALVFAPVPASVMLSARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPKTDKDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPP
Ga0058897_1095182413300004139Forest SoilFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTL
Ga0058897_1099248713300004139Forest SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK*
Ga0066388_10710663313300005332Tropical Forest SoilMNGAIVTLAAAVALLLAPLPASVSSSARAQGVNFSNSEPRDAQSEKIDCSQSRLRPPGGLNIQCGRYPVSNSHCLKQGYVVESTGNGPSIFVFAMTQRHTSKYCGIAAPQSVRSDMVMAIKKYRPFVHDDADNWSKKPFELKHAGMALFFDSPKKDRDGKCVAFYQPGPPVVNYGGD
Ga0070708_10025544023300005445Corn, Switchgrass And Miscanthus RhizosphereMKSAIFALAALMLAPLPDSVSFSARAQGVNFSNSEPRDAQSQTIACSQSRLQPPAGLDVACGKYPLSNSHCSKQGYVVESKASGPSIFVFAMTQQRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPKTDRDGQCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKDQALDLIKSFKVTSK*
Ga0070707_10089930223300005468Corn, Switchgrass And Miscanthus RhizosphereMKVVIVAAVLALMLAPLPASVSLSARAQGVNFSNSEPRDEHSETIACSQSRLQPPAGLDVECGRYPLSNSHCTKQGYVAESKAGGPSIFVYAMTEGRGSKYCGIAAPQSVRTDMVMAVKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDSDGKCVAFYQPGPP
Ga0070698_10178232013300005471Corn, Switchgrass And Miscanthus RhizosphereSLSARAQGVNFSNSEPRDEHSETIACSQSRLQPPAGLDVECGRYPLSNSHCYKQGYVVESKAGGPSIFVFAMTQGRSNKYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPLELKHAGLALFFDSPNTDRDGKCVAFYQPGPPIVSRGGEAQPPHGYLRDYYYRGWICVPPGQTLTKDAAHNLIK
Ga0070732_1091865313300005542Surface SoilALFAAALALVFAPLPASVSHSARAQGVNFSNSEPRDAQSETIACSQSRLQPPAGLDVVCGRYPVSNPHCQKQGYAVESKAGQPAIFVFAMTQGRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKETDGKCVAFYQPGPPVVNYGGENAPTHS
Ga0066707_1008920023300005556SoilMKGAIFTAVLALMLAPLPASESLSARAQGVNFSNSEPRDEQSERIACPQSRLQPAAGLDVECGRYPLSNSHCSKQGYVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPNASRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYVRGWICAPPGQTLTKDQAVDLIKSFKVSAK*
Ga0066706_1071128013300005598SoilMKGAIFTAVLALMLAPLPASESLSARAQGVNFSNSEPRDEQSERIACPQSRLQPAAGLDVECGRYPLSNSHCSKQGFVVESKASGPSIFVFAMTQRRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDAINWSKKPFELKHAGLALFFDSPNANRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICAPPGQTLTKDQAEDLIKSFKVTSK*
Ga0075028_10066331413300006050WatershedsMKAAIFPAVLALVFAPLSASVSFSAHAQGINFSNSEPRDEQSETIACSQSRLQPPGGLDVECGKYPLSNSHCSKQGYVVQSKASGPSIFVFAMTQQRSSKYCGIAAPQSVRTDMMMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICA
Ga0075030_10039691023300006162WatershedsMKAAIFAAVLAVAFAPLPASVSQSARAQGMNFSNSEPRDEQSETIACSQSRLQPPGGLDVECGRYPVNNSHCSKQGFVVQSKTNGPSIFVFAMTQQRSSRYCGIAAPQSVRTDMVMSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPNKDRDGRCVAFYQPGPPVVNYGAEAAPTHSYLRDYYYRGWICAAPGQTMTKDQALDLIKSFKVTAK*
Ga0075018_1058228413300006172WatershedsIMKRVIVAAGLALMLAPLPASAAGKDFANSEPRNEQSEKIACAQSRLQPPAGLDVDCGRYPVSSSHCYKRGYVVESKAGQPSIFVFAMTQGRSSTTCGIAAPQSVRTDMVMAVKKYRPFVRDDATNWTKKPLELKHAGLALLFDSPNKDRDGKCVAFYQPGPPVVNYGSDAAPTHSYERDYYYRGWICAPPGQTLTQ
Ga0066665_1025595523300006796SoilMKGAIFTAVLALMLAPLPASESLSARAQGVNFSNSEPRDEQSERIACPQSRLQPAAGLDVECGRYPLSNSHCSKQGYVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPNASRDGKCVAFYQPGPPVVNYGGEAAPTHSYLRDYYVRGWICAPPGQTLTKDQAVDLIKSFKVSAK*
Ga0063777_108121013300006861Peatlands SoilMRSPMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYG
Ga0075426_1022051023300006903Populus RhizosphereMRGAILAAVLALMLAPLPDSVSFSARAQGINFSNSEPRDEQSQTIACSQSRLQPPAGLDVECGRYPINNSHCSKQGYVVQSKASGPSIFVFAMTQLRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKDQALDLIKSFKVTSK*
Ga0075436_10008075233300006914Populus RhizosphereMRGAILAAVLALMLAPLPDSVSFSARAQGINFSNSEPRDEQSQTIACSQSRLQPPAGLDVECGRYPINNSHCSKQGYVVQSKASGPSIFVFAMTQLRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKDQALDLIKSFKVTS
Ga0075436_10141412313300006914Populus RhizosphereALAAGVLVLAPLPVSVSSSAHAQGVNFSNSEPRDAQSEPIDCSQSRLQPPGGLNVQCGRYPISNSHCLKQGYVVETKGSGPSIFVFAMTQRHSSKYCGIAAPQSVRSDMVMAVKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNAKRDGRCVAFYQPGPPITSFGGEAQPPH
Ga0079219_1093513413300006954Agricultural SoilARAAGADFSNSEPRNEQSERIACPASRLQPPAGLDVECGRYPVSNSHCYKQGYVVESKAGGPSIFVFAMTQGRGSRYCGIAAPQSVRTDMVMSIKKYRPFVHDEVMTWDKKPFDLKHAGLALLFDSPNSQRDGKCVAFYQPGPPVVNYGAEAAPTHSYERDYYYRGWICAPPGQTLTKDAANNLIKSFKVTAK*
Ga0066709_10160811713300009137Grasslands SoilMTALSSEEVMKAAIFAAVFALAFAPLPASVSLSARAQGVDFSNSEPRNEQSEKIACSQSLLQPPAGLDVECGRYPLSNSHCSKQGYVVESKASGPSIFVFAMTQRRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKSFELKHAGLALFFNSPKTDRDGQCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWVCAPAGQTLTKDQAADLIKSFKVTAK*
Ga0116221_120231213300009523Peatlands SoilMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICAAPGQTLTKAQAEDLIKSFKVTSK*
Ga0116223_1056329613300009839Peatlands SoilTALSSEENMKGVIFAAVLALMFAPLPASVSHSARAQGVNFSNSEPRDEQSERIACSQSLLRPPAGLDVECGRYPLSNAHCLKQGYAVESKAGGPSIFVFAMTQGGRSSKFCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTDRDGKCVAFYQPGAPVVNYGQDAAPSHSYLRDYYYRGWICVPPGQTLTKAQAVDLIK
Ga0126373_1108936813300010048Tropical Forest SoilMKGAVFALASAFSLILASLPASVSLSARAAGTDFSNSEPREAQSEKIACSQSRLQPPAGLDVECGRYPISNSHCLKQGYVVESKGSGPSIFVFAMTQRNTGRLCGIAAPQSVRTDMVMAVKKYRPFVHDDATNWTKKPFDLKHAGLALFFDSPNKDRDGRCVAFYQPG
Ga0126373_1117809213300010048Tropical Forest SoilMKGALFAFTSVLAVMLAPLPGSVSLSARAQGVNFSNSEPRDAQAEKIACSQSRLQPPTGLNVECGKYPISNSHCLKQGYGVESKAGAPSVFVFAMTQRRTARNCGIAAPQSVRTDMVMAIKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNKDRDGQCVAFYQPGPQDPYTGGPAAR
Ga0126370_1031392813300010358Tropical Forest SoilMRLLKTDREGIMKVAIVAFAVAFALMLAPPPASVSLSAHAQGVDFSNSEPRNAQSETIACAQSRLKPPAGLDVQCGKYPISNSHCLKQGYVVESKAGGPSIFVYAMTQRPSSRYCGIAAPQSVRTDMVMAVKKYRPFVHDDATNWAKKPFELKHAGLALFFDSPNTQRDGKCVAFYQPGPPVVNFGADAAPTHSYERDYYMRGWICAPPGQNLTKDAAYDLIKSFKVTAK*
Ga0126372_1102571223300010360Tropical Forest SoilPLFPAGGEIMQVTVFAFAAAFTLMLAPLPASAVGTDFSNSEPRDEQSERIACSQSRLQPPAGLDVECGRYPLSNAHCYKQGYGIESKAGGPSIFVFAMTGRTGRNCGIAAPQSARTDLVMAIKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNKNRDGQCVAFYQPGPPVTHFGGDSQLPHGYLRDYYIRGWVCAPPGQTLSKDAAYNLIKSFKVTSK*
Ga0126372_1199396613300010360Tropical Forest SoilGVNFSNSEPRDAQSERIACSQSRLQPPAGLDVDCGKYPLSNPHCYKQGYGVESKAGRPSVFVFAMTGRSGRNCGISAPQSVRTDMVMAIKKYRPFVHDDATNWSKKPFELKHAGLALLFDSPNKDRDGQCVAFYQPGPAVTPIGGEAQVPGARLRDYYMRGWICTPPGQTLSKDAAYNLIKSLKVTSK*
Ga0126378_1000603743300010361Tropical Forest SoilMRLLKTDREGIMKVAIVAFAVAFALMLAPPPASVSLSAHAQGVDFSNSEPRNAQSETIACAQSRLKPPAGLDVQCGKYPISNSHCLKQGYVVESKAGGPSIFVYAMTQRPSSRYCGIAAPQSVRTDMVMAVKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGTPVVNYGAEAAPTHSYLRDYYMRGWICAPPGQTLSKDAAYDLIKSFKVASR*
Ga0126379_1026744923300010366Tropical Forest SoilMQVTVFAFAAAFTLMLAPLPASAVGTDFSNSEPRDEQSERIACSQSRLQPPAGLDVECGRYPLSNAHCYKQGYGIESKAGGPSIFVFAMTGRTGRNCGIAAPQSARTDLVMAIKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNKDRDVQCVAFYQPGPPVTHFGGESQLPHGYLRDYYVRGWICAPPGQTLSKDAAYNLIKSFKVTSR*
Ga0136449_100003216493300010379Peatlands SoilMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWIYAAPGQTLTKAQAEDLIKSFKVTSK*
Ga0136449_10002970873300010379Peatlands SoilMKGVIFAAVLALMFAPLPASVSHSARAQGVNFSNSEPRDEQSERIACSQSLLRPPAGLDVECGRYPLSNAHCLKQGYAVESKAGGPSIFVFAMTQGGRSSKFCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTDRDGKCVAFYQPGAPVVNYGQDAAPSHSYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKVTSK*
Ga0138524_105345113300011066Peatlands SoilMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYG
Ga0150983_1200436813300011120Forest SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR*
Ga0150983_1420099523300011120Forest SoilVRNGHCTIQREKAITALASEEIMKGVIVATVLALVFAPVPASVMLSARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPKTDKDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPPGQTLTKDQAVDLIKSFKVAVK*
Ga0137393_1098059013300011271Vadose Zone SoilMKSAIFALAAAFALMLAPLPASVSLSARAQGVNFSNSEPRDAQSERIACSQSLLQPAAGLDVECGRYPLSNSHCSKQGYVVESKAGGPSIFVFAMTQRRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDAINWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKDAADNLI
Ga0137389_1065337313300012096Vadose Zone SoilMKGAVFALAAAFTLMLAPLPASVSLSARAQGVDFSNSEPRDEQSERIACPESRLQPPAGLDVECGRYPLSNSHCTKQGYVAESKAGGPSVFVYAMTQGRGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGQCVAFYQPGPPIVSRGGEAQPPHGYLRDYYYRGWICVPPGQTLNKAQAVDLIKSFKVTSK*
Ga0137363_1018135723300012202Vadose Zone SoilMKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNANRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKAQAEDLIKSFKVTSK*
Ga0137363_1055259923300012202Vadose Zone SoilMKGAIFAAVLALMFAPLPASVSLSARAQGVDFSNSEPRNEQSERIACSQSRLQPSAGLDVECGRYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGPPITSFGGRGQPTHGYLRDYYYRGWICVPSGQT
Ga0137381_1111599613300012207Vadose Zone SoilAQGVNFSNSEPRDQQSEKIACPQSRLQPAAGLDVECGRYPLSNSHCSKQGYVVEGKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNANRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKAQAEDLIKSFKVTSK*
Ga0137384_1028846723300012357Vadose Zone SoilMTALSSEEIMKAVIVAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERITCSQSLLQPAAGLDVECGRYPLSNSHCSKQGFVVESKASGPSIFVFAMTQRRSSRYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNANRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKAQAEDLIKSFKVTSK*
Ga0137360_1114415113300012361Vadose Zone SoilEEIMKSAMFAAALAVMLAPLPASAVGTDFSNSEPRDAQSERIACSQSLLQPAAGLDVECGRYPLSNSHCSKQGFVVESKASGPSIFVFAMTQRRSSKYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNANRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKAQAEDLIKSFKVTSK*
Ga0137359_1138484413300012923Vadose Zone SoilALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECGRYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQT
Ga0137404_1204291213300012929Vadose Zone SoilLPPSVSLSARAQGVNFSNSEPRDEQSEKIACSQSRLQPTSGLDVECGRYPLSNSHCSKQGYVVESKAGGPSIFVFAMTQGRSSKYCGIAAPQSVRTDMVTATKKYRPFVHDDATNWSKKPFELKHAGLAMFFDCSNTNREGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWIC
Ga0187803_1015593513300017934Freshwater SedimentMKGAIFAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSEKIACSQSLLHPPAGLDVECGRYPVSNSHCRKQGYAVESKAGGPSIFVFAMTQGRGSRYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPITSFGGTAQPPDGYLREYYMRGWICVPPGQTLTKAQAVDLIKSFKVA
Ga0187786_1006626613300017944Tropical PeatlandMKIAIFAFVAAFAFTLAPLSTSGSFSARAAGVDFSNSEPRDEQSEKIACTESRLQPPAGLAVECGRYPLSNPNCYKRGYVVEGKAGGPSIFVFAMTEGQSSGTCGIAAPQSVRSDMVMAIKKYRPFVRDDATNWSPKPFELKHAGQALLFDSPNKGRDGQCVAFYQPGSPVLSSHSEGKAPLSYERDYYYRGWICVPPGQTLTKAQAVDLIKSFKVTRK
Ga0187817_1036988123300017955Freshwater SedimentMKGVVVAVALALMFAPLPASVSHPARAYGVDFSNSEPRDEQSERIACSQSLLQPPAGLDVECGRYPVSSSHCQKQGYAVESKAGGPAIFVFAMTQGRSSKNCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDKDGKCVAFYQPGPPVIVKGGEASPGE
Ga0187810_1032617513300018012Freshwater SedimentMKGVVVAVALALMFAPLPASVSHPARAYGVDFSNSEPRDEQSERIACSQSLLQPPAGLDVECGRYPVSSSHCQKQGYAVESKAGGPAIFVFAMTQGRSSKNCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDKDGKCVAFYQPGPPVVNYGAEAAPTHSYLRDYYYR
Ga0187765_1002317733300018060Tropical PeatlandMKFAIFAFVAASTFMLAPLPTSGSFSARAAGVDFSNSEPRDEQSEKIACAESRLRPPAGLAVECGRYPLSNPNCYKQGYVVEGKAGGPSIFLFAMTEGQSSGTCGIAAPQSVRSDMVMAIKKYRPFVRDDATNWSPKPFELKHAGLALFFDSPNKGRDGQCVAFYQPGSPVLSSHSEGKSPLSYERDYYYRGWICVPPGQTLTKAQAVDLIKSFKVTRK
Ga0210407_1105887113300020579SoilPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVATK
Ga0210407_1115670513300020579SoilMRGAIFALAAAFALMLAPLPASAAGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAE
Ga0210403_1044013823300020580SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWISVPPGQTLTKDQAVDLIKSFKVAAK
Ga0210399_1006083133300020581SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWISVPPGQTLTKDQAVDLIKSFKVAAK
Ga0210401_1038192323300020583SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSSKPFELKHTGLALFFDSPNKGRDGKCVAFYQPGPAVVNYGAEAAPTHSYLRDYYYRGWICAPPGQTLTKDAAYDLIKSFKVTSK
Ga0179584_109790513300021151Vadose Zone SoilIMKGAIFAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSEKIACSQSRLQPTSGLDVECGRYPLSNSHCSKQGYVVESKAGGPSIFVFAMTQGRSSKYCGIAAPQSVRTDMVTATKKYRPFVHDDATNWSKKPFELKHAGLAMFFDSPNTNREGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICAPPGQTL
Ga0210406_1048835923300021168SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0210400_1016869023300021170SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0210408_1055889823300021178SoilVSLSARAQGVNFSNSEPRDEQSETIACSRSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0210384_1009385523300021432SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPITGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0210384_1028094123300021432SoilMKGVIVATVLALVFAPVPASVMLSARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRTDMVMATKKYRPFVHEDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKVTSK
Ga0210384_1028388523300021432SoilVNFSNSEPRDEQSERLACSQSLLQPPAGLDVECGRYPVSNPHCHKQGYVVESKAGGPSVFVFAMTQGRSTRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSSKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGAEAAPTHSYLRDYYYRGWICVPPGQTLTKAQAADLIKSFKVTSK
Ga0210384_1054718023300021432SoilMRGVIVAAALALMLAPLPASAAGVDFSNSEPRDEHSEAIACSQSLLHPAAGLDVECGRYPVSNSHCYKQGYVVEGKAGGPSMFVFAMTQGRSNRYCGIAAPQSVRSDMVMAIKKYRPFVRDDATNWSSKPFELKHTGLALFFDSPNKGRDGKCVAFYQPGPA
Ga0210402_1095751823300021478SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVP
Ga0210410_1032573713300021479SoilTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0210410_1129620613300021479SoilMLAPLPASAAGVDFSNSEPRDEHSEAIACSQSLLHPAAGLDVECGRYPVSNSHCYKQGYVVEGKAGGPSMFVFAMTQGRSNRYCGIAAPQSVRSDMVMAIKKYRPFVRDDATNWSSKPFELKHTGLALFFDSPNKGRDGKCVAFYQPGPAVVNYGAEAAPTHSYLRDYYYRGWICAPPGQTLTKDAAYDLIKSFKVTSK
Ga0210409_1007030653300021559SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0210409_1125879313300021559SoilMKGVIVAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSETIACSRSRLQPPGGLDVQCGRYPVSNSNCYKQGYAVESKVGGPSIFVFAMTQGQGSRYCGIAAPQSVRTDMVMATKKYRPFVHEDATNWSKKPFELKHAGLALFFDSANKDRDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWI
Ga0242648_102586823300022506SoilMKVALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0222729_100510613300022507SoilPLSPGEEIMKGAIFAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0222729_100510623300022507SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPH
Ga0222728_100631123300022508SoilMKGAIFAAVLALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0242649_102314013300022509SoilPLSGEEIMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGVALFFDSPKTDREGKCVAFYQPGAPVVNYGQDAAPSHSYLRDYYYRGWICVPPGQTLTKDQAVDLIKSFKIAAR
Ga0242663_109684613300022523SoilGEEIMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYR
Ga0242660_100627713300022531SoilPTGISVVREVHLPNPPIEVDQFPDTLPLSGEEIMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0242655_1001316513300022532SoilPSVLELEWLLGPVGRVVMHERPIFLGRHHHVASNALWCRDPAIFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0242662_1003780713300022533SoilKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0242662_1003780723300022533SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYL
Ga0242653_100491523300022712SoilMKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0242653_100491533300022712SoilEEIMKGVIVAAVLALMFAPLSASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVIKYSQDAAPSHSYLRNYYYRGWISVPPGQTLTKDQAVDLIKSFKVTAK
Ga0242661_100744813300022717SoilMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVTAK
Ga0242675_102343823300022718SoilAITALASEEIMKGVIVATVLALVFAPVPASVMLSARAQGVNFSNSEPRDEQSETIACSQSRLQPPGGLDVQCGRYPVSNSNCYKQGYVVESKAGGPSMFVFAMTQGGGSRYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGPPITSFGGPAQPPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKVAAK
Ga0242657_107277813300022722SoilPLSGEEIMKAALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0242657_123467013300022722SoilEHSETIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFK
Ga0242665_1008934523300022724SoilMKGIIFAAAFALMLAPLPASVSLSAHAQGVNFSNSEPRDEQSERLPCSQSLLQPPAGLDVECGRYPVSNSNCYKQGYAVESKVGGPSIFVFAMTQGQGSRYCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPITSFGGPAQPRHRYLREYYMRGRICSQPGRTL
Ga0242654_1001753433300022726SoilMKGAIFATVLALMFAPLPASVSLSARDQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0207699_1124909513300025906Corn, Switchgrass And Miscanthus RhizosphereGEIMKSAILAGAFALMLAPLPASAVGTDFSNSEPRDEQSERIACSQSRLQPPANLDVECGRYPLSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDLVMAIKKYRPFVHDDGTNWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPIYIESDVLPHGYLRDYYVR
Ga0207663_1041715623300025916Corn, Switchgrass And Miscanthus RhizosphereMKVALFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECGRYPLSNSHCEKQGYVVESKAGGPAIFVFAMTGRRGKGCGIAAPQSVRTDLVMAIKKYRPFVHDDGTNWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICVPPGQTLTKDQAVDLIKSFKVAVK
Ga0208827_110563613300027641Peatlands SoilMRSPMTALSWEEVMRAAIFAAVFALAFAPLPASVSQSARAQGVNFSNSEPRDEQSETIACSQSRLQPPAGLDVQCGRYALSNSHCSKQGFVVESKAGGPSIFVFAMTQLRSSKYCGIAAPQSVRSDMVTSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPKTDRDGRCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWICAAPGQTLTKAQAEDLIKSFKVTSK
Ga0209517_1019438423300027854Peatlands SoilMKGVIFAAVLALMFAPLPASVSHSARAQGVNFSNSEPRDEQSERIACSQSLLRPPAGLDVECGRYPLSNAHCLKQGYAVESKAGGPSIFVFAMTQGGRSSKFCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTDRDGKCVAFYQPGAPVVNYGQDAAPSHSYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKVTSK
Ga0209166_1037496413300027857Surface SoilMKGALVAFTAAFALTFAPLPTSVSSTARAQGVNFSNSEPRDAQSEPIACEQSRLRPPGGLDVQCGRYPLSNSHCTKQGYVVESKGSGPSIFVYAMTQGRGSKYCGIAAPQSVRSDMVMSIKKYRPFVHDDATNWAKKPFELKHAGMALFFDSPNTQRDGKCVAFYQPGPPVVNYGNDAAPTHAWLRDYYIRG
Ga0209415_1004917663300027905Peatlands SoilMKAAIFAAVLALSFAPLPASVSQSARAQGVNFSNSEPRDEQSQPIACSQSRLQPPAGLDVECGQYPLSNSHCSKQGYVVQSKAAGGPSIFVFAMTKRGGGKFCGIAAPQSVRTDMVMSVKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPKTDRDGKCVAFYQPGAAVVNYGQDAAPSHSWLRDYYYRGWICVPPGHTLTKAQAEDLIKSFKVTAR
Ga0209698_1032335613300027911WatershedsMKAAIFAAVLAVAFAPLPASVSQSARAQGMNFSNSEPRDEQSETIACSQSRLQPPGGLDVECGRYPVNNSHCSKQGFVVQSKTNGPSIFVFAMTQQRSSRYCGIAAPQSVRTDMVMSVKKYRPFVHDDATNWTKKPVELKHAGLALFFDSPNKDRDGRCVAFYQPGPPVVNYGAEAAPTHSYLRDYYYRGWICAAPGQTMTKDQALDLIKSFKVTAK
Ga0209526_1033762713300028047Forest SoilMKGAIFALAAAVALMLGPLPASAVGVDFSNSEPRDEQSETIACSQSRLQPHGGLDVQCGRYSVSNSHCFKQGYVVESKAGGPSIFVFAMTQGRGSRYCGIAAPQSVRTDMVMAIKKYRPFVHDDAADWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPITSFGGEAQPPHGYLRDYYMRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0222749_1065213113300029636SoilMKGIIFAAAFALMLAPLPASVSLSAHAQGVNFSNSEPRDEQSERLPCSQSLLQPPAGLDVECGRYPVSNSHCYKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIATPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGQCVAFYQPGPAVVNYGAEAAPTHSY
Ga0307482_111985513300030730Hardwood Forest SoilMKGATFALAAAFALMLAPLPASVSSSARAAGVDFSNSEPRDEQSEKIACPQSRLQPPAGLDVECGRYPLSNSHCTKQGYVVESKAGGPSIFVFAMTQGRGSKYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNTQRDGKCVAFYQPGPPIVSMGGEAQPPHGYLRDYYYRGWICVP
Ga0265461_1377929613300030743SoilGLVAALALTLAPPSARAAGIDFSNSEPRDAQSEPVACTQSLLHPPGGLDVECGRYPLSNEHCYKQGYVVQSKANGPSIFVYAMTGHGGRGCGIATPQSVRTDMVMSIKKYRPFVHDDATDWAKKPFELKHTGFALFFESPNKARDGKCVAFYQPGPPVIAKGGDSETNEPY
Ga0138302_171988313300030937SoilAPLPASVSLSARAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNKGRDGKCVAFYQPGPAVVNYGSEAAPTHSY
Ga0170834_10011276213300031057Forest SoilGEEIMKVAIFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACPASRLQPPAGLDVECGRYPLSNSNCTKQGYGVESKAGGPSIFVFAMTQGRGSKYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPIVSRGGPDQPPHGYLRDYYYRGWICVPPGQTLSKDAAYNLIKSFKVTSK
Ga0170834_10018082813300031057Forest SoilMKGAIFALAAVLALMFAPLPASVSLSAHAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICV
Ga0170822_1328044713300031122Forest SoilIMKAALFAAALALMFAPLPASVSLSADAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0170823_1449650213300031128Forest SoilMKVAIFAAALALMFAPLPASVSLSARAQGVNFSNSEPRDEQSERIACPASRLQPPAGLDVECGRYPLSNSNCTKQGYGVESKAGGPSIFVFAMTQGRGSKYCGIAAPQSVRSDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPIVSRGGPDQPPHGYLRDYYYRGWICVPPGQTLSKDAAYNLIKSFKVTSK
Ga0170823_1700926613300031128Forest SoilMKGAIFALAAVLALMLAPLPASADFSFSNSEPRDEQSERIACSQSRLQPPAGLDVECARYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0170824_11783882113300031231Forest SoilIMKVAIFAAALALMFAPLPASVSLSTRAQGVNFSNSEPRDEQSERIACAQSLLQPPAGLDVECGRYPVSNSHCYKQGYGVESKPGQPSIFVFAMSHAKISRNCAIDPPQSARENMVVAIKKYRPFVHDDATNWSEKPFELKHRGLALFFDSPNKDRDGKCVAFYQPGPPLIGKSKDPHSLYLRHYYVRGWICVPPGQTLTKDEAYDLIKSFKVTASYSS
Ga0170824_12051851923300031231Forest SoilVKSAIFTAALAVMLAPLPASAVGTDFSNSEPRDEHSVTIACSQSRLQAPAGLDVECGRYPVSSSHCYKQGYVVESKSGGPYIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0170824_12051851933300031231Forest SoilMKGAIFALAAVLALMFAPLPASVSLSAHAQGVNFSNSEPRDEQSERIACSQSRLQPPAGLDVECGRYPLSNSHCEKQGYVVEIKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0170820_1330191513300031446Forest SoilFSNSEPRDEQSERIACAQSLLQPPAGLDVECGRYPLSNSHCSKQGYGVENKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFK
Ga0170818_10433189413300031474Forest SoilFSNSEPRDEQSERIACAQSLLQPPAGLDVECGRYPLSNSHCSKQGYGVENKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATKWSKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPAVTNYGGDAAPTHSYLRDYYYRGWICVPPGQ
Ga0307469_1201519813300031720Hardwood Forest SoilPLPASVSSSARAAGADFSNSEPRNEQSERIACPASRLQPPAGLDVECGRYPVSNSHCYKQGYVVESKAGGPSIFVFAMTQGRGSRYCGIAAPQSVRTDMVMAVKKYRPFVHDDATNWTKKPFELKHAGLALFFDSPKTDRDGKCVAFYQPGPPVVNYGGDAAPTHSYLRDYYYRGWVCAPPGQTLT
Ga0307475_1053761213300031754Hardwood Forest SoilMKSAIFAAAFALMLAPLPAAFGTDFSFSNSEPRDEQSEKIACSQSLLHPPAGLDVECGRYPVSSSHCYKQGYVVESKAGGPSIFVFAMTGRRGKGCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSRKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGPPVTGYIGNAELPHGYLRDYYYRGWICVPPGQTLTKAQAVDLIKSFKIAVR
Ga0307475_1074614113300031754Hardwood Forest SoilIFALAGAFALMLAPLPTSAVGTDFSNSEPRDEQSQAIACSQSRLQPPAGLDVECGRYPLSNSHCEKQGYVVESKAGGPAIFVFAMTQGRSSKFCGIAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHTGLALFFDSPNTNRDGKCVAFYQPGAPVINYSQDAAPSHSYLRNYYYRGWICVPPGQTLTKDQAVDLIKSFKVAAK
Ga0307478_1028530023300031823Hardwood Forest SoilMKVALFAATLALVLAPLPASVSHSARAQGVNFSNSEPRDAQSETIACSQSRLQPPAGLDVECGRYPVSNSHCLKQGYVVQSKASGPSIFVFAMTQRHTSKFCGVAAPQSVRTDMVMATKKYRPFVHDDATNWSKKPFELKHAGLALFFDSPNSQRDGKCVAFYQPGPPVVNYGQDAAPTHSYERDYYYRGWICAAPGQNLTREAAEDLIKSFKVAAK
Ga0318563_1076150713300032009SoilSETIACAQSRLQPPAGLDVQCGRYPISNSHCLKQGYVVESKASGPSIFVYAMTQRPSSRYCGIAAPQSVRSDMVMAVKKYRPFVHGDATNWSKKPFELKHAGLALFFDSPNKDRDGKCVAFYQPGTPVVNYGAEAAPTHSYLRDYYMRGWICAPPGQTLTKDAAYDLIKSFK
Ga0307471_10228051123300032180Hardwood Forest SoilMKGVISTLAAAAALVLAPLPASVSSSAHAQGVNFSNSEPRDAQSETIACAQSRLQPPGGLDVQCGRYPISNSHCLKQGYVVESKATGPSMFVFAMTQRHSSKYCGIAAPQSVRSDMVMAIKKYRPFVHDDATNWSKKPFELKHASLALLFDSPNAQRDGKCVAFYQPGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.