NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104854

Metagenome Family F104854

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104854
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 237 residues
Representative Sequence MPALTGTMVPPQNKLTASVPAMSDALAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHQKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVES
Number of Associated Samples 83
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 67.00 %
% of genes near scaffold ends (potentially truncated) 36.00 %
% of genes from short scaffolds (< 2000 bps) 48.00 %
Associated GOLD sequencing projects 71
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(37.000 % of family members)
Environment Ontology (ENVO) Unclassified
(56.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(81.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 30.43%    β-sheet: 7.97%    Coil/Unstructured: 61.59%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00221Lyase_aromatic 7.00
PF07732Cu-oxidase_3 6.00
PF09346SMI1_KNR4 4.00
PF07704PSK_trans_fac 2.00
PF13414TPR_11 2.00
PF00326Peptidase_S9 2.00
PF01478Peptidase_A24 1.00
PF01761DHQ_synthase 1.00
PF00486Trans_reg_C 1.00
PF13701DDE_Tnp_1_4 1.00
PF06750DiS_P_DiS 1.00
PF00589Phage_integrase 1.00
PF01850PIN 1.00
PF00248Aldo_ket_red 1.00
PF01628HrcA 1.00
PF13560HTH_31 1.00
PF07676PD40 1.00
PF01987AIM24 1.00
PF00924MS_channel 1.00
PF00682HMGL-like 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG2986Histidine ammonia-lyaseAmino acid transport and metabolism [E] 7.00
COG2132Multicopper oxidase with three cupredoxin domains (includes cell division protein FtsP and spore coat protein CotA)Cell cycle control, cell division, chromosome partitioning [D] 6.00
COG1989Prepilin signal peptidase PulO (type II secretory pathway) or related peptidaseCell motility [N] 3.00
COG4423Uncharacterized conserved proteinFunction unknown [S] 2.00
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 1.00
COG1420Transcriptional regulator of heat shock responseTranscription [K] 1.00
COG2013AIM24 protein, required for mitochondrial respirationEnergy production and conversion [C] 1.00
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001471|JGI12712J15308_10052395All Organisms → cellular organisms → Bacteria → Acidobacteria1035Open in IMG/M
3300001593|JGI12635J15846_10012406All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7148Open in IMG/M
3300002245|JGIcombinedJ26739_100566732All Organisms → cellular organisms → Bacteria → Acidobacteria1013Open in IMG/M
3300002909|JGI25388J43891_1004231All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2928Open in IMG/M
3300002915|JGI25387J43893_1038344All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Saccharopolyspora → Saccharopolyspora erythraea650Open in IMG/M
3300005167|Ga0066672_10066545All Organisms → cellular organisms → Bacteria → Acidobacteria2117Open in IMG/M
3300005167|Ga0066672_10697651All Organisms → cellular organisms → Bacteria → Acidobacteria651Open in IMG/M
3300005172|Ga0066683_10187869All Organisms → cellular organisms → Bacteria → Acidobacteria1274Open in IMG/M
3300005175|Ga0066673_10002981All Organisms → cellular organisms → Bacteria6370Open in IMG/M
3300005175|Ga0066673_10200171All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Blastococcus → Candidatus Blastococcus massiliensis1139Open in IMG/M
3300005176|Ga0066679_10073157All Organisms → cellular organisms → Bacteria → Acidobacteria2031Open in IMG/M
3300005177|Ga0066690_10029665All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3167Open in IMG/M
3300005178|Ga0066688_10004084All Organisms → cellular organisms → Bacteria6460Open in IMG/M
3300005179|Ga0066684_10039061All Organisms → cellular organisms → Bacteria → Acidobacteria2650Open in IMG/M
3300005184|Ga0066671_10421464All Organisms → cellular organisms → Bacteria → Acidobacteria856Open in IMG/M
3300005187|Ga0066675_10106341All Organisms → cellular organisms → Bacteria1861Open in IMG/M
3300005446|Ga0066686_10514906All Organisms → cellular organisms → Bacteria → Acidobacteria814Open in IMG/M
3300005450|Ga0066682_10028549All Organisms → cellular organisms → Bacteria → Acidobacteria3245Open in IMG/M
3300005450|Ga0066682_10042463All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2731Open in IMG/M
3300005540|Ga0066697_10001760All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium9263Open in IMG/M
3300005554|Ga0066661_10009974All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4682Open in IMG/M
3300005555|Ga0066692_10102488All Organisms → cellular organisms → Bacteria → Acidobacteria1698Open in IMG/M
3300005561|Ga0066699_10061394All Organisms → cellular organisms → Bacteria → Acidobacteria2355Open in IMG/M
3300005568|Ga0066703_10046896All Organisms → cellular organisms → Bacteria → Acidobacteria2406Open in IMG/M
3300005575|Ga0066702_10030639All Organisms → cellular organisms → Bacteria2748Open in IMG/M
3300005576|Ga0066708_10074165All Organisms → cellular organisms → Bacteria → Acidobacteria1966Open in IMG/M
3300005586|Ga0066691_10159084All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1302Open in IMG/M
3300005598|Ga0066706_10032316All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3355Open in IMG/M
3300006031|Ga0066651_10075947All Organisms → cellular organisms → Bacteria1640Open in IMG/M
3300006032|Ga0066696_10163433All Organisms → cellular organisms → Bacteria → Acidobacteria1398Open in IMG/M
3300006796|Ga0066665_10199991All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1550Open in IMG/M
3300006800|Ga0066660_10935671All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → Micromonospora eburnea703Open in IMG/M
3300009012|Ga0066710_100151071All Organisms → cellular organisms → Bacteria3229Open in IMG/M
3300010321|Ga0134067_10000711All Organisms → cellular organisms → Bacteria6715Open in IMG/M
3300010322|Ga0134084_10087713All Organisms → cellular organisms → Bacteria → Acidobacteria976Open in IMG/M
3300010329|Ga0134111_10023184All Organisms → cellular organisms → Bacteria → Acidobacteria2110Open in IMG/M
3300010333|Ga0134080_10071947All Organisms → cellular organisms → Bacteria → Acidobacteria1388Open in IMG/M
3300011271|Ga0137393_10068357All Organisms → cellular organisms → Bacteria → Acidobacteria2826Open in IMG/M
3300011271|Ga0137393_11283853All Organisms → cellular organisms → Bacteria → Acidobacteria620Open in IMG/M
3300012198|Ga0137364_10004261All Organisms → cellular organisms → Bacteria7518Open in IMG/M
3300012199|Ga0137383_10469199All Organisms → cellular organisms → Bacteria → Acidobacteria922Open in IMG/M
3300012200|Ga0137382_10249328All Organisms → cellular organisms → Bacteria → Acidobacteria1230Open in IMG/M
3300012208|Ga0137376_10154481All Organisms → cellular organisms → Bacteria → Acidobacteria1975Open in IMG/M
3300012285|Ga0137370_10119611All Organisms → cellular organisms → Bacteria → Acidobacteria1501Open in IMG/M
3300012917|Ga0137395_10019864All Organisms → cellular organisms → Bacteria → Acidobacteria3884Open in IMG/M
3300012918|Ga0137396_10056148All Organisms → cellular organisms → Bacteria → Acidobacteria2714Open in IMG/M
3300012922|Ga0137394_11071285All Organisms → cellular organisms → Bacteria → Acidobacteria668Open in IMG/M
3300012924|Ga0137413_10204030All Organisms → cellular organisms → Bacteria → Acidobacteria1331Open in IMG/M
3300012927|Ga0137416_10212068All Organisms → cellular organisms → Bacteria → Acidobacteria1561Open in IMG/M
3300012975|Ga0134110_10098920All Organisms → cellular organisms → Bacteria → Acidobacteria1177Open in IMG/M
3300014166|Ga0134079_10085207All Organisms → cellular organisms → Bacteria → Acidobacteria1180Open in IMG/M
3300014166|Ga0134079_10141743All Organisms → cellular organisms → Bacteria → Acidobacteria961Open in IMG/M
3300017657|Ga0134074_1143297All Organisms → cellular organisms → Bacteria → Acidobacteria834Open in IMG/M
3300018431|Ga0066655_10189963All Organisms → cellular organisms → Bacteria → Acidobacteria1260Open in IMG/M
3300018433|Ga0066667_10103085All Organisms → cellular organisms → Bacteria → Acidobacteria1904Open in IMG/M
3300018433|Ga0066667_10141607All Organisms → cellular organisms → Bacteria → Acidobacteria1681Open in IMG/M
3300018482|Ga0066669_10007277All Organisms → cellular organisms → Bacteria5331Open in IMG/M
3300020022|Ga0193733_1026360All Organisms → cellular organisms → Bacteria → Acidobacteria1644Open in IMG/M
3300020199|Ga0179592_10012140All Organisms → cellular organisms → Bacteria → Acidobacteria3741Open in IMG/M
3300020579|Ga0210407_10007398All Organisms → cellular organisms → Bacteria → Acidobacteria8291Open in IMG/M
3300020579|Ga0210407_10058929All Organisms → cellular organisms → Bacteria → Acidobacteria2874Open in IMG/M
3300020579|Ga0210407_11033903All Organisms → cellular organisms → Bacteria → Acidobacteria625Open in IMG/M
3300020579|Ga0210407_11205620All Organisms → cellular organisms → Bacteria → Acidobacteria569Open in IMG/M
3300020580|Ga0210403_10368983All Organisms → cellular organisms → Bacteria → Proteobacteria1174Open in IMG/M
3300020581|Ga0210399_10021399All Organisms → cellular organisms → Bacteria5121Open in IMG/M
3300020581|Ga0210399_10111482All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_40CM_4_58_42243Open in IMG/M
3300021086|Ga0179596_10383088All Organisms → cellular organisms → Bacteria → Acidobacteria708Open in IMG/M
3300021088|Ga0210404_10007176All Organisms → cellular organisms → Bacteria → Acidobacteria4503Open in IMG/M
3300021088|Ga0210404_10392685All Organisms → cellular organisms → Bacteria → Acidobacteria775Open in IMG/M
3300021171|Ga0210405_10004000All Organisms → cellular organisms → Bacteria → Acidobacteria14056Open in IMG/M
3300021178|Ga0210408_10970540All Organisms → cellular organisms → Bacteria → Acidobacteria659Open in IMG/M
3300021478|Ga0210402_10112085All Organisms → cellular organisms → Bacteria → Acidobacteria2459Open in IMG/M
3300021478|Ga0210402_10160964All Organisms → cellular organisms → Bacteria → Acidobacteria2048Open in IMG/M
3300021478|Ga0210402_10427015All Organisms → cellular organisms → Bacteria → Acidobacteria1231Open in IMG/M
3300021478|Ga0210402_10827915All Organisms → cellular organisms → Bacteria → Acidobacteria851Open in IMG/M
3300021559|Ga0210409_10008776All Organisms → cellular organisms → Bacteria10261Open in IMG/M
3300025939|Ga0207665_10000086All Organisms → cellular organisms → Bacteria → Acidobacteria60660Open in IMG/M
3300026277|Ga0209350_1000431All Organisms → cellular organisms → Bacteria → Acidobacteria24464Open in IMG/M
3300026295|Ga0209234_1005578All Organisms → cellular organisms → Bacteria4762Open in IMG/M
3300026300|Ga0209027_1023005All Organisms → cellular organisms → Bacteria → Acidobacteria2361Open in IMG/M
3300026301|Ga0209238_1002472All Organisms → cellular organisms → Bacteria7294Open in IMG/M
3300026309|Ga0209055_1014693All Organisms → cellular organisms → Bacteria3945Open in IMG/M
3300026309|Ga0209055_1067855All Organisms → cellular organisms → Bacteria → Acidobacteria1495Open in IMG/M
3300026316|Ga0209155_1022829All Organisms → cellular organisms → Bacteria → Acidobacteria2601Open in IMG/M
3300026334|Ga0209377_1061427All Organisms → cellular organisms → Bacteria → Acidobacteria1644Open in IMG/M
3300026342|Ga0209057_1002986All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium12710Open in IMG/M
3300026343|Ga0209159_1151903All Organisms → cellular organisms → Bacteria → Acidobacteria903Open in IMG/M
3300026515|Ga0257158_1034160All Organisms → cellular organisms → Bacteria → Acidobacteria903Open in IMG/M
3300026537|Ga0209157_1259945All Organisms → cellular organisms → Bacteria → Acidobacteria670Open in IMG/M
3300026538|Ga0209056_10047303All Organisms → cellular organisms → Bacteria3879Open in IMG/M
3300026538|Ga0209056_10071365All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia2979Open in IMG/M
3300026547|Ga0209156_10018943All Organisms → cellular organisms → Bacteria → Acidobacteria4183Open in IMG/M
3300026548|Ga0209161_10011358All Organisms → cellular organisms → Bacteria6688Open in IMG/M
3300026548|Ga0209161_10071162All Organisms → cellular organisms → Bacteria → Acidobacteria2185Open in IMG/M
3300027674|Ga0209118_1001461All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis10958Open in IMG/M
3300027748|Ga0209689_1004720All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium9846Open in IMG/M
3300027908|Ga0209006_10056397All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3522Open in IMG/M
3300028536|Ga0137415_10052842All Organisms → cellular organisms → Bacteria → Acidobacteria3934Open in IMG/M
3300031231|Ga0170824_105940207All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1860Open in IMG/M
3300032180|Ga0307471_100000617All Organisms → cellular organisms → Bacteria18295Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil37.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil17.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001471Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12712J15308_1005239513300001471Forest SoilMAIPTGTIPPPQAKVAVRGPTMSDALAGPVPNRSKPLRKPTHAEATSKLQSGLVGFVIGVLVGGALLYFGFKILGGAALLFFCGGSLLFLIGRKDEVTDCPFCGAAIHNLPQPDANGVPKPVQCRKCWEYAGLQKGFVSPYNPNAVEERPTFRSPLAQSVIWPNGCVQCGATPTRFDETGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVELKTEI
JGI12635J15846_1001240623300001593Forest SoilMPAPTGTIVPPQNKAAGNVPAMSDAVAPPVPSRSVPLRKPTSSEVASKLTTGLGILAFGVFLGGPLLYFGWIVAGIAALFFFGGGGLLIVFGSKDEVASCPFCGAALDNLPKPNADGVPRPVQCKKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQTVVWPRGCVQCGAEPTRFDEVGTFNVNKGLLVVGAVRVKTFKLQGVPYCSAHKKAVEMQTGIGNKLFLDWRSLAMMRRYLAANRGRFAE*
JGIcombinedJ26739_10056673213300002245Forest SoilMAIPTGTIPPPQAKVAVRGPTMSDALAGPVPNRSKPLRKPTHAEATSKLQSGLVGFVIGVLVGGALLYFGFKILGGAALLFFCGGSLLFLIGRKDEVTDCPFCGAAIHNLPQPDANGVPKPVQCRKCWEYAGLQKGFVSPYNPNAVEERPTFRSPLAQSVIWPNGCVQCGATPTRFDETGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVELK
JGI25388J43891_100423153300002909Grasslands SoilMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQRVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNTHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFA
JGI25387J43893_103834413300002915Grasslands SoilMPALTGTMVPPENKLTASVPAMSDAVAAPVPTRSVPLRKPTSAEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKDFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEM
Ga0066672_1006654523300005167SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0066672_1069765113300005167SoilAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANR
Ga0066683_1018786923300005172SoilMPSPPGTTAPPQTKTLESRVPGSVMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0066673_1000298143300005175SoilMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066673_1020017113300005175SoilMPALTGTMVPPENKLTTSVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0066679_1007315723300005176SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066690_1002966513300005177SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0066688_1000408443300005178SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0066684_1003906143300005179SoilMPALTGTMVPPENKLTTSVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0066671_1042146413300005184SoilMPALTGTMVPPQNKLTASVPAMSDALAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHQKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVES
Ga0066675_1010634123300005187SoilMPALTGTMVPPENKLTTSVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0066686_1051490613300005446SoilVPGSVMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0066682_1002854923300005450SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVSGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066682_1004246313300005450SoilMPALTGTMVPPQNKLTASVPAMSDAVATPVPTRSVPLRKPTSSEVASKIVSGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0066697_10001760133300005540SoilMPALTGTMVPPQNKLTASVPAMSDAVATPVPTRSVPLRKPTSSEVASKIVSGLGVFALAVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0066661_1000997463300005554SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEM
Ga0066692_1010248813300005555SoilMAIPTGTIAPPQEKVALRGPAMSEAVAATVPNRSKPLRKPTHAETTSKLKSGVGGLVFGALVGGALLYFGFKVLGGAALLFFCGGSLLILFGSKDEVSDCPFCGAPLHNLPKPDTNGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCSAHKKAVEMSTEIGNKLFLDWRSLAMMRRYMAANRGRFAE*
Ga0066699_1006139423300005561SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0066703_1004689623300005568SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066702_1003063943300005575SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0066708_1007416513300005576SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQAVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGWTAPLK*
Ga0066691_1015908413300005586SoilKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066706_1003231653300005598SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQRVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNTHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0066651_1007594713300006031SoilTGTMVPPQNKLTASVPAMSDADAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPVAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0066696_1016343313300006032SoilMSDAVATPVPTRSVPLRKPTSSEVASKIVSGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0066665_1019999123300006796SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVGTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0066660_1093567113300006800SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTE
Ga0066710_10015107133300009012Grasslands SoilMPALTGTMVPPQNKLTASVPAMSDAVATPVPTRSVPLRKPTSSEVASKIVSGLGVFALGVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0134067_1000071133300010321Grasslands SoilMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0134084_1008771323300010322Grasslands SoilMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDDVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE*
Ga0134111_1002318433300010329Grasslands SoilPGSVMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0134080_1007194723300010333Grasslands SoilMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0137393_1006835723300011271Vadose Zone SoilMSDVPAPPVPSRSVPLRKPTHAETSSKLTSGLIILIFGLVVGGTLLYFGFKVLGGAALVIVGGFGLLMVLSRKDEMSACPFCEAVLDGLPKPNANGVPRRVQCGKCWEYSGLQKGFVSAYDPNAMEEKPTFRAPLAQSVIWPHGCAQCGAEPTRFEEVSTSSLSAGFLVVGTVRVKTFKLSGVPYCEAHKKAVEMNTGLGNTLYLDWRSLAMMRRYLAANRRRFEE*
Ga0137393_1128385313300011271Vadose Zone SoilMSDVAAPPVPSRSVPLRKPTHAEISSNLTSGLIILIFGLVVGGTLLYFGFKVLGGAALVIVGGFGLLMVLMRKDEVSACPFCGEVLDGLPKPNANGVPRRVQCKKCWEYSGLQKGFVSAYNPNALEEKPTFRAPLAQSVIWPHGCVQCGAEPTRFEEVSTSSLSAGFLVVGTIRVKTFKLSSVPYCEAHKKAVEMNTGLG
Ga0137364_1000426173300012198Vadose Zone SoilMSDADAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRTCWEYSGLQKGFVSPYNPNAVEERPTFRSPVAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKNAVEMATEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0137383_1046919913300012199Vadose Zone SoilMSDAVAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGN
Ga0137382_1024932823300012200Vadose Zone SoilMSDADAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPVAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0137376_1015448133300012208Vadose Zone SoilMSDAVAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRTCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKNAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS*
Ga0137370_1011961113300012285Vadose Zone SoilTTAPPQTKTLERRVPGSVMVDVAAPPVPNHSNPLPKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGVALLFFGGGGLATVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFDEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRGRFAE*
Ga0137395_1001986423300012917Vadose Zone SoilMSDAVAAPVPSRSNPLRKPTHAEATSKLKSGLGILIFGLLVGGALFYFGFKVLGGAALLFFGGGGLLIVLGRKDEVSACPFCGAVLDNLPKPNSDNVPRPVQCKKCWEYSGVQKGFVSPYNPNAMEERPTFRSPLAQSVVWPNGCVQCGAEPTRFDEVGTYNVNKGLLVVGAVRVKTFKLQGVPYCDAHKKAVEMNTGLGNKLYLEWRSLAMMRRYLAANRGRFAE*
Ga0137396_1005614833300012918Vadose Zone SoilMTSQPPNKLAAAVPAMSDAVAPPVPSRSNPLRKPTHAETTSKLKSGLGILIFGVVVGGALLYFGFKVIGGAALLFFGGAGLLIVFTRKDEVSACPFCRAVLDTLPKPSSDNAPRPVQCKKCWEYSGVRKGSVSPYNPNAMEERPTFRSPLAQSVIWPRGCVQCGAEPTRFEEVSTSSLNAGFLVVGALRVKTFKMSGVPYCNAHKKAVELSKGTGDKLYLDWRSLGMMRRYLAANRGRFAE*
Ga0137394_1107128513300012922Vadose Zone SoilMSDAVAAPVPSCSNPLRKPTHAEATSKLKSGLGILIFGLLAGGALFYFGFKVLGGAALLFFGGGGLLIVLGRKDEVSACPFCGAVLDNLPKPNSDNVPRPVQCKKCWEYSGVQKGFVSPYSPVEERPTFRSPLAQSVVWPNGCVQCGAEPTRFDEVGTYNVNKGPLVVGAVRVKTFKLQGVPYCNAYKKAQPQLRRHSPPESGAAQPQA
Ga0137413_1020403023300012924Vadose Zone SoilMSDAAAAPVPGRSNPLRKPTHAEATSKLKSGLGILIFGLVVGGALFYFGFKALGGAALLFFGGGGLLIVLGRKDEVSACPFCGAVLDNLPKPNSDNVPRPVQCKKYWEYSGVQKGFASTYNPYAMEERPTFRAAGAERGVAERMRAVRGRADAVDEVGTYNVNKGLLVVGAARVKTFKLQGVPYCNAHKKAVEMNTGIGNKLYLDWRSLATMRTYLAANHGRFVE*
Ga0137416_1021206813300012927Vadose Zone SoilMTSQPPNKLAAAVPAMSDAIAPPVPSRSNPLRKPTHAETTSKLKSGLGILIFGVVVGGALLYFGFKVIGGAALLFFGGAGLLIVFTRKDEVSACPFCGAVLDTLPKPSSDNAPRPVQCKKCWEYSGVRKGSVSPYNPNAMEERPTFRSPLAQSVIWPRGCVQCGAEPTRFEEVSTSSLNAGFLVVGALRVKTFKMSGVPYCNAHKKAVELSKGTGDKLYLDWRSLGMMRRYLAANRGRFAE*
Ga0134110_1009892013300012975Grasslands SoilMSDAVAAPVPTRSVPLRKPTSSEVASKIVSGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS*
Ga0134079_1008520713300014166Grasslands SoilMVPPENKLTASVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKA
Ga0134079_1014174323300014166Grasslands SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLPKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFELSGVPYCNAHKKAVEMSTGIGNKLSLDWRSLAMMRRYLAANRGRF
Ga0134074_114329713300017657Grasslands SoilTSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYFAANRGRFAE
Ga0066655_1018996323300018431Grasslands SoilMPSPPGTTAPPQTKTLESRVPGSVMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE
Ga0066667_1010308513300018433Grasslands SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSQYNPNAVEERPTFRSPLAQNVVWPRGRVQCEVEPTRFDKVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0066667_1014160713300018433Grasslands SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEVTGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFDEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRS
Ga0066669_1000727743300018482Grasslands SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRGCVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE
Ga0193733_102636013300020022SoilMPALTGTMVPPQNKLTASVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGIAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNTDGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHQKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0179592_1001214033300020199Vadose Zone SoilMTAPSGTSVPPQNKTSESKAPGSVMADVAAPPVPNRSNPLRKPTHAEAASKLKSGLGILIFGVVVGGALLYFGFKVLGGAALLFFGGAGLLIVFTRKDEVSACPFCGAVLDTLPKPSSDNVPRPVQCKKCWEYSGVQKGSVSPYNPNAMEERPTFRSPLAQSVIWPRGCAQCGAEPTRFEEVSTSSLNAGFLVVGAVRVKTFKLSGVPYCNAHKKAVELSKGIGDKLYLDWRSLGMMRRYLAANRGRFAE
Ga0210407_1000739863300020579SoilMAIPTGTIAPPQAKVALRGPAMSDVVPAPVPNRSKPLRKPTHAEKSSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLAIVFGHKDEVSDCPFCGAALPNLPQPDADGAPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFRSPLAQSVVWPNGCVQCGAQPTRFDEVSTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYMAANRGRFAE
Ga0210407_1005892933300020579SoilMAVPTGTIAPPPAKTALRGPAMSDAVAAPVPNRSKPLRKPTHAEKTSKLKSGLGGLIVGALIGGPLLYFGFYFWGGAALLLLCGGGLLILFGSKDEVTDCPFCGAPIHHLPQADASGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPNGCVQCGAAPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLGMMRRYMAANRGRFAE
Ga0210407_1103390313300020579SoilSEAISAPVPNRSKPLRKPTHAEKTSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLAILFGHKDEVSDCPFCGAALPNLPQPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVAEQPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWR
Ga0210407_1120562013300020579SoilPVPNRSKPLRKPTHAEKTSKLKSGLGGLIVGAVIGGPLLYFGFYFWGGAVLLFLCGGGLLILFGSKDDVTDCPFCGAAIHNLPQPGANGAPKPVQCRKCWEYSGLQKGFVSPYNPNAIEEQPTFRSPLAQSVVWPNGCVQCGAAPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMK
Ga0210403_1036898323300020580SoilLFRRWRKLRSAVRRCQMRLRAGAEPKQTAAKAHPCRGNQQAAVGLVGFVIGALVGGALLYFGFKILGDAALLFFCGGSLLILFGSKDEVSDCPFCGVAMHNLPQPDANDAPKPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPNGCVQCGATPTRFDEAGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYMAANRGRFAE
Ga0210399_1002139953300020581SoilMAIPTGTIAPPAAKVAPRGPAMSEAISAPVPNRSKPLRKPTHAEKTSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLAILFGHKDEVSDCPFCGAALPNLPQPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVAEQPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYMAANRGRFAE
Ga0210399_1011148213300020581SoilTALRGPAMSDAVAAPVPNRSKPLRKPTHAEKTSKLKSGLGGLIVGALIGGPLLYFGFYFWGGAALLLLCGGGLLILFGSKDEVTDCPFCGAPIHHLPQADASGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPNGCVQCGAAPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLGMMRRYMAANRGRFAE
Ga0179596_1038308813300021086Vadose Zone SoilTMAVPTGTIAPPPAKVAVRGPAMSDAVAAPVPNRSKPMRKPTHAEATSKLQSGLVGMVIGVLVGGALLYFGFPILGGAALLFFCGGSLLVLFGSKDEVSDCPFCGAALHNLPKADANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPAFRSPLAQSVVWPNGCVQCGAAPTRFDEVGTYAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMM
Ga0210404_1000717653300021088SoilMPTPTGTIAPPENKVALRGPAMSDAVAAPVPNRSKPLRKPTHAEANSKLRSGLGGLIVGALIGGPLLYFGFYFWGGVAVLLLCGGGLLILSGHKDDVSDCPFCGAALHNLPQPDPNGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFRSPLAQSVVWPNGCAQCGASPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYIAANRSRFVE
Ga0210404_1039268513300021088SoilMAVPTRTTAPSPIKVAGLRPAMSDVAAPPVPSRSVPLRKPTHAETTSKLRVGLIFLLFGLAVGGALLYFGFNVLGGAALVIFGGFGLLIVLMRKDEVSACPFCGAAMDGLPKPNASGLPRRVQCGKCWEYSGLQKGFVSPYNPNALEEKPTFRAPLAQSVIWPHGCVQCGAEPTRFEEVSTSSLSAGFLVVGTVRVKTFKLSGVPYCEAHKKAVEMNTGLGNTLYLDWRSLGMMRRYLAA
Ga0210405_10004000133300021171SoilMAIPTGTTAPPPIRTAGPRPGMSDVTAPPVPSRSKPVRKPTHAEATSKLTSGLFGLIFGALVGGALLYFGFKIVGGGVLLLFCGIGLAMIFSRNAEVSDCPFCGAALTQLRQPDANGAPRPVQCRKCWEYSGLQKGFVSPYNPNAVEDEPTFRSPLAQDVVWPHGCVQCGAEPARFDEVSTFSVSKGLLVVGALRVSTFKLRGVPYCNVHKDAVAMKTEVGNKLFLDWRSLGMMRRYLAANRGRFVE
Ga0210408_1097054013300021178SoilKPTHAEASSKLTSGLIILIFGLVVGGTLLYFGFKVLGGAALVIIGGFGLLMVLMRKDEVSACPFCGEILDGLPKPNANGLPRRVQCGKCWEYSGLQKGFVSAYDPNAMEEKPTFRAPLAQSVVWPHGCVQCGAVPTRFEEVSTSSLSAGFLVVGTVRVKTFKLSGVPYCEAHKKAVEMNTGLGNTLYLDWRSLAMMRRYLAANRGRFAE
Ga0210402_1011208533300021478SoilMAIPTGTIAPPPTKVALRGPAMSDAVAAPVPNRSKPLRKPTHAEKTSKLRSGLGGFVAGAVIGGPLLYFGFYFWGGAVLLFLCGGGLLILFGSKDDVTDCPFCGASIHNLPQPGANGAPKPVQCRKCWEYSGLQKEFVSPYDPNAVEEQPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEM
Ga0210402_1016096443300021478SoilMAIPPGTIAPPQAKVAVRGPAMSDAVAGPVPNRSKPLRKPTHAEATSKLQSGLVGFVIGVLVGGALLYFGFKILGGAALLFFCGGSLLILVGRKDEVSDCPFCGAAIHNLPRPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQSVVWPNGCVQCGATPTRFGEAGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKT
Ga0210402_1042701523300021478SoilMAIPTGTIAPPPAKVALRGPAMSEAVAAPVPNRSKPLRKPTHAEKTSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLAILFGHKDEVSDCPFCGAALPNLPQPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVAEQPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYIAANRGRFAE
Ga0210402_1082791513300021478SoilMAVPNGMIVPPPGKVVRRDPAMSDIAASPVPSRSKPVRKPTHAEATSKLTSGVGGLIFGVFVGGALLYFGFKIVGGGVLLLFGGIGLAMIFSRNAEVSDCPFCGAALTQLRQPDANGNPRPVQCRKCWEYSGLQKGFVSPYNPNAVEDQPTFRSPLAQGVVWPRGCVQCGVEPTRFDEVSTFNVSKGLLVVGAVRVKSFKLGGVPYCNAHKDAVELKTEVGNKLFLDWRSLGMMRRYMAANRGRFAE
Ga0210409_1000877673300021559SoilMAIPTGTIAPPQAKVALRGPAMSDVVPAPVPNRSKPLRKPTHAEKSSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLAIVFGHKDEVSDCPFCGAALPNLPQPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFRSPLAQSVVWPNGCVQCGAQPTRFDEVSTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYMAANRGRFAE
Ga0207665_10000086473300025939Corn, Switchgrass And Miscanthus RhizosphereMPTPTGTTAPTQTKTLESRVPGSVMVEVAPPPVPSRSNPLRKPTHAEAKSKLKTGMGILIFGLVVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPVAQTVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRGRF
Ga0209350_1000431173300026277Grasslands SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQRVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNTHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE
Ga0209234_100557823300026295Grasslands SoilMPALTGTMVPPQNKLTASVPAMSDALAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHQKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209027_102300523300026300Grasslands SoilMPALTGTMVPPQNKLTASVPAMSDAVAAPVPTRSVPLRKPTSAEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKDFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209238_100247213300026301Grasslands SoilVPLRKPTSAEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQKDFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209055_101469343300026309SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE
Ga0209055_106785523300026309SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209155_102282933300026316SoilMPALTGTMVPPENKLTTSVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209377_106142713300026334SoilPTHAEATSKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE
Ga0209057_100298633300026342SoilMPALTGTMVPPQNKLTASVPAMSDAVATPVPTRSVPLRKPTSSEVASKIVSGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS
Ga0209159_115190313300026343SoilPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRYLAANRGRFAE
Ga0257158_103416013300026515SoilMAIPTGTIAPPPVKVALRGPAMSDAVAAPVPNRSKPLRKPTHAEKTSKLQSGFAGMVIGVLVGGALFYFGFKILGGAALFVFCGLGLALLFGHKDEVSDCPFCGTPLPNLPQPDANGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFRSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGN
Ga0209157_125994513300026537SoilRVPGSVMVDIAAPPVPNRSNPLRKPTHAEATSKLKTGMGILIFGLAVGGALLYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDSLPKPNSDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAEEERPTFRAPLAQTVMWPRGCVQCGAEPTQFDEVSTSNLSAGLLVIGTVRVKTFKLSGVPYCNAHKKAVEMNTGIANKLYLDWRSLAMMRRY
Ga0209056_1004730333300026538SoilMPALTGTMVPPQNKLTASVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKNAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS
Ga0209056_1007136533300026538SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVLDNLPKPNPDNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVQCGAEPTRFEEVGTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHNKAVEMNTGIGNKLYLDWRSLAMMRRYLAANRGRFAE
Ga0209156_1001894353300026547SoilMPALTGTMVPPENKLTTSVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIGAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSECPFCGAALDNLPKPNPDGVPRPVQCRKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQNVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS
Ga0209161_1001135853300026548SoilMPALTGTMVPPQNKLTASVPAMSDAVAAPVPTRSVPLRKPTSSEVASKIVAGLGVFALGVVLGGPLLYFGWTVAGTAALIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQCGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMKTEIGNKLFLDWRSLAMMRRYMAANRGQFVESSSSGKS
Ga0209161_1007116223300026548SoilMPTPTGTTAPPQTKTLESRVPGSVMVDVAAPPVPNRSNPLRKPTHAEATGKLKTGMGILIFGLVVGGALFYFGFKILGGAALLFFGGGGLAIVLGRKDEVAACPFCGAVVDNLPKPNADNVPRPVQCKKCWEYSGLQKGSVSPYNPNAMEERPTFRAPLAQTVVWPRACVLCGAEPTRFEEVSTSNLSAGFLVVGTVRVKTFKLSGVPYCNAHKKAVEMSTGIGNKLYLDWRSLAMMRRYLAANRRRFAE
Ga0209118_100146183300027674Forest SoilMPAPTGTIVPPQNKAAGNVPAMSDAVAPPVPSRSVPLRKPTSSEVASKLTTGLGILAFGVFLGGPLLYFGWIVAGIAALFFFGGGGLLIVFGSKDEVASCPFCGAALDNLPKPNADGVPRPVQCKKCWEYSGLQKGFVSPYNPNAVEERPTFRSPLAQTVVWPRGCVQCGAEPTRFDEVGTFNVNKGLLVVGAVRVKTFKLQGVPYCSAHKKAVEMQTGIGNKLFLDWRSLAMMRRYLAANRGRFAE
Ga0209689_100472033300027748SoilMPGFNGNDGPPQNKLTASVPAMSDTVAAPVPSRSVPLRKPTSSEVASKIVAGLGVLALAVVLGGPLLYFGWTVAGIAGLIFFGGGGLLVMLFSRDEVSACPFCGAALDNLPKPNADGVPRPVQCRKCWEYSGLQRGFVSPYNPNAVEERPTFRSPLAQSVVWPRGCVQRGVEPTRFDEVGTFSVNKGLLVVGAVRVKTFKLPGVPYCAAHKKAVEMTTEIGNKLFLDWRSLAMMRRYMAANRGRFVESSSSGKS
Ga0209006_1005639733300027908Forest SoilMAIPTGTIPPPQAKVAVRGPTMSDALAGPVPNRSKPLRKPTHAEATSKLQSGLVGFVIGVLVGGALLYFGFKILGGAALLFFCGGSLLFLIGRKDEVTDCPFCGAAIHNLPQPDANGVPKPVQCRKCWEYAGLQKGFVSPYNPNAVEERPTFRSPLAQSVIWPNGCVQCGATPTRFDETGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVELKTEIGNKLFLDWRSLPMMRRYMAANRGR
Ga0137415_1005284233300028536Vadose Zone SoilMSDAVAPPVPSRSNPLRKPTHAETTSKLKSGLGILIFGVVVGGALLYFGFKVIGGAALLFFGGAGLLIVFTRKDEVSACPFCGAVLDTLPKPSSDNAPRPVQCKKCWEYSGVRKGSVSPYNPNAMEERPTFRSPLAQSLIWPRGCVQCGAETTRFEEVSTSSLNAGFLVVGALRVKTFKMSGVPYCNAHKKAVELSKGTGDKLYLDWRSLGMMRRYLAANRGRFAE
Ga0170824_10594020733300031231Forest SoilMAIPTGTIAPPQAKVALRGPAMSDTVAPPVPNRSKPLRKPTHAEKTGKLKSGLGGVVVGAVIGGPLLYFGFYFWGGAALLLLCGGGLLILFGSKDDVTDCPFCGAAIHNLPQPNANGAPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFQSPLAQSVVWPNGCVQCGAPPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLYLDWRSLPMMRRYMAANRGRFAE
Ga0307471_100000617123300032180Hardwood Forest SoilMPTPTDTIAPPQNKVALRGPAMSDAVAAPVPNRSKPLRKPTHAEANSKLKSGLGGLIVGALIGGPLLYFGFYFWGSVAVLLLCGGSLLILSGHKDEVSDCPFCGAALHNLPQPDPNGVPKPVQCRKCWEYSGLQKGFVSPYNPNAVEEQPTFRSPLAQSVVWPNGCVQCGAAPTRFDEVGTFAVNKGLLVVGAVRVKTFKLGGVPYCDAHKKAVEMKTEIGNKLFLDWRSLPMMRRYIAANRSRFVE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.