NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100767

Metagenome / Metatranscriptome Family F100767

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100767
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 198 residues
Representative Sequence ASEVSRALVSALPTAARDRVHVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFHNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRRPS
Number of Associated Samples 86
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 2.38 %
% of genes near scaffold ends (potentially truncated) 39.22 %
% of genes from short scaffolds (< 2000 bps) 34.31 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.824 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.529 % of family members)
Environment Ontology (ENVO) Unclassified
(41.176 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.039 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 35.22%    β-sheet: 21.74%    Coil/Unstructured: 43.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF03576Peptidase_S58 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG3191L-aminopeptidase/D-esteraseAmino acid transport and metabolism [E] 1.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A58.82 %
All OrganismsrootAll Organisms41.18 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10151882All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300005093|Ga0062594_101319655All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae → Zavarzinella → Zavarzinella formosa725Open in IMG/M
3300005172|Ga0066683_10363939All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300005440|Ga0070705_100329619All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1105Open in IMG/M
3300005468|Ga0070707_100550209All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1116Open in IMG/M
3300005518|Ga0070699_100647800All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes964Open in IMG/M
3300005586|Ga0066691_10029454All Organisms → cellular organisms → Bacteria2822Open in IMG/M
3300005586|Ga0066691_10258009All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1024Open in IMG/M
3300006914|Ga0075436_101212443All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300009012|Ga0066710_100172884All Organisms → cellular organisms → Bacteria3040Open in IMG/M
3300009012|Ga0066710_103921313All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300009837|Ga0105058_1111361All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300010320|Ga0134109_10316598All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300010323|Ga0134086_10473508All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300010323|Ga0134086_10479605All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300010329|Ga0134111_10406375All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300010333|Ga0134080_10515705All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300010362|Ga0126377_11411187All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300010364|Ga0134066_10005878All Organisms → cellular organisms → Bacteria2286Open in IMG/M
3300011271|Ga0137393_11046806All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → Streptomyces somaliensis694Open in IMG/M
3300011422|Ga0137425_1178654All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300012202|Ga0137363_11796143All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300012207|Ga0137381_10727784All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300012207|Ga0137381_11356824All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300012211|Ga0137377_10101490All Organisms → cellular organisms → Bacteria2726Open in IMG/M
3300012351|Ga0137386_10432200All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300012685|Ga0137397_11042711All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300012918|Ga0137396_11060925All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300012918|Ga0137396_11093462All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300014154|Ga0134075_10030890All Organisms → cellular organisms → Bacteria2176Open in IMG/M
3300014166|Ga0134079_10000683All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes8977Open in IMG/M
3300025922|Ga0207646_10329321All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1380Open in IMG/M
3300026295|Ga0209234_1125337All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes941Open in IMG/M
3300026296|Ga0209235_1193758All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300026298|Ga0209236_1238610All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300026530|Ga0209807_1265179All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300027748|Ga0209689_1188545All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300027846|Ga0209180_10129359All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1450Open in IMG/M
3300027909|Ga0209382_11420769All Organisms → cellular organisms → Bacteria696Open in IMG/M
3300031421|Ga0308194_10000915All Organisms → cellular organisms → Bacteria3405Open in IMG/M
3300031720|Ga0307469_11347113All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300032180|Ga0307471_101068183All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes974Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil14.71%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.84%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.86%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.94%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.98%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.98%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.98%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.98%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.98%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.98%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011422Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT640_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1015188213300002558Grasslands SoilMMEKDRTKRFQTAAEVSRALVGALPTAARDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALFAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVPGASPGGRELGDTVYRSLSFRDMMQLRPHLNASVEGSTAKRRP
Ga0062594_10131965513300005093SoilRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP*
Ga0066683_1036393923300005172SoilPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQVRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRIRVRSRAGRT*
Ga0066690_1100766613300005177SoilDVSEEMAVVLERMLDKDREKRFQTAGDVSRALVDALPTAARNKVRVPFRRRVTSMFYKSLLGLSVAGCLLSIAFVAGAAVVAYTVFSKAPRVAIRPPVPDSLTRAMRARRALAPDDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSVRTKLGFDVRGGLVFRLVI
Ga0070703_1054306013300005406Corn, Switchgrass And Miscanthus RhizosphereQITEAPEPIRRHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVD
Ga0070705_10032961923300005440Corn, Switchgrass And Miscanthus RhizosphereQITEAPEPIRRHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP*
Ga0070694_10071363413300005444Corn, Switchgrass And Miscanthus RhizosphereVAYTVFSKAPKVAIRPPVPDSLTRVMRARRALNPGDVALFAYQPAGQEDTTLLLLTRRRTVVVTPHSVRSYARDSVRTKMGFDIRGGLVFRLVILGTNTRELTDTVFHNLSFRDMIQLGGQINEREAGDSAKNRPRVRSRTRRTS*
Ga0070694_10124500213300005444Corn, Switchgrass And Miscanthus RhizosphereASILAQQITEAPEPIRRHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFR
Ga0066681_1092324913300005451SoilEEMAVVLERMMEKTRNKRYQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPVSLATMLRARRALLPGDVAMFAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVP
Ga0070707_10055020923300005468Corn, Switchgrass And Miscanthus RhizosphereLGVVLYHMLAGWPPFQGPDSASILAQQITEAPEPIRRHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP*
Ga0070698_10190889213300005471Corn, Switchgrass And Miscanthus RhizospherePFDGDSSASILAKQITQAPSPIRRSRSDVPEELAFVLERMLQKNPARRLQTAAELSRALVDALPTAARDRVRVPLRRRLRAMAFKSLIGLGVAGCLAFVAFVAGAAVVAYTVFSKKPKIIATEPIPDSLARALRERRALAAGEIAEYAFQPGGQEDTTLLLVTRRRMVIVTPGQVRSYARDS
Ga0070699_10064780023300005518Corn, Switchgrass And Miscanthus RhizosphereRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP*
Ga0070704_10229960913300005549Corn, Switchgrass And Miscanthus RhizosphereRQLTEVPEPIRRLRADVPDEMAVVLDRMLDKKRNKRFQMASEVSRALVGALPTAARDRVHIPLRRRIKTVFYRSLVGLGVAVLLLSIAAAVGAGLVAYYVFSKPPRVSARLPLPDSLTRGLRARRALLPGDVGLFAYRPSGQEDTTLLLLTRRRTVVVTTHDVRSYA
Ga0066707_1067782423300005556SoilRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDNVARMLRARGALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRTKMGLDVHGGLAFRLVIYGTRSKELADTVFRNLSFRDMVQLDKQLNRRDAADSVRPASAAPRRVPAKKPRKGSPRPRVRHRPG*
Ga0066691_1002945413300005586SoilMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGIGVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQVRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRILVRNRAGRT*
Ga0066691_1025800913300005586SoilQAPVPIRRHRPDVPEEMAVVLERMMEKSRSKRFQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQMRSYARDSVRTQMGLDVHGGLAFRLVIYGTRSKELADTVFRNLSFRDMVQLDKQLNRRDAEDSVRPASDAPRPARRVPAKKPRKGSPRLRARHRPA*
Ga0066706_1050132213300005598SoilLPTAARDRVHVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPEVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRRRTVVITPHGVRSYARDSVHTNMGFDVRGGLTFRLVIRGTESRELADTVFRSLSFRDMILLSGQLNERDAMDRARKGRPRVRSRTRRSL*
Ga0066659_1032934023300006797SoilMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQMRSYARDSVRTQMGLDVHGGLAFRLVIYGTRSKELADTVFRNLSFRDMVQLDKQLNRRDAEDSVRPASDAPRPARRVPAKKPRKGSPRPRVRRRPG*
Ga0066659_1125920313300006797SoilAGDVSRALVDALPTAARNKVRVPFRRRVTSMFYKSLLGLSVAGCLLSIAFVAGAAVVAYTVFSKAPRVAIRPPVPDSLTRAMRARRALAPDDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSVRTKLGFDVRGGLVFRLVILGKDTQAHTDTVFRSLSFRDMIQLGGQLNEREAGDSAAKHRPRVRTRTRRPS*
Ga0066660_1058307413300006800SoilVSRALVGALPTAASDRVRVPLSRRLRLMFYRSLVGLTVAGCLLSIAFAGGAAAVAYYVFSKPPRIAALAPLPESLVRPLRARGALASGDAALSAYEPAGQEDSTLLLLTRRRTVVVTPHEVRSYPRDSTRTQMGWDIHGGLAFRLVIYGTRAKQLADTVFRSLSFRDMIQLRKQLNRRDGEDSLRAASGGSPSPRRVPATKPRKGSPRPRARRRPG*
Ga0066660_1082663113300006800SoilGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPDDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSVRTKLGFDVRGGLVFRLVILGKDTQAHTDTVFRSLSFRDMIQLGGQLNEREAGDSAAKHRPRVRTRTRRPS*
Ga0075421_10087852713300006845Populus RhizosphereGALPTAARDRVRVPFRRRLRMMFYKSLVGLGVAGCLLFIAFVAGAAVVAYTVFSKAPRIAVERPIPESLAQRLRNRRALASGDVVEYVYQPAGQEDTTILLVTQRRLAVVTPRQVRSYARDSIRADWDFDFRGGLTFRLAILEKRSNGTVDTVFRHLSFRDMVTLAPRLNAIEQDAAGQRVRVRSRTRPT*
Ga0075431_10062909813300006847Populus RhizosphereGALPTAARDRVRVPFRRRLRMMFYKSLVGLGVAGCLLFIAFVAGAAVVAYTVFSKAPRIAVERPIPESLAQRLRNRRALASGDVVEYVYQPAGQEDTTILLVTQRRLAVVTPRQVRSYARDSIRADWDFDFRGGLTFRLAILEKRSNGTVDTVFRHLSFRDMVTLAPRLNAIEQDAAGQRVRVRSRARPT*
Ga0075425_10226421413300006854Populus RhizosphereAGSPPFEGLSSASVLAQQIAEPPEPIRRRRPDVPEEMAVVLERMMEKNRSKRFQMASDVSRALVGVMPTAARDRVVVPLRRRLRLMFYKSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRVAARAPLPDSLTLMLRTRHALLPGDIARYAYRPAGQEDTTLLLLTRRRTVVVTPNQVRSYARDSVQRDMDLILHGGLAF
Ga0075425_10316758613300006854Populus RhizosphereKRFQMASEVSRALVSVMPTAARDRVVVPLRRRLRLMFYKSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRVAARAPLPDSLTRMLRTRRALLPGDIARYAYRPAGQEDTTLLLLTRGRTVVVTPSQVRSYARDSVRRDIDLILHGGLAFRLVIYGKNSESFADT
Ga0075424_10158410513300006904Populus RhizosphereAVVAYTVFSKAPKIAIRPPVPDSLTRAMRAHRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTNRAFTDTVFHNLSFRDLILLSGQLNEREAGDSAAKGRSRVRARTRRPS*
Ga0075436_10121244313300006914Populus RhizosphereRALVGALPTAASDRVHVPLSRRLRLMFYKSLVGLSVAGCLLFIAFAGGAAVVAYYVFSKPPRVAARAPFPDSLSRMLRARRALLTGDAPVYAYRPAGQEDTTLLLLTRRRTVVVTPTQVRSYARDSVRRDMDLILHGGLAFRLVIYGRHSDVVADTVYRGLSFRDMVQLRPQLNRSQTAATPTRRPSSV
Ga0066710_10017288413300009012Grasslands SoilRAVVGALPTAASERVRVPLRRRLRLMFYKSRVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQVRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRIRVRNRAGRT
Ga0066710_10392131313300009012Grasslands SoilRGKRFQMASEVSRALVGALPTAASDRVHVPLRRRLQVMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPLVAAHAPIPDSLSRMLRARRALLIGDAALYVYRPAGQEDTTLLLLARRRTVVVTPHQVRSYARDSVRRNMDLEIHGGLSFRLVIYVLGTTPGAGELADTVYRSLSFRDMMQL
Ga0099792_1010553733300009143Vadose Zone SoilFYKSLLGLSVAGCLLSVAFVAGAAVVAYTVFSKAPKVAVRPPVPDSLARAMRARRALAPGDIALYAYQPAGQEDTTLLLLTRRRTVVVTPHGTRSYARDSVETKMGFDVRSGLAFRLVIHGKDPSRLADTVFRNLSFRDMILLGGQLNERDELDSAAKNRPRVRARTRRPS*
Ga0105058_111136113300009837Groundwater SandSLVGLSVAGCLLSIAAAGGAALVAYYVFSKPPRVSARSPLPDSLTRGLRARRALFPGDVGLFAYRPSGQEDTTLLLLTRRRTVVVTPHEVRSYARDSVKREMDLVLHGGLAFRLVIYGRRSSELADTVYRNLSFRDMMQIRSQLNREVELPPAARPAPPPSRSAPAVKARPNQRRRARP*
Ga0134109_1020393213300010320Grasslands SoilAPPFEGPSSASILAEQLTQAPVPIRRHRPDVPEEMAVVLERMMEKTRTKRFQMASEVSRALVGALPTAARDQVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPVSLASMLRARRALLPGDIAMFAYQPAGPEDSTLLLLTRRRTVVVSPHQVRSYARDSVRRDMDLILHGGLAFRLVIDGRHSATVAETRGCGRRYPPARGRPAKHVAHKSPAETTIR
Ga0134109_1031659813300010320Grasslands SoilEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQLRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNISFRDMVTLAPRLND
Ga0134086_1047350813300010323Grasslands SoilGALPTAARDRVHLPLSRRIKAMFYRSLVGLSVAGCLLFLAFAGGGAVVAYYVFSKPPRVAAHAPLPDSLTRVLRARRALLPGDIARYAYRPAGQEDTTLLLLTRRRTVVVTPNQVRSYARDSVRRDMDLILHGGLAFRLVIYGKNSAPVADTVYRSLSFRDMMLLRSQLNR
Ga0134086_1047960513300010323Grasslands SoilVSRALVGAMPTAARDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPHIAARAPLPGSLTRMLRARRALLPGDVGLFAYRPTGEEDTTLLLLTRRRTVVVTPGEVRSYARDSVRRDIDLILHGGLAFRLAIYGKHSSGVSDTVYRNLSFRDMMQ
Ga0134111_1019319913300010329Grasslands SoilNALPTAARDRVHVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPEVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRHHTVVVTPHGVRSYARDSVDTRMGFDVRGGLTFRLVIRGTESRELADTVFRSLSFRDMILLSGQLNERDAMDRARKGRPRVRSRTRRSL*
Ga0134111_1040637513300010329Grasslands SoilASEVSRALVGALPTAARDRVHIPLRRRLRSMFYRSLVGLSVAGCLLSIAFAGGAAAVAYYVFSKPPRIAAHAPLSDTLARMLRARGALVAGDVASYAYQPAGQEDSTLLLLTRHRTVVVTPHQVRSYARDSVRRDIDLEIHGGLSFRLMIYAPRIADTVYRSLSFRDMMQLRPELNRTVEAAATKRRAVAPPAA
Ga0134080_1051570513300010333Grasslands SoilRIKRFQRASEVSRALVGALPTAARDRVRIPLRRRLRSMVYRSLVGLSVAGCLLSIAFAGGAAAVAYYVFSKPPRIAAHAPLPDSLARLLRARGALVTGDVASYAYQPAGTEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRHLNLEIHGGLSFRLVIYAARIADTVYRSLSFRDMMQLRPQLNRTEDAI
Ga0134080_1067418613300010333Grasslands SoilEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRHRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALTDTVFRNLSFRDL
Ga0134063_1043723613300010335Grasslands SoilVVLERMLEKDRKKRFQMASEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRHRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALTDTVFRNLSFRDLILLSGQLNEREAGDSAAKSRPRVRART
Ga0126377_1141118713300010362Tropical Forest SoilSSASILAQQITEAPDPIRRHRPDVPEEMVVVLDRMMEKSRNKRFQMASEVSRALVGALPTAARDRVRVPLGRRLRSMFYKSLLGISLLLIAFAGGAAAVAYYVFSTPPLIAAQRPIPDSLAQRLRSRRALASDDVAEYVYRPGGREDTTLLLLTQRRLAVVTPRQVRSYSRDSIRADFDLDLRGGLTFRLAIRGKGSNGLADTVFRNLSFRDMVTLAPRLSSVEQDAGGRVRVRSRVRPT
Ga0134066_1000587813300010364Grasslands SoilRVRVPLRRRLRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQVRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRIRVRNRAGRI*
Ga0137393_1104680613300011271Vadose Zone SoilAILAQILTETPEPIRQLRPDVPEEMAVVVERMMEKNRGKRFQMASEVSRALVGALPTAASDRVHVPLRRRLQVMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPLVAARAPIPDSLSRMLRARRALLIGDAALYVYRPAGQEDTTLLLLARRHTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVINVLGTTPGAGELADTVYRSLSFRDMMQLRPELNRAGAV
Ga0137425_117865413300011422SoilTAARDRVRIPLRRRIKTVFYRSLVGLGVAVLLLSIAAAFGAGLVAYYVFSKPPRVSARLPLPDSLTRGLRARRALLPGDVGLFAYRPSGQEDTTLLLLTRRRTVVVTSHEVRSYMRDSVQRDMDLVLHGGLAFRLVIYGRRSSAVADTVYRNLSFRDMMEIRSELNRKVAPAPAPA
Ga0137382_1102201313300012200Vadose Zone SoilGTSSASILAKQLTQPPAPIRRERPDVSEEMAVVLERMLEKDRKKRFQMASEVSRGLVNALPTAARDRVRVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKVPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLV
Ga0137363_1179614313300012202Vadose Zone SoilPLRRRLKAMFYRSLVGLSVAGCLLFIAFAGGGAVVAYYVFSKPPQVAARSPFPDSLARMLRDRRALLPGDLGMFAYRPAGQEDTTMLLLTRRRTVVVTPHEVRSYARDSVRRDMDLILHGGLAFRLLIYGRHSATVADTVYRSLSFRDMVQLRTQLNRAPEPAPATRP
Ga0137381_1072778423300012207Vadose Zone SoilIRRERPDVPEEMAVVLERMLEKDRKKRFQMASEVSRALVNALPTAARDRVRVPFRRRVTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPEVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRRRTVVITPHGVRSYARDSVHTNMGFDVRGGLTFRLVIRGTESRELADTVFRSLSFRDMILLSGQLNERDAMDRARKGRPRVRSRTRRSL*
Ga0137381_1135682413300012207Vadose Zone SoilMAVVLDRMMEKTRTKRFQMAGEVSRALVGALPTAASDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQMRSYARDSVRTQMGLDVHGGLAFRLVIYGTRSKELADTVFRNLSFRDMVQLDKQLNRR
Ga0137377_1010149043300012211Vadose Zone SoilASEVSRALVGALPTAARDRVHIPLRRRLRSMFYRSLVGLSVAGCLLSIAFAGGAAAVAYYVFSKPPRIAAHAPLPESLARMLRARGALVAGDVASYAYQPAGQEDSTLLLLTRNRTVVVTPHQVRSYARDSVRRDIDLEIHGGLSFRLMIYAPRIADTVYRSLSFRDMMQLRPELNRTVEAAATKRRAVAAPAARPRPRSTTPSKPPPRTRPRTGTRRRP*
Ga0137370_1091503513300012285Vadose Zone SoilIRRLRPDAPEEMALVLDRMLDKKRANRFQMASEVSRALVGALPTAARDRVHLPLSRRIKAMFYRSLVGLSVAGCLLFLAFAGGGAVVAYYVFSKPPRVAARLPLPDSLTRMLRGRRALLPGDVGLYAYRPAGQEDTTLLLLTRRRTVVVTPHDVRSYARDSVRRDMDLMLHGGLAFRLVI
Ga0137387_1092363423300012349Vadose Zone SoilGAAVVAYTVFSKVPKVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRRRTVVITPHGVRSYARDSVHTNMGFDVRGGLTFRLVIRGTESRELADTVFRSLSFRDMILLSGQLNERDAMDRARKGRPRVRSRTRRSL*
Ga0137387_1125296613300012349Vadose Zone SoilTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKVPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALTDTVFRNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRTR
Ga0137386_1043220023300012351Vadose Zone SoilMLQASSAGAHALRSASSASILAQQLTQTPPPIRRERPDVPEEMAVVLERMLEKDRKKRFQMASEVSRALVNALPTAARDRVHVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPEVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRRRTVVITPHGVRSYARDSVHTNMGFDVRGGLTFRLVIRGTESRELADTVFRSLSFRDMILLSGQLNERDAMESAPKGRPRVRVRTRRSR*
Ga0137385_1057003313300012359Vadose Zone SoilLSIAFVAGAAVVAYTVFSKVPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALTDTVFRNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRTRRSS*
Ga0137360_1011126313300012361Vadose Zone SoilMLEKDRKKRFQMASEVSRALVNALPTAARDRVRVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKTPKVVIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTTARAFTDTVFRNLSFRDMILLSGQLNEREAGDSAAKSRPRARARTRRPS*
Ga0137373_1080139213300012532Vadose Zone SoilPDVPEEMAVVLERMLEKDPKKRFQMASEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRALRARRALNSDDVALYAYQPAGQEDTTLLLLTRHRTVVVTPHAVRSYARDSVDTHMGFDVGGGLTFRLVIRGSESRELADTVFRSLSFRDMILLSGQLNERDAMESAPKGRPRVRVRTRRS*
Ga0137397_1104271113300012685Vadose Zone SoilVLERMLDKKRAKRFQMASEVSRALIGALPTAARDRVRIPLRRRIKTVFYRSLVGLSVAGCLLSIAATGGAALVAYYVFSKPPRVAALAPLPISLSRELRARRALLPGDVGLFAYRPAGQEDTTLLLLTRRRTVVVTPHEVRSYARDSVRRDIDLILHSGLAFRLVIYGRHSSGVADTVYRNLSFRDMMQIRSQLNREPP
Ga0137396_1106092513300012918Vadose Zone SoilAARDQVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALFVYQPAGPEDSTLLMLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVPGASPGARELGDTVYRSLSFRDMMQLRPQLNASVEGSTANRRPTSVRQAPPTRAP
Ga0137396_1109346213300012918Vadose Zone SoilAARDQVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRTAARAPLPDSLARMLRARRALLPGDVALFVYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVPGASPGGRELGDTVYRSLSFRDMMQLRPQLNASVEGSTAKRRPTSVRQAPP
Ga0137396_1127358513300012918Vadose Zone SoilAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTIVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFRNLSFRDMILLSGQLNEREAGDSAAKSRPRVRARTRRPS*
Ga0137404_1024782323300012929Vadose Zone SoilVSRALVNALPTAARDRVRVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFGKTPKVVIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTTARAFTDTVFRNLSFRDMILLSGQLNEREAGDSAAKSRPRARARTRRPS*
Ga0137407_1051730523300012930Vadose Zone SoilMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKTPKVVIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTTARAFTDTVFRNLSFRDMILLSGQLNEREAGDSAAKSRPRARARTRRPS*
Ga0137407_1152756313300012930Vadose Zone SoilSASILAQQIAEPPEPIRRHRPDLPEEMAVVLERMMEKTRNKRYQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARPPLPVSLATMLRARRALLPGDVAMFAYQPAGPEDSTLLLLTRRRTVVVTPHDVRSYARDSVRRDMDLILHGVLAFRLVIYGRHSATFADTVYRSL
Ga0137410_1139651213300012944Vadose Zone SoilFDGPSSANILAQQLTELPEPIRRRRPDVPEEMAVVLERMLDKKRAKRFQMASEVSRALIGALPTAARDRVRIPLRRRIKTVFYRSLVGLSVAGCLLSIAATGGAALVAYYVFSKPPRVAALAPLPISLLRELRARRALLPGDVGLFAYRPAGQEDTTLLLLTRRRTVVVTPHEVRSYARDSVRRDIDLILHSGLAFRLVIYG
Ga0134077_1043117713300012972Grasslands SoilRPDVSEEMAVVLERMLEKDRKKRFQMASEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRHRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALSDTVFRN
Ga0134075_1003089033300014154Grasslands SoilMFYRSLVGLSVAGCLLSIAFAGGAAAVAYYVFSKPPRIAAHAPLPDSLARLLRARGALVTGDVASYAYQPAGTEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRHLNLEIHGGLSFRLVIYAPRIADTVYRSLSFRDMMQLRPELNRTVEAATPTRRAVAPPTARPRPRATTPSQPAPRTRPRTGTRRRP*
Ga0134079_1000068313300014166Grasslands SoilQGPSSASILAQQITEAPEPIGSHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPDSLAQRLRSRRALVSGDVVEYVYQPAGPEDTTLLLLTRRRLAVVTPRQVRSYLRDSIRTDFDLDFRGGLAFRLAIYDKRPSQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRIRVRNRAGRI*
Ga0137420_117635423300015054Vadose Zone SoilARDRVRVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGLEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFRNLSFRDMILLSGQLNEREAGDSAAKSRPRVRARTRRPS*
Ga0134069_139318213300017654Grasslands SoilFQMASEISRALVNALPTAARDRVHVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPEVAIRPPVPDSLTRALRARRALNPGDVALYAYQPAGQEDTTLLLLTRRRTVVITPHGVRSYARDSVHTNMGFDVRGGLTFRLVIRGTESRELADTVF
Ga0134083_1022337123300017659Grasslands SoilEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRHRTVVLTPHGVRSYSRDSVRTQMGFDLRGGLIFRLVIRGTYARALTDTVFRNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRTRRSS
Ga0184618_1025298513300018071Groundwater SedimentTPIRRERPDVPEEMAVVLERMLEKDRKKRFQMASEVSRGLVNALPTAARDRVRVPFRRRLTAMFYKSLLGLSVAGCLLFIAFAAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVPLFAYQPAGQEDTTLLLLTRRRTVVVTPHGMRSYSRDSVRTKLGFDVRGGLVFRLVIREAEGRALADTVFRSLSFRDMILLGGQLKEREAVDSAAKGRPRVRPRTRRPS
Ga0066662_1300761913300018468Grasslands SoilQTAAEVSRALVGALPTAARDQVHVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIATHAPLPDSLTRMLRERRALLPGDIARYAYQPAGQEDSTLLLLTRRRTVVVTPHQMRSYARDSVRREMDLEIHGGLSFRLVIHVPGTSPHAGELAD
Ga0066669_1155958113300018482Grasslands SoilLTEQPAPIRRLRADVPEEMAVVLDRMLAKQRNQRFQMASEVSRALVGALPTAARDRVHIPLGRRLKAMFYRSLVGLSVAGCLLGIAFVGGAAVVAYYVFSKPPRVAARAPLPPSLTSMLRARRALLPGDAGLFAYQPTGQEDTTMLLLTRRRTVVVTPHEVRSYARDSVRRDMDLILHGGLAFRLVLYGRHSPTLAHTAS
Ga0184643_115092643300019255Groundwater SedimentRFQMASEVSRALVNALPTAARDRVRVPFRRRLTAMFYKSLLGLSVAGCLLFIAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAPGDVPLFAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSVRTKLGFDVRGGLVFRLVIRESEGRALTDTVFRSLSFRDMILLGGQLNERDAVDSAAKGRPRVRPRTRRPS
Ga0193715_111917713300019878SoilALVSALPTAARDRVHVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFHNLSFRDLILLSG
Ga0193713_110558913300019882SoilGVAGCLLSIAFVAGAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFHNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRRPS
Ga0193695_105449113300021418SoilMAVVLERMLEKDREKRFQTASDVSRALVDALPTAARNKVRVPFRRRLTAMFYKSLLGLSVAGCLLSIAFVAGAAVVAYTVFSKAPRVAIRPPVPDSLTRAMRARHALAPDDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSVRTKLGFDVRGGLVFRLVILGMDTRALADTVFRSLSFRDMILLGGQLNEREAGDSVAKHRPRGRARTRTRRPS
Ga0207646_1032932113300025922Corn, Switchgrass And Miscanthus RhizosphereLGVVLYHMLAGWPPFQGPDSASILAQQITEAPEPIRRHRPDVPEEMAVVLDRMMEKSRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRIRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP
Ga0209234_112533713300026295Grasslands SoilVPEEMAVVLERMMEKNRAKRFQTAAEVSRALVGALPTAARDQVHVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSVARMLRARRALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRTKMGLDVHGGLAFRLVIYGTHSKELADTVFRNLSFRDMVQLDKQLNRRDAADSVRPASAAPRRVPAKKPRKGSPRPRVRHRPG
Ga0209235_119375813300026296Grasslands SoilQLTQAPVPIRRHRPDVPEEMAVVLERMMEKSRTKRFQMASEVSRALVGALPTAARDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALFAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVPGASLGGRELGDTVYRSLSFRDMMQLRPHLNASVEGSTAKRRPSVRQAPP
Ga0209236_123861013300026298Grasslands SoilTAAEVSRALVGALPTAARDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALFAYQPAGPEDSTLLLLTRRRTVVVTPHQVRSYARDSVRRDMDLEIHGGLSFRLVIYVPGASPGGRELGDTVYRSLSFRDMMQLRPHLNASVEGSTAKRRPSVRQAPP
Ga0209027_124767613300026300Grasslands SoilRALVGALPTAASDRVRVPLSRRLRLMFYRSLVGLTVAGCLLSIAFAGGAAAVAYYVFSKPPRIAALAPLPESLVRPLRARGALASGDAALSAYEPAWQEDSTLLLLTRRRTVVVTPHEVRSYPRDSTRTQMGWDIHGGLAFRLVIYGTRAKQLADTVFRSLSFRDMIQLRKQLNRRDGEDSLRAASGGSP
Ga0209469_115304413300026307SoilERPDVSDEVAVVLDRMLEKEPERRYQRAAEVSRALVNALPTAARDQVHVPLRRRLRAMALKSLIGLGVAGCLAFIAFVAGAAVVAYTVFSKRPKIVAMEPIPDSLARALRQRRALASGDTAEYAYQPGGEEDTTLLLVTRRRVVVVTPREVRSYVRDSIQPVLSPDLG
Ga0209153_119849913300026312SoilERMLEKNRSRRFQMAGEVSRALVGALPTAASDRVRVPLSRRLRLMFYRSLVGLTVAGCLLSIAFAGGAAAVAYYVFSKPPRIAALAPLPESLVRPLRARGALASGDVALYAYEPAGQEDSTLLLLTRRRTVVVTPHEVRSYPRDSTRTQMGWDIHGGLAFRLVIYGTRAKQLADTVFRSLSFRDMIQLRKQLNRRDGEDSQRAVSGGSPSPRRVPATKPRKGSPRPRARRRPG
Ga0209647_127342413300026319Grasslands SoilFEGTSSASILAQQITQPPEPIRRHRPDVPEEMTVVLERMMEKDRAKRFQMASEVSRALVGAMPTAARDRVRVPLRRRLRLMFYKSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRVAAHSPLPDSLTRLLRARRALLPGDIARYAYRPAGQEDTTLLLLTRGRTVVVTPNQVRSYARDSVRRDMD
Ga0209158_105875013300026333SoilQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDSLARMLRARRALLPGDVALSAYQPAGPEDSTLLLLTRRRTVVVTPHQMRSYARDSVRTQMGLDVHGGLAFRLVIYGTRSKELADTVFRNLSFRDMVQLDKQLNRRDAEDSVRPASDAPRPARRVPAKKPRKGSPRLRARHRPA
Ga0209807_126517913300026530SoilRALVGALPTAASDRVHVPLRRRLQVMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPLVAAHAPIPDSLSRMLRARRALLIGDAALYVYRPAGQEDTTLLLLARRRTVVVTPHQVRSYARDSVRRNMDLEIHGGLSFRLVIYVLGTTPGAGELADTVYRSLSFRDMMQLRPELNRAGAVSPAKHRAVRP
Ga0209474_1046788613300026550SoilTAASDRVRVPLSRRLRLMFYRSLVGLTVAGCLLSIAFAGGAAAVAYYVFSKPPRIAALAALPESLVRPLRARGALASGDVALYAYEPAGQEDSTLLLLTRRRTVVVTPHEVRSYPRDSTRTQMGWDIHGGLAFRLVIYGTRAKQLADTVFRSLSFRDMIQLRKQLNRRDGEDSLRAASGGSPSPRRVPATKPHKGSPRPRARRRPG
Ga0208991_121393913300027681Forest SoilKSLLGLSVAGCLLSVAFVAGAAVVAYTVFSKAPRVALRPPLPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTMLLLTRRRTVVVTPRGVRSYSRDSVRTKMGFNVRGGLVFRLVILGTEGKELTDTVFRSLSFRDMILLGGQLNEREAVDSAAKHRPRARVRTRRPS
Ga0209689_118854523300027748SoilLAQQLTQEPTPIRRQRSDVSEELSVVLERMLAKEPGKRFQTARDVSRALVDALPTAARNRIHVPLRRRLAAMFYKSLLGLSVAGCLLFIAFIAGAAVVAYTVFSKPARIEVQPPVPDSLVRSLRARRALATGDVALYAYEPAGQEDSTLLLLTRRRTVVVTPHEVRSYARDSTRTQMGLDVHGGLAFRLVIYGTRAKELADTVFRSLSFRDMIQLGKQLNRRDAEDSLRAASGGPPPRRRVPATKPRRGSPRLRVRRRPG
Ga0209811_1014451523300027821Surface SoilPTAARDRVHVPLKRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKIALRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVHTKMGFDLRGGLIFRLVIRGTINRAFTDTVFRNLSFRDMILLSGQISEREAGDSAAKSRPRVRARTRRPS
Ga0209180_1012935923300027846Vadose Zone SoilFEGPSSAAILAQQLTEPPEPIRRHRPDVPEEMAVVLERMMEKNRAKRFQTAAEVSRALVGALPTAARDQVRVPLRRRLRLMFYRSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRIAARAPLPDRLTRMLRARRALLVGDVALSAYEPAGQEDSTLLLLTRRRTVVVTPHEVRSYARDSVRTQMGLDVHGGLAFRLVIHGTRSKELADTVFRSLSFRDMVQLDKQLNRGDAEDSVRPASASPRRVPAKKPRKGSPRPRVRRRPA
Ga0209382_1142076913300027909Populus RhizosphereRNKRFQMASEVSRALVGALPTAARDRVRVPFRRRLRMMFYKSLVGLGVAGCLLFIAFVAGAAVVAYTVFSKAPRIAVERPIPESLAQRLRNRRALASGDVVEYVYQPAGQEDTTILLVTQRRLAVVTPRQVRSYARDSIRADWDFDFRGGLTFRLAILEKRSNGTVDTVFRHLSFRDMVTLAPRLNAIEQDAAGQRVRVRSRARPT
Ga0307307_1023287613300028718SoilERPDVSEEMAVVLERMLEKDRKKRFQMASEVSRGLVSALPTAARDRVRVPFRRRLQTMFYKSLLGLGVAGCLLSVAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAVGDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHSVRSYSRDSASTHMGFDLRGGLVFRLVIRGIAARPLTDTVFRSLSF
Ga0307287_1004820713300028796SoilAFVAGAAVVAYTVFSKAPKVAFRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHGVRSYSRDSSRTHMGFDLRGGLVFRLVIRGTPARTLTDTVFRSLSFRDWILLGGQLSEREEGDSTAKRRPRVRARTRRPS
Ga0307308_1041031413300028884SoilAVVLERMLEKDRKKRFQMASEVSRGLVSALPTAARDRVRVPFRRRLQTMFYKSLLGLGVAGCLLSVAFVAGAAVVAYTVFSKAPKVAIRPPVPDSLTRAMRARRALAVGDVALYAYQPAGQEDTTLLLLTRRRTVVVTPHSVRSYSRDSASTHMGFDLRGGLVFRLVIRGIAARPLTDTVFRSLSFRDMILLGGQLSEREEGDSTAKRRTRVRAR
(restricted) Ga0255311_111348513300031150Sandy SoilASILAQQITQPPEPIRRHRPDVPEEMTVVLERMMEKNRANRFQMASEVSRALVGAMPTAARDRVRVPLRRRLRLMFYKSLVGLSVAGCLLFIAFAGGAAAVAYYVFSKPPRVAAHSPLPDSLTRLLRARRALLPGDIARYAYRPAGQEDTTLLLLTRRRTVVVTPNQVRSYARDSVRRDMDLILHGGLAFRLVIYGT
Ga0307495_1020230913300031199SoilPTAARDRVHVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFHSLSFRDLILLSGQLNEREAGDSAAKSRP
Ga0308194_1000091513300031421SoilASEVSRALVSALPTAARDRVHVPLRRRLTAMFYKSLLGLGVAGCLLSIAFVAGAAVVAYTVFSKAPKIAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTTARAFTDTVFHNLSFRDLILLSGQLNEREAGDSAAKSRPRVRARTRRPS
Ga0307469_1134711313300031720Hardwood Forest SoilMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGIGVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIADSLAQRLRSRRALVSGDVVEYVYQPAGQEDTTLLLLTRRRLAVVTPRQVRTYLRDSIRTDFDLDFRGGLAFRLAIYGKRPNQLADTVFRNLSFRDMVTLAPRLNDLNRDAAGRRIRVRNRAGRI
Ga0307469_1204801113300031720Hardwood Forest SoilVAGAAVVAYTVFSKAPRVAIRPPVPDSLTRAMRARRALAPGDVALYAYQPAGQEDTTLLLLTRRRTVVLTPHGVRSYSRDSVRTHMGFDLRGGLIFRLVIRGTINRAFTDTVFHNLSFRDMILLSGQINEREAGDSAAKSRPRPRARTRRPS
Ga0307471_10106818323300032180Hardwood Forest SoilPEPIRRHRPDVPEEMAVVLDRMMEKNRNKRFQMASEVSRALVGALPTAASDRVRVPLRRRLRLMFYKSLVGISVAGCLLFIAFAGGAAAVAYYVFSKPPRIAAQRPIPESLLQRLRQRRALASGDVAQYVYQPAGQEDTTLLLLTRRRIAVVTPRQVRSYSRDSIRTDVDLDFRGGLAFRLTIYGKQSSGPADTVFRNLSFRDMVTLAPRLNDLEKDSSGRRVRVRTRAP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.