NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102797

Metagenome / Metatranscriptome Family F102797

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102797
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 124 residues
Representative Sequence MGHDFAKLNADTLDRALQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPVAPHEGDVLDFAGFGDWRIEGSQRVGVRPTGKPPREFFVCAPAAV
Number of Associated Samples 91
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 20.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.040 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(25.743 % of family members)
Environment Ontology (ENVO) Unclassified
(38.614 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.535 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.13%    β-sheet: 25.00%    Coil/Unstructured: 44.87%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01625PMSR 36.63
PF01828Peptidase_A4 4.95
PF05157T2SSE_N 3.96
PF13561adh_short_C2 2.97
PF00583Acetyltransf_1 2.97
PF12802MarR_2 1.98
PF00756Esterase 0.99
PF14307Glyco_tran_WbsX 0.99
PF03459TOBE 0.99
PF08734GYD 0.99
PF01894UPF0047 0.99
PF02653BPD_transp_2 0.99
PF00347Ribosomal_L6 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 36.63
COG0097Ribosomal protein L6P/L9ETranslation, ribosomal structure and biogenesis [J] 0.99
COG0432Thiamin phosphate synthase YjbQ, UPF0047 familyCoenzyme transport and metabolism [H] 0.99
COG4274Uncharacterized conserved protein, contains GYD domainFunction unknown [S] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.04 %
All OrganismsrootAll Organisms3.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005529|Ga0070741_10009302All Organisms → cellular organisms → Bacteria19187Open in IMG/M
3300005529|Ga0070741_10056692All Organisms → cellular organisms → Bacteria4633Open in IMG/M
3300005532|Ga0070739_10000141All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria155773Open in IMG/M
3300015195|Ga0167658_1003417Not Available5913Open in IMG/M
3300027773|Ga0209810_1000007All Organisms → cellular organisms → Bacteria894281Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.74%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil9.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.91%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost7.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere5.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.95%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil2.97%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.97%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.97%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil0.99%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.99%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2140918007Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Active_allEnvironmentalOpen in IMG/M
2170459007Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 10-21cmEnvironmentalOpen in IMG/M
3300001535Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A3-PF-15A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001536Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A15-65cm-8A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005532Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007740Permafrost core soil microbial communities from Svalbard, Norway - sample 2-9-2 SoapdenovoEnvironmentalOpen in IMG/M
3300007821Permafrost core soil microbial communities from Svalbard, Norway - sample 2-10-2 SoapdenovoEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300012008Permafrost microbial communities from Nunavut, Canada - A39_80cm_12MEnvironmentalOpen in IMG/M
3300012010Permafrost microbial communities from Nunavut, Canada - A7_35cm_12MEnvironmentalOpen in IMG/M
3300012014Permafrost microbial communities from Nunavut, Canada - A10_80cm_6MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300013763Permafrost microbial communities from Nunavut, Canada - A15_65cm_0MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300013832Permafrost microbial communities from Nunavut, Canada - A3_5cm_0MEnvironmentalOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300015089Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G8A, Adjacent to main proglacial river, end of transect (Watson river))EnvironmentalOpen in IMG/M
3300015195Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-6c, vegetation/snow interface)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015261Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-104_1 MetaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027773Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034125Peat soil microbial communities from wetlands in Alaska, United States - Sheep_creek_tus_01_15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A_all_C_004607602140918007SoilMTRGVGHDFAKLNADTLDRALQARRLDLQSVDQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSVDEKQLLAAPHEGDVLDFAGFGDWRIQGSQRVGVRPSGKPPREFFVCAPAVV
L02_003738202170459007Grass SoilVEFSKLNADALDRALAARRRDLDDIDPEVRERREYALAAIEGYAALLREPAVTAPTRNPMRYSFKLVFPDGRWSLDEKQLAAAPQEGDVLDFEGYGDWCIKSSESV
A3PFW1_1031400623300001535PermafrostMTRGVGQDFAKLNADTLDRALQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPAPSRNLVRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAPF*
A1565W1_1045314823300001536PermafrostMGHDFAKLNADTLDCALQARRRDLQGVDGEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQQVGVRPAGKPPREFFVCAAV*
C688J35102_11865074223300002568SoilVEFSKLNADALDRALAARRRDLDRIDPEVRARREYALAAIEGYTALLREPGVPTPSRNAIRYCFKLVFPDGRWSLDEKQLVAAPQEGDVLDFDGYGDWCIQGSQQVGVKPVGKPPREFFVCAPVAA*
soilH1_1006762013300003321Sugarcane Root And Bulk SoilMTRVVGPDHTAANLEALERALRVCREALPSIESEDERERRAAALSAFEGYTALLREPNRPAPSRNAIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIRGAQRVRVKPSGKPPRTFFVCSPV*
Ga0062595_10012107923300004479SoilMVEFSKLNADALDRALAARRRDLDGIDPELRERREYALAAIEGYTALLREPVVPAPTRDPIRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQRVGVRPAGKPPREFFVCAPVAA*
Ga0066672_1071934813300005167SoilMTSGVGPDFAKLNADTLDRALQARRRDLRGVGKEERERREFALAAIEGYTALLREPMEPAPSLNLMRYCFKLVFPDGRWSIDEKQLPTAPHEGDILDFAGYGDWLIQGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0066679_1030309823300005176SoilMGAHSPSWGRQKRRLAAMTRGMGDFAKLNADTLDRALQARRRALQGVDREERERAEFALAAIEGYTALLREPLEPAPSRNLSLYSFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIQGSQRVGVRPAGKPPREFFVCAPAG*
Ga0066684_1037031123300005179SoilALSERRAALAGIESEDERERRAAALSAFEGYTALLREPPRPAPSRNPIRYAFKLVFPDGRWSVDECELTAAPREGDVLGFDGIGNWRIQGAQRVGVKPAGKPPRTFFVCSPAG*
Ga0066675_1127718413300005187SoilMTRGVGPDFAKLNADTLDRALTARRRDLHAVDPEMRERREFALAAIEGYAALLREPMEPAPSSRLIRYCFKLVFPDGRWSIDEKHLPAAPHEGDVLEFAKYGDWRIQGSQQVGVRPAGKPPRQFFVCAPAAV*
Ga0070680_10101705723300005336Corn RhizosphereRHSLRKGRRIGIRWVMTRVVGPDYTAANLEALERALSARREALPSIEGEDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG*
Ga0070660_10054053033300005339Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAVPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG*
Ga0070691_1097776313300005341Corn, Switchgrass And Miscanthus RhizosphereERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG*
Ga0070714_10050594423300005435Agricultural SoilVGDFSKRNADALERALAARRRELDGVDEDVRARRESALKAIEGYTALLREPVEQTIPSISPSRYCFKLVFPDGRWSLGEKQLVAAPREGDVIEIDGYGDWRIQGSQSVGVRPAGKPPREFFVCAPVAA*
Ga0070713_10160182123300005436Corn, Switchgrass And Miscanthus RhizosphereVQRDDENPVGDFSKRNADALERALAARRRELDGVDEDVRARRESALKAIEGYTALLREPVEQTIPSISPSRYCFKLVFPDGRWSLGEKQLAAAPREGDVIEIDGYGDWRIQGSQSVGVRPAGKPPREFFVCAPVAA*
Ga0070681_1016527423300005458Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGEDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG*
Ga0070699_10114090023300005518Corn, Switchgrass And Miscanthus RhizosphereVGDFSKRNADALERALAARRRELDGVDEDVRARRESALKAIEGYTALLREPVEQTIPSISPSRYCFKLVFPDGRWSLGEKQLAAAPREGDVIEIDGYGDWRIQGSQSVGVRPAGKPPREFFVCAPVAA*
Ga0070741_1000930293300005529Surface SoilMTLVVGHDYARENAAALERALAARRSELGALGSEDERERRAGALRAIEGYMELLREPSEPPATRNLIRYCFKLVFADGRWDVGEKQLPAAPREGDVLGFEGIGEWRIEGAQRIGTRPAGKPPRTLFVCAPVS*
Ga0070741_1005669283300005529Surface SoilVGQDFSKANLAALERALEARREALDTIDGEERERRAAALAAIEGYTALLREPMQPTPARSAIRYCFKLVFPDGRWSIDEKQLVSTPREGDVLGFEGIGTWRILGAQRVGVKPAGKPPRTFFVCSPVA*
Ga0070741_1038741923300005529Surface SoilMTLDMEFAKANAEALERALAARRASLGGIESDSERERREAALTAFEGYTALLREPVPATPVRDSIRYCFKLVFPDGRWSIDEKQLSAAPREGDVLGFEGGNWRIQGAQRVGVKPNGKPPRTFFVCAPVA*
Ga0070679_10027849523300005530Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG*
Ga0070739_10000141163300005532Surface SoilMTLDMELARANAEALERALAARRRALATIDGDAERERRAAALSAFEGYTALLREPVPPAPTRNPIRYCFKLVFPDGRWSIDEKQLRVVPRAGDVVGFDGCGRWRIEGAQRVGVKPAGKPPRTFFVCAPVV*
Ga0070731_1003448963300005538Surface SoilMTRVVGPDYAAANLDALERALRERRQALPGIESEDERERRAAALSAFEGYTELLREPTRPAPSRNPIRYAFKLVFPDGRWSVDERELTAAPREGDVLGFDGIGDWRIQGAQRVRVKPAGKPPQTFFVCSPVG*
Ga0070732_1030335923300005542Surface SoilVGQDFSKHNADALERALAARRRELSGVDEDVRARRESALKAIEGYTALLREPAEQTTPSLSPIRYCFKLVFPDGRWSLDEKQLAAEPREGDVIEIDGYGDWRIQSSQSVGVKPSGKPPREVFVCAQVAA*
Ga0070732_1065671823300005542Surface SoilLARGSDKSVERDDDNPVGDFSSYNADALERALAARRRELAGVDEDVRARRESALKAIEGYTALLREPAEQTIPSITPSRYCFKLVFPDGRWSLDEKQLVAAPQEGDVIEIDGYGDWRIQGSQSVGVRPAGKPPREFFVCAPVAA*
Ga0066692_1091273513300005555SoilLGSVDQETRERRELALAAIEGYTALLREPSSQGASRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLAFAGYGDWRIQGSQLVGVKPAGKPQREFFVCAPAA*
Ga0079220_1059509313300006806Agricultural SoilETSLRKGTRAPQDSPMTIDMEFAKANADALERALAARRQSLGEIESETERDRRAAALAAFEGYTALLREPLPAAPVRNSIRYCFKLVFPDGRWSIDEKQLPAAPREGDVLGFENGSWRIQGAQRIGVKPAGKPPRTFFVCAPVA*
Ga0104326_13318843300007740SoilMTRGVGHDFAKLNADTLERALQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPTPNRNLMRYCFKLVFPDGRWSIDEKQLPAAPHQGDVLDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0104323_11979533300007821SoilMTRDVGHDFAKLNADTLDRALQARRCDLLGVGKEERERREFALAAIEGYTALLREPMEPAPNRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIQGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0066710_10088204323300009012Grasslands SoilVLERVLEARRRDLGSVDKDVRERRELALAAIEGYAALLREPSSQTPSGSLIRYCFKLVFPDGRWSVDEKQLAAAPHEGDVLAFAGYGDWRIQESQRVGVAPAGKPQREFFVCAPAA
Ga0099829_1048687323300009038Vadose Zone SoilMGELWKQNADVLERVLAARRLELGTVGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA*
Ga0099830_1030767223300009088Vadose Zone SoilMGELWKQNADVLERVLAARRLELGTVGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIQGSQRVGVKPAGKPQREFFVCAPAA*
Ga0099827_1055617323300009090Vadose Zone SoilMGELWKQNADVLERVLAARRLELGTAGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIQGSQRVGVKPAGKPQREFFVCAPAA*
Ga0099827_1062550323300009090Vadose Zone SoilVEFSKLNADALDRALAARRRELDGIEPEARARRESALAAIEGYTALLREPVMPAPSRNPIRYCFKLVFPDGRWSLDEKQLAAAPRAGDVLDFEGYGDWCIQSSQSVGVKPAGKPPREFFVCAPAA*
Ga0105240_1196471523300009093Corn RhizosphereMTRVVGPDYTDANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSG
Ga0066709_10062105523300009137Grasslands SoilVGAPPDFSKRNADVLERVLEARRRDLGSVDKDVRERRELALAAIEGYAALLREPSSQTPSGSLIRYCFKLVFPDGRWSVDEKQLAAAPHEGDVLAFAGYGDWRIQGSQRVGVAPAGKPQREFFVCAPAA*
Ga0066709_10428717013300009137Grasslands SoilVEFSKLNADALDRALAARRRDLDGIEPEVRARRESALAAIEGYAALLREPGVAAPARSPIRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGWCIQGSQSIGVRPSGKPPREFFVCAPVAA*
Ga0099792_1095359413300009143Vadose Zone SoilMVEFSKLNADALDRALAARRRDLDGIDPELRERREYALAAIEGYTALLREPGVPVPTRDPVRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQRVGVRPAGKPPREFFVCAPVAA*
Ga0134128_1010607653300010373Terrestrial SoilVGHDYAAANLDALERALAERRAALAGIASEDERERRAAALSAFEGYTALLREPTEPSAGRNPIRYAFKLVFPDGRWSVDERELLAAPREGDVLGFDGIGDWRIQGAQRVGVKPAGKPPRMFFICSPVA*
Ga0126350_1181901913300010880Boreal Forest SoilMTIGVGQDFSRQNADALERALEARRRDLPAVGDKDRARRESALAAIEGYTALLREPAPSRKLMRYCFKLVFPDGRWSVDEKQLPAAPRAGDVLGFAGYGDWRIESSEHVGAKPAGKPPREFFVCAPAA*
Ga0120174_106989013300012008PermafrostMGHDFAKLNADTLDRALQARRRDLQGVDGEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQQVGVRPAGKPPREFFVCAAV*
Ga0120118_102774123300012010PermafrostMGHDFAKLNADTLDRALQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPVAPHEGDVLDFAGFGDWRIEGSQRVGVRPTGKPPREFFVCAPAAV*
Ga0120159_102351533300012014PermafrostMTRGVGQDFAKLNADTLDRALQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAPF*
Ga0137389_1147701813300012096Vadose Zone SoilRRAEALSLIEGYTALLREPALPSPLRRPVRYCFKLVLPDGSWSINEKQLASPPCEGDVVGFEGCGDWRIQGSQFVGVKPAGKPPREFFVCAPAA*
Ga0137388_1043736223300012189Vadose Zone SoilMGELWKQNADVLERVLAARRLELGTAGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA*
Ga0137382_1012041323300012200Vadose Zone SoilMTRGVGPDFAQLNADTLDRALRARRLDLLGVDQEERERREYALAAIEGYTALLREPMEAPPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGGDWRIEGSQRIGVRPAGKPPREFFVCAPAAV*
Ga0137382_1098162413300012200Vadose Zone SoilVEFSRLNADALDRALAARRRDLDGIDPEVRERREYALAAIEGYTALLREPGVPGPTRDPIRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQSVGARPAGKPSREFFVCAPVAA*
Ga0137381_1017569733300012207Vadose Zone SoilVGARPDFSKQNADVLERVLEARRRDLGSVDQETRERRELALAAIEGYTALLREPSSQGPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLAFPGYGDWRIQGSQRVGVKPAGKPQREFFVCAPAA*
Ga0137385_1129664423300012359Vadose Zone SoilEARARRESALAAIEGYTALLREPVMPAPSRNPIRYCFKLVFPDGRWSIYEKLLHAAPHDGDVLAFPGYGDWLHQGSQRAGVKPGGKPPREFFVCAPAA*
Ga0137361_1061796813300012362Vadose Zone SoilVEFSRLNADALDRALAARRRDLDGIDPEVRERREYALAAIEGYTALLREPGVPVPTRDPVRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQSVGARPAGKPSREFFVCAPVAA*
Ga0137390_1044633533300012363Vadose Zone SoilMPAGNDDKTMGELWKQNADVLERVLAARRLELGTVGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDV
Ga0137358_1067790513300012582Vadose Zone SoilMTRGVGPDFAKLNADTLDRALRARRLDLQGVDQEERERREYALAAIEGYAALLREPMEAPPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGGDWRIEGSQRIGVRPAGKPPREFFVCAPAAV*
Ga0137398_1081351023300012683Vadose Zone SoilMVEFSKLNADALDRALAARRRDLDGIDPEIRERREYALAAIEGYTALLREPGVPVPTRDPVRYSFKLVFPDGRWSLDEKQLVAAPREGDVLVFEGYGDWCIQSSQRVGVRPAGKPPREFFVCAPV
Ga0137395_1039458123300012917Vadose Zone SoilVEFSKLNADALDRALAARRRDLDGIDPELRERREYALAAIEGYTALLREPGVPGPTRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQSVGARPAGKPSREFFVCSPVAA*
Ga0137395_1128284113300012917Vadose Zone SoilMPAGNDDKTMGELWKQNADVLERVLAARRLELGTAGTEERERRESALAAMEGYTALMREASPASSGNPIRYCFKLVFPDGLLSIDEKQLPATPHEGDVLDFAGFGDWRIEGSQRVGVAPAGKPPREFFV
Ga0137359_1089795513300012923Vadose Zone SoilDTLDRALQARRLDLQGFDQEERERHEYALAAFEGYTALLREPMEAPPNRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQRVGVAPAGKPPREFFVCAPAAV*
Ga0137404_1045224623300012929Vadose Zone SoilMPAGNDDKTMGPPPDFSKQNADVLDRVLEARRRELAGVDEEERERSEFALAAIEGYAALMREPGPAASSRNPIRYFFKLVFPDGRWSVDEKQLSTVPHEGDVVDFPGLGEWRIEGSQRVGVKPAGKPQREFFVCAPAA*
Ga0164304_1129752523300012986SoilMRPGHDDKTMGELWKQNADVLERVLAARRLELGSVGAEERERRELALAAMEGYTALMREASPASSRNPILYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA*
Ga0157369_1069601123300013105Corn RhizosphereMTRVVGPDYTAANLEALERALRARREALPSIESEDERERRAAALSAFEGYTALLREPNRPAPSRNAIRYAFKLVFPDGRWSVDEQELQAAPRQGDVLGFDGIGRWRIQGAQRVRVKPAGKPPRTFFVCSPV*
Ga0157372_1293330813300013307Corn RhizosphereRRAPALSAFEGSTALPREPSRPAPSRNPIRYAFKLVFPDGRWSVDERELTSAPREGDVLGFDGIGDWRIQGAQRVRVKPAGKPPLTFFVCSPIL*
Ga0120179_100775053300013763PermafrostMTRGVGPDFAKLNADTLDRALQSRRRDLQGVAQEERERREFALAAIEGYTALLREPMEHAPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0120158_1017172813300013772PermafrostPGARPLQARRRDLQGVDQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPNAPHEGDILDFAGYGDWRIEGSQXXXXSSSARPPQSESDFAQRFGQSAH*
Ga0120132_111205123300013832PermafrostSQHNAEVLDRVLEARRRDLDGVDEAERERREFALAAIEGYAALMREAGPQASSRNPIRYCFKLVFPDGRWSVDEKQLPTAPHEGDVLAFPGYGEWRIEGSERVGVKPAGKPQREFFVCAPAA*
Ga0182008_1006855533300014497RhizosphereVGHDYAAANLDALERALRERRAALPGITSEDERERRAAALSAFEGYTALLREPTQPAAARNPIRYAFKLVFPDGRWSVDERELLSAPREGDVLGFDGIGDWRIQGAQRVGVKPAGKPPRTFFVCSPVI*
Ga0182008_1047127423300014497RhizosphereRERRAAALAAFEGYTALLREPTAVSAPVRASIRYCFKLVFPDGRWSIDEKQLPAAPREGDVLGFDGAGRWRIQGAQRVGVKPNGKPPRTFFVCAPVA*
Ga0167643_104397923300015089Glacier Forefield SoilRESALLAVEGYAALLREPLTQTASRNPIRYCFKLVFPDGRWSLDEKQLAAAPQEGDVLGFDGYGDWCIQGSQRVGVKPAGKAPREFFVCAPAA*
Ga0167658_1003417103300015195Glacier Forefield SoilMTRGVGPDFAKLNADTLDRALLARRRDLQGVGQEERERREFALAAIEGYTALLREPMEHAPSRNLIRYCFKLVFPDGRWSIDEKQLSVVPHEGDVLDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0167658_104613223300015195Glacier Forefield SoilMTRGVGHDFAKLNADTLDRALQARRRDLQGVGKEERERSEFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPAAPHEGDVVDFAGFGDWRIEGSQRVGVRPAGKPPREFFVCAPAAV*
Ga0137412_1059633213300015242Vadose Zone SoilAARRRDLDQVEPEVRARREYALAAIEGYTALLREPGVPAPSRSAIRYCFKLVFPDGRWSLDEKQLVAAPREGDVLDFDGYGDWCIQGSQRVGVKPSGKPAREFFVCAPVAA*
Ga0182006_108427723300015261RhizosphereVGHDYAAANLDALERALRERRAALPGITSEDERERRAAALSAFEGYTALLREPTQPAAARNPIRYAFKLVFPDGRWSVDERELLSAPREGDVLGFDGIGDWRIQGAQRVGVKPQGKPPRTFFVCSPVA*
Ga0137403_1059246613300015264Vadose Zone SoilDLDGIDPELRERREYALAAIEGYTALLREPGVPVPTRDPVRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQRVGVRPAGKPPREFFVCAPVA*
Ga0193729_107624123300019887SoilVEFSKLNADALDRALAARRRDLDGIEPEARARRESALAAIEGYTALLREPVALAPSRNPIRYCFKLVFPDGRWSLDEKQLAAAPREGDVLGFEGYGDWCIQGSQRVGVKPAGKAPREFFVCAPAA
Ga0193751_105673223300019888SoilMGPPQDFSQHNADVLDRVLEARRRDLGSVDQEERERREFALSAVEGYAALMRESGPQASSRNPIRYCFKLVFPDGRWSVDEKQLPTAPHEGDVLAFPGYGEWRIEGSERVGVKPAGKPQREFFVCAPAA
Ga0193728_120980323300019890SoilVEFSKLNADALDRALAARRRDLDGIEPEARARRESALAAIEGYTALLREPVALAPSRNPIRYCFKLVFPDGRWSLDEKQLAAAPRQGDVLGFEGYGDWCIQGSQRVGVKPAGKAPREFFVCAPAA
Ga0206356_1142427813300020070Corn, Switchgrass And Miscanthus RhizosphereKSGPKSAPLPRHSLRKGRRIGIRWVMTRVVGPDYTAANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG
Ga0179591_110292423300024347Vadose Zone SoilVEFSQLNADALDRALAARRRDLDQVEPEVRARREYALAAIEGYTALLREPGVPAQSRSAIRYCFKLVFPDGRWSLDEKQLVAAPREGDVLDFDGYGDWCIQGSQRVGVKPSGKPAREFFVCAPVAA
Ga0207707_1008567333300025912Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGEDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG
Ga0207663_1164864023300025916Corn, Switchgrass And Miscanthus RhizosphereEVERTRRAAALTAFEGYTALLREPTQPAAGRNPIRYAFKLVFPDGRWSVDERELPAAPREGDVVGFEGIGNWRIEGAQRVGVKPAGKPPRMFFVCSPAA
Ga0207660_1066129513300025917Corn RhizosphereERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG
Ga0207657_1083594613300025919Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAVPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG
Ga0207652_1012322543300025921Corn RhizosphereMTRVVGPDYTAANLEALERALSARREALPSIEGDDERERRAAALSAFEGYTALLREPNRAAPSRNNPIRYAFKLVFPDGRWSVDEQELPAAPREGDVLGFDGIGTWRIHGAQRVRVKPSGKPPRTFFVCSPVG
Ga0207700_1081105313300025928Corn, Switchgrass And Miscanthus RhizosphereVGDFSKRNADALERALAARRRELDGVDEDVRARRESALKAIEGYTALLREPVEQTIPSISPSRYCFKLVFPDGRWSLGEKQLAAAPREGDVIEIDGYGDWRIQGSQSVGVRPAGKPPREFFVCAPVAA
Ga0209152_1035304813300026325SoilMTIGVGQDFSKANLDALERALEARRAALPGIDGDDARERRAAALSAFEGYTALLREPIQPTPSRNPIRYCFKLVFPDGRWAIDEKQLPAAPREGDVLGFEGIGQWRIQGAQRVGVKPAGKPPRTFFVCAPAA
Ga0208989_1018974023300027738Forest SoilGVAQEERERREFALAAIEGYTALLREPMEPAPSRNLIRYCFKLVFPDGRWSIDEKQLPAEPHEGDVLDFAGYGDWRIQGSQRVGVKPPGKPPREFFVCAPAAV
Ga0209810_10000077073300027773Surface SoilMTLDMELARANAEALERALAARRRALATIDGDAERERRAAALSAFEGYTALLREPVPPAPTRNPIRYCFKLVFPDGRWSIDEKQLRVVPRAGDVVGFDGCGRWRIEGAQRVGVKPAGKPPRTFFVCAPVV
Ga0209580_1022230213300027842Surface SoilVGQDFSKHNADALERALAARRRELSGVDEDVRARRESALKAIEGYTALLREPAEQTTPSLSPIRYCFKLVFPDGRWSLDEKQLAAEPREGDVIEIDGYGDWRIQSSQSVGVKPSGKPPREVFVCAQVAA
Ga0209701_1065193623300027862Vadose Zone SoilRRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIQGSQRVGVKPAGKPQREFFVCAPAA
Ga0209579_1055326413300027869Surface SoilMTRVVGPDYAAANLDALERALRERRQALPGIESEDERERRAAALSAFEGYTELLREPTRPAPSRNPIRYAFKLVFPDGRWSVDERELTAAPREGDVLGFDGIGDWRIQGAQRVRVKPAGKPPQTFFVCSPVG
Ga0209590_1047845623300027882Vadose Zone SoilMGELWKQNADVLERVLAARRLELGTAGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIQGSQRVGVKPAGKPQREFFVCAPAA
Ga0209590_1066276113300027882Vadose Zone SoilPEDDDRHRRAEALSLIEGYTALLREPALPSPLRRPVRYCFKLVLPDGSWSINEKQLASPPCEGDVVGFEGCGDWRIQGSQFVGVKPAGKPPREFFVCAPAA
Ga0209488_1122223713300027903Vadose Zone SoilMVEFSKLNADALDRALAARRRDLDGIDPELRERREYALAAIEGYTALLREPGVPVPTRDPVRYSFKLVFPDGRWSLDEKQLVAAPQEGDVLDFEGYGDWCIQSSQRVGVRPAGKPPREFFVCAPVAA
Ga0307282_1023612613300028784SoilDFSKQNADVLDRVLEARRRDLAGVDEDERERREFALAAIEGYAALMREPGPPASSRNPIRYCFKLVFPDGRWSVDEKQLSTVPHKGDVVDFPGYGEWRIEGSQRVGVKPAGKPQREFFVCAPAA
Ga0307504_1000704643300028792SoilRRLELGTAGTEERERRESALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPREGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA
Ga0307312_1029124313300028828SoilMPAGDDDKRMGPPQDFSKQNADVLDRVLEARRRDLAGVDEDERERREFALAAIEGYAALMREPGPPASSRNPIRYCFKLVFPDGRWSVDEKQLSTVPHKGDVVDFPGYGEWRIEGSQRVGVK
Ga0170824_10312499723300031231Forest SoilVGEEERERRELALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSLVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA
Ga0170819_1267244523300031469Forest SoilLERVLAARRLELGSVGAEERDRRELALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA
Ga0170818_10198038623300031474Forest SoilAGVLERVLAARRLELGSVGAEERDRRELALAAMEGYTALMREASPASSRNPIRYCFKLVFPDGRWSIDEKQLSIVPHEGDVIAFPGYGDWRIEGSQRVGVKPAGKPQREFFVCAPAA
Ga0307479_1079366423300031962Hardwood Forest SoilMTRVMGDFAELNADMLDRALQARRRALQGVDREERERGEFALAAIEGYTALLREPMEPAPSRNLSLYSFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRVQGSQRVGVRPAGKPPREFFVCAPAAV
Ga0307471_10274044623300032180Hardwood Forest SoilMTRVMGDFAELNADMLDRALQARRRALQGVDREERERGEFALAAIEGYTALLREPMKPAPSRNLSLYSFKLVFPDGRWSIDEKQLPAAPHEGDVLDFAGFGDWRVQGSQRVGVRPAGKPPREFFVCAPAAG
Ga0370484_0105657_307_6933300034125Untreated Peat SoilVGHDFAKLNADTLDRALQARRLDLQSVDQEERERREFALAAIEGYTALLREPLEPAPSRNLIRYCFKLVFPDGRWSVDEKQLLAAPHEGDVLDFAGFGDWRIQGSQRVGVRPSGKPPREFFVCAPAVV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.