NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104867

Metagenome Family F104867

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104867
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 93 residues
Representative Sequence MAVLSSPVVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVATTGLREAIAWVQMPADTPMVWRIVTGLAYLA
Number of Associated Samples 84
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 2.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(30.000 % of family members)
Environment Ontology (ENVO) Unclassified
(48.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 65.04%    β-sheet: 0.00%    Coil/Unstructured: 34.96%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF03454MoeA_C 89.00
PF03205MobB 7.00
PF12804NTP_transf_3 2.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0303Molybdopterin Mo-transferase (molybdopterin biosynthesis)Coenzyme transport and metabolism [H] 89.00
COG1763Molybdopterin-guanine dinucleotide biosynthesis proteinCoenzyme transport and metabolism [H] 7.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.00 %
All OrganismsrootAll Organisms2.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005450|Ga0066682_10005609All Organisms → cellular organisms → Bacteria → Proteobacteria6100Open in IMG/M
3300015054|Ga0137420_1108652All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium 13_2_20CM_2_66_51090Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil30.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil20.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil17.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.00%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment1.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.00%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012172Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT366_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014880Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_16_10DEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021972Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2m2EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027546Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1003935223300002558Grasslands SoilMALLSSPVVRRALRLTGYIXLTLFVYWVNVVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQMPADTPMVWRIVTGLAYIAVLGLAMAGLQTL
JGI25385J37094_1012882323300002558Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAW
JGI25383J37093_1016686913300002560Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPXAARLGILYIIPVLLVTWTXGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVTALAYLAVLG
JGI25384J37096_1009906713300002561Grasslands SoilMAILGSPAVRRALRLTGYIILSLLVYWANAVTPPSARLGILYIIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQLPADTPLVWRIANAGAYVAVLGVAMAGLQTLRHREA
JGI25384J37096_1018859713300002561Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVTALAYLAVLGV
JGI25612J43240_102453113300002886Grasslands SoilMPALNFSSPTLRRVLRLTGYVLLTLLVYWLNAVTPXTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLRETIAWVQMPADTPMVWRVVTGLAYLAVL
JGI25613J43889_1019293923300002907Grasslands SoilMPALNFSSPTLRRALRLMGYILLTLVVYWLNAVTPATARLGILYIIPVLLVTWTEGLVWGIVFAVATTGLREAVAWVQMPPDTPMV
JGI25382J43887_1040714223300002908Grasslands SoilMPALASPAVRQALRLAGYVLLTLLVYWANALTPSTARFGILYTIPVLLVTWTEGLAWGIVFAVATTVFREAIAWIQMPTDTPTLWRILN
JGI25382J43887_1051834523300002908Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVTALAYLAVLGVAMAGLQ
JGI25617J43924_1029282613300002914Grasslands SoilMTPALRRGLRLAGYVLLTLVVYWANVLTPSAARLGILYIVPVLLVTWTEGLPWGIVFAVATTALREFVAWDQMPPETPLLWRVANGASYV
Ga0063356_10644482923300004463Arabidopsis Thaliana RhizosphereLKPALRLLGYVLLILLVYQANAHTPPEVRLGILYIIPVLLVTWTEGIVWGIVFAVATITFREVVALEQLPAG
Ga0066674_1019859113300005166SoilMRVPSSPALRQVLRLTGYVLLTLFVYWLNAITPPDARLGILYIVPVLLVTWTEGLGWGIVFAVVTTGFRETIAWVQMPLDTPIVWRVLTALAYLVVLG
Ga0066683_1015036223300005172SoilMTASPTVRRALRLVGYVLLTLLVYWANALTPPAARLGILYIIPVLLVTWTDGLRWGIVAGIASIALRETVAWDQMPADTPLGWRIA
Ga0066686_1062806613300005446SoilMTASPTVRRALRLVGYVLLTLLVYWANALTPPAARLGILYIVPVLLVTWTDGLRWGIVIGIASIALRETVAWDQMPADTPLGWRIAN
Ga0066682_10005609103300005450SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMPADMPMLWRILNGLAYLAVLGVAMAGLQSLRRSQAQ
Ga0066682_1063147823300005450SoilMPALNFSSPTLGRALRLTGYVLLTLLVYWLNAVTPSTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLRETIAWVQMPADTPM
Ga0070706_10056285913300005467Corn, Switchgrass And Miscanthus RhizosphereMATLSSPLARRALRLTGYILLTLLVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGYRETIAWVQMPPDTPMVWRI
Ga0070698_10135915223300005471Corn, Switchgrass And Miscanthus RhizosphereMATLSSPLARRALRLTGYILLTLLVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGYRETIAWVQMPPDTPMVWRIVTGLAYLAVLGV
Ga0066692_1035500713300005555SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMPADTPMLWRILNGLAYLAVLGV
Ga0066651_1017409013300006031SoilMTASPTVRRALRLVGYVLLTLLVYWANALTPPAARLGILYIVPVLLVTWTDGLRWGIVAGIASIALRETVAWDQMPADTPLGWRIANGASYVAVVAVAM
Ga0066656_1030233813300006034SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMPADMPMLWRILNGLAYLAVL
Ga0066658_1058434913300006794SoilMAALSSPNVRRALRLAGYLLLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPA
Ga0099794_1067655213300007265Vadose Zone SoilMAVLSSPVVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVATTGLREAIAWVQMPADTPMVWRIVTGLAYLA
Ga0099827_1089098523300009090Vadose Zone SoilMPALSPPSPTLRRALRVTGYVLLTLLVYWANALTPPAARFGILYTIPVLLVTWTEGLAWGIVFAVSTTVFREAIAWVQMPAD
Ga0099827_1197075413300009090Vadose Zone SoilMAVLSSPAVRRALRLTAYILLTLLVYWINAVTPPAARLGTLYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMVWRIITALAYLAVLGVAMAGLQTLR
Ga0134109_1006450623300010320Grasslands SoilMTASPTVRRALRLVGYVLLTLLVYWANALTPPAARLGILYIVPVLLVTWTDGLRWGIVAGIASIALRETVAWDQMPADT
Ga0134109_1026018913300010320Grasslands SoilMRVPALSSPVVRQVLRLTGYVLLTLFVYWLNVITPPDARLGILYIIPVLLVTWTEGLLWGLVFAVVTTGFRETIAWVQMPAETAMVWRVVTSLAYLAVLGV
Ga0134084_1042136013300010322Grasslands SoilMAASPTLRRVLRLSAYIALTLLVYWANAVTPSSARLGVLYIVPVLLVTWTEGLTWGIVFAVATTVFREATAWDQMPADTPLL
Ga0134065_1007116413300010326Grasslands SoilMIAASPVLRQVLRLTGYVLLTLLVYWFNAVTPPEARLGILYIIPVLLVTWTEGLAWGILFALVTTVYREATAWVQVPPDTPLVWRVVTGLAYLAVLGVAMAGLQTL
Ga0134111_1007881223300010329Grasslands SoilMAFLSSPVVRRALRLTGYLLLTLVVYWFNAVTPPSARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREAIAWVQMPADTPMVWRV
Ga0134080_1003309533300010333Grasslands SoilMAALASPTVRQALRVAGYVLLTLLVYWVNAVTPSTARFGILYTIPVLLVTWTEGLAWGILLAAATTVFREATAWVQM
Ga0134080_1052399813300010333Grasslands SoilMPALASPALGRGVRLTGYVLLTLLVYWVNALTPSTARFCILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMPADMPM
Ga0134071_1004595813300010336Grasslands SoilMAAIASPAVRQALRLVGYVLLTLLVYWLNAVTPSTARFGILYTIPVLLVTWTEGLAWGIVFAAATTVFREAIAWVEMPADTPTLWRILNGLAYLAVLGLAMAAAL
Ga0134128_1025238113300010373Terrestrial SoilMPLVASPLLRRTLRVAAYLLLTLAVYWINVTTTPAARLGVLYVIPVLLVTWTEGLTWGIVFGIASIALRETVAWSQMPGDTPLLWRIGNAAAYVLVVAV
Ga0137392_1029388323300011269Vadose Zone SoilMAILSSPAIRRALRLTGYILLSLVVYWANAVTPPSARLAILYIIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQMPADTPLVWRLANAGAYVAVLGV
Ga0137320_107055823300012172SoilMPVLASPVLRRTLRLTGYVALTLAVYWLNVSTPSTARLAVLYIIPVLLVTWTEGLVWGIVFGLASITLREAVVWDRMPAEAPIQWRIG
Ga0137382_1017960723300012200Vadose Zone SoilMAVLSSPVVRRALRLTGYLLLTLVVYWLNVVTPATARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREVIAWVQMPADTPMVWRVVTGLAYLAVLGLAMAGLQTLRRR
Ga0137382_1020867323300012200Vadose Zone SoilMALLSSPVVRRALRLTGYLLLTLVVYWFNAVTPPSARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREATAWVQMPADTPMVWRVVTGLAYLAVLGLAMAGLQT
Ga0137365_1023327823300012201Vadose Zone SoilMTVLSSPVVRRALRLTGYLLLTLVVYWFNAVTPPSARLGILYIIPVLLVTWSEGLAWGLVFAVGTTGLREAIAWVQMPADTPMVWRVVTGLAYLAVL
Ga0137399_1012302413300012203Vadose Zone SoilMAALSPNVRRALRLAGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMVWRIVTALAYLAVL
Ga0137399_1066911923300012203Vadose Zone SoilMTASPTVRRALRLVGYVLLTLLVYWANALTPPAARLGILYIIPVLLVTWTDGLRWGIVIGIASIALRETV
Ga0137399_1159666923300012203Vadose Zone SoilMTLPASPALRRALRLAGYVALMLLVYWANVFTPSTARLAILYVVPVLLVTWTDGVVWGIVFGVASIGLREMVAWDQMPPDTPLGWRIANGAAYVAVVA
Ga0137380_1041403413300012206Vadose Zone SoilMTPALRRGLRLAGYVLLTLVVYWANVLTPSAARLGILYVVPVLLVTWTEGLTWGIVFAVATTALREFVAWDQMPPD
Ga0137379_1025579913300012209Vadose Zone SoilMTPALRRGLRLAGYVLLTLVVYWANVLTPSAARLGILYIVPVLLVTWTEGLTWGIVFAVATTALREFVAWDQMPPDTPLVWRVANGASYVVVLG
Ga0137370_1056840913300012285Vadose Zone SoilMSLLSSPAVRRALRLTGYILLTLLVYWINAVTPPAARLGTLYIIPVLLVTWTEGLAWGIVFAVVTTGLREAIAWVQ
Ga0137387_1055244923300012349Vadose Zone SoilMAILTSPAIRRALRLAGYIILSLAVYWANAVTPSSARLAILYIIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQMPADTPLAWRIANA
Ga0137366_1009554843300012354Vadose Zone SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMTS
Ga0137361_1132730513300012362Vadose Zone SoilMPALSPAVRQALRLIGYVLVTLLVYWANALSPSTARLGILYTIPVLLVTWTEGLAWGIVFAVATTVFREAIAWVQMPADTPMLW
Ga0137390_1090894923300012363Vadose Zone SoilMAALDSPVLRRALRLTGYILLTLGVYWVNVLTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGFREAIAWVQMPADTPMVWRVVTGVAYLAVLGVAMAGLQTLRRS
Ga0150984_10430414923300012469Avena Fatua RhizosphereVAASPAVRRVLRLVGYVLLTLLVYWGNVLTPPSARLGVLYIVPVLLVTWTDGVVWGIVFGIASIALRETVAWDQMPADTALAWRIAN
Ga0137398_1023175423300012683Vadose Zone SoilMPALSSPSPTLRRALRFTGYVLLTLVVYWLNAVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLREATAFVQMPADTPMVWRVVTGLAYLA
Ga0137398_1099725923300012683Vadose Zone SoilMPTLNFSSPTLGRALRLTGYVLLTLLVYWLNAVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLRETIAWVQMPADTPMVWR
Ga0137394_1086061913300012922Vadose Zone SoilMATLTSPLARRALRLTGYIVLTLLVYWVNVVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGYRETIAWVQMPPDTPMVWRMITALAYLAVLGVAM
Ga0137413_1174116823300012924Vadose Zone SoilMATLTSPLARRALRLTGYIVLTLLVYWVNVVTPPAARLGVLYIIPVLLVTWTEGLVWGIVFAVATTGLREATAWVQMPVET
Ga0137419_1043768223300012925Vadose Zone SoilMSVLSSPVVRRALRLTGYLLLTLVVYWLNAVTPASARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREAIAWVQMPEDTPMVWRIVTGLAYLA
Ga0137419_1188438613300012925Vadose Zone SoilMATLTSPLARRALRLTGYIVLTLLVYWVNVVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGYRETIAWVQMPPDTPMVWRMITALAYL
Ga0137416_1140099513300012927Vadose Zone SoilMPALGSSSPTLRRALRLTGYVLLTLVVYWLNAVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLREATAWVQMPADTPMVWRVVTGLAYLAVLGLAMAGLQTLR
Ga0137416_1143904823300012927Vadose Zone SoilMALPSSPVLRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGFREATAWV
Ga0137410_1170157113300012944Vadose Zone SoilMPALSFSSPTLRRALRLVGYILLTLVVYWLNAVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAVATTGLREAVAWVQMPADTPMV
Ga0134075_1029942313300014154Grasslands SoilMAALSSPNVRHALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGFREAIAWVQMPADTPMVWRVLTGLAYLAVLGLAMAGLQTLRR
Ga0180082_103058823300014880SoilMPVLASPVLRRTLRLTGYVALTLAVYWLNVSTPSTARLAVLYIIPVLLVTWTEGLVWGIVFGLASITLREAVVWDRMPAEAPIQWRIGNAVAYIAV
Ga0137420_110865213300015054Vadose Zone SoilMAMLSSPAVRRALRLTGYLLLTLVVYWLNAVTPASARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREAIAWVQMPADTPMVWRVVTGLAYLAVLGLAMGLAMAGLQTLRRR
Ga0137409_1130472023300015245Vadose Zone SoilMPALSFSSPTLRRALRLVGYILLTLVVYWLNAVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAVATTGLREAVAWVQMPADTPMVWRIVT
Ga0180089_101025423300015254SoilMPLPASATLRRALRLAGYVLLTLLVYWINAVTPAVARLGVLYIIPVLLVTWTEGLIWGIVFGVASIGMREAVAWDQMPADTPLGWRIGNAAAYVAVVAVAMAGLQTLRRS
Ga0134069_119478623300017654Grasslands SoilMLALASPAVRRVLRLTGYVLLTLLVYWANAVTPSTARLGILYTIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQMPADTPMLW
Ga0134083_1014494323300017659Grasslands SoilMTASPSVRRALRLVGYVLLTLLVYWANAVTPPAARLGILYIVPVLLVTWTDGLRWGIVFGIASIALRETVAWDQMPADTPLGWRIANGASYVAVVAV
Ga0184623_1004825513300018056Groundwater SedimentMPFTASATLRRALRLVGYVLLTLLVYWVNAVTPPEARLGVLYIIPVLLVTWTEGLIWGIVFGVASIGMREAVAWDQMPADTPLGWRIGNAAAYVAVVAVAMAG
Ga0066662_1074356523300018468Grasslands SoilMAILTSPAIRRALRLTGYIILSLAVYWANAVTPSSARLAILYIIPVLLVTWTEGLAWGIVFAVATTGFREAIAWVQMPADT
Ga0066669_1075388713300018482Grasslands SoilMIAASPVLRQVLRLTGYVLLTLLVYWFNAVTPPEARLGILYIIPVLLVTWTEGLAWGILFALVTTVYRETTAWVQMPPDTPL
Ga0066669_1089145613300018482Grasslands SoilMAFLSSPVVRRALRLTGYLLLTLVVYWFNAVTPPSARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREAIAWVQMPADTPMVWRVVTGLAYLAVLGLAMACLQTLRRRVS
Ga0193756_103865913300019866SoilMPALSSPSPTLRRALRLTGYVLLTLLVYWANALTPPAARFGILYTVPVLLVTWTEGLAWGIVFAVSTTVFREAIAWVQMPADTPM
Ga0193733_101489443300020022SoilMARALRLTGYVLLTLLVYSINAVTPPAARLGILYTIPVLLVTWTEGLAWGILFAVVTTGFREVIAWEQLPADTPMVWRVITGAAYLA
Ga0193737_100885623300021972SoilMAAPMSPEVRRALRLAGYVALTVLVYWANDHTPVEIRLGILYIVPVVLVTWTEGLGWGIGFAVATIGLREIVAWEQ
Ga0207684_1121212123300025910Corn, Switchgrass And Miscanthus RhizosphereMATLSSPLARRALRLTGYILLTLLVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGYRETIAWVQMPPDTPMVWRIVTGL
Ga0207689_1140804413300025942Miscanthus RhizosphereMPLVASPLLRRTLRVAAYLLLTLAVYWINVTTPPAARLGVLYVIPVLLVTWTEGLTWGIVFGIASIALRETVAWSQMPGDTPLLWRVGNAAAYVLVVAVAMAGLQK
Ga0207648_1030234923300026089Miscanthus RhizosphereMPLVASPLLRRALRVAAYLLLTLAVYWINVTTPPAARLGVLYVIPVLLVTWTEGLTWGIVFGIASIALRETVAWSQMPGDTPLLWRIGNAAAYVLVVAVAMAGLQK
Ga0209438_110645923300026285Grasslands SoilMSVLSSPVVRRALRLTGYLLLTLVVYWLNAVTPASARLGILYIIPVLLVTWTEGLAWGLVFAVATTGLREAIAWVQMPEDTPTVWRIVTGLAYLAVLGLAMAGLQTLRRR
Ga0209235_119391213300026296Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPGAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVTALAY
Ga0209237_108513013300026297Grasslands SoilMAVLGSPAVRRALRLTGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVTALAYLAVLGVAMAGLQT
Ga0209237_119878913300026297Grasslands SoilMPALASPAVRQALRLAGYVLLTLLVYWANALTPSTARFGILYTIPVLLVTWTEGLAWGIVFAVATTVFREAIAWIQMPTDTPTLWRILNG
Ga0209236_111383313300026298Grasslands SoilMPALSPAVRQALRLIGYVLVTLLVYWANALSPSTARLGILYTIPVLLVTWTEGLAWGIVFAVATTVFREAIAWVQMPADTPMLWRIV
Ga0209027_102322313300026300Grasslands SoilMAGRALRLTGYLLLTLLVYWVNAVTPPAARLGILYIIPVLLVTWTEGLTWGIVFAVTTIGFREATAWVQMPADT
Ga0209239_103611953300026310Grasslands SoilMPALASPAVRQALRLAGYVLLTLLVYWANALTPSTARFGILYTIPVLLVTWTEGLAWGIVFAVATTVFREAIAWIQMPTDTPTLWRI
Ga0209375_108246513300026329SoilMAALPSPGVRQPLRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAATTTVFREAIAWVQMPADMPMLWRILNGLAYLAVLGVAMAGLQSLRR
Ga0209375_121381423300026329SoilMPALNFSSPTLGRALRLTGYVLLTLLVYWLNAVTPSTARLGILYIIPVLLVTWTEGLVWGIVFAAATTGLRETIAWVQMPADTPMVWRVVTGLAYLAVL
Ga0209158_124250513300026333SoilMAALSSPNVRRALRLAGYLLLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMVWRIVSGLAYLAVLGVA
Ga0209690_128002523300026524SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGIVFAVSTTVFREAIAWVQMPA
Ga0209807_126603523300026530SoilMPALNFSSPTLRRALRLTGYILLTLVVYWLNVVTPPTARLGILYIIPVLLVTWTEGLVWGIVFAVATTGLREAIAWVQMPADTPMVWRVVTGL
Ga0209157_120363723300026537SoilMAALPSPGVRQALRLVGYVFLTLLVYWVNAVTPPAARFGILYTIPVLLVTWTEGLAWGILLAVATTVFREATAWVQMPADT
Ga0209805_103658813300026542SoilMNPALRRGLRLAGYVLLTLVVYGANVLTPSAARLGILYIVPVLLVTWTEGLTWGIVFAVATTVLREFVAWDQMPPDTPLVWRVANGASYVVVLGIAMAGLQ
Ga0209474_1023866723300026550SoilMNPALRRGLRLAGYVLLTLVVYWANVLTPSAARLGILYIVPVLLVTWTEGLTWGIVFAVATTVLREFVAWDQMPPDT
Ga0208984_110363813300027546Forest SoilMALLNSPVLRRALRLTGYILLTLVVYWVNAITPPAARLGILYIIPVLLVTWTEGLAWGIVFAVVTTGFREATAWVQMPPDTPMVWRIVTGLAYLAVLGLAMAGLQTLRRRE
Ga0209076_108685713300027643Vadose Zone SoilMTLPASPALRRALRLAAYVVLMLLVYWANVFTPSTARLAILYVVPVLLVTWTDGVVWGIVFGVASIGLREMVAW
Ga0209588_125960023300027671Vadose Zone SoilMAALSSPNVRRALRLAGYILLTLVVYWVNAVTPPAARLGILYIIPVLLVTWTEGLAWGILFAVVTTGFREAIAWVQMPADTPMLWRIVSGLAYLAVLGVAMAGLQTL
Ga0209073_1040310013300027765Agricultural SoilMPAFSSPTLRRALRLLGYVVLTLVVYWVNAVTPPDGRFGILYTIPVLLVTWTEGLVWGIVFAAATTVFREAIAWVQMPVDTPMLWRVMNGLAYLAVLGVAMAGLQTLRHSQA
Ga0209177_1046894423300027775Agricultural SoilMPAFSSPTLRRALRLLGYVVLTLVVYWVNAVTPPDGRFGILYTIPVLLVTWTEGLVWGIVFAAATTVFREAIAWVQMPVDTPMLWRVMNGLAYLAVL
Ga0209464_1020694523300027778Wetland SedimentMPRSPASHALRLGVYALLTGLVYVANASTPGAIRLGILYIIPVLLVTWTEGLIWGIVLAVLTIVFREVI
Ga0209382_1189484513300027909Populus RhizosphereMAGAPTAGRVLRLVGYVLLTALVYWANSDTPPTARLGILYVIPVLLVTWTDGLIWGVVFGIASIALRETVALEQMPLDTPLPWRIGNAAAYAAVLAVAI
Ga0307312_1000756013300028828SoilMARALRLTGYVLLTLLVYSINAVTPPAARLGILYTIPVLLVTWTEGLAWGILFAVVTTGFREVIAWEQLPADTPMVWRVITG
Ga0214471_1009102043300033417SoilMAFAISATLRRALRLSGYVLLTLLVYWVNDATPPVARLGVLYIIPVLLVTWTEGLAWGIVFGIASITMREAVAWDQMPAETPLGWL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.