NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104835

Metagenome / Metatranscriptome Family F104835

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104835
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 100 residues
Representative Sequence MMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELK
Number of Associated Samples 82
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 11.00 %
% of genes from short scaffolds (< 2000 bps) 6.00 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (89.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.000 % of family members)
Environment Ontology (ENVO) Unclassified
(52.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(81.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 42.86%    β-sheet: 18.37%    Coil/Unstructured: 38.78%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00486Trans_reg_C 100.00



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A89.00 %
All OrganismsrootAll Organisms11.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10019987All Organisms → cellular organisms → Bacteria2392Open in IMG/M
3300002561|JGI25384J37096_10057021All Organisms → cellular organisms → Bacteria1467Open in IMG/M
3300005171|Ga0066677_10012139All Organisms → cellular organisms → Bacteria3767Open in IMG/M
3300005559|Ga0066700_10292670All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1146Open in IMG/M
3300009089|Ga0099828_10150346All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2056Open in IMG/M
3300012977|Ga0134087_10075712All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1371Open in IMG/M
3300018431|Ga0066655_10035541All Organisms → cellular organisms → Bacteria2467Open in IMG/M
3300020022|Ga0193733_1033624All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1448Open in IMG/M
3300024330|Ga0137417_1026137All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1407Open in IMG/M
3300026331|Ga0209267_1035935All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2330Open in IMG/M
3300026548|Ga0209161_10168663All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1249Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil21.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil15.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001086Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3EnvironmentalOpen in IMG/M
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001145Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2EnvironmentalOpen in IMG/M
3300001154Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1EnvironmentalOpen in IMG/M
3300001160Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027388Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12709J13192_100722513300001086Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQALSNVTVTGDTVSVTAQALLTPSVAEVFVRVLDPKGQEVSQPRAELKKLFNNPSLTTPIAGANA
JGI12683J13190_101754113300001089Forest SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAITQAMNNVTVTGDTVSVTAQALLTPSVADNLVRVLDYKGQEVGQPRAELKKLFNDP
JGI12682J13319_100521333300001145Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDVKLQEVGQPQPALKKLFNSPSLTDPIEGAKAG
JGI12636J13339_102076123300001154Forest SoilMMFRGALARLAAQYFVLLILILGCFDLIVYVTVSEALQAKADKDLRSAVTQALSNVTVTGDTVSVTAQALLTPSVAEVF
JGI12654J13325_101912023300001160Forest SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAITQAMNNVTVTGDTVSVTAQALLTPSVADN
JGI12635J15846_1035094723300001593Forest SoilMMFRGALVRLAAQYFVLLVLILGLLDVIVYVTVSQSLQAKADNDLRHAVSQAAKGVVITESVTVNAQGLLDPSVADTFVRVLDIKGQEQ
JGI12635J15846_1039802623300001593Forest SoilMMFRGALARLAAQYFVLLILILGCFDLIVYVTVSEALQAKADKDLRSAVTQALSNVTVTGXTVSVTAQALLTPSVAEVFVRVLDPKGQEVSQPRAELKRLFNNPSLTA
JGI12635J15846_1044162913300001593Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQALSNVTVTGDTVSVTAQALLTPSVAEVFVRVLDPKGQEVSQPRAELKKLFNNPSLTTPIAG
JGI12635J15846_1046766423300001593Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDVKLQEVGQPQPALKKLFNSPSLTDPIEG
JGI12053J15887_1034798323300001661Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQTKADNDLRHAVSQAAKQVVVTETVSVNAQGLLDPSVADTFVRVLDNKGQEVSQPQPALKKLFNNPSLFGSITAA
JGI12053J15887_1046512113300001661Forest SoilMMFRGALVRLAAQYFVLLVLILGXFDLIVYVTVSQSLQTKADNDLQHAVSQAAKQVVATDTVTVSAQGLLDPSVADTFVRVLD
JGIcombinedJ26739_10059906223300002245Forest SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAITQAMNNVTVTGDTVSVTAQALLTPSVADNLVRVL
JGI25385J37094_1001998753300002558Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDLKGQEVSQPRAELKKLFNNPSLSAPTLAANAGQSGQTQVASGSELYM
JGI25384J37096_1005702113300002561Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDLKGQEVSQPRAELKKLFNNPSLSAPTLAANAGQS
JGI25382J37095_1018917223300002562Grasslands SoilMMFRGALMRLAAQYFVLLVLXLGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVIFNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIG
JGI25382J43887_1025748623300002908Grasslands SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADSDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFFRVLNNKDQEVTQPQPA
JGI25386J43895_1010855713300002912Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIGAANAGQSGQTQVASGSELYMVDTR
JGI25389J43894_106594613300002916Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQGKADYDLQHAVTQATKEVVVTGDNVSVNAQVLLDPSVADTFVRVLDIKAQ
Ga0066677_1001213963300005171SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNDPSLS
Ga0066677_1073568223300005171SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIGAANAGQPGQTQVSSGSE
Ga0066683_1008118713300005172SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPIAAAKAGRSVDTRISTGTDFYV
Ga0066673_1006943113300005175SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQ
Ga0066685_1017337933300005180SoilMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADNDLRLAVAQAYKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAP
Ga0066685_1055755823300005180SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTF
Ga0070699_10154886213300005518Corn, Switchgrass And Miscanthus RhizosphereMIFRGALVRLAAQYFLLLVLILGFFDLIVYVTVSQSLQAKADSDLKHAVSQAAKEVVVTDTVSVNAQGLLDPSVADTFVRVLDNKGQEVSQPQPALKK
Ga0066697_1014627433300005540SoilMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADNDLRLAVAQAYKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRP
Ga0066697_1026539823300005540SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSL
Ga0066697_1081590523300005540SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVSQAMSNVTVTGETVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKK
Ga0066701_1059613123300005552SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADSDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDNKGQEVSQPQPALKKLFNSPSLTEPIAAARAGQSADTRV
Ga0066698_1041723613300005558SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPS
Ga0066700_1029267023300005559SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLVSPSLADVFVRVLDFKGQEVSQPRAELKKLFNNLSLSAPIGAANAGQPGQTQVTSGSELYMVDTRP
Ga0066700_1079088623300005559SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNDPSLSAPIAAAKAGRTVDT
Ga0066670_1050736823300005560SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFV
Ga0066705_1008107913300005569SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKEVVPSDTVSVNAQGLLDPSVADTFVRVLNNKDQEVSQPQPAL
Ga0066654_1054868313300005587SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQPLLTPSLA
Ga0066665_1096164013300006796SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLRLAVGQANKDFVVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPHPELKKL
Ga0066659_1063548113300006797SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPI
Ga0079219_1122691023300006954Agricultural SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQTKADNDLKHAVTQAAKEVVPSDTVSVNAQGLL
Ga0099793_1059130313300007258Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQALSNVTVTGDTVSVTAQALLTPSVAEVF
Ga0099829_1037889913300009038Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVTQATKEVVPSDTVSVNAQGLLDPSVADTFVRVLDIKG
Ga0099828_1015034643300009089Vadose Zone SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADSDLKHAVSQAAKEVVATDTVSVNAQGLLD
Ga0099792_1024778623300009143Vadose Zone SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADNDLRHAIAQATRNVTVTGDTVSATAQSLLEPSAADNLVRVLDYKGQEVSQPRAELKKLFNDPSLSAPIGAANAGRPGDTSVAYG
Ga0099792_1118442223300009143Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLD
Ga0134109_1014809423300010320Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQ
Ga0134063_1014116623300010335Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQQLLTPSLADVFVRVLDSKGQEVSQPRAELKKLFNNS
Ga0134063_1054096213300010335Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADT
Ga0150983_1194599723300011120Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLLVYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDVKLQEVGQPQPALKK
Ga0137382_1130230013300012200Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKQLVITDSVSVNAQGLLDPSVADTFMRV
Ga0137381_1166797523300012207Vadose Zone SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTITGETISVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELK
Ga0137370_1032154323300012285Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVSQAMSNVTVTGETVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLGAPIGAAHAGQPGQ
Ga0137370_1095315113300012285Vadose Zone SoilMMFRGALARLAAQYFILLVLILGCFDLIVYVTVSQALQTKADNDLQHAVGQATKEVVVSGDTPSVNAQGMLDPSVADTFLRVLDIKGQEV
Ga0137361_1091657723300012362Vadose Zone SoilMMFRGALMRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVIFNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIGAANAGHSGQTQVASGSELYMV
Ga0137398_1103824213300012683Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLLTPSLADV
Ga0134110_1014623813300012975Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFI
Ga0134087_1007571213300012977Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADNDLRLAVAQAYKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKL
Ga0134081_1002065713300014150Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPIAAAKAGRSVDTRISNSTDFYVI
Ga0134079_1040455613300014166Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLRLAVGQANKDFVVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNKPSLSAP
Ga0134072_1014433023300015357Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDI*
Ga0066655_1003554163300018431Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSL
Ga0066667_1030168413300018433Grasslands SoilMMFRGALARLAAQYFILLVLILGCFDLIVYVTVSQSLQAKADNDLNHAVSQAAKQVVITDTVSFNAQGLLDPSVADVFVRVLDIKGQEVSQPRAELKKLFNNPNLTAPIAAAK
Ga0066667_1056814323300018433Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELK
Ga0066667_1060575823300018433Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPIAAAKAGRSVDTRISNST
Ga0066662_1033589723300018468Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLRLAVGQANKDFVVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPIAAA
Ga0066662_1055284613300018468Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQQLLTPSLADVFVRVLDSKGQEV
Ga0193733_103362433300020022SoilMMFRGALVRLAAQYFILLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLD
Ga0210399_1151324913300020581SoilMMFRGALVRLAAQYFVLLVLILGFFDLLIYVTVSQSLQAKADNDLKHAVIQAAKMVVATDTVSVNAQGLLDPSIADTFVRVLDTKGQEVTQPQPALKKLFNNPSPIDPIAAAKAGHSAETRVWNGAYLYIV
Ga0210402_1195218223300021478SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQTKADNDLKHAVSQGAKQLVITDSVTVNAQGLLDPSVADTFVRVLDVKGQEVGQPQPALKKLFNSPSLKNPIVGANAGQSTDTRV
Ga0210410_1135077523300021479SoilMMFRGALVRLAAQYFVLLVLILGFFDLIIYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFV
Ga0137417_102613733300024330Vadose Zone SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQSKADKDLQTAVTQAMSNVTVTGDTVSVNPQPLLTPSLADVFVRVLDLKGQEVSQPRAELKKLFNNP
Ga0209350_113890813300026277Grasslands SoilMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADNDLRLAVAQAYKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELK
Ga0209238_118265323300026301Grasslands SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGETISVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSTP
Ga0209238_118905413300026301Grasslands SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQGKADYDLQHAVTQATKEVVVTGDNVSVNAQV
Ga0209469_105815833300026307SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLQLAIGQANKDFAVTGDTVTFNGQGLLDPSV
Ga0209153_122299713300026312SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGDTVSVNPQQLLTPSLADVFVRVLDSKGQEVSQPRAELKKLFNNPSFSTAT
Ga0209687_102133813300026322SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLRLAVGQANKDFVVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPI
Ga0209152_1010495523300026325SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIGQANKDFAVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNDPSLSAPIAAAKAGRTVDTRVSNAT
Ga0209802_104426013300026328SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNN
Ga0209267_103593513300026331SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQALQAKADNDLRLAVGQANKDFVVTGDTVTFNGQGLLDPSVADTFIRVLDIKGQEQGQPRPELKKLFNNPSLSAPIAAAKAGRTLDTRISNATDLYVIDT
Ga0209158_128724013300026333SoilMMFRGALARLAAQYFILLVLILGCFDLIVYVTVSQSLQAKADNDLNHAVSQAAKQVVITDTVSFNAQGLLDPSVADVFVRVLDIKGQEVSQPRAEL
Ga0209377_118960323300026334SoilMMFRGALARLAAQYFILLVLILGCFDLIVYVTVSQSLQAKADNDLNHAVSQAAKQVVITDTVSFNAQGLLDPSVADVF
Ga0209377_121583713300026334SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPS
Ga0257176_104040023300026361SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLNKAVTQAMSMVQITGDTVSVTAQSLISPSAAEVFVRVLDVKGQEVTQPRTELKKLFNNPSLPIGLANAGQPS
Ga0209690_125926823300026524SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADSDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFFRVLN
Ga0209160_132604223300026532SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVSGDTVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIGAA
Ga0209056_1030769913300026538SoilMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSEALQTKADNDLRLAIAQANKDFAVTGDTVTFNGQGLLDPSVADTF
Ga0209376_136043223300026540SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLETAVTQAMSNVTVTGETVSVNPQPLLTPSLADVFVRVLDFKGQEVSQPRAELKKLFNNPSLSAPIGAANAGQPGQTQVS
Ga0209161_1016866333300026548SoilMIFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQALQAKADKDLATAVTQAMSNVTVTGETISVNPQPLLTPSLADVFV
Ga0208995_102293813300027388Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKSDKDLQSAVTKAMSNVTVTGDTVQALTPSAEIFVRVLDFKSQEVSQPRAEL
Ga0209527_103058533300027583Forest SoilMMFRGALVRLAAQYFVLLVLILGCFDLIVYVTVSQSLQAKADNDLEHAVSQASKEVVVTDTVSVNAQGLLDPSVADTFVRVLDIKGQEVSQPQTALKKLFNNPSLTAPIAAAKAGQSGDTRV
Ga0209733_102934813300027591Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQTKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDVKGEEVG
Ga0209331_108450413300027603Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVSQAAKEVVATDTVSVNAQGLLDPSVADTFVRVLDVKGQEVGQPQPALKKLFNS
Ga0208991_106933313300027681Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKSDKDLQSAVTKAMSNVTVTGDTVQALTPSAEIFVRV
Ga0209328_1013423623300027727Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQTKADNDLTHAVAQATKEVVVTGDTVSVNPQGLLDPSVADTFVRVLDIKGQEVSQ
Ga0209328_1015457913300027727Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQAMSNVTVTGDTVSVAAQALLTPSVAEVFVRVLDPKGQEVSQPRAELKKLFNNP
Ga0208989_1010956323300027738Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQTKADNDLRHAVSQAAKQVVVTETVSVNAQGLLDPSVADTFVRVLDNKGQEVSQPQPALKKLFNNP
Ga0209283_1074914913300027875Vadose Zone SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQAMSNVTVTGDTVSVAAQALLTPSVAEVFVRVLDPKGQEVSQPRAELKKLFNNPSLTAPIAAANAGQLSQ
Ga0209590_1052942813300027882Vadose Zone SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYFTVSQSLQAKADNDLQHAVSQAARQVVATDTVSVNAQGLLDPSVADTFVRVLDIKGQEVSQPQPALKKLFNN
Ga0308194_1013940423300031421SoilMMFRGALARLAAQYFILLVLILGCFDLIVYVTVSQALQSKADNDLEHAVGQATKEVVVAGDTVTVNAQGLLDPSVADTFLRVLD
Ga0307479_1122799823300031962Hardwood Forest SoilMMFRGALARLAAQYFVLLVLILGCFDLIVYVTVSQALQAKADKDLRTAVTQAMSNVTVTGDTVSVTAQTLLTPSVAEVFVRVLDPKGQEVSQPRAELKK
Ga0307471_10388346623300032180Hardwood Forest SoilMMFRGALVRLAAQYFVLLVLILGFFDLIVYVTVSQSLQAKADNDLKHAVTQASKEVNPTDPVQVNAQGLLDPSVADTFFRVLDNKGQEASQPQPALKKLFNSPSLNDPIKAAKAGQS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.