NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F091814

Metagenome / Metatranscriptome Family F091814

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091814
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 54 residues
Representative Sequence TAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA
Number of Associated Samples 92
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(25.234 % of family members)
Environment Ontology (ENVO) Unclassified
(49.533 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.617 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 53.66%    β-sheet: 0.00%    Coil/Unstructured: 46.34%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF12838Fer4_7 64.49
PF00499Oxidored_q3 12.15
PF00420Oxidored_q2 9.35
PF01059Oxidored_q5_N 1.87
PF00662Proton_antipo_N 0.93
PF00361Proton_antipo_M 0.93
PF13237Fer4_10 0.93
PF00146NADHdh 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG0839NADH:ubiquinone oxidoreductase subunit 6 (chain J)Energy production and conversion [C] 12.15
COG1009Membrane H+-translocase/NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunitEnergy production and conversion [C] 1.87
COG0650Formate hydrogenlyase subunit HyfCEnergy production and conversion [C] 0.93
COG1005NADH:ubiquinone oxidoreductase subunit 1 (chain H)Energy production and conversion [C] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil25.23%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil17.76%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.02%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.74%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.74%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.80%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.80%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.87%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.87%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.87%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.87%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.93%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.93%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000953Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300010095Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010126Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010132Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012224Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300018029Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MGEnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025992Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300033815Sediment microbial communities from East River floodplain, Colorado, United States - 31_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI11615J12901_1051440633300000953SoilYLTVIATAIWFFHDQLGWAYDTRFSLALFGVNLALAVPLVFVLDRGHIVAGSVQRRRA*
JGI25384J37096_1004056033300002561Grasslands SoilVIATAIWFLHDRLGWTYDSRFALALFALNLVLAVPLFFVLDRGHIIAGSVVEQGGRA*
JGI25382J37095_1026415313300002562Grasslands SoilGYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA*
Ga0062590_10154114523300004157SoilLGWAYDARFALALFGMNLLIGIPVFFVLDRGRLVAGSVAEERA*
Ga0066672_1018499213300005167SoilWILHAVLGWTYDTRFGLVLFGLNVLLAIPLFFVLDRGHIVAGSMAGERA*
Ga0066677_1012675613300005171SoilIWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGAVAEA*
Ga0066677_1073578823300005171SoilATAIWFLHDGLGWTYDTRFALALFALNLALAVPLFFVLDRGRIVAGSMAEGEA*
Ga0066673_1008386613300005175SoilIWFLHDRLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0066690_1000693813300005177SoilAMWILHAVLGWTYDTRFGLVLFGLNVLLAIPLFFVLDRGHIVAGSMAGERA*
Ga0066690_1007613953300005177SoilTAIWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA*
Ga0066690_1058414523300005177SoilHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA*
Ga0066684_1054818113300005179SoilLGWSYDTRFGLVLFALNVLLAVPLFFGLDRGHLIAGAVAEEPA*
Ga0066676_1019851513300005186SoilLGWTYDSRFALALFALNLVLAVALFFVLDRGHLIAGSVTEQGERA*
Ga0066388_10780430813300005332Tropical Forest SoilLTVIATAIWLFHDFLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA*
Ga0070714_10087324213300005435Agricultural SoilVLHERLGWAYDSRFALALFGVNILLAVPLFFVLDRGHIVSGSAAEERA*
Ga0066686_1020420113300005446SoilAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA*
Ga0066689_1012784513300005447SoilLHDRLAWTYDTRFALTLFALNVLLAIPLFFVLDRGHIVAGSVAEEGRAS*
Ga0070706_10009532853300005467Corn, Switchgrass And Miscanthus RhizosphereVMLPLALGYITVMATAIWLLHARLGWNYDARFALALFSLNVLLAIPLFFALDRGHLISGSEARGET*
Ga0070706_10027312843300005467Corn, Switchgrass And Miscanthus RhizosphereYLTVIATAIWFFHDQLGWAYDTRFNLALFAVNVALAVPLVFVLDRGHVVAGSVERRRA*
Ga0070707_10069402833300005468Corn, Switchgrass And Miscanthus RhizosphereYITVMATAIWLLHARLGWNYDARFALALFSLNVLLAIPLFFALDRGHLISGSEARGET*
Ga0066692_1084567523300005555SoilIWFLHDQLGWMYDSRFALALFGLNVLLAVPLFFGLDRGHIIAGSVVEEGGRA*
Ga0066699_1030468313300005561SoilALWILHAVLGWAYDTRFGLALFGLNVLLAIPLFFVLDRGHIVAGSMAGERA*
Ga0066702_1077560723300005575SoilIWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVVEA*
Ga0066706_1054015333300005598SoilLVYLTVIATAIWYLHERLGWVYDARFALALGAVNVALAVPLFFVLDRGRLVSGSVAREGA
Ga0075285_101852213300005890Rice Paddy SoilATAIWLLHEGLGWTYDTRFALTLFGLNLLLAVPLFFVLDRGHIVSGSVAEERA*
Ga0075023_10063994113300006041WatershedsAIWFFHDQLGWAYDTRFSVAMFAVNLALAVPLFFVLDRGHLVSGSVQRRRA*
Ga0066652_10170532113300006046SoilVIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVSLLFWLDRGHIVAGSVAEEGGRA*
Ga0079222_1115790423300006755Agricultural SoilSYVSVVATAVWFFHDRLGWAYDTRFALALFGVNVLLAVPLFFVLDRGHIVSGSVAEERV*
Ga0066653_1005325313300006791SoilIWILHALLGWTYDTRFGLVLFALNVLLAIPLLFGLDRGHLVAGSVAEERA*
Ga0066665_1009583113300006796SoilTAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGRLVAGSVAEERA*
Ga0066665_1088993823300006796SoilAIWYLHARLGWAYDARFALALAAVNLALAVPLVFVLDRGRLVNGSVARERA*
Ga0066659_1152894213300006797SoilLTYLTVIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVPLLFWLDRGHIVAGSVAEEGGRA*
Ga0079221_1026453913300006804Agricultural SoilAALLPLALTYVTVIASAIWLLHDRMGWTYDSRFALTLFGLNVLLAVPLLFWLDRGHIVAGSMAEQGGRA*
Ga0075424_10258701713300006904Populus RhizosphereLLYVSVIATVVWVLRARLGWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSTAEERA
Ga0075435_10196511213300007076Populus RhizosphereVTVIATAIWFLHDQLGWMYNTQFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0099828_1152395223300009089Vadose Zone SoilAAIWFLHAQLGWEYDARFAATLFGVNLILGVFVFFVLDRGRIVSGSVARERG*
Ga0099828_1172571623300009089Vadose Zone SoilLGWGYDRRFALALFGVNVLLAVPLFFVLDRGRVIAGSVAEERV*
Ga0066709_10089992543300009137Grasslands SoilLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGQA*
Ga0066709_10238979113300009137Grasslands SoilTAIWFLHDQLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0105067_110115223300009812Groundwater SandLVYLMVIATAIWYLHDQLGWSYDGRFAGVLFGVNLVLAVPLVFVLDRGRLVSGSMEEEEGKA*
Ga0105057_106110513300009813Groundwater SandQLGWSYDGRFAGVLFGVNLVLAVPLVFVLDRGRLVSGSMEEEEGKA*
Ga0127475_105821723300010095Grasslands SoilVIATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS*
Ga0127482_103612513300010126Grasslands SoilATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS*
Ga0127482_111782833300010126Grasslands SoilATAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA*
Ga0127455_117086323300010132Grasslands SoilWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVGRSGSGAELTWPSA*
Ga0134070_1016148933300010301Grasslands SoilTAIWLLHDQLGWSYGTPFALALFGLNVLLAIPLFFVLDRGHLVSGSVAEEAG*
Ga0134070_1026825423300010301Grasslands SoilAIWILHEQLGWTYGTRFALALFALNVLLAIPLFFVLDRGRIVAGSVAEERA*
Ga0134088_1010918413300010304Grasslands SoilVLHDRLGWTYDSRFALALCGLNVLLAIPLFFVLDRGHLIAGSVAEGGA*
Ga0134064_1010925513300010325Grasslands SoilIATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS*
Ga0134064_1036815523300010325Grasslands SoilLAYVTVIATAIWILHEQLGWTYGTRFALALFGLNVLLAIPLFFVLDRGRIVAGSVAEERA
Ga0134111_1047326223300010329Grasslands SoilAYVTIIATAIWFLHDRLDWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0134080_1025854413300010333Grasslands SoilPLALGYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA*
Ga0126376_1280555613300010359Tropical Forest SoilIATAIWVFHDSLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA*
Ga0134066_1013152223300010364Grasslands SoilPLALTYLTVIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVSLLFWLDRGHIVAGSVAEEGGRA*
Ga0137364_1025761113300012198Vadose Zone SoilGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGRRA*
Ga0137382_1089064913300012200Vadose Zone SoilTIIATAIWFLHDRLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0137382_1091407413300012200Vadose Zone SoilATAIWFLHDQLGWLYDSRFALALFGLNVLLAVPLFFGLDRGHIIAGSVVEEGGRA*
Ga0137380_1007253253300012206Vadose Zone SoilPLALGYVSVIATAIWLLHDRLAWTYDTRFALTLFALNALLAIPLFFVLDRGHIVAGSVAPEGGRA*
Ga0134028_125307623300012224Grasslands SoilIWFLHDQLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0137387_1003418613300012349Vadose Zone SoilVMASAIWLLHARLGWTYDTRFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA*
Ga0137387_1018893813300012349Vadose Zone SoilTAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA*
Ga0137387_1061042733300012349Vadose Zone SoilLLHDRLAWTYDTRFALTLFALNALLAIPLFFVLDRGHIVAGSVAPEGGRA*
Ga0137369_1099042923300012355Vadose Zone SoilALGYVTVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFLVLDRGHIIAGSEAAEGT*
Ga0137404_1174310413300012929Vadose Zone SoilTVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGA*
Ga0134110_1032232113300012975Grasslands SoilIATAMWILHAVLGWTYDTRFGLVLFVLNVLLAIPLFFGLDRGHLVAGSVAEERA*
Ga0134076_1014941913300012976Grasslands SoilQLGWTYGTRFALALFGLNVLLAIPLFFVLDRGRIVAGSVAEERA*
Ga0134075_1004953353300014154Grasslands SoilLTVIATAIWFLHNRLGWSYDSRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAGGV*
Ga0134078_1019865733300014157Grasslands SoilDGLGWTYDTRFALALFALNLTLAVPLFFVLDRGRIVAGSVAEA*
Ga0134112_1030139213300017656Grasslands SoilVQLGWAYDTRFALALGAVNVVLAIPLLFVLDRGHLVSGSVARGRA
Ga0134083_1046535713300017659Grasslands SoilHDWLGWTYDSRFALALFALNLVLAVPLFFVLDRGHLIAGSVVEQGERA
Ga0187824_1007301233300017927Freshwater SedimentWFFHDFLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA
Ga0187787_1026630823300018029Tropical PeatlandLALVYVTLLAGAIWVLHERLGWQYDQRFALALLGLNVALAIPLFFVLDRGHLIAGSEARAGGEV
Ga0179592_1034226923300020199Vadose Zone SoilGWNYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGP
Ga0215015_1098091223300021046SoilVLHNRLGWSYDQRFALALFGVNLLLAVPLFFVLDRGRLVAGSVAEERV
Ga0210379_1037305523300021081Groundwater SedimentALGYLAVLATAIWVLHDRLGWAYNTRFALALFGVNLLIAVPLVFVLDRGRLVAGSVAEEH
Ga0209520_1071905613300025319SoilLASAIWLLHDRLGWAYDTRFSLALFGLNVLLAIPLFFVLDRGRIVAGSVAEEGA
Ga0208775_101793723300025992Rice Paddy SoilATAIWLLHEGLGWTYDTRFALTLFGLNLLLAVPLFFVLDRGHIVSGSVAEERA
Ga0209234_123639323300026295Grasslands SoilERLGWVYDARFALALGAVNVALAVPLFFVLDRGRLVSGSVARERA
Ga0209237_128659613300026297Grasslands SoilTIIATAIWFLHDQLGWMYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA
Ga0209265_105025913300026308SoilFLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA
Ga0209239_115020833300026310Grasslands SoilVLATAIWILHALLGWSYDTRFGLVLFALNVLLAVPLFFGLDRGHLIAGAVAEEPA
Ga0209239_120016123300026310Grasslands SoilTALWILHAVLGWSYDTRFGLVLFVLNVLLAIPLFFGLDRGHLVAGSVAEERA
Ga0209470_110848813300026324SoilHARLGWTYDARFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA
Ga0209375_123521913300026329SoilDRLGWPYDTRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAGGV
Ga0209804_118818533300026335SoilLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA
Ga0209159_118989713300026343SoilDRLGWPYDTRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAAGRGA
Ga0209808_131766513300026523SoilTANWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA
Ga0209690_115251433300026524SoilYITVMASAIWLLHARLGWTYDTRFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA
Ga0209058_132662523300026536SoilMLPLALGYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA
Ga0179587_1064450713300026557Vadose Zone SoilVIATAIWLLHDQLGWNYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGP
Ga0209689_100709713300027748SoilAVLGWAYDTRFGLALFGLNVLLAIPLFFVLDRGHIVAGSMAGERA
Ga0209177_1008321213300027775Agricultural SoilWFLHARLGWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSAAEERA
Ga0209074_1028863013300027787Agricultural SoilIATAIWFLHDQLGWTYDTRFALALFALNLALAVPLFFVLDRGRLVAGSMAEGEA
Ga0209180_1039118533300027846Vadose Zone SoilLHDRLGWGYDRRFALALFGVNVLLAVPLFFVLDRGRVIAGSVAEERV
Ga0209590_1021428113300027882Vadose Zone SoilLTVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGA
Ga0209069_1067872213300027915WatershedsVYLTVIATAIWFFHDQLGWAYDTRFSVAMFAVNLALAVPLFFVLDRGHLVSGSVQRRRA
Ga0209853_109116613300027961Groundwater SandTVVASAIWLLHDRLGWVYDTRFSLALFGLNVLLAIPLFFVLDRGHIIAGSVAEERA
Ga0307282_1042499213300028784SoilDRLGWTYDTRFALVLFAVNLLLAVPLFFVLDRGHIIAGSVAEERA
Ga0308194_1008236033300031421SoilIWLLHDRLGWTYDTRFALVLFAVNLLLAVPLFFVLDRGHIIAGSVAEERA
Ga0307469_1170702223300031720Hardwood Forest SoilLHDRLKWAYDTRFALALFGMNLLIGIPLFFLLDRGRLVAGSVVEEQV
Ga0307471_10001010413300032180Hardwood Forest SoilLSWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSAAEERA
Ga0307471_10148081213300032180Hardwood Forest SoilVTVIATAIWFLHDRLGWMYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVLEEGGSA
Ga0307472_10235675023300032205Hardwood Forest SoilLALGYVTVIATAIWFLHDQLGWIYNTQFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA
Ga0310812_1048818723300032421SoilALIYLTVIATAIWFFHDQLGWAYDTRFSLALFGVNLALAVPLVFVLDRGHIVAGSVQRRR
Ga0316628_10115813433300033513SoilYLTVVATAIWYLHAVLGWAYDMRFSLVMFALNLVLAVPVFLVLDRGHLISGSVQRRSA
Ga0364930_0138561_633_8183300033814SedimentVYLMVIATAIWYLHDQLGWGYDGRFAGVLFAINLALAVPLFFVLDRGRLVSGSMEGEGGK
Ga0364946_014478_3_1703300033815SedimentATAIWYLHDQLGWGYDRRFAGVLFAVNLALAVPLVFVLDRGRLVSGSMEEEGGKA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.