NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103866

Metagenome / Metatranscriptome Family F103866

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103866
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 66 residues
Representative Sequence KELGEYLGDRINNPLAVILASAQLLEMRQRSDATSEAAQRIGAAVSKINEVVREIAIRSGEVPRV
Number of Associated Samples 91
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.68

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(16.832 % of family members)
Environment Ontology (ENVO) Unclassified
(27.723 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.495 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 55.91%    β-sheet: 0.00%    Coil/Unstructured: 44.09%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.68
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00441Acyl-CoA_dh_1 45.54
PF00512HisKA 16.83
PF02771Acyl-CoA_dh_N 11.88
PF02770Acyl-CoA_dh_M 4.95
PF13343SBP_bac_6 2.97
PF06508QueC 1.98
PF12911OppC_N 0.99
PF02900LigB 0.99
PF13701DDE_Tnp_1_4 0.99
PF02592Vut_1 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 62.38
COG0037tRNA(Ile)-lysidine synthase TilS/MesJTranslation, ribosomal structure and biogenesis [J] 1.98
COG0137Argininosuccinate synthaseAmino acid transport and metabolism [E] 1.98
COG0171NH3-dependent NAD+ synthetaseCoenzyme transport and metabolism [H] 1.98
COG0301Adenylyl- and sulfurtransferase ThiI (thiamine and tRNA 4-thiouridine biosynthesis)Translation, ribosomal structure and biogenesis [J] 1.98
COG0482tRNA U34 2-thiouridine synthase MnmA/TrmU, contains the PP-loop ATPase domainTranslation, ribosomal structure and biogenesis [J] 1.98
COG0519GMP synthase, PP-ATPase domain/subunitNucleotide transport and metabolism [F] 1.98
COG06037-cyano-7-deazaguanine synthase (queuosine biosynthesis)Translation, ribosomal structure and biogenesis [J] 1.98
COG0780NADPH-dependent 7-cyano-7-deazaguanine reductase QueF, C-terminal domain, T-fold superfamilyTranslation, ribosomal structure and biogenesis [J] 1.98
COG1606ATP-utilizing enzyme, PP-loop superfamilyGeneral function prediction only [R] 1.98
COG1738Queuosine precursor transporter YhhQ, DUF165 familyTranslation, ribosomal structure and biogenesis [J] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere10.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil6.93%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere5.94%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.97%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands2.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.99%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.99%
Termite NestEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Termite Nest0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.99%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.99%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.99%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.99%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.99%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.99%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006169Termite nest microbial communities from Madurai, IndiaEnvironmentalOpen in IMG/M
3300006580Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLPC (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009823Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_40_50EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014254Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailB_D2EnvironmentalOpen in IMG/M
3300014265Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D2EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026032Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031903Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-1Host-AssociatedOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25386J43895_1001299843300002912Grasslands SoilINNPLAVIMASAQLLEMRAPSNATSEAAERITTAVSKINAVVREIASKSGEMA*
soilL2_1008503333300003319Sugarcane Root And Bulk SoilQAAQLAVLRELGEYLGDRINNPLAVILASAQLLEMKAPSDATSEAAERIAGAVKKINAVV
Ga0066398_1020460413300004268Tropical Forest SoilAETQRTVEAAELALLKELGEYLGDRINNPLAVILASAQLLEMKDRNSATSEAAQRIGAAVARINAVVREIAIRSGEAPVS*
Ga0063356_10276268113300004463Arabidopsis Thaliana RhizosphereELGEYLGDRINNPLAVILASAQLLQMKERSSATSEAAARISEAVSRINGVVREIAIRSGEMPRA*
Ga0062591_10070752623300004643SoilNPLAVILASAQLLEMKAPSDATSEATERIAGAVEKINAVVREIARRSGEAD*
Ga0066672_1013900213300005167SoilERHAFAQNTERTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR*
Ga0066673_1073042513300005175SoilLTERADRKVEAAQLALLKELGEFLGDRINNPLAVIMASAQLLEMRAPSDATSQAAERVTAAVSKITAVVREIAAKSGDELIS*
Ga0066679_1087795713300005176SoilVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR*
Ga0066679_1089242913300005176SoilVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS*
Ga0066675_1033756923300005187SoilLALLKELGEYLGDRINNPLAVILASAQLLEMKERSNATTEAAQRIGAAVGKINEVVREIAIRSGETPIV*
Ga0066388_10825661613300005332Tropical Forest SoilRSMAEAQRTVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVALR*
Ga0070691_1086502913300005341Corn, Switchgrass And Miscanthus RhizosphereLGEYLGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINGVVREIAIRSGELPRA*
Ga0066686_1066498323300005446SoilLAVIMASAQLLEMRAPSSATSEAAERITTAVSKINAVVREIASKSGEMA*
Ga0066686_1106474623300005446SoilGDRINNPLAVIMASAQLLEMRAPGTATTEAAERITAAVSKINAVVREIAEKSGERVV*
Ga0066687_1023490413300005454SoilEAAQLALLKELGEFLGDRINNPLAVIMASAQLLEMRAPGSATTEAAERITAAVSKINAVVREIAEKSGERVV*
Ga0068867_10130808813300005459Miscanthus RhizosphereEYLGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINGVVREIAIRSGELPRA*
Ga0073909_1019399823300005526Surface SoilSADAQRTVQAAELALLRELGEYLGDRINNPLAVILASAQLLEMKEHNAATSEAAERIGTAVAKINAVVREIAVRSGEAPVE*
Ga0066661_1059298823300005554SoilGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR*
Ga0066692_1092234523300005555SoilDRKVEAAQLALLKELGEFLGDRINNPLAVIMASAQLLEMRAPGSATTEAAERITAAVSKINAVVREIAEKSGERVI*
Ga0066698_1024101813300005558SoilRTVETAQLALLKELGEYLGDRINNPLAVILASAQLLEMKERSHATSEAAQRIGAAVSKINDVVREIAIRSGEQPRG*
Ga0066700_1057050713300005559SoilAQNTERTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR*
Ga0070664_10239907713300005564Corn RhizosphereLLKELGEYLGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINTVVREIAIRSGEIPRA*
Ga0066703_1044631013300005568SoilAQNTERTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS*
Ga0066706_1135340623300005598SoilLALLKELGEYLGDRINNPLAVILASAQLLEMKERSNATHEAAQRIGAAVSKINDVVREIAIRSGEQPRG*
Ga0068859_10078771513300005617Switchgrass RhizosphereQRTVQAAELALLRELGEYLGDRINNPLAVILASAQLLEMKEHNAATSEAAQRIGSAVAKINAVVREIAVRSGEAPIE*
Ga0066905_10089135813300005713Tropical Forest SoilQLALLKQLGEFLGDRINNPLAVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIASKSGEV*
Ga0075285_105335223300005890Rice Paddy SoilQAAELALLKELGEYLGDRINNPLAVILGSAQLLEMKDRSHATSEAAQRIAAAVSKINEVVREIASRSGETPHR*
Ga0075417_1006881633300006049Populus RhizosphereLKELGEFLGDRINNPLAVIMASAQLLEMRAPGNATSEAAERITAAVSKINAVVREIAVRSGERAS*
Ga0075417_1068720313300006049Populus RhizosphereRINNPLAVILASAQLLQLKERSTATSEAAERIGEAVSRINQVVREIAIRSGEIPRV*
Ga0082029_112201613300006169Termite NestLGDRITNPLAVILASAPLLEMKDRNSATSEAAQRIGAAVAKINAVVREIAIRSGEAPVS*
Ga0074049_1268059413300006580SoilVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIATRSGVALR*
Ga0079220_1203132313300006806Agricultural SoilVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIASKSGDVVR*
Ga0075425_10187137923300006854Populus RhizosphereVEAAQLALLKELGEFLGDRIHNPLAVIMASAQLLEMRAPSNATSEAAERITAAVSKINAVVREIASKSGERV*
Ga0075434_10049043913300006871Populus RhizosphereAVILASAQLLEMKERSHATSEAAQRIGAAVSKINDVVREIALRSGETPRT*
Ga0075419_1003322853300006969Populus RhizosphereEYLGDRINNPLAVILASAQLLEMKAPSDATTEAAERIAGAVKKINTVVREIARRSGEAD*
Ga0066710_10281432113300009012Grasslands SoilTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS
Ga0066710_10436885123300009012Grasslands SoilLGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR
Ga0111539_1117753313300009094Populus RhizosphereRINNPLAVILASAQLLEMKEHNAATSEAAERIGTAVAKINAVVREIAVRSGEAPVE*
Ga0111539_1181198823300009094Populus RhizosphereLASAQLLEMKERSHATSEAAQRIGAAVSKINDVVREIALRSGETPRT*
Ga0066709_10400736313300009137Grasslands SoilLGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR*
Ga0066709_10439594513300009137Grasslands SoilTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS*
Ga0105092_1014960813300009157Freshwater SedimentSAELALLKELGEYLGDRINNPLAVILASAQLLQIKERSDATSAAAQRIGEAVSRINGVVREIAIRSGEVPRV*
Ga0075423_1096581713300009162Populus RhizosphereLKELGEYLGDRINNPLAVILASAQLLEMKERSHATSEAAQRIGAAVSKINDVVREIALRSGETPRT*
Ga0075423_1100118713300009162Populus RhizosphereRINNPLAVILASAQLLEMKERSHATSEAAQRIGAAVSKINDVVREIALRSGETPRT*
Ga0105241_1057579813300009174Corn RhizosphereEAQRTVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVDLR*
Ga0126374_1180880123300009792Tropical Forest SoilALLKQLGEFLGDRINNPLAVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIALKSGEV*
Ga0105078_105617113300009823Groundwater SandNPLAVILASAQLLEMKAPGDATTEAAERIAGAVKKINAVVREIARRSGEAD*
Ga0126384_1088028713300010046Tropical Forest SoilRKVEAAELALLKQLGEFLGDRINNPLAVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIALKSGEV*
Ga0134071_1059193913300010336Grasslands SoilAQLALLKELGEFLGDRINNPLAIIMASTQLLEMRAPGNATSEAAERITAAVSKINAVVREIAVRSGEAPR*
Ga0126370_1008475233300010358Tropical Forest SoilRINNPLAVIMASAQLLEMRAPSDATSEAAQRITAAVSKINAVVREIALKSGEV*
Ga0126372_1171451223300010360Tropical Forest SoilEAAELALLKQLGEFLGDRINNPLAVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIASKSGEV*
Ga0134125_1287914923300010371Terrestrial SoilLAVILASAQLLEMKEHNAATSEAAERIGTAVAKINAVVREIAVRSGEAPVE*
Ga0134124_1316034313300010397Terrestrial SoilYLGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINTVVREIAIRSGEIPRA
Ga0134123_1047255513300010403Terrestrial SoilELALLKELGEYLGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINGVVREIAIRSGELPRA*
Ga0137362_1067275413300012205Vadose Zone SoilHERQALAERTDRKVEAAQLALLKELGEFLGDRINNPLAVIMASAQLLEMRAPSNATSEAAERITTAVSKINAVVREIASKSGEMA*
Ga0137358_1084720713300012582Vadose Zone SoilEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITMAVSKINAVVREIAAKSGERGVS*
Ga0137410_1004274353300012944Vadose Zone SoilDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS*
Ga0134078_1005344323300014157Grasslands SoilPLAVIMASAQPLEMRAPGSATTEDAERITAAVSKINAVVREIAEKSSERVI*
Ga0075312_100658613300014254Natural And Restored WetlandsNPLAVILASAQLLEMKAPGDATTEAAERIAGAVKKINAVVREIARRSGEVD*
Ga0075314_103382323300014265Natural And Restored WetlandsLGEYLGDRINNPLAVILASAQLLEMKAPGDATTEAAERIAGAVKKINAVVREIARRSGEVD*
Ga0137420_125207113300015054Vadose Zone SoilLGEYLGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIAAKSGERGVS*
Ga0134073_1003566323300015356Grasslands SoilQLALLKELGEFLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITAAVSKINAVVREIAEKSGERVV*
Ga0132255_10223468923300015374Arabidopsis RhizosphereLGDRINNPLAVILASAQLLEMKEHNAATSEAAERIGTAVAKINAVVREIAVRSGEAPVE*
Ga0132255_10377372623300015374Arabidopsis RhizosphereLGDRINNPLAVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIALKSGEV*
Ga0134112_1017774313300017656Grasslands SoilVEAQRTVQAAELALLRELGEYLGDRINNPLAVILASAQLLEMKERSHATSEAAHRIGAAVSKINEVVREIAIRSGETPRA
Ga0187774_1110618713300018089Tropical PeatlandKELGEYLGDRINNPLAVILASAQLLEMRQRSDATSEAAQRIGAAVSKINEVVREIAIRSGEVPRV
Ga0066667_1001569563300018433Grasslands SoilKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR
Ga0207642_1074248823300025899Miscanthus RhizosphereLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINYVVREIATRSGVALR
Ga0207662_1026677913300025918Switchgrass RhizosphereQAAELALLRELGEYLGDRINNPLAVILASAQLLEMKEHNAATSEAAQRIGSAVAKINAVVREIAVRSGEAPVE
Ga0207706_1064899113300025933Corn RhizosphereRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINGVVREIAIRSGELPRA
Ga0207651_1165166113300025960Switchgrass RhizosphereTEAQRTVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIATRSGVALR
Ga0208419_102973323300026032Natural And Restored WetlandsALSLNVDRDVQAAQLAVLRELGEYLGDRINNPLAVILASAQLLEMKAPGDATTEAAERIAGAVKKINAVVREIARRSGEVD
Ga0209237_102310213300026297Grasslands SoilGDRINNPLAVIMASAQLLEMRAPSNATSEAAERITTAVSKINAVVREIASKSGEMA
Ga0209469_108663113300026307SoilKELGEFLGDRINNPLAVIMASAQLLEMRAPGSATTEAAERITAAVSKINAVVREIAEKSGERVI
Ga0209152_1047892523300026325SoilRTVETAQLALLKELGEYLGDRINNPLAVIMASAQLLEMRAPGNATTEAAERITIAVSKINAVVREIASKSGERGVR
Ga0209056_1065247813300026538SoilVQAAELALLKELGEYLGDRINNPLAVILASAQLLEMKERSHATSEAAHRIGAAVSKINEVVREIAIRSGETPRA
Ga0209161_1039597423300026548SoilQRRVQAAELALLKELGEYLGDRINNPLAVILASAQLLEMKEHSHATSEAAQRIGAAVSKINDVVREIAIRSGEQPRG
Ga0256866_117514513300027650SoilELGEYLGDRINNPLAVILASAQLLQMKERSNATSEAAQRIGEAVSRINGVVREIAIRSGEIPRV
Ga0209073_1051863013300027765Agricultural SoilVIMASAQLLEMRAPSDATSEAAERITAAVSKINAVVREIASKSGDVVR
Ga0209177_1023014913300027775Agricultural SoilDRINNPLAVILASAQLLQMRERSNATSEAADRIGEAVSRINQVVREIAIRSGEVPRA
Ga0209814_1050043523300027873Populus RhizosphereYLGDRINNPLAVILASAQLLQLKERSTATSEAAERIGEAVSRINQVVREIAIRSGEIPRV
Ga0209382_1137617723300027909Populus RhizosphereELGEYLGDRINNPLAVILASAQLLEMKAPSDATTEAADRIAGAVKKINAVVREIARRSGEVD
Ga0268264_1268182123300028381Switchgrass RhizosphereLGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIATRSGVALR
Ga0247822_1060231523300028592SoilGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVALR
Ga0247825_1090768323300028812SoilAELALLRELGEYLGDRINNPLAVILASAQLLQMKERSSATSEAAARIGEAVSRINGVVREIAIRSGEMPRA
Ga0310886_1036591223300031562SoilGDRINNPLAVILASAQLLQMKERSSATSEAAQRIGEAVSRINGVVREIAIRSGELPRA
Ga0307468_10141210413300031740Hardwood Forest SoilLGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAADRIGAAVSKINDVVREIASRSGVALR
Ga0307413_1182621623300031824RhizosphereDRINNPLAVILASAQLLEMKERSIATSEAAERIGAAVSKINEVVREIAIRSGEQPRP
Ga0310892_1042706613300031858SoilQRTVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVDLR
Ga0307407_1045326323300031903RhizosphereYLGDRINNPLAVILASAQLLEMKAPSDATSEAAERIAGAVKKINAVVREIARRSGEVD
Ga0310884_1015432713300031944SoilEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVDLR
Ga0307416_10142887513300032002RhizosphereVQAAQLAVLRELGEYLGDRINNPLAVILASAQLLEMKAPSDATSEAAERIAGAVKKINAVVREIARRSGEVD
Ga0307411_1023447423300032005RhizosphereEAQRTVQSAELALLKELGEYLGDRINNPLAVILASAQLLEMKERSIATSEAAERIGAAVSKINEVVREIAIRSGEQPRP
Ga0307411_1111223113300032005RhizosphereLGDRINNPLAVILASAQLLEMKAPGDATTEAAEHIAGAVKKINAVVREIARRSGEAD
Ga0307411_1153519223300032005RhizosphereLGEYLGDRINNPLAVILASAQLLQMKERSTATSEAAQRISEAVSRINGVVREIAIRSGEVPRS
Ga0310889_1021982223300032179SoilNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVALR
Ga0307471_10297959213300032180Hardwood Forest SoilGDRINNPLAVILGSAQLLEMRDHSQAASEAADRIGAAVSKINDVVREIASRSGVALR
Ga0335084_1061622123300033004SoilVKSAELQLLKELGEYLGDRINNPLAVILGSAQLLEMRDHSQAASEAAERIGAAVSKINDVVREIASRSGVELR
Ga0326726_1126445623300033433Peat SoilRINNPLAVILASAQLLQMRERSTATSEAADRIGEAVSRINQVVREIAIRSGEVPRV
Ga0247829_1170749313300033550SoilVQGAELALLKELGEYLGDRINNPLAVILASAQLLQIKERSTATSEAADRIGEAVSRINQVVREIAIRSGEIPRV
Ga0364932_0073974_1035_12803300034177SedimentMAEAQRTVQGAELALLKELGEYLGDRINNPLAVILASAQLLQMKERSTATNEAADRIGEAVSRINKVVREIAVRSGEAPRI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.