NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101668

Metagenome / Metatranscriptome Family F101668

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101668
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 77 residues
Representative Sequence SHRRNSREIELMTWAKHAATLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTFVAKN
Number of Associated Samples 90
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 33.33 %
% of genes near scaffold ends (potentially truncated) 2.94 %
% of genes from short scaffolds (< 2000 bps) 1.96 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.69

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(28.431 % of family members)
Environment Ontology (ENVO) Unclassified
(26.471 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.804 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.35%    β-sheet: 37.50%    Coil/Unstructured: 46.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.69
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF01841Transglut_core 6.86
PF00294PfkB 4.90
PF03466LysR_substrate 4.90
PF10544T5orf172 3.92
PF01145Band_7 2.94
PF03459TOBE 2.94
PF00005ABC_tran 1.96
PF00210Ferritin 1.96
PF02810SEC-C 1.96
PF02585PIG-L 0.98
PF10415FumaraseC_C 0.98
PF03364Polyketide_cyc 0.98
PF00903Glyoxalase 0.98
PF01522Polysacc_deac_1 0.98
PF01494FAD_binding_3 0.98
PF00557Peptidase_M24 0.98
PF02371Transposase_20 0.98
PF13360PQQ_2 0.98
PF05974DUF892 0.98
PF06305LapA_dom 0.98
PF09723Zn-ribbon_8 0.98
PF07085DRTGG 0.98
PF12697Abhydrolase_6 0.98
PF06250YhcG_C 0.98
PF03747ADP_ribosyl_GH 0.98
PF03734YkuD 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.96
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.98
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.98
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.98
COG0726Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 familyCell wall/membrane/envelope biogenesis [M] 0.98
COG1376Lipoprotein-anchoring transpeptidase ErfK/SrfKCell wall/membrane/envelope biogenesis [M] 0.98
COG1397ADP-ribosylglycohydrolasePosttranslational modification, protein turnover, chaperones [O] 0.98
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 0.98
COG3034Murein L,D-transpeptidase YafKCell wall/membrane/envelope biogenesis [M] 0.98
COG3547TransposaseMobilome: prophages, transposons [X] 0.98
COG3685Ferritin-like metal-binding protein YciEInorganic ion transport and metabolism [P] 0.98
COG3771Lipopolysaccharide assembly protein YciS/LapA, DUF1049 familyCell wall/membrane/envelope biogenesis [M] 0.98
COG4804Predicted nuclease of restriction endonuclease-like (RecB) superfamily, DUF1016 familyGeneral function prediction only [R] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.02 %
All OrganismsrootAll Organisms0.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005518|Ga0070699_100005668All Organisms → cellular organisms → Bacteria10938Open in IMG/M
3300011120|Ga0150983_12891500Not Available559Open in IMG/M
3300030937|Ga0138302_1745172Not Available632Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil28.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil17.65%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil10.78%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.86%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.90%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil3.92%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa3.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.94%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.96%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.96%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots1.96%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021377Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R7Host-AssociatedOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021439Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R03EnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027266Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027326Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027629Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029944II_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300030057Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_E1_1EnvironmentalOpen in IMG/M
3300030738Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VDE Co-assemblyEnvironmentalOpen in IMG/M
3300030740Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ARE Co-assemblyEnvironmentalOpen in IMG/M
3300030800Metatranscriptome of forest soil microbial communities from Dalarna County, Sweden - Site 2 - Mineral N1 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030937Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A4_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031233Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_E3_2EnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031793Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f21EnvironmentalOpen in IMG/M
3300031831Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f20EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032756Forest Soil Metatranscriptomics Site 2 Humus Litter Mineral Combined AssemblyEnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J11758_1260139423300000789SoilVLRGRAHYGRVGEIAEYIFRFVRDPKGELLQFGLVAFSKSEDRALARHIVDIFGTEN*
JGI1027J12803_10676792723300000955SoilSEEVDYWRDELKSNRRNTREIELMTWAKHAATLRGRAHYGRMDEVAEYVFQFNRTSEGELLKFGAVAFSNSAHWGLARQVIDIFGTKN*
JGI12627J18819_1014408613300001867Forest SoilLKSHRRNSREIELISWKKHAATLRGRAHYGRIGEVAEYLFQFKRIAEGELLKFGVVALSKPEDRALARQVIDIFGTRN*
Ga0062385_1121880723300004080Bog Forest SoilRNSREIELITWAKHAATLRGRAHYGPFDEFAEYVFQFIRTPEGEVWKFGAVAFSKAEDRPLARQVIDIFGTNN*
Ga0062384_10124185223300004082Bog Forest SoilKPEEVDYWRDELKSNRRNRHEIELMTWGNHTSTLRGRADYGRSDEVAEYLFQFIRTSEGKLLKFGTVAFSKSVDRALARHVIDIVGTKN*
Ga0062387_10014312013300004091Bog Forest SoilQLKSNRRNNHELEVMTWGKHASKLRGRADYGHLDEIAEYVFQFIRSSEGKLLKFGTVAFSKFADRALARQIIDVAGTKN*
Ga0058899_1202054323300004631Forest SoilNRRNSHEIELMTWGKHASTLRGRADYGLMDEVAEYVFQFIRTAEGELLKFGTVAFSKSVDRALARQVIDIVGTKN*
Ga0070709_1166453413300005434Corn, Switchgrass And Miscanthus RhizosphereTWGKHASTLRGRADYGRVDEVAEYLFQFIRTTEGKLLKFGTVAFSKSVDSALARQVIDIVGTKN*
Ga0070699_10000566813300005518Corn, Switchgrass And Miscanthus RhizosphereMTWAKHAATLRGRAHYGPMDEVAEYVFEFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN*
Ga0070697_10009061333300005536Corn, Switchgrass And Miscanthus RhizosphereMTWAKHAATLRGRAHCGRIDEIAEYVFQFIRTSEGELLEFGAVAFSKSEDRALSRQAIDIFGTKN*
Ga0070762_1027069013300005602SoilNAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLSKCGAIAFSKSEDRALARQVIDRFVAKTDRDSVNR*
Ga0070764_1110253513300005712SoilAANAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLSKCGAIAFSKSEDRALARQVIDRFVAKTDRDSVNR*
Ga0070717_1195960323300006028Corn, Switchgrass And Miscanthus RhizosphereEEVDYWREQLKSNRRNPHEIELMTWGKHASTLRGRADYGRVDEVAEYLFQFIRTTEGKLLKFGTVVFSKAVDSALARQVIDIAGTQN*
Ga0075030_10149047813300006162WatershedsREIELITWAKHAVTLRGRAHYGPIDEIAEYVFQFIRTSEGELLKFGAVAFSKSEDRALARQVIDIFGNKN*
Ga0075014_10085423813300006174WatershedsPEEVDYWRDELKSHRKNSREIELMTWAKHAATLRGRVHYGRIDEVAEYVFQFNRTSEGELLKFGTVAFSRSEDTALARQVIDIFGTKN*
Ga0070712_10185024413300006175Corn, Switchgrass And Miscanthus RhizosphereAENAVVLRGRAHYGRLNEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTFVAKNCPAIDR*
Ga0079222_1233148523300006755Agricultural SoilPIELLTWSGPQATLRGRAQYGPLDEVAEYVFRFMRTPEGGLLKFGAVAFCHRADRALARQVIEAFAAQN*
Ga0105247_1129273113300009101Switchgrass RhizosphereLPLAGPEEVDYWRNELKSNRRNSREIELMTWAENAVILRGRAHYGRLDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTYVAKN*
Ga0126374_1048453813300009792Tropical Forest SoilEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGHMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN*
Ga0126377_1284151413300010362Tropical Forest SoilEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN*
Ga0126381_10415649523300010376Tropical Forest SoilTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGRLLKSGAIAFSKSEDGALARQAIDTFVAKN*
Ga0150983_1289150013300011120Forest SoilVDYWRDELKSHRRNSREIELMTLAKHAATLRGRAHYGRIDEVAEYVFQFTRTSGGELLKFGAVAFSKSDDTALARQVIDIFGTKN*
Ga0150983_1307605213300011120Forest SoilEEVDYWRDQLKSNRRNSHEIELMTWGKHASTLRGRADYGRVDEVAEYLFQFIRTNEGKLLKFGTVAFSKSVDSALARQVIDIVGTKN*
Ga0150983_1313816123300011120Forest SoilEEVDYWRDELKSNRRNTREIELMTWAKHAATLRGRAHYGPIDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN*
Ga0150983_1428190913300011120Forest SoilHELEVMTWGNHASTLRGRADYGHLDEIAEYVFQFIRTSEGKLLKFGTVAFSKSVDRALARQVIDVVGTRN*
Ga0137362_1018961523300012205Vadose Zone SoilMTWAKHAATLRGRAHYGPMDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN*
Ga0137360_1083565633300012361Vadose Zone SoilMLRGRANYGCIDEVTEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDAFAAENCPTINPKP*
Ga0137398_1100139313300012683Vadose Zone SoilRNRHEIELMTWGQHASTLRGRADYGRVDEVAEYLFQFIRTAEGRLLKFGTVAFSKAVDSALARQVIDIAGTKN*
Ga0164303_1028326713300012957SoilRPEEVDYWRNELKSNRRNSREIELMTWAENAVILRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTFVAKN*
Ga0164305_1107262013300012989SoilEEVDYWRDELKSHRRNSHEIELMTWAKHAATLRGRAHYGRMDEVAEYVFQFYRTSEGELLKFGAIAFSKSDDRALARQVIDIFGTKN*
Ga0182036_1024719313300016270SoilIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0182041_1002916213300016294SoilNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0182041_1170090213300016294SoilHAATLRGRAHYGRIDEFAEYVFQFIRTSEGELWKFGAVAFSKAEDRALARQVIDIFGTKN
Ga0182033_1024983013300016319SoilRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0182033_1126247723300016319SoilELKSNRRNSREIELMTWAKNAVTLRGRANYGRIDQVVEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIENCTPMTGAASD
Ga0182035_1006609923300016341SoilELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0182032_1188973723300016357SoilSEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGHMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0182039_1015020843300016422SoilEEVDYWRDELKSNRRNSREIELMTWAKNAVTLRGRANYGGVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0182038_1168728013300016445SoilELKSNRRNSREIELMTWAKNAVTLRGRANYGGVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0210406_1055233723300021168SoilMTWAKHAATLRGRAHYGPMDEVAEYVFEFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKIERAFLVSH
Ga0210400_1061126313300021170SoilLRGRADYGRVDEVAEYLFQFIRTTEGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0210408_1013815633300021178SoilMTWGNHASTLRGRADYGHLDEIAEYVFQFIRTSEGKLLKFGTVAFSKSVDRALARQVIDVVGTRN
Ga0210408_1021201633300021178SoilMTWAKHAATLRGRAHYGPMDEVAEYVFEFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN
Ga0210388_1009352643300021181SoilKSNRRNSREIELMTWAANAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLSKCGAIAFSKSEDRALARQVIDRFVAKTDRDSVNR
Ga0213874_1011946713300021377Plant RootsREIELMTWVEKAVTLRGRAHYGGMNEVAEYVFQFNRSSEGKLLKCGAIAFSKSEDGALARQVIDRYVAHT
Ga0213876_1066524813300021384Plant RootsKTTLRGRAQYGGLDEVAEYVFRFMRTPEGGLLKSGAVAFSKPEDRALARQVIDAFEAQN
Ga0210393_1022951733300021401SoilHASKLRGRADYGHLDEIAEYVFQFIRSSEGKLLKFGTVAFSKTADRALARQIIDVAGTKN
Ga0210397_1064756223300021403SoilTWEQHASKLRGRADYGHLDEIAEYVFQFIRSSEGKLLKFGTVAFSKFADRALARQIIDIAGTKN
Ga0210383_1071660423300021407SoilEEVDYWRYELKSNRRNSREIELMTWAANAVTLRGRAQYGRMDEVAEYVFQFNRTAEGKLLKFGAIAFSKSEDRALARQVIDTFAAKTERGPVNR
Ga0210384_1039619913300021432SoilRNTTEIELMTWGKHASTLRGRADYGRVDEVAEYLFQFIRTTEGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0213879_1012582413300021439Bulk SoilSNRKNLRPIELLTWSGHQATLRGRAQYGPRDEVAEYVFRFMRTSEGGLLKFGAVAFSHRADGALARQVIEAFGAQN
Ga0210392_1017420733300021475SoilIELMTWAANAVTLRGRAQYGRMDEVAEYVFQFNRTAEGKLLKFGAIAFSKSEDRALARQVIDTFAAKTERGPVNR
Ga0210392_1105699313300021475SoilIELMTWAANAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLLKFGAIAFSKSEDRALARQVIDRFVAKTDRGSVSR
Ga0210398_1141578623300021477SoilEVMTWEQHASKLRGRADYGHLDEIAEYVFQFIRSSEGKLLKFGTVAFSKSADRALARQIIDVAGTKN
Ga0210402_1005396753300021478SoilHRRNSHELEVMTWGNHASTLRGRADYGHLDEIAEYVFQFIRTSEGKLLKFGTVAFSKSVDRALARQVIDVVGTRN
Ga0210402_1045968133300021478SoilRYELKSNRRNSREIELMTWAANAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLLKFGAIAFSKSEDRALARQVIDRFVAKTDRGSVSR
Ga0210410_1079441313300021479SoilRRNSREVELMTWAENAVTLRGRAHYGSVDEFAEYVFQFNRTAEGKLLKSGAIAFSKSEDRALAQQVIDRFAAKIDRGPVNH
Ga0242665_1008750123300022724SoilDYWRDELKSHRRNSREIELMTWAKHAATLRGRVHYGRIDEVAEYVFQFNRTSEGELLKFGTVAFSRSEDRALARQVIDIFGTKN
Ga0179589_1044589213300024288Vadose Zone SoilVDYWRDELKSNRRNSHEIELMTWGNYTSTLRGRADYGRSDEVAEYLFQFIRTSEGKLLKFGTVAFSKSVDRALARHVIDIVGTKN
Ga0207646_1024517613300025922Corn, Switchgrass And Miscanthus RhizosphereMTWAKHAATLRGRAHYGPMDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN
Ga0207664_1138278123300025929Agricultural SoilLFTPEEVDYWRDELKSHRRNSREIELMTWAKHAATLRGRAHYGRMDEVAEYVFQFYRTSEGELLKFGAIAFSKSDDRALARQVIDIFGTKN
Ga0207665_1023584023300025939Corn, Switchgrass And Miscanthus RhizosphereWAKHAATLRGRAHYGPIDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN
Ga0179593_104635423300026555Vadose Zone SoilMPPRLRGRADYGRVDEVAEYLFQFIRTAEGRLLKFGTVAFSKAVDSALARQVIDIAGTKN
Ga0209215_104218213300027266Forest SoilMTWAKHAATLRGRAHYGPVDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0209731_102507223300027326Forest SoilKHAATLRGRAHYGRIGEVAEYLFQFKRIAEGELLKFGVVALSKPEDRALARQVIDIFGTR
Ga0209422_106481113300027629Forest SoilSTPEEVDYWRDQLKSNRRNSHEIELMTWGKHASTLRGRADYGRGDEVAEYLFRFVRTSEGHLLKFGTVAFSKSVDSALARQVIDIVGTEN
Ga0209736_110864113300027660Forest SoilIELMTWGKHASTLRGRADYGRVDEVAEYLFQFIRTTEGKLLKFGTVVFSKAVDSALARQVIDIAGTQN
Ga0209656_1022943013300027812Bog Forest SoilNSREIELMTWAKHAATLRGRVHYGRVDEVAEYVFQFNRTSEGELLKFGTVAFSRSEDRALARQVIDIFGTKN
Ga0209526_1068395823300028047Forest SoilEEVDYWRNELKSNRRNSREIELMTWAENAVVLRGRAHYGRLDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTFVAKN
Ga0308309_1073570423300028906SoilRNSREIELMTWAANAVTLRGRAHYGRMDEFAEYVFQFNRTAEGKLSKCGAIAFSKSEDRALARQVIDTFAAKTDRGPVNR
Ga0311352_1009359213300029944PalsaTPEEVDYWRDQLKVNRRNSLEIELMTWAKHSSTLRGRADYGRSDEVAEYLFQFIRTSDGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0302176_1022177513300030057PalsaVDYWRDQLKVNRRNSLEIELMTWAKHSSTLRGRADYGRSDEVAEYLFQFIRTSDGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0265462_1267221413300030738SoilKSNRRNSHEIELMTWGKHASTLRGRAEYGLMDEVAEYIFQFIRTAEGELLKFGTVAFSKSVDRALARQVIDIVGTKN
Ga0265460_1206495623300030740SoilYEIELMTWGKHASTLRGRAEYGLMDEVAEYIFQFIRTAEGELLKFGTVAFSKSVDRALARQVIDIVGTKN
Ga0074032_1109676713300030800SoilAATLRGRVHYGRIDEVAEYVFQFIRTSEGELLKFGTVAFSRSKDTALARQVIDIFGTKN
Ga0138302_174517223300030937SoilSHRRNSREIELMTWAKHAATLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGALARQVIDTFVAKN
Ga0170824_11574598023300031231Forest SoilWRDELKLHRRNSREIELMTWAKHATTLRGRVHYGRIDEVAEYVFQFNRTSEGELLKFGTVAFSRSEDTALARQVIDIFGTKN
Ga0302307_1033150423300031233PalsaRNSLEIELMTWAKHSSTLRGRADYGRSDEVAEYLFQFIRTSDGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0302325_1342169223300031234PalsaLRGRADYGRMDEVAEYLFQFVRTSEGKLLKFGTVAFSKSVDSALARQVIDIVGTKN
Ga0310915_1020423233300031573SoilYWHEELKSNRRNSREIELMTWAKNAVTLRGRANYGGVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0318561_1061751113300031679SoilRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0306918_1002051213300031744SoilGSEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0306918_1017426113300031744SoilVDYWREELKSNRRDSREIELMTWARNAVTLRGRANYGRVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0306918_1125263613300031744SoilWAKNAVTLRGRAHYGHMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0307475_1103477813300031754Hardwood Forest SoilWRDELKSNRRNTREIELMTWAKHAATLRGRAHYGPMDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN
Ga0318548_1064565523300031793SoilSGVEEVAYWRDELKSNRRNSREIELMTWAKNAVTLRGRANYGRIDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGPQARQVIDTFVIQNCKPLTGAVSE
Ga0318564_1048971713300031831SoilSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0306925_1098182213300031890SoilVTLRGRANYGRIDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARQVIDTFVIQNCTPLTGAESE
Ga0318536_1056320513300031893SoilGSEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0306921_1097575213300031912SoilRMLSLSGSEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0310916_1001372113300031942SoilNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0310916_1037390713300031942SoilNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0310913_1051841523300031945SoilGFEEVDYWREELKSNRRNSREIELMTWAKNAVTLRGRANYGGVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0310909_1021353743300031947SoilEVDYWHEELKSNRRNSREIELMTWAKNAVTLRGRANYGGVDEVAEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIQNCTPLTGAVCE
Ga0306926_1092832923300031954SoilYWRTELKSNRRNSREIELMTWAKNAVTLRGRANYGRIDQVVEYVFQFNRTAEGGLLEFGAIAFSKSEDGAQARHVIDTFVIENCTPVTGAASD
Ga0306922_1095447113300032001SoilIELMTWAKNAVTLRGRAHYGHMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0306924_1005764753300032076SoilSGSEEVDYWRNELKSNRRNSREIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTPEGKLLKFGAIAFSKSEDGTLARQVIDKLPARER
Ga0306924_1079298013300032076SoilKSNRRNSHEIELMTWAKNAVTLRGRAHYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDGTLARQVIDKFVAEN
Ga0307471_10007249123300032180Hardwood Forest SoilMTWAKHAATLRGRAHCGRIDEIAEYVFQFIRTSEGELLEFGAVAFSKSEDRALSRQAIDIFGTKN
Ga0307472_10162816723300032205Hardwood Forest SoilELMTWAKHAATLRGRAHYGPMDEVAEYVFQFNRTSEGELLKFGAVAFSKSADWGLARQVIDIFGTKN
Ga0315742_1234965423300032756Forest SoilDYWRDQLKLNRRNSHEIELMTWEKHASTLRGRADYGRGDEVAEYLFRFVRTSEGHLLKFGTVAFSKSVDSALARQVIDVVGTEN
Ga0335079_1228536913300032783SoilEEVDYWRNALKSNRRNSREIELMTWAENAVTLRGRANYGRMDEVAEYVFQFNRTSEGKLLKFGAIAFSKSEDASLARQVIDTFVAKN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.