NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078339

Metagenome / Metatranscriptome Family F078339

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078339
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 95 residues
Representative Sequence MLTEENRTDILSLWAGLVSVVAIYFLIGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG
Number of Associated Samples 79
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 68.10 %
% of genes near scaffold ends (potentially truncated) 25.00 %
% of genes from short scaffolds (< 2000 bps) 81.03 %
Associated GOLD sequencing projects 71
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (76.724 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.379 % of family members)
Environment Ontology (ENVO) Unclassified
(42.241 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.862 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 65.85%    β-sheet: 0.00%    Coil/Unstructured: 34.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF13505OMP_b-brl 3.45
PF14499DUF4437 2.59
PF11159DUF2939 1.72
PF02622DUF179 1.72
PF01068DNA_ligase_A_M 1.72
PF00872Transposase_mut 1.72
PF13414TPR_11 0.86
PF01548DEDD_Tnp_IS110 0.86
PF02223Thymidylate_kin 0.86
PF09586YfhO 0.86
PF06078DUF937 0.86
PF13560HTH_31 0.86
PF03797Autotransporter 0.86
PF04055Radical_SAM 0.86
PF01527HTH_Tnp_1 0.86
PF13751DDE_Tnp_1_6 0.86
PF00034Cytochrom_C 0.86
PF00589Phage_integrase 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 1.72
COG1678Putative transcriptional regulator, AlgH/UPF0301 familyTranscription [K] 1.72
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 1.72
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 1.72
COG0125Thymidylate kinaseNucleotide transport and metabolism [F] 0.86
COG3547TransposaseMobilome: prophages, transposons [X] 0.86
COG3753Uncharacterized conserved protein YidB, DUF937 familyFunction unknown [S] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms76.72 %
UnclassifiedrootN/A23.28 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101933915Not Available540Open in IMG/M
3300000579|AP72_2010_repI_A01DRAFT_1035475Not Available720Open in IMG/M
3300000651|AP72_2010_repI_A10DRAFT_1042565Not Available592Open in IMG/M
3300000789|JGI1027J11758_13033895Not Available515Open in IMG/M
3300000956|JGI10216J12902_108482430Not Available652Open in IMG/M
3300002245|JGIcombinedJ26739_101011021All Organisms → cellular organisms → Bacteria → Proteobacteria716Open in IMG/M
3300002245|JGIcombinedJ26739_101255712All Organisms → cellular organisms → Bacteria → Proteobacteria631Open in IMG/M
3300002245|JGIcombinedJ26739_101596458All Organisms → cellular organisms → Bacteria → Proteobacteria549Open in IMG/M
3300002245|JGIcombinedJ26739_101663256All Organisms → cellular organisms → Bacteria → Proteobacteria537Open in IMG/M
3300005167|Ga0066672_10785025All Organisms → cellular organisms → Bacteria → Proteobacteria601Open in IMG/M
3300005186|Ga0066676_10424414All Organisms → cellular organisms → Bacteria → Proteobacteria899Open in IMG/M
3300005332|Ga0066388_100205756All Organisms → cellular organisms → Bacteria → Proteobacteria2591Open in IMG/M
3300005332|Ga0066388_101098468All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1346Open in IMG/M
3300005332|Ga0066388_101125741All Organisms → cellular organisms → Bacteria → Proteobacteria1332Open in IMG/M
3300005332|Ga0066388_105100335All Organisms → cellular organisms → Bacteria → Proteobacteria667Open in IMG/M
3300005363|Ga0008090_14436892All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter asburiae506Open in IMG/M
3300005467|Ga0070706_100030005All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5008Open in IMG/M
3300005467|Ga0070706_101065317All Organisms → cellular organisms → Bacteria → Proteobacteria745Open in IMG/M
3300005471|Ga0070698_100119879All Organisms → cellular organisms → Bacteria2591Open in IMG/M
3300005471|Ga0070698_100375767Not Available1353Open in IMG/M
3300005471|Ga0070698_100484551All Organisms → cellular organisms → Bacteria → Proteobacteria1174Open in IMG/M
3300005546|Ga0070696_101286338All Organisms → cellular organisms → Bacteria → Proteobacteria620Open in IMG/M
3300005552|Ga0066701_10773747All Organisms → cellular organisms → Bacteria → Proteobacteria573Open in IMG/M
3300005555|Ga0066692_10678145All Organisms → cellular organisms → Bacteria → Proteobacteria641Open in IMG/M
3300005559|Ga0066700_10016305All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4031Open in IMG/M
3300005559|Ga0066700_10619849All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300005566|Ga0066693_10180025All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium → Rhizobium leguminosarum808Open in IMG/M
3300005713|Ga0066905_100271950All Organisms → cellular organisms → Bacteria → Proteobacteria1314Open in IMG/M
3300005713|Ga0066905_101288734All Organisms → cellular organisms → Bacteria → Proteobacteria657Open in IMG/M
3300005713|Ga0066905_101387080All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300005764|Ga0066903_100043176All Organisms → cellular organisms → Bacteria → Proteobacteria5359Open in IMG/M
3300005764|Ga0066903_101251580Not Available1382Open in IMG/M
3300005764|Ga0066903_102303585All Organisms → cellular organisms → Bacteria → Proteobacteria1040Open in IMG/M
3300005764|Ga0066903_103949925All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300005937|Ga0081455_10053403All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium3452Open in IMG/M
3300006028|Ga0070717_11182020Not Available696Open in IMG/M
3300006028|Ga0070717_11852772Not Available545Open in IMG/M
3300006031|Ga0066651_10264945All Organisms → cellular organisms → Bacteria → Proteobacteria912Open in IMG/M
3300006046|Ga0066652_100041617Not Available3391Open in IMG/M
3300006046|Ga0066652_100722962Not Available949Open in IMG/M
3300006854|Ga0075425_101340042All Organisms → cellular organisms → Bacteria → Proteobacteria811Open in IMG/M
3300007255|Ga0099791_10007651All Organisms → cellular organisms → Bacteria → Proteobacteria4469Open in IMG/M
3300009012|Ga0066710_102349340All Organisms → cellular organisms → Bacteria → Proteobacteria774Open in IMG/M
3300009038|Ga0099829_10739261All Organisms → cellular organisms → Bacteria → Proteobacteria817Open in IMG/M
3300009038|Ga0099829_11693082All Organisms → cellular organisms → Bacteria → Proteobacteria520Open in IMG/M
3300009089|Ga0099828_10613242All Organisms → cellular organisms → Bacteria → Proteobacteria979Open in IMG/M
3300009090|Ga0099827_10113628All Organisms → cellular organisms → Bacteria → Proteobacteria2170Open in IMG/M
3300009090|Ga0099827_10140135All Organisms → cellular organisms → Bacteria1969Open in IMG/M
3300009143|Ga0099792_10302975All Organisms → cellular organisms → Bacteria → Proteobacteria949Open in IMG/M
3300010046|Ga0126384_10645764All Organisms → cellular organisms → Bacteria → Proteobacteria932Open in IMG/M
3300010047|Ga0126382_12406972Not Available511Open in IMG/M
3300010321|Ga0134067_10168460All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium790Open in IMG/M
3300010362|Ga0126377_12656718All Organisms → cellular organisms → Bacteria → Proteobacteria576Open in IMG/M
3300010366|Ga0126379_12676573All Organisms → cellular organisms → Bacteria → Proteobacteria596Open in IMG/M
3300010376|Ga0126381_101761041All Organisms → cellular organisms → Bacteria → Proteobacteria894Open in IMG/M
3300010398|Ga0126383_13181829All Organisms → cellular organisms → Bacteria → Proteobacteria536Open in IMG/M
3300010398|Ga0126383_13464429Not Available515Open in IMG/M
3300010863|Ga0124850_1122251Not Available666Open in IMG/M
3300010863|Ga0124850_1148772Not Available527Open in IMG/M
3300010863|Ga0124850_1150569Not Available517Open in IMG/M
3300011270|Ga0137391_10947807All Organisms → cellular organisms → Bacteria → Proteobacteria702Open in IMG/M
3300011271|Ga0137393_10435066All Organisms → cellular organisms → Bacteria → Proteobacteria1124Open in IMG/M
3300011271|Ga0137393_11017913All Organisms → cellular organisms → Bacteria → Proteobacteria705Open in IMG/M
3300012096|Ga0137389_10837840All Organisms → cellular organisms → Bacteria → Proteobacteria789Open in IMG/M
3300012189|Ga0137388_10128402All Organisms → cellular organisms → Bacteria2218Open in IMG/M
3300012198|Ga0137364_11312689Not Available538Open in IMG/M
3300012201|Ga0137365_10029859All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium4193Open in IMG/M
3300012201|Ga0137365_10113602All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2036Open in IMG/M
3300012201|Ga0137365_10766524Not Available705Open in IMG/M
3300012202|Ga0137363_10217216Not Available1541Open in IMG/M
3300012202|Ga0137363_10281428All Organisms → cellular organisms → Bacteria → Proteobacteria1360Open in IMG/M
3300012204|Ga0137374_11160485All Organisms → cellular organisms → Bacteria → Proteobacteria544Open in IMG/M
3300012205|Ga0137362_10126817All Organisms → cellular organisms → Bacteria2168Open in IMG/M
3300012205|Ga0137362_10230232Not Available1599Open in IMG/M
3300012205|Ga0137362_10532794Not Available1015Open in IMG/M
3300012206|Ga0137380_10217149All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1730Open in IMG/M
3300012207|Ga0137381_10489664All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1073Open in IMG/M
3300012207|Ga0137381_11198696All Organisms → cellular organisms → Bacteria → Proteobacteria652Open in IMG/M
3300012209|Ga0137379_10169339All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2096Open in IMG/M
3300012210|Ga0137378_10436279All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1214Open in IMG/M
3300012210|Ga0137378_10583719All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1027Open in IMG/M
3300012211|Ga0137377_10078732All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3098Open in IMG/M
3300012349|Ga0137387_10061459All Organisms → cellular organisms → Bacteria2535Open in IMG/M
3300012350|Ga0137372_10247340Not Available1404Open in IMG/M
3300012355|Ga0137369_10575769All Organisms → cellular organisms → Bacteria → Proteobacteria786Open in IMG/M
3300012355|Ga0137369_10994668All Organisms → cellular organisms → Bacteria → Proteobacteria556Open in IMG/M
3300012356|Ga0137371_10191182All Organisms → cellular organisms → Bacteria → Proteobacteria1603Open in IMG/M
3300012356|Ga0137371_10281385All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1297Open in IMG/M
3300012357|Ga0137384_10818402All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria753Open in IMG/M
3300012359|Ga0137385_10163660All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1954Open in IMG/M
3300012361|Ga0137360_10344851All Organisms → cellular organisms → Bacteria → Proteobacteria1246Open in IMG/M
3300012361|Ga0137360_10562860All Organisms → cellular organisms → Bacteria → Proteobacteria974Open in IMG/M
3300012362|Ga0137361_10131276All Organisms → cellular organisms → Bacteria2218Open in IMG/M
3300012363|Ga0137390_10270723Not Available1683Open in IMG/M
3300012917|Ga0137395_10497101All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300012930|Ga0137407_12359280All Organisms → cellular organisms → Bacteria → Proteobacteria509Open in IMG/M
3300012948|Ga0126375_11050110All Organisms → cellular organisms → Bacteria → Proteobacteria667Open in IMG/M
3300016387|Ga0182040_10718517All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium818Open in IMG/M
3300018468|Ga0066662_10142986All Organisms → cellular organisms → Bacteria → Proteobacteria1794Open in IMG/M
3300018468|Ga0066662_11917047Not Available620Open in IMG/M
3300021560|Ga0126371_10220261All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2001Open in IMG/M
3300025910|Ga0207684_11475890All Organisms → cellular organisms → Bacteria → Proteobacteria554Open in IMG/M
3300026334|Ga0209377_1313088All Organisms → cellular organisms → Bacteria → Proteobacteria524Open in IMG/M
3300026547|Ga0209156_10108904All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1369Open in IMG/M
3300027655|Ga0209388_1030227All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1546Open in IMG/M
3300027748|Ga0209689_1272125Not Available673Open in IMG/M
3300027862|Ga0209701_10601813All Organisms → cellular organisms → Bacteria → Proteobacteria583Open in IMG/M
3300027875|Ga0209283_10805539Not Available577Open in IMG/M
3300027882|Ga0209590_10202109All Organisms → cellular organisms → Bacteria → Proteobacteria1256Open in IMG/M
3300027882|Ga0209590_10421668All Organisms → cellular organisms → Bacteria → Proteobacteria862Open in IMG/M
3300028047|Ga0209526_10049053All Organisms → cellular organisms → Bacteria → Proteobacteria2973Open in IMG/M
3300028047|Ga0209526_10053199All Organisms → cellular organisms → Bacteria → Proteobacteria2851Open in IMG/M
3300028705|Ga0307276_10113114Not Available666Open in IMG/M
3300028881|Ga0307277_10000804All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales11998Open in IMG/M
3300028881|Ga0307277_10001378All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium9430Open in IMG/M
3300032180|Ga0307471_101381712All Organisms → cellular organisms → Bacteria → Proteobacteria865Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil41.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.79%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil12.07%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.76%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.59%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.59%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.86%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.86%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.86%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.86%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000579Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A01EnvironmentalOpen in IMG/M
3300000651Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A10EnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010863Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (PacBio error correction)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028705Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_115EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10193391513300000364SoilMXXEXDRXLXLSXWAGLVSVVAIFFLTGSFWKAALIGAFVGGSSLLGYGTPWVLKGSFAIAVFAVAVALGLPPPDQWLQLIDQVKETAFASRG*
AP72_2010_repI_A01DRAFT_103547513300000579Forest SoilMLTQEDRALVLSGWAGLVSFVAIYFLLGNFWKAALIAAFVGGSSLLGYGARWVLKGSFAIAVLAIAVALGLPPPDQWLQLIDQVKETVFASRG*
AP72_2010_repI_A10DRAFT_104256513300000651Forest SoilVLTPERRTYILPAWAGLVSGIALYFLTASLWKALLIGAFVGGSSLLGYGTRLVLGGSFVIAVIGIAVALGLPPPDHWVQMCKDVREFLVHMAAG*
JGI1027J11758_1303389513300000789SoilMLTQEERSDILSLWAGXVSVVAXYFLXGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG*
JGI10216J12902_10848243023300000956SoilMLTEEDRALVLSVWAALVSVVAIYFLIGSFWKAALIGAFVGISSLLGYGTRWVLKGSFAIAVLAIAVALGLPMPDQWLQALNEVARDNLSLR*
JGIcombinedJ26739_10101102113300002245Forest SoilMGVINGDFESVGLRMLTQDERADILSLWAGLVSVVALYFLTGSFWKAALIGAFVSGSAALGFGRRWVLKGSFAIVVLAIAVVLGLPPPDQWLQWLYEAREAVLALRTSG*
JGIcombinedJ26739_10125571223300002245Forest SoilMLTQEDRALVLSVWAALVSVIAIYFLTGNFWKAALIGVFVGGSSLLGYGTRWVLKGSFAIAVLAIAVALGLPPPDQWLQLINEVREAVFTARTTG*
JGIcombinedJ26739_10159645813300002245Forest SoilMTEGRRAQVLSLWAALVSVVAIYFLTGNYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETI
JGIcombinedJ26739_10166325623300002245Forest SoilMGVINGDFESVGLRMLTQDERADILSLWAGLVSVVALYFLTGSFWKAALIGAFVSGSAALGFGRRWVLKGSFAIVVLAIAVVLGLPPPDQWLQWL
Ga0066672_1078502513300005167SoilMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG*
Ga0066676_1042441423300005186SoilMLLSVWACLVSVIAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0066388_10020575633300005332Tropical Forest SoilMLTEQDRALVLSVWAALVSVAAIYFLTGSFWKAALIGAFVGGSSLLGYGTRWVLKGSFAIAVLAIAVALGLPPPDQWPQLIHQVQEAVLAF
Ga0066388_10109846813300005332Tropical Forest SoilMRMLTEEERRDILSLWAGFVSAVAIYFLVGSFWKAVLIGAFVGGSAVLGYGTRWLLNGSFAFAVLAIAVALGLPPPDQWLQLLYGAREAVLPLDRQAEHPQALDLSH*
Ga0066388_10112574123300005332Tropical Forest SoilMLGQPRIAGGNSALYKIVVFQRAGPQMLTQEDRALILSVWAALVSAVAIYFLTGSFWKAALISTFVGVSSLLGYGTRWVLKGSFAIAVLAITVALLPPPDQWLQLFHQVQEAVLASRAGG
Ga0066388_10510033523300005332Tropical Forest SoilMLTEENLRGLLPLWAGLVSGVAIYFLTGSFWKAVLIGAFVGVSAGLGYGARWLLKASFTFAVLAIAVAFGLPPPDQWLQLLTEAREAVLALRTSG*
Ga0008090_1443689213300005363Tropical Rainforest SoilLSLWAGLVSAVAIYFLVGGFWKAVFIGAFVGGSALLQYGTRWLLNGSFVLAVLAMAVALGLPPPDEWLQLLYGARESVLARTVRLNIR*
Ga0070706_10003000553300005467Corn, Switchgrass And Miscanthus RhizosphereMTEERRAQVLPLWAGLASVIAIYLLTENFWKAALIGMFVGGSAVVGYGTQWVLKGSFAIAVVAIAVALGFPPPDQWLQLARDAREAILALRTSG*
Ga0070706_10106531713300005467Corn, Switchgrass And Miscanthus RhizosphereMEMTVFERAGSRMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVNEMRETVFAARAAG*
Ga0070698_10011987923300005471Corn, Switchgrass And Miscanthus RhizosphereMTEGQRARVLSLWAALVSVVAIYFLTGDYWKAAFIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR*
Ga0070698_10037576723300005471Corn, Switchgrass And Miscanthus RhizosphereMWFFERAGPRMLTAEKRAEILPVWAGLVSVVAIYFLTGSFWKAAFIGAFVGVSSSLGYGTRWVLMASFAIAVLAIAVALGLPPPDQWPQVMDGV
Ga0070698_10048455123300005471Corn, Switchgrass And Miscanthus RhizosphereMGVRMLTQEERRDVLSLWAGLVSVVAIYFIIGSFWKAVLIGAFVGGSTLLGYGTRWLLKGSFALAVLAIAVALGLPPPDQWLQLLYDAREAVLALRTSG*
Ga0070696_10128633823300005546Corn, Switchgrass And Miscanthus RhizosphereMLLSVWACLVSVIAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETV
Ga0066701_1077374723300005552SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVHDVREAVCVPRTSG*
Ga0066692_1067814513300005555SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVNEMRETVFAARAAG*
Ga0066700_1001630593300005559SoilMLTEENRTDILSLWAGLVSVVAIYFLIGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG*
Ga0066700_1061984923300005559SoilVWAGLVSLIAIYFLTSDVWKALLIGVFVAGSAILGYGARWVLMGSFAIAVLAIAVALGLPRPDQWLQLAHEAQEAILALRTSR*
Ga0066693_1018002523300005566SoilMRILTEEERRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAVLGYGTRWLLNCSFAFAVLAIAVALGLPPPDQWLHLLYGAREASL
Ga0066905_10027195033300005713Tropical Forest SoilMIVVYCEADSDANGARAVRIGGSQMLTEEDRALALSGWAALVSVIAIFFLTGSFWKAALIGAFVGVSSLLGYGTRWVLKGSFAIAVLAIAVALGLPPPDQWLQLIHQVQEAVLASKAGG*
Ga0066905_10128873413300005713Tropical Forest SoilMLTQEDRALILSVWAALVSAVAIYFLTGSFWKAALISTFVGVSSLLGYGTRWVLKGSFAIAVLAIAVALLPPPDQWLQLFHQVQEAVLASRAGG*
Ga0066905_10138708023300005713Tropical Forest SoilMPMLTDEERRDILSLWAGFVSAVAIYFLIGSFWKAGLIGAFVGGSAVLGYGTRWLLNGSFAFAVLAIAVALGLPPPDQWLQLLY
Ga0066903_10004317643300005764Tropical Forest SoilMLTEEERRAYLSLWAGLVSAVAIYFLVGGFWKAVFIGAFVGGSALLQYGTRWLLNGSFVLAVLAMAVALGLPPPDEWLQLLYGARESVLARTVRLNIR*
Ga0066903_10125158023300005764Tropical Forest SoilMFTAEQRALILPFWAGFVSFVAIYFLFGGFWKAAFIGVFVAASSLLGYGTRRLLQGSFAFAVLAFAVALGLPPPDQWLQLIHEAQEAVLAFRGC*
Ga0066903_10230358533300005764Tropical Forest SoilMRMLTEEERRDILSLWAGFVSAVAIYFLVGTFWKAVLIGAFVGGSALLGYGTRWLLNGSFAFAVLAIAVALGFPSPDQWLQLLYGAREAVLAFR*
Ga0066903_10394992513300005764Tropical Forest SoilMLTEEERRDILSLWAGLVSVVAIYFITGSFWKAVLIGAFVGGSALLGYGTRWLLKGSFALAVLAIAVALGLPPPDQWLQLLYDAREAILALRTSG*
Ga0081455_1005340353300005937Tabebuia Heterophylla RhizosphereMLTEEDRVLVLSVWAALVSVFAIYFLTGNFWKAALIGAFVGGSSLLGYGTRWVLKGSFAIAVLAIAVALGLPPPDQWPQLIHQVQETILASRAGG*
Ga0070717_1118202013300006028Corn, Switchgrass And Miscanthus RhizosphereMLTAEKRAEILPVWAGLVSVVAIYFLTGSFWKAAFIGAFVGVSSSLGYGTRWVLMASFAIAVLAIAVALGLPPPDQWPQVMDGVRGTIFGS
Ga0070717_1185277213300006028Corn, Switchgrass And Miscanthus RhizosphereMLTQEERSDILSLWAGLVSVVAIYFLIGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG*
Ga0066651_1026494523300006031SoilMLTEEQRRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAVLGYGTRWLLNCSFAVAVLAIAVALGLPPSDQWLQLLYGAREALALGPSG*
Ga0066652_10004161743300006046SoilMRILTEEERRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAVLGYGTRWLLNCSFAFAVLAIAVALGLPPPDQWLHLLYGAREASLPLDRD*
Ga0066652_10072296213300006046SoilMLTEEQRRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAVLGYGTRWLLNCSFAVAVLAIAVALGLPPSDQWLQLLYGARE
Ga0075425_10134004223300006854Populus RhizosphereMLLSAWACLVSVVAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETVFAARAAG*
Ga0099791_1000765113300007255Vadose Zone SoilMTEGQRARVLSLWAALVSVVAIYFLTENYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR*
Ga0066710_10234934023300009012Grasslands SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG
Ga0099829_1073926113300009038Vadose Zone SoilMFTEEQRAQILSLWAGLVAVFAIYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQLVH
Ga0099829_1169308213300009038Vadose Zone SoilMLTEKERAEILSLWAGLVSVVAVYFLIGRFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG*
Ga0099828_1061324213300009089Vadose Zone SoilMFTEEQRAQILSLWAGLVAVVAMYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQLVHDVREAV
Ga0099827_1011362823300009090Vadose Zone SoilMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG*
Ga0099827_1014013523300009090Vadose Zone SoilMERDLFERAGQRMFTEEQRAQILSLWAGLVAVFAIYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQLVHDVREAVCAPRTSG*
Ga0099792_1030297523300009143Vadose Zone SoilMLAEKERAEILSLWAGLVSVVAIYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG*
Ga0126384_1064576413300010046Tropical Forest SoilMLTEQDRALILSVWAALVSVVAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVLAIAVALGLPPPDQWPQLIHQVQEAVLAFRAGG*
Ga0126382_1240697213300010047Tropical Forest SoilILSLWAGFVSAVAIYFLIGSFWKAGLSGAFVGGSAVLGFGTRWLLNGSFAFAVLAIAVALGLTPPDQWLQLLYEAREVVLALRTSG*
Ga0134067_1016846013300010321Grasslands SoilMRMLTEEEWREILSWWAAFVSAVAIYFLVGSFWKALLIGAFVGGSAVLGYGTRWLLNCSFAVAVLAIAVALGLPPPDQWLHLLYGAREASLPLDRD*
Ga0126377_1265671813300010362Tropical Forest SoilMLTQEDRALILSVWAALVSAVAIYFLTGSFWKAALISTFVGVSSLLGYGTRWVLKGSFAIAVLAIAVALGLPPPDQWPQLIHQVQEAVLVFRAGG*
Ga0126379_1267657313300010366Tropical Forest SoilMLTEEERRDILSLWAGFVSAVAIYFLVGSFWKAVLIGAFVGGSALLGYGTRWLLNGSFAFAVLAIAVALGFPSPDQWLQLLYGAREAVLAFR*
Ga0126381_10176104123300010376Tropical Forest SoilMLTEEERRDILSLWAGFVSAVAIYFLVGSFWKAVLIGAFVGGSALLGYGTRWLLNGSFAFAVLAIAVALGFPSPDQWLQLLYGAREAALAFR*
Ga0126383_1318182913300010398Tropical Forest SoilILSLWAGFVSAVAIYFLVGSFWKAVLIGAFVGGSASLGYGTRWLLNGLFAFAVLAIAVALGFPSPDQWLQLLYGAREAVLAFR*
Ga0126383_1346442923300010398Tropical Forest SoilMLTEADRALVLSVWAALVSFVAIYFLLGNFWKAALIAAFVGGSSLLGYGTRWVMKGSFAIAVLAIAVALGFPPPDQWLQLIDQMKETVFASRG*
Ga0124850_112225113300010863Tropical Forest SoilMLTPEERTDILSLWAGLVSVVAIYFLSGSIWKAGLVGAFVGGSALLGYGTRWVLKGAFAFAVVAIAVAFGLPPPDQWLQLLTEAREAVLALRTSG*
Ga0124850_114877213300010863Tropical Forest SoilMLTPEERTDILSLWAGLVSVVAIYFLSGSIWKAGLVGAFVGGSALLGYGTRWVLKGAFAFAVVAIAVAFGLPPPDQWLQLLTEAREAVLGPCP*
Ga0124850_115056923300010863Tropical Forest SoilMLGHSKVAMGNGPRYRIVVFERAGPQMLTQEDRALVLSVWAALVSAVAIYFLVGSFWKAALISTFVGVSSLLGYGTRWVLKGSFAIAVFAIAVALGLPPPDQWLQLFHQVQQAVLASRAGG*
Ga0137391_1094780723300011270Vadose Zone SoilMWFFERAGPRMLTAEKQAEILPVWAGLVSVVAIYFLTGSFWKAAFIGAFVGVSSSLGYGTRWVLMGSFAIAVLAIAVALGLPPPDQWPQVMDGVRGTIFGSRAGT*
Ga0137393_1043506613300011271Vadose Zone SoilMTEGRRAQVLSLWAALVSVVAIYFLTGNYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR*
Ga0137393_1101791333300011271Vadose Zone SoilLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG*
Ga0137389_1083784013300012096Vadose Zone SoilAALVSVVAIYFLTENYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR*
Ga0137388_1012840243300012189Vadose Zone SoilMERDLFERAGQRMFTEEQRAQILSLWAGLVAVVAMYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQLVHDVREAVCAPSTLRVTIVCAKR*
Ga0137364_1131268923300012198Vadose Zone SoilRMLTEEDRALVLSVWAALVSVVAIYFLIGSFWKAALIGAFVGGSSMLGWGTRWVLNGSFVIAVLAIAVALGLPMPDQSLQALNEVRETIFRLARVAM*
Ga0137365_1002985923300012201Vadose Zone SoilMLTEEDRALVLSVWAALVSVVAIYFLIGSFWKAALIGAFVGGSSMLGWGTRWVLNGSFVIAVLAIAVALGLPMPDQSLQALNEVRETIFRLARVAM*
Ga0137365_1011360233300012201Vadose Zone SoilMTEEKRAQGLALWAGLVSVAAIYFLTGDYWRAALIGAFVGGSSVLGYSTSWVLKGSFAIAVLAIAVALGLPPPDQWLQLARDAQQAILAFRTSP*
Ga0137365_1076652423300012201Vadose Zone SoilMLTAEQRTQILPFWAGLVSIVAIYFLTANAWKAVLIGAFVGGSAMLGYGRGVLTGSFAIAVLAIAVTLGFPPPDQWLQLIEEVRGAIFRPRALG*
Ga0137363_1021721623300012202Vadose Zone SoilMLAEEDRALVLSVWAALVSVIAIYFLTENFWQAALIGAFVGVSSLLGYGPKWVLKGSFAIAVLAIAVALGLPPPDQWLQLVNEVREAVFAARTTG*
Ga0137363_1028142813300012202Vadose Zone SoilMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTS
Ga0137374_1116048513300012204Vadose Zone SoilMLNEQDRAMLLSVWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137362_1012681723300012205Vadose Zone SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGTRWVLKGSFAIAVVAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137362_1023023223300012205Vadose Zone SoilMERDLFERAGQRMFTEEQRAQILSLWAGLVAVFAIYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQLLQLVHDVREAVCAPRTSG*
Ga0137362_1053279413300012205Vadose Zone SoilMLTQEERSDILSLWAGLVSVVAIYFLIGSFWKAALIGAFVGGSAVLRYGTRWVLKGSFAIAVLAIAVALGLPPPDQWLQLLYEAREAFLGFRTSG*
Ga0137380_1021714923300012206Vadose Zone SoilMTEEKRAQGLALWAGLVSVAAIYFLTGDYWRAALIGAFVGGSSVLGYSTSWVLKGSFALAVLAIAVALGLPPPDQWLQLARDAQQAILAFRTSP*
Ga0137381_1048966423300012207Vadose Zone SoilMTEEKRAQGLALWAGLVSVVAIYFLTGDYWRAALIGAFVGGSSVLGYSTSWVLKGSFALAVLAIAVALGLPPPDQWLQLAHDAQEAILAFRTSR*
Ga0137381_1119869623300012207Vadose Zone SoilFQSVGLRMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG*
Ga0137379_1016933913300012209Vadose Zone SoilMLLSVWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVVAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137378_1043627913300012210Vadose Zone SoilAEQRTQILPFWAGLVSIVAIYFLTANAWKAVLIGAFVGGSAMLGYGRGVLTGSFAIAVLAIAVTLGFPPPDQWLQLIEEVRGAIFRPRALG*
Ga0137378_1058371923300012210Vadose Zone SoilMTEEKRAQGLALWAGLVSVAAIYFLTGDYWRAALIGAFVGGSSVLGYSTSWVLKGSFALAVLAIAVALGLPPPDQWLQLAHDAQEAILAFRTSR*
Ga0137377_1007873233300012211Vadose Zone SoilMLLSVWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVIAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137387_1006145933300012349Vadose Zone SoilMLLSVWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVVVIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137372_1024734023300012350Vadose Zone SoilMLTEEDRALVLSVWAALVSVVAIYFLIGSFWKAALIGAFVGGSSMLGWGTRWVLNGSFVIAVLAIAVALGLPMPDQSLQALNEVRETIFRPARAAM*
Ga0137369_1057576923300012355Vadose Zone SoilMLNEQDRAMLLSVWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFAIAVVAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137369_1099466813300012355Vadose Zone SoilMLTAEQRTQILPFWAGLVSIVAIYFLTANAWKAVLIGAFVGGSAMLGYGRGVLTGSFAIAVLAIAVTLGFPPPDQWLQLIEEVRGAIF
Ga0137371_1019118233300012356Vadose Zone SoilMTEERRAQILPLWAGLVSVLTIYFLTEKFWTAALIGVFVGISAVLGYGTRWVLNGSFAIAVVAIAVALGFPPPDQWLQLARDAREGILALRTSG*
Ga0137371_1028138513300012356Vadose Zone SoilQVLPLWAGLVSVVAIYFLIENFWKAALIGMFVGISAVLGYGTRWVLNGSFAIAVVAIAVALGLPPPDQWLQLAHDAQEAILAFRTSR*
Ga0137384_1081840213300012357Vadose Zone SoilMTEEKRAQGLALWAGLVSVAAIYFLTGDYWRASLIGVFVGVSAVLGYATSWVLKGSFAIAVVAIAVALGLPPPDQWLQLAHDAQEAILAFRTSR*
Ga0137385_1016366033300012359Vadose Zone SoilMTDERCAQVLPLWAGLVSVVAIYFLIENFWKAALIGMFVGISAVLGYGTRWVLNGSFAIAVVAIAVALGLPPPDQWLQLAHDAQEAILAFRTSP*
Ga0137360_1034485133300012361Vadose Zone SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVNEMRETVFAAKAAG*
Ga0137360_1056286023300012361Vadose Zone SoilMGIPSTGSGGCEWTEEKRALLLPLWAALVSVVVIYFLTENYWKALLIATFVAGSAMLQYGPRWVLMGSFAIAVLAIAVALGLPRPDQWLQLAHEAQEAILALRTSR*
Ga0137361_1013127613300012362Vadose Zone SoilMFTEEQRAQILSLWAGLVAVFAIYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQLLQLVHDVREAVCAPRTSG*
Ga0137390_1027072313300012363Vadose Zone SoilMLTAEKQAEILPVWAGLVSVVAIYFLTGSFWKAPFIGAFVGVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQLLQLVHDVREA
Ga0137395_1049710113300012917Vadose Zone SoilMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQ*
Ga0137407_1235928013300012930Vadose Zone SoilMTEEKRALLLPLWAALVSVVAIYFLTENYWKALLIATFVAGSAMLQYGPRWVLMGSFAIAVLAIAVALGLPRPDQWLQLAHEAQEAILALRTSR*
Ga0126375_1105011013300012948Tropical Forest SoilMLTEQDRALILSVWAALVSVVAIYFLTGSFWKAALIGAFVGGSTLLRYGTRWVLKGSFAIAVLAIAVALGLPPPDQWPQLIHQVQEAVLAFRAGG*
Ga0182040_1071851723300016387SoilMLTQEEWADILSLWAGLVSVVAIYFLIGNIWKAALIGAFVGGSVALGYGTQWVLKGSFAIAVLAIAVALGLPPPDQWLQLLYEAREAVLRLKTSG
Ga0066662_1014298613300018468Grasslands SoilMLNEQDRALLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVNEMRETVFAAKAAGRSPARWLSTVPVCGDPGSAVHR
Ga0066662_1191704713300018468Grasslands SoilAYIPTRNIAQRGMGRILCFLSLWAGLVSVVAIYFLIGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG
Ga0126371_1022026143300021560Tropical Forest SoilMLTEEERRAYLSLWAGLVSAVAIYFLVGGFWKAVFIGAFVGGSALLQYGTRWLLNGSFVLAVLAMAVALGLPPPDEWLQLLYGARESVLARTVRLNIR
Ga0207684_1147589013300025910Corn, Switchgrass And Miscanthus RhizosphereMTEERRAQVLPLWAGLVSVVAIYFLTGDYWKAALIGMFVGISALLGYGTQWILTGAFAIAVFAIAVALGLPPPDQWLQLARDAQEAILAFSTSH
Ga0209377_131308823300026334SoilLLSAWACLVSVFAIYFLTGSFWKAALIGAFVGGSSLLGYGARWVLKGSFVIAVVAIAVALGLPPPDQWLQLVNEMRETVFAARAAG
Ga0209156_1010890413300026547SoilMRILTEEERRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAVLGYGTRWLLNCSFAFAVLAIAVALGLPPPDQWLHLLYGAREASLPLDRD
Ga0209388_103022723300027655Vadose Zone SoilMTEGQRARVLSLWAALVSVVAIYFLTENYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR
Ga0209689_127212523300027748SoilGLVSVVAIYFLIGSFWKAVLIGAFVGGSAALGYGTRWLLKGSFAFAVLAIAVALGIPPPDQWLQLLYEAREAVLALRTSG
Ga0209701_1060181313300027862Vadose Zone SoilLRMTEGQRARVLSLWAALVSVVAIYFLTENYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR
Ga0209283_1080553913300027875Vadose Zone SoilMERDLFERAGQRMFTEEQRAQILSLWAGLVAVVAMYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQ
Ga0209590_1020210923300027882Vadose Zone SoilMLTEKERAEILSLWAGLVSVVAVYFLIGSFWKAAFIGAFVGGSAALGYGRRWLLQGSFAFAVLAIAVALGLPPPDQWLQLLYEAREVVLALRTSG
Ga0209590_1042166813300027882Vadose Zone SoilMERDLFERAGQRMFTEEQRAQILSLWAGLVAVFAIYFLTGNFWKAALIGVFVCVSSALGYGTRWVLTGSFAIAVVAIAVALGLPPPDQWLQLVHDVREAVCAPSTLRVTIVCAKR
Ga0209526_1004905333300028047Forest SoilMGVINGDFESVGLRMLTQDERADILSLWAGLVSVVALYFLTGSFWKAALIGAFVSGSAALGFGRRWVLKGSFAIVVLAIAVVLGLPPPDQWLQWLYEAREAVLTLRTSG
Ga0209526_1005319923300028047Forest SoilMTEGRRAQVLSLWAALVSVVAIYFLTGNYWKAALIGVFVGVSSVLGYGTQWVLKGSFAIAIFAIAVALGLPPPDQWLQLAHEAQETILGFRTSR
Ga0307276_1011311423300028705SoilMLTEEERRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAMLGYGTRWLLNGSFAFAVLAIAVALGLPPPDQWLQLLYGAREAVLALRPSG
Ga0307277_1000080493300028881SoilMLTEEQRRDILALWAGFVSTVAIYFLIGSFWKAVLIGAFVGGSAVLRYGTGWPLNGSFAVAVLAIAVALGLPPPDQWLELLYGAREAVLALRPSG
Ga0307277_1000137873300028881SoilMRMLTEEERRDILSLWAGFVSAVAIYFLIGSFWKAVLIGAFVGGSAMLGYGTRWLLNGSFAFAVLAIAVALGLPPPDQWLQLLYGAREAVLALRPSG
Ga0307471_10138171213300032180Hardwood Forest SoilMTEGHRARVLSLWAALVSVVAIYFLTENYWKAALVGVFVGVSSVLGYGTQWVLKGSFAIAVFAIAVALGLPPPDQWLQLAHETQETILGFRTSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.