NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F085404

Metagenome / Metatranscriptome Family F085404

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F085404
Family Type Metagenome / Metatranscriptome
Number of Sequences 111
Average Sequence Length 87 residues
Representative Sequence MKPTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWHQREMARAGVL
Number of Associated Samples 85
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 72.97 %
% of genes near scaffold ends (potentially truncated) 29.73 %
% of genes from short scaffolds (< 2000 bps) 85.59 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.80

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (58.559 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(27.928 % of family members)
Environment Ontology (ENVO) Unclassified
(40.541 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(55.856 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.51%    β-sheet: 19.49%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.80
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.218.1.0: automated matchesd6ipna36ipn0.65228
d.52.9.1: Cation efflux protein cytoplasmic domain-liked3bypa13byp0.65143
d.218.1.7: Archaeal tRNA CCA-adding enzyme catalytic domaind1r89a21r890.64464
d.80.1.2: 5-carboxymethyl-2-hydroxymuconate isomerase (CHMI)d1otga_1otg0.64401
d.218.1.5: Catalytic subunit of bi-partite nucleotidyltransferased1ylqa11ylq0.63636


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 111 Family Scaffolds
PF05168HEPN 8.11
PF14076DUF4258 5.41
PF11013DUF2851 2.70
PF00589Phage_integrase 2.70
PF12728HTH_17 1.80
PF01969Ni_insertion 1.80
PF15919HicB_lk_antitox 1.80
PF14659Phage_int_SAM_3 0.90
PF03060NMO 0.90
PF01381HTH_3 0.90
PF13698DUF4156 0.90
PF09250Prim-Pol 0.90
PF07516SecA_SW 0.90
PF00583Acetyltransf_1 0.90
PF10262Rdx 0.90

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 111 Family Scaffolds
COG1895HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 8.11
COG2250HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 8.11
COG1641CTP-dependent cyclometallase, nickel-pincer nucleotide (NPN) cofactor biosynthesisCoenzyme transport and metabolism [H] 1.80
COG0516IMP dehydrogenase/GMP reductaseNucleotide transport and metabolism [F] 0.90
COG0653Preprotein translocase subunit SecA (ATPase, RNA helicase)Intracellular trafficking, secretion, and vesicular transport [U] 0.90
COG2070NAD(P)H-dependent flavin oxidoreductase YrpB, nitropropane dioxygenase familyGeneral function prediction only [R] 0.90


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms58.56 %
UnclassifiedrootN/A41.44 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003998|Ga0055472_10020091Not Available1448Open in IMG/M
3300004013|Ga0055465_10027307Not Available1378Open in IMG/M
3300004463|Ga0063356_100164978All Organisms → cellular organisms → Bacteria2552Open in IMG/M
3300005177|Ga0066690_10837149Not Available594Open in IMG/M
3300005332|Ga0066388_102073925All Organisms → cellular organisms → Bacteria1022Open in IMG/M
3300005332|Ga0066388_102331688Not Available969Open in IMG/M
3300005332|Ga0066388_104910011Not Available680Open in IMG/M
3300005338|Ga0068868_100208776All Organisms → cellular organisms → Bacteria1631Open in IMG/M
3300005340|Ga0070689_101327409Not Available648Open in IMG/M
3300005345|Ga0070692_11304775Not Available521Open in IMG/M
3300005446|Ga0066686_10956088All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300005544|Ga0070686_101285217Not Available611Open in IMG/M
3300005562|Ga0058697_10126333All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300005576|Ga0066708_10865633Not Available565Open in IMG/M
3300005617|Ga0068859_100150033All Organisms → cellular organisms → Bacteria2407Open in IMG/M
3300005764|Ga0066903_100851178All Organisms → cellular organisms → Bacteria1641Open in IMG/M
3300005836|Ga0074470_10962176All Organisms → cellular organisms → Bacteria2918Open in IMG/M
3300005843|Ga0068860_100475779All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300006049|Ga0075417_10211659Not Available920Open in IMG/M
3300006800|Ga0066660_11573233Not Available520Open in IMG/M
3300006844|Ga0075428_100026340All Organisms → cellular organisms → Bacteria6436Open in IMG/M
3300006844|Ga0075428_102539268Not Available524Open in IMG/M
3300006845|Ga0075421_102459092Not Available544Open in IMG/M
3300006847|Ga0075431_101338430Not Available677Open in IMG/M
3300009038|Ga0099829_10037355All Organisms → cellular organisms → Bacteria3511Open in IMG/M
3300009089|Ga0099828_10466651Not Available1139Open in IMG/M
3300009089|Ga0099828_10578415All Organisms → cellular organisms → Bacteria → Terrabacteria group1011Open in IMG/M
3300009090|Ga0099827_10106860All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella2234Open in IMG/M
3300009090|Ga0099827_10210115All Organisms → cellular organisms → Bacteria1622Open in IMG/M
3300009090|Ga0099827_10496986All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300009090|Ga0099827_10948637All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300009094|Ga0111539_10697360All Organisms → cellular organisms → Bacteria1182Open in IMG/M
3300009100|Ga0075418_10015823All Organisms → cellular organisms → Bacteria8341Open in IMG/M
3300009100|Ga0075418_10313963All Organisms → cellular organisms → Bacteria1672Open in IMG/M
3300009100|Ga0075418_10995717All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300009100|Ga0075418_11253384All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300009101|Ga0105247_11796562Not Available510Open in IMG/M
3300009137|Ga0066709_100400614All Organisms → cellular organisms → Bacteria1903Open in IMG/M
3300009143|Ga0099792_10266744All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella1003Open in IMG/M
3300009147|Ga0114129_13174874Not Available535Open in IMG/M
3300009156|Ga0111538_10676333All Organisms → cellular organisms → Bacteria1308Open in IMG/M
3300009156|Ga0111538_12018265Not Available725Open in IMG/M
3300009444|Ga0114945_10086275All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1750Open in IMG/M
3300009444|Ga0114945_10330404Not Available901Open in IMG/M
3300009691|Ga0114944_1069627All Organisms → cellular organisms → Bacteria → Proteobacteria1312Open in IMG/M
3300009691|Ga0114944_1119718All Organisms → cellular organisms → Bacteria → Terrabacteria group1016Open in IMG/M
3300009691|Ga0114944_1176200All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300009691|Ga0114944_1321908Not Available640Open in IMG/M
3300010043|Ga0126380_10428811All Organisms → cellular organisms → Bacteria992Open in IMG/M
3300010304|Ga0134088_10273123Not Available815Open in IMG/M
3300010323|Ga0134086_10488353Not Available508Open in IMG/M
3300010397|Ga0134124_12246353Not Available586Open in IMG/M
3300011119|Ga0105246_10950395All Organisms → cellular organisms → Bacteria → Proteobacteria774Open in IMG/M
3300011270|Ga0137391_10371030All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella1227Open in IMG/M
3300012096|Ga0137389_10479838All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300012189|Ga0137388_10277809All Organisms → cellular organisms → Bacteria1531Open in IMG/M
3300012189|Ga0137388_11765444All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella551Open in IMG/M
3300012202|Ga0137363_11246583Not Available631Open in IMG/M
3300012204|Ga0137374_10016299All Organisms → cellular organisms → Bacteria → Terrabacteria group8540Open in IMG/M
3300012204|Ga0137374_10232704Not Available1559Open in IMG/M
3300012206|Ga0137380_10008039All Organisms → cellular organisms → Bacteria9507Open in IMG/M
3300012206|Ga0137380_11315008All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300012207|Ga0137381_10184792All Organisms → cellular organisms → Bacteria1804Open in IMG/M
3300012209|Ga0137379_10181263All Organisms → cellular organisms → Bacteria2019Open in IMG/M
3300012211|Ga0137377_11743346All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300012349|Ga0137387_10560628All Organisms → cellular organisms → Bacteria829Open in IMG/M
3300012349|Ga0137387_10858792Not Available657Open in IMG/M
3300012353|Ga0137367_10037870All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes3688Open in IMG/M
3300012356|Ga0137371_11130381Not Available588Open in IMG/M
3300012359|Ga0137385_10956300All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300012360|Ga0137375_10119161All Organisms → cellular organisms → Bacteria2641Open in IMG/M
3300012361|Ga0137360_11234827Not Available647Open in IMG/M
3300012405|Ga0134041_1219041Not Available579Open in IMG/M
3300014154|Ga0134075_10409594Not Available599Open in IMG/M
3300014308|Ga0075354_1136311Not Available547Open in IMG/M
3300014745|Ga0157377_10986209All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300015359|Ga0134085_10277511All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300015371|Ga0132258_11471278All Organisms → cellular organisms → Bacteria1720Open in IMG/M
3300015372|Ga0132256_102178452Not Available659Open in IMG/M
3300015374|Ga0132255_101239640Not Available1124Open in IMG/M
3300017792|Ga0163161_11217217Not Available652Open in IMG/M
3300017997|Ga0184610_1044705Not Available1292Open in IMG/M
3300018053|Ga0184626_10018323All Organisms → cellular organisms → Bacteria2819Open in IMG/M
3300018053|Ga0184626_10228638All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium784Open in IMG/M
3300018056|Ga0184623_10108147All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium1286Open in IMG/M
3300018063|Ga0184637_10102950Not Available1758Open in IMG/M
3300018063|Ga0184637_10387553All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300018433|Ga0066667_11056190All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300019208|Ga0180110_1200018Not Available540Open in IMG/M
3300019229|Ga0180116_1136793All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300019249|Ga0184648_1018994All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300019249|Ga0184648_1175781All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium827Open in IMG/M
3300019249|Ga0184648_1253918All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella2650Open in IMG/M
3300019259|Ga0184646_1368635All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella2428Open in IMG/M
3300019259|Ga0184646_1552879Not Available2090Open in IMG/M
3300020063|Ga0180118_1240243All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300020065|Ga0180113_1286725Not Available537Open in IMG/M
3300022563|Ga0212128_10134752All Organisms → cellular organisms → Bacteria1594Open in IMG/M
3300022563|Ga0212128_10587124All Organisms → cellular organisms → Bacteria → Terrabacteria group675Open in IMG/M
3300025149|Ga0209827_10937627Not Available677Open in IMG/M
3300025157|Ga0209399_10402990Not Available530Open in IMG/M
3300025901|Ga0207688_10430242All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Betaproteobacteria incertae sedis → Candidatus Accumulibacter → Candidatus Accumulibacter adjunctus821Open in IMG/M
3300026035|Ga0207703_11249134All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Phycisphaerae → environmental samples → uncultured Phycisphaerae bacterium714Open in IMG/M
3300027846|Ga0209180_10096240All Organisms → cellular organisms → Bacteria1683Open in IMG/M
3300027875|Ga0209283_10441111Not Available844Open in IMG/M
3300027882|Ga0209590_10118913All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella1603Open in IMG/M
3300027882|Ga0209590_10766595All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300027909|Ga0209382_12306859Not Available505Open in IMG/M
3300028381|Ga0268264_11017496Not Available835Open in IMG/M
3300030546|Ga0247646_1124841Not Available667Open in IMG/M
3300034147|Ga0364925_0129539Not Available908Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil27.93%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.61%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs9.01%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment8.11%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment5.41%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.60%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.60%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.60%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.80%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.80%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.90%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.90%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.90%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.90%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.90%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.90%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.90%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.90%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.90%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.90%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.90%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave0.90%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003998Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleC_D2EnvironmentalOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005562Agave microbial communities from Guanajuato, Mexico - As.Ma.eHost-AssociatedOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019208Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT231_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019229Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT660_1_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020065Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025149Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes)EnvironmentalOpen in IMG/M
3300025157Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 (SPAdes)EnvironmentalOpen in IMG/M
3300025901Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4 (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030546Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Cnb11 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0055472_1002009113300003998Natural And Restored WetlandsMKPVPQEFLERLRKVVPPEAEVLPLSWTFEQEDYNIAVVMPDTIDRLEARHLEDSLLDVVMDWDEAHDTFTVCKVWREHEMARPEVR
Ga0055465_1002730713300004013Natural And Restored WetlandsMKPVPQEFLERLRKVVPPEAEVLPLSWTFEQEDYNIAVVMPDTIDRLEARHLEDSLLDVVMDWDEAHDTFTVCKVWREHEMARPEVR*
Ga0063356_10016497833300004463Arabidopsis Thaliana RhizosphereMKPAPQDLLELVKKAAPPEAEIVPLAWAFEDEDYNIAVIMPDTTDRLTARQIEDQLIDAVIDWDAAHHTFTLCKVWRQHEMTRAGVL*
Ga0066690_1083714913300005177SoilPQELLELVEKAAPPEAEIVPLTWAFEDEDYNIAVVMPDTIDRPTARRIEDRLIDAVLDWDAVHRTFTLCKVWLQHEMARVGVR*
Ga0066388_10207392533300005332Tropical Forest SoilMKPTKPVPQDLLELVQKTAPLEAHVVPLAWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVIDWDAAHGTYTLCMVWPKREKPLAGVR*
Ga0066388_10233168823300005332Tropical Forest SoilMGEAAASWRVREVLHDPLAWAFEDENYNIAVVMPDTIDRLTARQLEDQLIDAVIDWDVVHHTITLCKVWRQHEMACAGVM*
Ga0066388_10491001123300005332Tropical Forest SoilSMKSVPQELLELVKKAAPPEAEIVPLTWAFEDEDYNIAVVMPDTIDRLTARQIEDHLIDAVIDWDAAHHTFTLCKVWRQHEMARTGVL*
Ga0068868_10020877633300005338Miscanthus RhizosphereVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0070689_10132740913300005340Switchgrass RhizosphereKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0070692_1130477513300005345Corn, Switchgrass And Miscanthus RhizosphereMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0066686_1095608813300005446SoilMKPTKPVPQDLFELVQKTAPPEADVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVW
Ga0070686_10128521713300005544Switchgrass RhizosphereVKHALKPVPQALLELVKKAAPPEAEIVPLAWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVLDYDAAHGTYTLCMVWPQRDKALAGIH*
Ga0058697_1012633313300005562AgaveMKHTKPVPQDLLELVQKTAPLEAHVVPLSWAYEDEDYNIAIVMPDTVERLEAREIEDRLIDAVMDYDAAHGTYTLCMVWPQRDKALAGIH*
Ga0066708_1086563313300005576SoilMKPTKPVPQDLFELVQKTAPPEADVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWLQRDKALAGIY*
Ga0068859_10015003323300005617Switchgrass RhizosphereMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0066903_10085117843300005764Tropical Forest SoilMQPVPEDLMTLVKQTAPSEAEIVPLTWAFEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDAAHGTFTLCMVWHRHEMARVGAR*
Ga0074470_1096217633300005836Sediment (Intertidal)MKFVPQELQELVKKTAPPEAEIVPLSWAFEDEDYNIAVVMPDTIDRLSARQIEDRLIDVVIDWDAAHHTFTLCKVWQQHEMARAAVL*
Ga0068860_10047577943300005843Switchgrass RhizosphereMCQPSRPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0075417_1021165923300006049Populus RhizosphereMKSVPQELLELVKKAAPPDAEIVPLTWAFEDEDYNIAIVMPDAIDRLTVRQIEDRLIDAVIDWDAAHHTFTLCKVWQQHEMTRAGVR*
Ga0066660_1157323323300006800SoilMKPVPQELLELVEKAAPPEAEIVPLSWAFEDEDYNIAVVMPDTIDRPTARRIEDRLIDAVLDWDAVHRTFTLCKVWL
Ga0075428_10002634073300006844Populus RhizosphereVKHALEPGPQALLELVKQATPPEAEIVSLAWAFEDEDYNIAVVMPDTIDRLTARQIEDCLIDAVLDWDAAHHTFTLCKVWQQHEMTRAGVL*
Ga0075428_10253926813300006844Populus RhizosphereMKPTKPVPRDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMGYDAAHGTFTLCMVWPQREKALVGIH*
Ga0075421_10245909213300006845Populus RhizosphereVKHALEPGPQALLELVKQATPPEAEIVSLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVLDWDAAHHTFTLCKVWQQHEMTRAGVL*
Ga0075431_10133843013300006847Populus RhizosphereAAISALREIVKHALEPGPQALLELVKQATPPEAEIVSLAWAFEDEDYNIAVVMPDTIDRLTARQIEDCLIDAVLDWDAAHHTFTLCKVWQQHEMTRAGVL*
Ga0099829_1003735543300009038Vadose Zone SoilLKQTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDAAHGTYTLCMVWPQRDKALAGIR*
Ga0099828_1046665133300009089Vadose Zone SoilMEPVPQALLELIKKTAPPESAIVPLSWAFEDEEYNIAIVMPDTVECLEARHIEGRVRDVAMDWDAAHGTVTLCKVWRQYEIARPGVL*
Ga0099828_1057841533300009089Vadose Zone SoilMKPVPQDLLDLVKKAAPPEAEIVPLAWAFEDEDYNMAIVMPDTVDRLEARQIENRLIDAVMNYDAAHGTYTLCMVW
Ga0099827_1010686023300009090Vadose Zone SoilMKSVPQELLELVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL*
Ga0099827_1021011513300009090Vadose Zone SoilLKQTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDAAHGTYTLCMVWPQRDKALAGI
Ga0099827_1049698613300009090Vadose Zone SoilMKPTKPVHQDLLDLVQKTAPPEADVVPLSWAYEDEDYNIAIVVPDTVDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWP
Ga0099827_1094863723300009090Vadose Zone SoilLRETVKNTLEPVPQALLALVKKAVPPVAEIVPLTWAFEDEDYNIAVVMPDTIDRLTARQIEDCLIDAVMDYDAAHGTYTLCMVWREREKVHAGVQ*
Ga0111539_1069736033300009094Populus RhizosphereMKHTKPVPQDLLELVQKTAPLEAHVVPLAWAYEDEDYNIAIVMPDTVDRLEAREIENRLIDAVMDYDATHGTYTLCVVWPQRDKALAGIH*
Ga0075418_10015823123300009100Populus RhizosphereLREIVKHALEPGPQALLELVKQATPPEAEIVSLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVLDWDAAHHTFTLCKVWQQHEMTRAGVL*
Ga0075418_1031396333300009100Populus RhizosphereMKHTKPVPQDLLELVQKTAPLEAHVVPLAWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVMDYDAAHGTYTLCMVWPQRDKALVGIH*
Ga0075418_1099571733300009100Populus RhizosphereMKSVPQELLELVKKAAPPDAEIVPLTWAFEDEDYNIAIVMPDAIDRLTVRQIEDRLIDAVIDWDAAHHTFTLCKVWRQHEMARAGVL*
Ga0075418_1125338413300009100Populus RhizosphereMKPAPQKLLELVRKAAPPEADVVPLPWAFEQEDYNLAVVMPDTVERLVARQIEDRLLDIILDYDDAHDTFTVCKVWHQSEMARAGVL*
Ga0105247_1179656213300009101Switchgrass RhizosphereKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0066709_10040061423300009137Grasslands SoilLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDVVMDYDAAHGTYTLCMVWREREKVHAGVQ*
Ga0099792_1026674423300009143Vadose Zone SoilVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL*
Ga0114129_1317487413300009147Populus RhizosphereKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVMDYDAAHGTYTLCMVWPQRDKALVGIH*
Ga0111538_1067633313300009156Populus RhizosphereMRIVSEGIAVQPVPQDRIALVQTIAPPEAEIVPLTWVFEDEDYNITVVMPGTVDRPTTRQIEDRLIDAVIDWDAAHHTFTLCKVWRQHEMARAGVL*
Ga0111538_1201826513300009156Populus RhizosphereLREIVKHALEPGPQALLELVKQATPPEAEIVSLAWAFEDEDYNIAVVMPDTIDRLTARQIEDCLIDAVLDWDAAHHTFTLCKVWQQHEMTRAGVL*
Ga0114945_1008627523300009444Thermal SpringsMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTFTLCRVWHQREMARAGVW*
Ga0114945_1033040423300009444Thermal SpringsLVSQAAPPEAEIVPLTWAFEDEDHNIAVVMPDTIDRLTARQIEDQLIDAVIDWDATHHTFTLCKVWREHEKAYAGLH*
Ga0114944_106962733300009691Thermal SpringsMKPTKPVPHDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVIDYDAAHGTFSLCMVWPQRDKALAGIH*
Ga0114944_111971813300009691Thermal SpringsVKHTLEPVPQALLALVKKAAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTTRQIEDQLIDAVIDWDAAHHTFTLCKVWRQHEMARAGVR*
Ga0114944_117620023300009691Thermal SpringsMQPIPEDLLALVSQAAPPEAEIVPLTWAFEDEDHNIAVVMPDTIDRLTARQIEDQLIDAVIDWDATHHTFTLCKVWREHEKAYAGLH*
Ga0114944_132190813300009691Thermal SpringsMKPTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWHQREMARAGVL*
Ga0126380_1042881123300010043Tropical Forest SoilMKPTKPVPRDLLELVQKTAPPEANVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDCDAAHGTFTLCMVWPQRDKALAGIH*
Ga0134088_1027312333300010304Grasslands SoilMKPTKPVPQDLFELVQKTAPPEADVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDSLIDAVMDYDASHGTFTLCMVWHQRDKALAGIH*
Ga0134086_1048835313300010323Grasslands SoilMKPTKPVPRDLLELVQKTAPPEANVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWLQRDKALAGIY*
Ga0134124_1224635313300010397Terrestrial SoilMKPAKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQR
Ga0105246_1095039523300011119Miscanthus RhizosphereMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDHLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH*
Ga0137391_1037103013300011270Vadose Zone SoilELVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL*
Ga0137389_1047983833300012096Vadose Zone SoilMKPVPQDLLDLVKKAAPPEAEIVPLAWAFEDEDYNMAIVMPDTVDRLEARQIEDRLIDAVMNYDAAHGTYTLCMVWREREKVHAGVQ*
Ga0137388_1027780913300012189Vadose Zone SoilLKQTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDAAHGTYTLCMVWPQRDKALAGVG*
Ga0137388_1176544413300012189Vadose Zone SoilPQELLELVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL*
Ga0137363_1124658313300012202Vadose Zone SoilMKSVPQELLELVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRRTARQLEDRLIDAVIDWDAAHHTFTLCKVWRHHEMARAGVL*
Ga0137374_1001629973300012204Vadose Zone SoilMKSTKPVPQGLLELVQKTTPPEADVVPLSWAFEDEDCNIAVVMPDTIDRRTARQLEDRLIDAVIDWDAAHHTFTLCKVWRHHEMARAGVL*
Ga0137374_1023270423300012204Vadose Zone SoilMKPTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDAAHGTFTLCMVWHQRDKALAGVR*
Ga0137380_10008039113300012206Vadose Zone SoilMKHTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDVVMDYDAAHGTYTLCMVWREREKVHAGVQ*
Ga0137380_1131500813300012206Vadose Zone SoilMKSVPQELLELVRKAAPPEAEIVPLTWAFEDEDYNIAVVMPDTIDRLTARQTEDRLIDAVIDWDAAHHTFTLCKVWRQHEM
Ga0137381_1018479233300012207Vadose Zone SoilMKPTKPVPQDLLELVQKTTPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDTAHGTFTLCMVWHQRDKALAGIH*
Ga0137379_1018126333300012209Vadose Zone SoilMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHSTFTLCMVWHQRDKALAGIR*
Ga0137377_1174334623300012211Vadose Zone SoilMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHSTFTLCMVWH
Ga0137387_1056062813300012349Vadose Zone SoilMKHTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYHIAIVMPDTVERLEAREIEDRLIDVVMDYDAAHGTYTVCMVWPQRDKALA
Ga0137387_1085879223300012349Vadose Zone SoilMKSVPQDLLELVKKAAPPEAEMVRLTWAFEDEDYNIAVVMPDTIDRLTARQMEDRLIDAVIDWDAAYHTFTLCKVWRQHEMARTGVL*
Ga0137367_1003787043300012353Vadose Zone SoilMKSTKPVPQGLLELVQKTAPPEADVVPLSWAFEDEDCNIAVVMPDTIDRRTARQLEDRLIDAVIDWDAAHHTFTLCKVWRHHEMARAGVL*
Ga0137371_1113038113300012356Vadose Zone SoilTAPPEADVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDRLIDAFMDYDAAHGTFTLCMVWLQRDKALAGIY*
Ga0137385_1095630013300012359Vadose Zone SoilMKPTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDASHSTFTLCMVWHQRDKALAGIH*
Ga0137375_1011916113300012360Vadose Zone SoilMKPTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVVPDTRDRLEARQVEDRLIDAVMDYDAAHGTFT
Ga0137360_1123482723300012361Vadose Zone SoilMKPTKPVPQDLFELVQKTAPPEADVVPLSWAFEDEDCNIAVVMPDTIDRRTARQLEDRLIDAVIDWDAAHHTFTLCKVWRHHEMARAGVL*
Ga0134041_121904113300012405Grasslands SoilMKPVPQDLLELVEKAAPPEAEIVPLSWAFEDEDYNIAVVMPDTIDRPTARRIEDRLIDAVLDLDAVHRTFTLCKVWLQHEMARVGVR*
Ga0134075_1040959413300014154Grasslands SoilMKSTKPVLQGLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDVVMDYDAAHGTYTLCMVWREHEKALAGMR*
Ga0075354_113631123300014308Natural And Restored WetlandsMKSVPQELLELVKKAAPPEAEIVPLAWAFEDEDYNIAVVMPDAVERLEARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL*
Ga0157377_1098620913300014745Miscanthus RhizosphereTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVLDYDAAHGTYTLCMVWPQRDKALAGIH*
Ga0134085_1027751133300015359Grasslands SoilMKPTKAVPQDLLELVQKTAPSEAKVVPLPWAYEDEDYNIAIVMPDTIDRLTARQIEDRLIDAVIDWDAAHGTYTLCMV
Ga0132258_1147127833300015371Arabidopsis RhizosphereMKPIPQELLELVKKAAPPEAEIVPLTWTFEDEDYHIAVVMPDTVDRLTARQIEDRLIDAVIDWDAAHHTFTLCKVWRQHEMTRAGVL*
Ga0132256_10217845223300015372Arabidopsis RhizosphereMKPIPQELLELVKKAAPPEAEIVPLTWTFEDEDYHIAVVMPDTVDRLTARQIEDRLIDAVIDWDAAHHTFTLCKVWQQHEMTRAGLL*
Ga0132255_10123964013300015374Arabidopsis RhizosphereRRDDPMKPTKAVPQDLLKLVQKTAPSEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTYTLCMVWPQRDKALAGIH*
Ga0163161_1121721713300017792Switchgrass RhizosphereLVKKAAPPEAEIVPLAWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVLDYDAAHGTYTLCMVWPQRDKALAGIH
Ga0184610_104470533300017997Groundwater SedimentMKPVPQELLEHLRKVAPPEAEVIPLSWAFEQEDYNIAVVMPDAIDRAEARQIEDCLLDVIMDWDDAHDTFTVCKVWREHEMARPGVR
Ga0184626_1001832323300018053Groundwater SedimentMKQTKPVPQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTYTLCMVWRERERVHAEVQ
Ga0184626_1022863813300018053Groundwater SedimentMKAVPQELLELVRKAAPPEADVVPLPWAFEKEDYNLAVVMPDSVDRLAARQIEDRLLDVIMDYDDTHDTFTVCKVWRAQEIARAG
Ga0184623_1010814713300018056Groundwater SedimentMEPVPQALLNLVKKAAPPEAEIMPLPWAFEKEDYNLAVVMPDAVDRLAARQIEDRLLDVIMDYDEAHDTFTVCKVWRAREMARQGCCENCWDSGGKRR
Ga0184637_1010295033300018063Groundwater SedimentMKAVPQELLELVRKAAPPEADVIPLPWAFEKEDYNLAVVMPNSIDRLAARQIEDRLLDVIMDYDEAHDTFTVCKVWRVREMARVGVL
Ga0184637_1038755323300018063Groundwater SedimentVEPVPEDLLALVKQTAPPEAEVVPLTWTFEDEDYNIAIVIPDTVERLEARQIEDRLIDTVMDYDAAHGTFTLCMVWHQREKALAGIH
Ga0066667_1105619023300018433Grasslands SoilMKPTKPVPQDLFELVQKTAPPEADVVPLSWAYEDEDYNIAIVVPDTMDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWLQRDKALAGIY
Ga0180110_120001823300019208Groundwater SedimentSLMQSTKPVPDELLTLVHQAAPPEAYIVPLEWAYEAEDYNIALVMPDTLTPEASHQLQDHMIDAVMDWDAAHGTYTLIMVWREREKASAGVR
Ga0180116_113679313300019229Groundwater SedimentMHSTKPLPHDLLTLVQKAAPPEAHIVPLEWAYEAEDYNIAIVMPDTITPEEAHQVQDRVIDAVMDWDAAHDTYTLAMVWRQHEMDRAGAR
Ga0184648_101899413300019249Groundwater SedimentQDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHSTFTLCMVWPQRDKALAGMH
Ga0184648_117578123300019249Groundwater SedimentMQSTKPLPHDLLALVQKTAPPEAHIIPLDWAYEAEDYNIAVVMPDTITPEDAHQLQERLIDAVMDWDAAHGTYTLTMVWQQHEMDRAGAR
Ga0184648_125391833300019249Groundwater SedimentMKAVPQELLELVRKAAPPEADVVPLPWAFEKEDYNLAVVMPNSIDRLAARQIEDRLLDVIMDYDEAHDTFTVCKVWRVREMARVGVL
Ga0184646_136863533300019259Groundwater SedimentMKAVPQELLELVRKAAPPEADVVPLPWAFEKEDYNLAVVMPNSIDRLAARQIEDRLLDLIMDYDEAHDTFTVCKVWRVREMARVGVL
Ga0184646_155287933300019259Groundwater SedimentMKPVPQELLEHVQKVAPPEAEVIPLSWAFEQEDYNIAVVMPDTTDRAEARQIEDGLLDVIMDWDDAHDTFTVCKVWREHEMARPGVR
Ga0180118_124024333300020063Groundwater SedimentMHSTKPFPHDLLTLVQKAAPPEAHIVPLEWAYEAEDYNIAIVMPDTITPEEAHQVQDRVIDAVMDWDAAHDTYTLAMVWRQHEMDRAGAR
Ga0180113_128672513300020065Groundwater SedimentMKPVPRDLLELVQKTAPPEAEIVPLTWAFEDEDYNLAVVMPDGIERDVARQIEAQLIDAVIDWDAAHHTFTLCKVWRQHDIARAGVR
Ga0212128_1013475223300022563Thermal SpringsMQPIPEDLLALVSQAAPPEAEIVPLTWAFEDEDHNIAVVMPDTIDRLTARQIEDQLIDAVIDWDATHHTFTLCKVWREHEKAYAGLH
Ga0212128_1058712413300022563Thermal SpringsVKHTLEPVPQALLALVKKAAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTTRQIEDQLIDAVIDWDAAHHTFTLCKVWRQHEMARAGVR
Ga0209827_1093762713300025149Thermal SpringsMKPTKPVPHDLLELVQKTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVIDYDAAHGTFSLCMVWPQRDKALAGIH
Ga0209399_1040299013300025157Thermal SpringsMKPTKPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDRLIDAVMDYDAAHGTFTLCMVWHQREMARAGVL
Ga0207688_1043024213300025901Corn, Switchgrass And Miscanthus RhizosphereELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH
Ga0207703_1124913413300026035Switchgrass RhizosphereVKHALEPVPQALLELVKKAAPPEAEIVPLAWAYEDEDYNIAIVMPDTVDRLEAREIEDRLIDAVLDYDAAHGTYTLCM
Ga0209180_1009624023300027846Vadose Zone SoilVQPVPQDLIALLKQTAPPEAEVVPLSWAYEDEDYNIAIVMPDTVDRLEARQIEDHLIDAVMDYDAAHGTYTLCMVWPQRDKALAGIR
Ga0209283_1044111113300027875Vadose Zone SoilMEPVPQALLELIKKTAPPESAIVPLSWAFEDEEYNIAIVMPDTVECLEARHIEGRVRDVAMDWDAAHGTVTLCKVWRQYEIARPGVL
Ga0209590_1011891323300027882Vadose Zone SoilMKSVPQELLELVKKTAPPEAEIVPLAWAFEDEDYNIAVVMPDTIDRLTARQIEDRLIDAVIDWDAAHSTFTLCKVWRQHEMARAGVL
Ga0209590_1076659523300027882Vadose Zone SoilVKNTLEPVPQALLALVKKAVPPVAEIVPLTWAFEDEDYNIAVVMPDTIDRLTARQIEDCLIDAVMDYDAAHGTYTLCMVWREREKVHAGVQ
Ga0209382_1230685913300027909Populus RhizosphereMKSVPQELLELVKKAAPPDAEIVPLTWAFEDEDYNIAIVMPDAIDRLTVRQIEDRLIDAVIDWDAAHHTFTLCKVWQQHEMTRAGVR
Ga0268264_1101749633300028381Switchgrass RhizosphereMCQPSRPVPQDLLELVQKTAPPEADVVPLSWAYEDEDYNIAIVMPDTVERLEARQIEDRLIDAVMDYDEAHGTFTLCMVWHQRDKALAGIH
Ga0247646_112484123300030546SoilMKPVPQDLLELVQKTAPPEAEIVPLAWAFEDEDYNIAVVMPDGVDRVTARQIEDRLIDAVIDWDAAHRTFTLCKVWRQHEMARAGVR
Ga0364925_0129539_33_3053300034147SedimentMQSTKPVPDELLTLVHQAAPPEAYIVPLEWAYEAEDYNIALVMPDTLTPEASHQLQDHMIDAVMDWDAAHGTYTLIMVWREREKASAGVR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.