NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100337

Metagenome Family F100337

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100337
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 80 residues
Representative Sequence MTNADDPLPEPSFHWRRWVTIGYVSVTLILLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIVASWKKP
Number of Associated Samples 42
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 54.90 %
% of genes near scaffold ends (potentially truncated) 15.69 %
% of genes from short scaffolds (< 2000 bps) 45.10 %
Associated GOLD sequencing projects 38
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (54.902 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake
(53.922 % of family members)
Environment Ontology (ENVO) Unclassified
(78.431 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(79.412 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 70.37%    β-sheet: 0.00%    Coil/Unstructured: 29.63%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF13539Peptidase_M15_4 27.45
PF01510Amidase_2 7.84
PF07484Collar 3.92
PF13229Beta_helix 2.94
PF00149Metallophos 1.96
PF10073DUF2312 1.96
PF12708Pectate_lyase_3 1.96
PF10926DUF2800 0.98
PF13392HNH_3 0.98
PF16510P22_portal 0.98
PF03237Terminase_6N 0.98
PF03906Phage_T7_tail 0.98
PF09374PG_binding_3 0.98
PF00589Phage_integrase 0.98
PF14354Lar_restr_allev 0.98
PF05838Glyco_hydro_108 0.98
PF03796DnaB_C 0.98
PF04448DUF551 0.98
PF13550Phage-tail_3 0.98
PF11134Phage_stabilise 0.98
PF10124Mu-like_gpT 0.98
PF00126HTH_1 0.98
PF11651P22_CoatProtein 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0305Replicative DNA helicaseReplication, recombination and repair [L] 0.98
COG1066DNA repair protein RadA/Sms, contains AAA+ ATPase domainReplication, recombination and repair [L] 0.98
COG3926Lysozyme family proteinGeneral function prediction only [R] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A54.90 %
All OrganismsrootAll Organisms45.10 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002161|JGI24766J26685_10076006Not Available728Open in IMG/M
3300002835|B570J40625_100503383All Organisms → cellular organisms → Bacteria → Proteobacteria1137Open in IMG/M
3300004481|Ga0069718_12857856Not Available692Open in IMG/M
3300005525|Ga0068877_10028990All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3814Open in IMG/M
3300005525|Ga0068877_10129187Not Available1561Open in IMG/M
3300005527|Ga0068876_10021833Not Available4044Open in IMG/M
3300005527|Ga0068876_10105039Not Available1682Open in IMG/M
3300005805|Ga0079957_1154532Not Available1163Open in IMG/M
3300008266|Ga0114363_1001090Not Available16430Open in IMG/M
3300008266|Ga0114363_1006336All Organisms → cellular organisms → Bacteria → Proteobacteria5948Open in IMG/M
3300008266|Ga0114363_1014776All Organisms → cellular organisms → Bacteria → Proteobacteria5404Open in IMG/M
3300008266|Ga0114363_1027969Not Available2408Open in IMG/M
3300008266|Ga0114363_1177453Not Available679Open in IMG/M
3300008267|Ga0114364_1009230All Organisms → cellular organisms → Bacteria5808Open in IMG/M
3300008267|Ga0114364_1054318Not Available1419Open in IMG/M
3300008448|Ga0114876_1000281All Organisms → cellular organisms → Bacteria39738Open in IMG/M
3300008448|Ga0114876_1007008Not Available6877Open in IMG/M
3300008448|Ga0114876_1011171All Organisms → cellular organisms → Bacteria → Proteobacteria5088Open in IMG/M
3300008448|Ga0114876_1139438Not Available901Open in IMG/M
3300008450|Ga0114880_1000193All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria36917Open in IMG/M
3300008450|Ga0114880_1003319All Organisms → cellular organisms → Bacteria → Proteobacteria8861Open in IMG/M
3300008450|Ga0114880_1009223All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4928Open in IMG/M
3300008450|Ga0114880_1023811Not Available2814Open in IMG/M
3300008450|Ga0114880_1110330Not Available1048Open in IMG/M
3300009085|Ga0105103_10047343All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2166Open in IMG/M
3300009169|Ga0105097_10240931Not Available996Open in IMG/M
3300009419|Ga0114982_1002316Not Available8408Open in IMG/M
3300010374|Ga0114986_1004610All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3064Open in IMG/M
3300011334|Ga0153697_1176Not Available21989Open in IMG/M
3300012017|Ga0153801_1019012All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1224Open in IMG/M
3300013372|Ga0177922_11181144Not Available620Open in IMG/M
3300017716|Ga0181350_1000383All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria13044Open in IMG/M
3300017716|Ga0181350_1001347All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7198Open in IMG/M
3300017716|Ga0181350_1001860All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6161Open in IMG/M
3300017716|Ga0181350_1002812All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5077Open in IMG/M
3300017716|Ga0181350_1003328All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas4668Open in IMG/M
3300017716|Ga0181350_1005868Not Available3542Open in IMG/M
3300017716|Ga0181350_1007079All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3223Open in IMG/M
3300017716|Ga0181350_1013225All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2339Open in IMG/M
3300017716|Ga0181350_1014605All Organisms → Viruses → Predicted Viral2225Open in IMG/M
3300017716|Ga0181350_1019885Not Available1886Open in IMG/M
3300017716|Ga0181350_1030521Not Available1486Open in IMG/M
3300017716|Ga0181350_1048945All Organisms → Viruses → Predicted Viral1128Open in IMG/M
3300017716|Ga0181350_1059304Not Available1000Open in IMG/M
3300017716|Ga0181350_1063885Not Available956Open in IMG/M
3300017716|Ga0181350_1147043Not Available551Open in IMG/M
3300017722|Ga0181347_1010026Not Available3076Open in IMG/M
3300017722|Ga0181347_1014919All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2494Open in IMG/M
3300017722|Ga0181347_1056949Not Available1172Open in IMG/M
3300017722|Ga0181347_1075906Not Available986Open in IMG/M
3300017722|Ga0181347_1167135All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Brevundimonas → Brevundimonas aurantiaca593Open in IMG/M
3300017723|Ga0181362_1015883Not Available1620Open in IMG/M
3300017723|Ga0181362_1047451Not Available897Open in IMG/M
3300017761|Ga0181356_1006179Not Available4761Open in IMG/M
3300017766|Ga0181343_1065374All Organisms → cellular organisms → Bacteria → Proteobacteria1056Open in IMG/M
3300017774|Ga0181358_1082069All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1176Open in IMG/M
3300017778|Ga0181349_1005616All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5290Open in IMG/M
3300017778|Ga0181349_1009850All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3979Open in IMG/M
3300017778|Ga0181349_1027355Not Available2307Open in IMG/M
3300019784|Ga0181359_1000386All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9563Open in IMG/M
3300019784|Ga0181359_1001827All Organisms → cellular organisms → Bacteria5935Open in IMG/M
3300019784|Ga0181359_1030369All Organisms → cellular organisms → Bacteria2084Open in IMG/M
3300019784|Ga0181359_1058930Not Available1472Open in IMG/M
3300019784|Ga0181359_1100561Not Available1060Open in IMG/M
3300020048|Ga0207193_1380600Not Available950Open in IMG/M
3300022190|Ga0181354_1014362Not Available2414Open in IMG/M
3300022407|Ga0181351_1008972All Organisms → cellular organisms → Bacteria → Proteobacteria4010Open in IMG/M
3300022407|Ga0181351_1015828All Organisms → Viruses → Predicted Viral3145Open in IMG/M
3300022407|Ga0181351_1022847Not Available2649Open in IMG/M
3300022407|Ga0181351_1026235Not Available2474Open in IMG/M
3300022407|Ga0181351_1029286All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2340Open in IMG/M
3300022407|Ga0181351_1048670Not Available1775Open in IMG/M
3300022407|Ga0181351_1069072All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1434Open in IMG/M
3300022407|Ga0181351_1075531All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1355Open in IMG/M
3300022407|Ga0181351_1096765Not Available1148Open in IMG/M
3300022407|Ga0181351_1125545Not Available956Open in IMG/M
3300022407|Ga0181351_1138544Not Available890Open in IMG/M
3300022407|Ga0181351_1174711Not Available746Open in IMG/M
3300022407|Ga0181351_1246766Not Available559Open in IMG/M
3300022407|Ga0181351_1263520Not Available528Open in IMG/M
3300027710|Ga0209599_10001847Not Available9416Open in IMG/M
3300027793|Ga0209972_10029246All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3211Open in IMG/M
3300027805|Ga0209229_10107852Not Available1255Open in IMG/M
3300027806|Ga0209985_10020208Not Available4183Open in IMG/M
3300027806|Ga0209985_10298336Not Available729Open in IMG/M
3300027972|Ga0209079_10008184All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3525Open in IMG/M
(restricted) 3300028559|Ga0247831_1017250All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Brevundimonas → Brevundimonas viscosa5548Open in IMG/M
(restricted) 3300028569|Ga0247843_1019504All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5609Open in IMG/M
(restricted) 3300028569|Ga0247843_1035154Not Available3221Open in IMG/M
3300031758|Ga0315907_10009459All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9832Open in IMG/M
3300031758|Ga0315907_10133866All Organisms → cellular organisms → Bacteria → Proteobacteria → Oligoflexia → Bdellovibrionales → Bdellovibrionaceae → Bdellovibrio2108Open in IMG/M
3300031787|Ga0315900_10021256Not Available7516Open in IMG/M
3300031787|Ga0315900_10214078All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae1688Open in IMG/M
3300031787|Ga0315900_10765825Not Available671Open in IMG/M
3300031857|Ga0315909_10105635All Organisms → cellular organisms → Bacteria → Proteobacteria2415Open in IMG/M
3300031857|Ga0315909_10272557Not Available1283Open in IMG/M
3300034012|Ga0334986_0000457Not Available35668Open in IMG/M
3300034012|Ga0334986_0114893Not Available1594Open in IMG/M
3300034012|Ga0334986_0203687Not Available1104Open in IMG/M
3300034062|Ga0334995_0005154All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → Pseudo-nitzschia multiseries DNA virus13077Open in IMG/M
3300034066|Ga0335019_0000223All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria36543Open in IMG/M
3300034272|Ga0335049_0376166Not Available938Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake53.92%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater8.82%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake8.82%
Freshwater, PlanktonEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton6.86%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater6.86%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.94%
Deep SubsurfaceEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface2.94%
Freshwater And SedimentEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater And Sediment1.96%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.96%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.98%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater0.98%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake0.98%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Lake Sediment0.98%
FreshwaterEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater0.98%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002161Freshwater and sediment microbial communities from dead zone in Sandusky Bay, Ohio, USAEnvironmentalOpen in IMG/M
3300002835Freshwater microbial communities from Lake Mendota, WI - (Lake Mendota Combined Ray assembly, ASSEMBLY_DATE=20140605)EnvironmentalOpen in IMG/M
3300004481Combined Assembly of Gp0112041, Gp0112042, Gp0112043EnvironmentalOpen in IMG/M
3300005525Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel6S_1000h metaGEnvironmentalOpen in IMG/M
3300005527Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel5S_2200h metaGEnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300008266Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample HABS-E2014-0108-C-NAEnvironmentalOpen in IMG/M
3300008267Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, sample HABS-E2014-0024-100-LTREnvironmentalOpen in IMG/M
3300008448Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - August 4, 2014 all contigsEnvironmentalOpen in IMG/M
3300008450Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - Oct 27, 2014 all contigsEnvironmentalOpen in IMG/M
3300009085Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009169Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009419Subsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S input2 FTEnvironmentalOpen in IMG/M
3300010374Subsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S-1-Day17EnvironmentalOpen in IMG/M
3300011334Lotic viral community from Han River, Hwacheon, Gangwon-do, South Korea - DaesungEnvironmentalOpen in IMG/M
3300012017Freshwater microbial communities from Central Basin Lake Erie, Ontario, Canada - Station 1208 - Top - Depth 1mEnvironmentalOpen in IMG/M
3300013372Freshwater microbial communities from Lake Erie, Ontario, Canada. Combined Assembly of 10 SPsEnvironmentalOpen in IMG/M
3300017716Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.DCM.DEnvironmentalOpen in IMG/M
3300017722Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.S.NEnvironmentalOpen in IMG/M
3300017723Freshwater viral communities from Lake Michigan, USA - Su13.ND.MM110.S.NEnvironmentalOpen in IMG/M
3300017761Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.NEnvironmentalOpen in IMG/M
3300017766Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.S.DEnvironmentalOpen in IMG/M
3300017774Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017778Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020048Microbial communities from Manganika and McQuade lakes, Minnesota, USA Combined Assembly of Gp0225457, Gp0225456, Gp0225455, Gp0225454, Gp0225453, Gp0224915EnvironmentalOpen in IMG/M
3300022190Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.NEnvironmentalOpen in IMG/M
3300022407Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300027710Subsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S input2 FT (SPAdes)EnvironmentalOpen in IMG/M
3300027793Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel1S_2200h metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027805Freshwater and sediment microbial communities from dead zone in Sandusky Bay, Ohio, USA (SPAdes)EnvironmentalOpen in IMG/M
3300027806Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel6S_1000h metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027972Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 10-12cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028559 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_1mEnvironmentalOpen in IMG/M
3300028569 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2017_8mEnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031787Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA114EnvironmentalOpen in IMG/M
3300031857Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125EnvironmentalOpen in IMG/M
3300034012Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME18Aug2017-rr0027EnvironmentalOpen in IMG/M
3300034062Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME27Jul2012-rr0045EnvironmentalOpen in IMG/M
3300034066Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME11Jul2017-rr0087EnvironmentalOpen in IMG/M
3300034272Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME18Jul2017-rr0156EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI24766J26685_1007600633300002161Freshwater And SedimentPEPSFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAXDLARIVASWKKP*
B570J40625_10050338333300002835FreshwaterVTNADDPLPEPSFHWRRWVTIGYVSVTLVLLAVIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASASDLARIAASWKKP*
Ga0069718_1285785613300004481SedimentVTDNQDPLPEPSFQWRRWVTIGYVVVTLGLLVGIVWKLSDGGPLRDVAIALIVSQAFFALLYMGGASAADIARIVASWKRQP*
Ga0068877_1002899013300005525Freshwater LakeDPDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0068877_1012918743300005525Freshwater LakeMTDNQDPLPEPSFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP*
Ga0068876_1002183333300005527Freshwater LakeMIDADNPAPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0068876_1010503923300005527Freshwater LakeMTDPDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0079957_115453233300005805LakeMTDQQDPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0114363_1001090123300008266Freshwater, PlanktonMTDNQDPLPEPSFQWRRWVTIGYVVATTLLLGFIVFKLIEGGPLRDVALALIGSQAFFALMYMGGASASDIARIVASWKKP*
Ga0114363_100633623300008266Freshwater, PlanktonMTDNMPLSDNPAPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0114363_101477633300008266Freshwater, PlanktonMTDPDNPLPEPSFHWRRWVTIGYVTVTLALLAGIVWKLSDGGPLRDIALALIASQAFFGFCYMGGASASDIARIVASWKKP*
Ga0114363_102796943300008266Freshwater, PlanktonVIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLGFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKRSS*
Ga0114363_117745323300008266Freshwater, PlanktonHRWRRWVTIGYLIVTAALLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKP*
Ga0114364_100923073300008267Freshwater, PlanktonMTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVWRMTESVPLKDVALALIGSQAFFALLYMAGASASDLARIAASWKRQP*
Ga0114364_105431833300008267Freshwater, PlanktonMTDPDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFGFCYMGGASAADIARIVSSWRRP*
Ga0114876_1000281663300008448Freshwater LakeMLDEQNNPYGHPMSDNPAPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP*
Ga0114876_100700843300008448Freshwater LakeMIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKRQP*
Ga0114876_101117153300008448Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVTVTLILLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP*
Ga0114876_113943823300008448Freshwater LakeMTDNQDPLPEPSFHWRRWVTIGYVTVTLALLAWVIWKLTDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP*
Ga0114880_1000193263300008450Freshwater LakeMTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVWRMAEAVPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKQ*
Ga0114880_100331963300008450Freshwater LakeVIDLDNPVPEPSHRWRRWVTIGYLIVTAGLLVGIVGKLSAGGPLRDVALALIGAQAFFALLYMAGASASDLARIAASWKKP*
Ga0114880_100922323300008450Freshwater LakeMTNADNPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARVIASWKKL*
Ga0114880_102381133300008450Freshwater LakeVTNADDPLPEPSFHWRRWVTIVYVSVTLGILGGIVLKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP*
Ga0114880_111033023300008450Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLILLAVIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP*
Ga0105103_1004734323300009085Freshwater SedimentVTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALISSQAFFALMYMGGASASDLARIVASWKKP*
Ga0105097_1024093123300009169Freshwater SedimentMTEPDNPVPELSHRWRRWVTIGYLIVTAGLLGFIVWRMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKP*
Ga0114982_1002316123300009419Deep SubsurfaceVTENQDPLPEPSFHWRRWVTIGYVVSTTVLLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIAASWKKP*
Ga0114986_100461013300010374Deep SubsurfaceLPEPSFHWRRWVTIGYVVSTTVLLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIAASWKKP*
Ga0153697_117653300011334FreshwaterVTDNQDPLPEPSFLWRRWVTVGYVVSTTALLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP*
Ga0153801_101901233300012017FreshwaterMTNPDNPVPEPSHRWRRWVTIGYLIVTAGLLGFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIAASWKRQS*
Ga0177922_1118114423300013372FreshwaterVTDNQDPLPEPSFHWRRWVTIGYVAVTLALLVGIVCKLSDGGPLRDVAIALIVSQAFFALLYMGGASAADIARIAASWKKP*
Ga0181350_100038393300017716Freshwater LakeMTNADNPLPEPSFHWRRWVTIGYVSVTLVLLAAIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKK
Ga0181350_100134773300017716Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVTVTLVLLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181350_100186023300017716Freshwater LakeMTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLGFIVYQMTESAPLRDVALALIGSQAFFGLLYMAGASASDLARIVASWKRSS
Ga0181350_100281243300017716Freshwater LakeMTDNQDPLPEPSFLWRRWVTIGYVAATTILLGLIILRLIEAGPLRDIALALIGSQAFFALLYMGGASASDLARIIASWKKP
Ga0181350_100332823300017716Freshwater LakeVIDLDNPVPEPSHRWRRCVTIGYLIVTAGLLVGIVYKLSAGGPLRDVALALIGSQAFFALLYMAGASASDLARIAASWKKP
Ga0181350_100586823300017716Freshwater LakeVTDNQDPLPEPSFHWRRWVTIGYVVSTTALLVGIILKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0181350_100707933300017716Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLILLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0181350_101322513300017716Freshwater LakeVTNADDPLPEPSFHWRRWVTIGYVSVTMALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181350_101460553300017716Freshwater LakeMTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVWRMTESIPLKDVALALIGSQAFFALLYMAGASASDLARIVASWKRQP
Ga0181350_101988533300017716Freshwater LakeVTDNQDPLPEPSFHWRRWVTIGYVISTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKT
Ga0181350_103052133300017716Freshwater LakeMIDPDNPVPEPSHRWRRWVTIGYLMVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFCLMYMGGASAADLARIVASWRKPS
Ga0181350_104894533300017716Freshwater LakeVTDLDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLA
Ga0181350_105930433300017716Freshwater LakeVIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKA
Ga0181350_106388533300017716Freshwater LakeFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181350_114704323300017716Freshwater LakeRWVTIGYVASTTILLGGIVLKLTEGGPLRDVALALIGSQAFFALLYMGGASASDLARIVASWKKP
Ga0181347_101002633300017722Freshwater LakeMTNADNPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKL
Ga0181347_101491923300017722Freshwater LakeMTDLDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKRPS
Ga0181347_105694933300017722Freshwater LakeMTDLDNPVPEPSHRWRRWVTIGYLIVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKRPS
Ga0181347_107590613300017722Freshwater LakeEPSHRWRRWVTIGYLIVTAGLLVGIVYKLSAGGPLRDVALALIGSQAFFALLYMAGASASDLARIAASWKKP
Ga0181347_116713523300017722Freshwater LakeMIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLGFIVYQMTESAPLRDIALALIGSQAFFALLYMAGASAADLARIVASWKRSS
Ga0181362_101588323300017723Freshwater LakeVTDLDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKRHP
Ga0181362_104745113300017723Freshwater LakeMTDLDNPVPEPSHRWRRWVTIGYLIVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKQ
Ga0181356_100617933300017761Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSATLALRAGIVWQLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181343_106537433300017766Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0181358_108206913300017774Freshwater LakeWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181349_100561643300017778Freshwater LakeVIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKK
Ga0181349_100985033300017778Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVFVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0181349_102735553300017778Freshwater LakeVTGNQDPLPEPSFHWRRWVTIGYVAVTLALLVAIVWKLSDGGPLRDVAIALIVSQAFFALLYMGGASAADIARIVASWKKS
Ga0181359_100038613300019784Freshwater LakeVIDLDNPVPEPSHRWRRWVTIGYLIVTAGLLVGIVYKLSAGGPLRDVALALIGSQAFFALLYMAGASASDLARIAASWKKP
Ga0181359_100182763300019784Freshwater LakeMTDNQDPLPEPSFQWRRWVTIGYVVATTLLLGFIVFKLIEGGPLRDVALALIGSQAFFALMYMGGASASDIARIVASWKKP
Ga0181359_103036933300019784Freshwater LakeMTNTDNPLPEPSFHWRRWVTVGYLIVTALLVGFIVYQMTESAPLRDVALALIGSQAFFALMYMGGASAADLARIMASWKKP
Ga0181359_105893023300019784Freshwater LakeMIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLGFIVYQMTESAPLRDVALALIGSQAFFCLMYMGGASAADLARIVASWKRSS
Ga0181359_110056113300019784Freshwater LakeMTNADNPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0207193_138060013300020048Freshwater Lake SedimentVTAAYSQKAKRVTNADDPLPEPSFHWRRWVTIGYVFVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASASDLARIVASWKKP
Ga0181354_101436253300022190Freshwater LakeDDNQDPLPEPSFQWRRWVTIGYVVATTLLLGFIVFKLIEGGPLRDVALALIGSQAFFALMYMGGASASDIARIVASWKKP
Ga0181351_100897273300022407Freshwater LakeMTDPDNPLPEPSFHWRRWVTIGYVSVTLALLAAIVWKVSDGGPLRDIALALIASQAFFGFCYMGGASASDLARIVASWKKP
Ga0181351_101582833300022407Freshwater LakeMTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFIVWRMAESVPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKQ
Ga0181351_102284733300022407Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLVLLAGIIWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181351_102623523300022407Freshwater LakeMIDQQDPLPEPSFHWRRWVTIGYVASTTILLGGIVLKLTEGGPLRDVALALIGSQAFFALLYMGGASASDIARIVASWKKP
Ga0181351_102928643300022407Freshwater LakeVIDLDNPVPEPSHRWRRWVTIGYLIVTAGLLVGIVGKLSAGGPLRDVALALIGAQAFFALLYMAGASASDLARIAASWKKP
Ga0181351_104867033300022407Freshwater LakeVTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKS
Ga0181351_106907223300022407Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181351_107553133300022407Freshwater LakeVTDNQDPLPEPSFQWRRWVTIGYVVATTLLLGFIVFKLIEGGPLRDVALALIGSQAFFALMYMGGASASDIARIVASWKKP
Ga0181351_109676533300022407Freshwater LakeMTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVLKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0181351_112554523300022407Freshwater LakeMTDNQDPLPEPSFHWRRWVTIGYVVSTTALLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKT
Ga0181351_113854423300022407Freshwater LakeMIDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKS
Ga0181351_117471113300022407Freshwater LakeVIDPDNPVPEPSHRWRRWVTIGYLTVTAGLLAFIVSQMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKP
Ga0181351_124676613300022407Freshwater LakeVNENQDPLPEPSFHWRRWVTIGYVVSTTVLLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0181351_126352023300022407Freshwater LakeVTDPDNPVPEPSHRWRRWVTIGYLIVTAGLLAFVVWRMTESAPLRDVALALIGSQAFFALLYMAGASASDLARIVASWKKQ
Ga0209599_10001847113300027710Deep SubsurfaceVTENQDPLPEPSFHWRRWVTIGYVVSTTVLLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIAASWKKP
Ga0209972_1002924643300027793Freshwater LakeMTDPDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0209229_1010785223300027805Freshwater And SedimentMTDNQDPLPEPSFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASASDLARIVASWKKP
Ga0209985_10020208103300027806Freshwater LakeDPDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0209985_1029833623300027806Freshwater LakeMTDNQDPLPEPSFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0209079_1000818463300027972Freshwater SedimentVTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALISSQAFFALMYMGGASASDLARIVASWKKP
(restricted) Ga0247831_101725033300028559FreshwaterMIDPDNPVPEPSHRWRRWVTIGYLITTAGLLGFIVWRMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKS
(restricted) Ga0247843_101950443300028569FreshwaterMTDSQDPLPEPSFHWRRWVTIGYVVSTTILLAGIVWKLTEGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
(restricted) Ga0247843_103515433300028569FreshwaterMTNADDPLPEPSFHWRRWVTIGYVSVTLVLLAVIVWKLSDGGPLRDVALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0315907_1000945943300031758FreshwaterMTDPDNPLPEPSFHWRRWVTIGYVTVTLALLAGIVWKLSDGGPLRDIALALIASQAFFGFCYMGGASASDIARIVASWKKP
Ga0315907_1013386633300031758FreshwaterMTDLDNPVPEPSHRWRRWVTIGYLIVTAALLAFIVYQMTESAPLRDVALALIGSQAFFALLYMAGASAADLARIVASWKKP
Ga0315900_10021256133300031787FreshwaterHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0315900_1021407833300031787FreshwaterMIDADNPAPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0315900_1076582523300031787FreshwaterPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0315909_1010563543300031857FreshwaterMTDNMPLSDNPAPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0315909_1027255723300031857FreshwaterMTNADDPLPEPSFHWRRWVTIGYVSVTLILLAVIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0334986_0000457_18507_187523300034012FreshwaterMTNADDPLPEPSFHWRRWVTIVYVSVTLALLAGIVLKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASWKKP
Ga0334986_0114893_166_4113300034012FreshwaterMTDNQDPLPEPSFHWRRWVTIGYVVSTTALLGAIVWKLTEGGPLRDVALALIGSQAFFALLYMGGASAADLARIVASWKKP
Ga0334986_0203687_879_11033300034012FreshwaterLPEPSFHWRRWVTIGYVVSTTILLVGIIWKLSEGGPLRDVALALIGSQAFFALLYMGGASASDLARIVASWKKP
Ga0334995_0005154_6021_62663300034062FreshwaterMTDLDNPLPEPSFHWRRWVTIGYVVATTILLALIVWKLTEGGPLRDIALALIGSQAFFALMYMGGASASDLARIVASWKKP
Ga0335019_0000223_9570_98153300034066FreshwaterMTDPDNPLPEPSFHWRRWVTIGYVSVTLALLAAIVWKVSDGGPLRDIALALIASQAFFGFCYMGGASASDLARIIASWKKP
Ga0335049_0376166_1_2343300034272FreshwaterVTNADDPLPEPSFHWRRWVTIGYVSVTLALLAGIVWKLSDGGPLRDIALALIGSQAFFALLYMGGASAADLARIIASW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.