NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101760

Metagenome / Metatranscriptome Family F101760

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101760
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 121 residues
Representative Sequence SSPPPPHRIGILMSPVGRFLQCRDCQLSYTFPDGAKFGAIAKQFESHLCLSPIRIPGWHTDRRFVIVRYEGKVPAMASCAKCQRKFFTPTTLARDAVGAEEYLGRKFDVHDCPAEIEERHK
Number of Associated Samples 77
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 53.92 %
% of genes near scaffold ends (potentially truncated) 42.16 %
% of genes from short scaffolds (< 2000 bps) 68.63 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (52.941 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(19.608 % of family members)
Environment Ontology (ENVO) Unclassified
(17.647 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.118 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.13%    β-sheet: 24.16%    Coil/Unstructured: 55.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF07508Recombinase 2.94
PF08281Sigma70_r4_2 1.96
PF00027cNMP_binding 1.96
PF02371Transposase_20 1.96
PF02405MlaE 1.96
PF13481AAA_25 0.98
PF00072Response_reg 0.98
PF00190Cupin_1 0.98
PF07635PSCyt1 0.98
PF14417MEDS 0.98
PF01569PAP2 0.98
PF00486Trans_reg_C 0.98
PF13620CarboxypepD_reg 0.98
PF00688TGFb_propeptide 0.98
PF13174TPR_6 0.98
PF00924MS_channel 0.98
PF08388GIIM 0.98
PF00589Phage_integrase 0.98
PF00239Resolvase 0.98
PF12587DUF3761 0.98
PF04909Amidohydro_2 0.98
PF00691OmpA 0.98
PF13751DDE_Tnp_1_6 0.98
PF00082Peptidase_S8 0.98
PF12681Glyoxalase_2 0.98
PF00512HisKA 0.98
PF00069Pkinase 0.98
PF02591zf-RING_7 0.98
PF01663Phosphodiest 0.98
PF01875Memo 0.98
PF01464SLT 0.98
PF13185GAF_2 0.98
PF13495Phage_int_SAM_4 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.92
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 3.92
COG0767Permease subunit MlaE of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 1.96
COG3547TransposaseMobilome: prophages, transposons [X] 1.96
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.98
COG1355Predicted class III extradiol dioxygenase, MEMO1 familyGeneral function prediction only [R] 0.98
COG1579Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domainGeneral function prediction only [R] 0.98
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.98
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms52.94 %
UnclassifiedrootN/A47.06 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004092|Ga0062389_101866109All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300005434|Ga0070709_10009332All Organisms → cellular organisms → Bacteria5407Open in IMG/M
3300005434|Ga0070709_10062986All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2366Open in IMG/M
3300005435|Ga0070714_100134012All Organisms → cellular organisms → Bacteria → Acidobacteria2216Open in IMG/M
3300005436|Ga0070713_100001227All Organisms → cellular organisms → Bacteria16384Open in IMG/M
3300005437|Ga0070710_10039708All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2590Open in IMG/M
3300005526|Ga0073909_10182222Not Available897Open in IMG/M
3300005526|Ga0073909_10247583Not Available791Open in IMG/M
3300005526|Ga0073909_10477742Not Available600Open in IMG/M
3300005526|Ga0073909_10622324Not Available535Open in IMG/M
3300005534|Ga0070735_10196188Not Available1237Open in IMG/M
3300005542|Ga0070732_10906551Not Available538Open in IMG/M
3300006028|Ga0070717_11159535Not Available703Open in IMG/M
3300006057|Ga0075026_100967781Not Available527Open in IMG/M
3300006086|Ga0075019_11158741Not Available503Open in IMG/M
3300006102|Ga0075015_100481253Not Available712Open in IMG/M
3300006173|Ga0070716_100304173All Organisms → cellular organisms → Bacteria → Acidobacteria1110Open in IMG/M
3300006175|Ga0070712_101597182Not Available570Open in IMG/M
3300009137|Ga0066709_101095431Not Available1171Open in IMG/M
3300009518|Ga0116128_1054661All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1250Open in IMG/M
3300009519|Ga0116108_1015385All Organisms → cellular organisms → Bacteria2726Open in IMG/M
3300009640|Ga0116126_1002413All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae10688Open in IMG/M
3300010379|Ga0136449_100014731All Organisms → cellular organisms → Bacteria → Acidobacteria20838Open in IMG/M
3300010379|Ga0136449_102938805Not Available668Open in IMG/M
3300010379|Ga0136449_104144947Not Available539Open in IMG/M
3300010401|Ga0134121_11129878Not Available778Open in IMG/M
3300011120|Ga0150983_12477478Not Available678Open in IMG/M
3300011120|Ga0150983_13577867Not Available805Open in IMG/M
3300011120|Ga0150983_13894982All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium572Open in IMG/M
3300012200|Ga0137382_10591153Not Available792Open in IMG/M
3300012207|Ga0137381_10152002All Organisms → cellular organisms → Bacteria → Acidobacteria1992Open in IMG/M
3300012209|Ga0137379_10804001Not Available845Open in IMG/M
3300012210|Ga0137378_10014095All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6953Open in IMG/M
3300012210|Ga0137378_10282792All Organisms → cellular organisms → Bacteria1543Open in IMG/M
3300012210|Ga0137378_10569686All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1041Open in IMG/M
3300012210|Ga0137378_10633974All Organisms → cellular organisms → Bacteria → Acidobacteria979Open in IMG/M
3300012349|Ga0137387_10786511Not Available688Open in IMG/M
3300012350|Ga0137372_10100915All Organisms → cellular organisms → Bacteria → Acidobacteria2422Open in IMG/M
3300012351|Ga0137386_10181148All Organisms → cellular organisms → Bacteria → Acidobacteria1511Open in IMG/M
3300012357|Ga0137384_10202820All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1659Open in IMG/M
3300012357|Ga0137384_10310482All Organisms → cellular organisms → Bacteria → Acidobacteria1310Open in IMG/M
3300012357|Ga0137384_10580831Not Available916Open in IMG/M
3300012359|Ga0137385_10208868All Organisms → cellular organisms → Bacteria → Acidobacteria1702Open in IMG/M
3300012960|Ga0164301_10198944Not Available1276Open in IMG/M
3300014158|Ga0181521_10271670All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter kueseliae884Open in IMG/M
3300014164|Ga0181532_10000064All Organisms → cellular organisms → Bacteria → Acidobacteria102740Open in IMG/M
3300014164|Ga0181532_10017141All Organisms → cellular organisms → Bacteria5367Open in IMG/M
3300014164|Ga0181532_10018314All Organisms → cellular organisms → Bacteria → Acidobacteria5152Open in IMG/M
3300014165|Ga0181523_10004046All Organisms → cellular organisms → Bacteria → Acidobacteria12926Open in IMG/M
3300014165|Ga0181523_10038117All Organisms → cellular organisms → Bacteria3047Open in IMG/M
3300014165|Ga0181523_10377348Not Available792Open in IMG/M
3300015371|Ga0132258_11757301Not Available1563Open in IMG/M
3300016750|Ga0181505_10246206Not Available1534Open in IMG/M
3300017925|Ga0187856_1011397All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter kueseliae5065Open in IMG/M
3300017933|Ga0187801_10058608All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1411Open in IMG/M
3300017942|Ga0187808_10288021Not Available739Open in IMG/M
3300017943|Ga0187819_10051207All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis2452Open in IMG/M
3300017955|Ga0187817_10423276All Organisms → cellular organisms → Bacteria → Acidobacteria850Open in IMG/M
3300018008|Ga0187888_1002255All Organisms → cellular organisms → Bacteria → Acidobacteria16799Open in IMG/M
3300018012|Ga0187810_10386727Not Available587Open in IMG/M
3300018019|Ga0187874_10072378All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1545Open in IMG/M
3300018026|Ga0187857_10230442Not Available857Open in IMG/M
3300018057|Ga0187858_10191482Not Available1340Open in IMG/M
3300018057|Ga0187858_10382590Not Available877Open in IMG/M
3300020579|Ga0210407_10001706All Organisms → cellular organisms → Bacteria20003Open in IMG/M
3300020579|Ga0210407_10068372All Organisms → cellular organisms → Bacteria2664Open in IMG/M
3300020580|Ga0210403_10001313All Organisms → cellular organisms → Bacteria23720Open in IMG/M
3300020580|Ga0210403_10968812Not Available667Open in IMG/M
3300020581|Ga0210399_11582310Not Available506Open in IMG/M
3300020582|Ga0210395_10006048All Organisms → cellular organisms → Bacteria9197Open in IMG/M
3300021088|Ga0210404_10025542Not Available2593Open in IMG/M
3300021171|Ga0210405_10432632Not Available1035Open in IMG/M
3300021171|Ga0210405_11155852Not Available576Open in IMG/M
3300021181|Ga0210388_10008401All Organisms → cellular organisms → Bacteria8166Open in IMG/M
3300021181|Ga0210388_10936675Not Available745Open in IMG/M
3300021403|Ga0210397_10240292All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300021420|Ga0210394_10001774All Organisms → cellular organisms → Bacteria32470Open in IMG/M
3300021420|Ga0210394_10017951All Organisms → cellular organisms → Bacteria → Acidobacteria6640Open in IMG/M
3300021432|Ga0210384_10504132Not Available1089Open in IMG/M
3300021474|Ga0210390_10642404Not Available887Open in IMG/M
3300021474|Ga0210390_10812103Not Available774Open in IMG/M
3300021477|Ga0210398_10023775All Organisms → cellular organisms → Bacteria5196Open in IMG/M
3300022532|Ga0242655_10139257Not Available702Open in IMG/M
3300022721|Ga0242666_1180477All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium531Open in IMG/M
3300025916|Ga0207663_10028959All Organisms → cellular organisms → Bacteria → Acidobacteria3247Open in IMG/M
3300025928|Ga0207700_10094563All Organisms → cellular organisms → Bacteria → Acidobacteria2368Open in IMG/M
3300025929|Ga0207664_10359917All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1289Open in IMG/M
3300027174|Ga0207948_1014371Not Available919Open in IMG/M
3300027590|Ga0209116_1082913Not Available702Open in IMG/M
3300027821|Ga0209811_10028173All Organisms → cellular organisms → Bacteria → Acidobacteria1862Open in IMG/M
3300027842|Ga0209580_10309851All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium786Open in IMG/M
3300027898|Ga0209067_10417179Not Available753Open in IMG/M
3300027905|Ga0209415_10797147Not Available659Open in IMG/M
3300027986|Ga0209168_10137911Not Available1241Open in IMG/M
3300031057|Ga0170834_106259869Not Available928Open in IMG/M
3300031122|Ga0170822_12731404Not Available655Open in IMG/M
3300031128|Ga0170823_12967791Not Available852Open in IMG/M
3300031231|Ga0170824_124122240All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1857Open in IMG/M
3300032160|Ga0311301_10052762All Organisms → cellular organisms → Bacteria → Acidobacteria9216Open in IMG/M
3300032160|Ga0311301_10058138All Organisms → cellular organisms → Bacteria8552Open in IMG/M
3300032515|Ga0348332_13201531All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium562Open in IMG/M
3300032829|Ga0335070_10006802All Organisms → cellular organisms → Bacteria13441Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.61%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.73%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil8.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.82%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland6.86%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog6.86%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil5.88%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment4.90%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.90%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.92%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.92%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland2.94%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.96%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.98%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009518Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_150EnvironmentalOpen in IMG/M
3300009519Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_150EnvironmentalOpen in IMG/M
3300009640Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_40EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014164Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin11_30_metaGEnvironmentalOpen in IMG/M
3300014165Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_30_metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300016750Metatranscriptome of peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_30_metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300017925Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_40EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017942Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_3EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018008Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_7_40EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018019Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_150EnvironmentalOpen in IMG/M
3300018026Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_100EnvironmentalOpen in IMG/M
3300018057Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_150EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022721Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027590Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027898Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062389_10186610913300004092Bog Forest SoilMSASSSPPPPPHRIAIIMSPEGRFLQCRDCQLTYTFPEGLTFGVVAKQFKAHSCVTPIRRPAWQTEGRFVVIRHEGKVPALATCTRCERKFFTPTTLMRDV
Ga0070709_1000933253300005434Corn, Switchgrass And Miscanthus RhizosphereMSRTGRFLKCRDCQIIFNFPDGAQFGSIAKQFESHLCGSPIGIPGWRTERGFIIMRYEGEVPAMASCAKCQLKFFTPETFARDAVGAEEYLARKFDVHDCPAEIEERNRRRLF*
Ga0070709_1006298643300005434Corn, Switchgrass And Miscanthus RhizosphereMASSSPPPHSLHRICIIMSPVGRFLQCRDCQLSYTFPDGVEYATLARQFESHLCLPPIRGPDWRTDSRFVIVRYEGKIPVMASCARCERKFFTPATFARGAVGAHEYLGQKFDVHICAESDEKQP*
Ga0070714_10013401243300005435Agricultural SoilMPASSSPPSPSLHRIGIVMSPVGRFLQCRDCQLSYTFPDGVEYGTLAKQFETHLCLPSIRSPDWRTDRRFVIVRYEGKIPAMASCARCKRKFFTPATLARDATGAKEYLGQKFDLHECAGIEGETAVGYGAAQVFGLGGET*
Ga0070713_10000122753300005436Corn, Switchgrass And Miscanthus RhizosphereMPASSSPPSPSLHRIGIVMSPVGRFLQCRDCQLSYTFPDGVEYGTLAKQFETHLCLPSIRSPDWRTDRRFVIVRYEGKIPAMASCARCKRKFFTPATLARDATGAKEYLGQKFDLHECAGIEERQP*
Ga0070710_1003970813300005437Corn, Switchgrass And Miscanthus RhizosphereSFTFPDGAQFGPIAKQFGSYRCSSPIRIPAWRTDRHFVIVRYEGRVPAMASCGKCDRKFFTPTTVTRDAIGAEEYLGRKFDVHECEESKRSSDG*
Ga0073909_1018222213300005526Surface SoilMSASSSTPSPHRIANLMSPVGRFLQCRVCQLSFTFPDGAQFGPIAKQFGSYRCSSPIRIPAWRTDRQFVIVRYEGRVLAMASCGKCDRKFFTPTTVTRDAIGAEEYLGRKFDVHECEESKRSSDG*
Ga0073909_1024758323300005526Surface SoilLISPTGRFLECRDCQISFNFPDGAQFDSIVKQFESHLCGSPIGIPGWRTERGFVIIRYEGEVPVMASCAKCQLKFLTPATFARDAIGAEEYLARKFDLHGCPAKIEERSRRRPF*
Ga0073909_1047774213300005526Surface SoilMPVPVAKIREYDGFARGELEHAMASSPPPPHPRHRIGIVMSSVGRFLQCSDCQLSYTFPEGLQYDTLAKQFESHLCLSPIRNPDWCTDSRFVIMRYEGKVPAMAACARCKRKFFTPATFARDAVGAEEYLGHKFDLHVCVEIEERE*
Ga0073909_1062232413300005526Surface SoilMPASSSPPSPSLHRIGIVMSPVGRFLQCRDCQLSYTFPDGVEYGTLAKQFETHLCLPSIRSPDWRTDRRFVIVRYEGKIPAMASCARCKRKFFTPATLARDATGAKEYLGQKFELHECAGIEERQP*
Ga0070735_1019618813300005534Surface SoilMSATSSPPPLHRIAIVISRDGRLLQCIDCQVTFPFPNGIKFGVVSKQFDAHSCVTPTRASSWQIGRRFVILRYEGQVPAMASCAKCARKFFTPTALIRDASGAEMYLGRKFDLHRCDPDPQLVGDT
Ga0070732_1090655113300005542Surface SoilPPHRIGILMSPVGRFLQCRDCQLSYTFPDGAKFGAIAKQFESHLCLSPIRFSAWQTDRRFVIVRYEGKVPAMASCAKCQRKFFTPTTLARDAVGAEEYLGRKFDVHDCPAEIEERHK*
Ga0070717_1115953513300006028Corn, Switchgrass And Miscanthus RhizosphereMNYVLSANSSPPPPPHRIAILTSPEGRFLQCSDCKLSYVFPDGVQFGTIAKQFDAHSCVTPIRKPAWQTDRRLVVLRYEGKVPALASCAKCERKFFTPTTLVRDAVGAEEYLGRKFDVHVCPEEIGARGRRRL*
Ga0075026_10096778123300006057WatershedsMSASSSTPPPHRISILMSPVGRFLQCRDCQLSYTFAGRHTDRRFVIVRYEGKVPAMASCARCQRKFFTPTTLARDAVGAEEYLGRKFDVHD
Ga0075019_1115874113300006086WatershedsFLECSDCQVSFNFPDGAQFDSIAKKFESHLCGSPIGTPGWRTERGFVIMRYEGEVPAMASCAECQLKFFTPATFARDAVGAEEYLARKFDVHVCPAEIEEKNRRKLF*
Ga0075015_10048125313300006102WatershedsSSPPPPHRIGILMSPVGRFLQCRDCQLSYTFPDGAKFGAIAKQFESHLCLSPIRIPGWHTDRRFVIVRYEGKVPAMASCAKCQRKFFTPTTLARDAVGAEEYLGRKFDVHDCPAEIEERHK*
Ga0070716_10030417323300006173Corn, Switchgrass And Miscanthus RhizosphereMSASSSTPSPHRIANLMSPVGRFLQCRVCQLSFTFPDGAQFGPIAKQFGSYRCSSPIRIPAWRTDRHFVIVRYEGRVPAMASCGKCDRKFFTPTTVTRDAIGAEEYLGRKFDVHECEESKRSSDG*
Ga0070712_10159718223300006175Corn, Switchgrass And Miscanthus RhizosphereRIHGELEHAMSASSSTPSPHRIANLMSPVGRFLQCRVCQLSFTFPDGAQFGPIAKQFGSYRCSSPIRIPAWRTDRHFVIVRYEGRVPAMASCGKCDRKFFTPTTVTRDAIGAEEYLGRKFDVHECEESKRSSDG*
Ga0066709_10109543123300009137Grasslands SoilMEFAMPASSSPPSSRHRIAILASPVGRLLQCRDCKLTFVFPDEAYYGRIAKQFEPHLCHPSIRTPGRWRTDSRFVIVRYEGKVPAMASCAKCERKFFTPTTLSRDAIGAEEYLGHKFDLHACAEIAEKHP*
Ga0116128_105466113300009518PeatlandAKFGAIAKQFESHLCLSPIRIPGWHTDRRFVIVRYEGKVPALASCAKCQRKFFTPTTLARDAVGAEEYLGSKFDAHDCPAEIEQRHK*
Ga0116108_101538513300009519PeatlandQIGFKFPDGTQFGAIAKQFESYLCGPPIGIPDWRTERRFVIVRHEGKFPAMASCAKCQLKFFTPATFARDAVGAELYLLDKFDLHECKEGPKK*
Ga0116126_100241393300009640PeatlandMSPAGRFLECADCQIGFKFPDGTQFGAIAKQFESYLCGPPIGIPDWRTERRFVIVRHEGKFPAMASCAKCQLKFFTPATFARDAVGAELYLLDKFDLHECKEGPKK*
Ga0136449_100014731163300010379Peatlands SoilMILCVRGELEYVMSTSSSPPPPPHRIAVIMSPEGRFLQCRDCQLAYTFPDGIKFGDIAKQFGTHSCVTPMHRPAWHNDRRFVILRYEGKVPALASCARCERKFFTPPALIRDASGAEEYLGRKFDVHECDRDPQPEYRAMLGERQ*
Ga0136449_10293880523300010379Peatlands SoilMSPVGRFLQCSDCQLSFTFPHEVKFGDLAKQFELHSCVAPNRIPGGHSDRRFVILRYEGKVPALASCSKCEHKFFTPTTLMRDASGAE
Ga0136449_10414494713300010379Peatlands SoilMFPAGRFLQCTDCKLRFTFPDGVQFGTIAKRFESHLCGLPIRVPGVSIERRFVILKYGGTVPAMAACAKCERKFFTPTTFARDAVGAKE*
Ga0134121_1112987823300010401Terrestrial SoilYDGLARGELGHAMSASSRPPPSLHRIGLLMSPVGRFLQCRDCQLSYTFPDGLEYDTLAKQFETHLCLPSIRSPDWRTDSRFVIVRYEGKIPAMASCARCKRKFFTPATLARDATGAKEYLGQKFDLHECAGIEERQP*
Ga0150983_1247747813300011120Forest SoilAMSGSSSPPLALHRIAILVSPLGRFLQCSDCKLSYVFPDGVQFGTIAKQFESHLCGSPIRIPSWQTDRRFVIVRYAGKVPVMASCAKCERKFFTPTTLARDVVKAEEYLGRKFDVHDCAEPKR*
Ga0150983_1357786713300011120Forest SoilMSASSSPPPHRIAIVISADGRLLQCIDCHLTYPFPNGIKFGVVSKQFDAHSCVTPTRTPAWQTDRRFVILRYEGKVPTMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHLCEPDPQLVGDAV*
Ga0150983_1389498213300011120Forest SoilLEYAMSTSSSSLSSLHHIGILMSPVGRFLQCSKCQLSFTFPDGITFGALAKQFDAHACAIPARRPAWQTDGRFVVLKYEGRIATWGSCERCERKFFTPTALMRDASGAEEYLGNKFDMHECAEPKR*
Ga0137382_1059115323300012200Vadose Zone SoilMSKATSSSLPRHRIGILMFPEGRFLECADCQISFKFPDGAQFGAIAKQFEFHLCGPPIDSLAWRTERRFVIVRHEGKVPAMASCAKCQLKFFTPPTFACGPLGAERYLLDKFDMHECGELPTK*
Ga0137381_1015200233300012207Vadose Zone SoilMPASSSPPSSRHHRIAILVSPVGRLLQCRDCKLTFVFSDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137379_1080400113300012209Vadose Zone SoilMALPVERFAMLATSSPSSSRHRIAILASPVGRFLQCRDCKRTFVFPDGVYYGTIAKQFESHSCHPQVRSPGWRTDSRFVIVRYEGKVPAMASCAKCGRKFIAPATLAHDAIGAEEYLGHKFDLHVCAGIEES*
Ga0137378_1001409513300012210Vadose Zone SoilMSPVGRFLQCRYCQLSYTFPDGVQFGTIAKQFESHLCLPRIRSPDWRTDSRFVIMRYEGKVPAMSSCARCERKFFTPATFACNAVAAEEYLGQKFDLHVCVEIEEKHP*
Ga0137378_1028279223300012210Vadose Zone SoilMALPVERFAMLATSSPSSSRHRIAILASPVGRFLQCRDCKRTFVFPDGVYYGTIAKQFESHSCHPQVRSPGWRTDSRFVIVRYEGKVPAMASCAKCGRKFIAPATLAHDAIGAEEYLG
Ga0137378_1056968613300012210Vadose Zone SoilKQFESHSCHPQVRSPGWRTDSRFVIVVYEGKVPAMASCAKCGRKFIAPATLAHDAIGAEEYLGHKFDLHVCAGIEES*
Ga0137378_1063397423300012210Vadose Zone SoilMPACSSPPSSRHHRIATLVSPVGRLLQCRDCKLTFVFSDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137387_1078651123300012349Vadose Zone SoilMPASSSPPSSRHRIAILASPVGRLLQCRDCKLTFVFSDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137372_1010091533300012350Vadose Zone SoilGLARGELGHTMPASSSPPSSRHHRIAILVSPVGRLLQCRDCKLTFVFSDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137386_1018114833300012351Vadose Zone SoilLSSSLRMEILRAATGELSSRRWCGPFSEYDGLARGELGHTMPASSSPPSSRHHRIAILVSPVGRLLQCRDCKLTFVFPDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137384_1020282013300012357Vadose Zone SoilMALPVERFAMLATSSPSSSRHRIAILASPVGRFLQCRDCKRTFVFPDGVYYGTIAKQFESHSCHPQVRSPGWRTDSRFVIVRYEGKVPAMASCAKCGRKFIAPATLAHDAIGAEEYLGQKFDLHVCAGIEES*
Ga0137384_1031048233300012357Vadose Zone SoilLRMEILRAATGELSSRRWCGPFSEYDGLARGELGHTMPASSSPPSSRHHRIAILVSPVGRLLQCRDCKLTFVFSDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCERKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0137384_1058083113300012357Vadose Zone SoilSYTFPDGVQFGTIAKQFESHLCLPRIRSPDWRTDSRFVIMRYEGKVPAMSSCARCERKFFTPATFACNAVAAEEYLGQKFDLHVCVEIEEKHP*
Ga0137385_1020886823300012359Vadose Zone SoilMASISPPSARHHRIAILVSPVGRLLQCRDCKLTFVFPDEAHYGTIAKQFEPHLCYPPICRPGWRTDSRFVIVRYDGKVPAMASCAKCEQKFFTPAIFARDATGAEEYLGHKFELHVCVGIEEKQW*
Ga0164301_1019894423300012960SoilSLPRHRIGILMFPEGRFLECAECQISFKFPDGAQFGAIAKQFEFHLCGPPIDSPAWRTDRRFVIVRHEGKVPAMASCAKCQLKFFTPPTFACGPLGAERYLLDKFDLHECWEEPAK*
Ga0181521_1027167023300014158BogMSPAGRFLECADCQIGFKFPDGTQFGAIAKQFESYLCGPPIGIPDWRTERRFVIVRHEGKFPAMASCAKCQLKFFTPATFARDAVGAELYLLD
Ga0181532_1000006463300014164BogLSFTFADGAQFGTVAKLFEPYSCSSPIRIPAWHNDLRFVILRYEGKVPVLASCEKCQRKFFTPPTLARDAVGAEEYLGQKFEVHVCPKEM*
Ga0181532_1001714143300014164BogMDYVMSANSSPPPPPHRIAILMAPEGRFLQCRDCHLTYTFPDGLKFGVVAKQFGAHSCVTPIRKPAGQTDRRFVVLRYEGKVPALASCTKCERKFFTPTTLMRDAIGAEEYLGRKFDVHRCE*
Ga0181532_1001831413300014164BogMSPVGRFLQCRDCQSSFTFPDGAQFGKVAKLFEPWSCSSPTCIPAWHTDRRFVILRYEGKVPVLASCAKCTRKFFTPPTLARDAVGANEYLGQKFDVHVCPK*
Ga0181523_1000404643300014165BogMDYVMSANSSPPPPPHRIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAKQFGAHSCVTPIRKPAGQTDRRFVVLRYEGKVPALASCTKCERKFFTPTTLMRDAIGAEEYLGRKFDVHRCE*
Ga0181523_1003811753300014165BogLSFTFADGAQFGTVAKLFEPYSCSSPIRIPAWHNDLRFVILRYEGKVPVLASCEKCQRKFFTPPSLARDAVGAEEYLGQKFEVHVCPKEM*
Ga0181523_1037734813300014165BogMAGNKIYRLRSPAGRTRAYDGLVRGELEDAMSASSSPPSSPHRIGIIMSPVGRFLQCRDCQSSFTFPDGAQFGKVAKLFEPWSCSSPTCIPAWHTDRRFVILRYEGKVPVLASCAKCTRKFFTPPTLARDAVGANEYLGQKFDVHVCPK*
Ga0132258_1175730123300015371Arabidopsis RhizosphereMSPVGRFLQCRDCQLSYTFPDGLEYDTLAKQFETHLCLPSIRSPDWRTDSRFVIVRYEGKIPAMASCARCERKFFTPATFARDAAGAKEYLGQKFDLHECAGIEERQP*
Ga0181505_1024620623300016750PeatlandMDYVMSANSSPPPPPHRIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAKQFGAHSCVTPIRKPAGQTDRRFVVLRYEGKVPALASCTKCERKFFTPTTLMRDAIGAEEYLGRKFDVHRCE
Ga0187856_101139713300017925PeatlandMSPAGRFLECADCQIGFKFPDGTQFGAIAKQFESYLCGPPIGIPDWRTERRFVIVRHEGKFPAMASCAKCQLKFFTPATFARDAVGAELYLLDKFDLHECKEGPKK
Ga0187801_1005860823300017933Freshwater SedimentMRSCQRIKAYDGLAHGELEDAVSKATSSPPPRHRIAILMSPAGRFLECRDCRISFSFPDGSHFDAIAKQFEPHLCGSPIGIAGWRTERGFIIVRYEGKVPAIASCAKCQLKFFTPATFARDAVGAEEYLARKFDVH
Ga0187808_1028802113300017942Freshwater SedimentVSKATSSPPPRHRIGILMSPAGRFLECADCQVGFKFPDGAQFGSIAKQFESHFCGSPIGIPGWRTERRFVIVRHEGEVPAMASCAKCQLKFFTPASFARDAVGAELYLLDKFDLHECKEV
Ga0187819_1005120713300017943Freshwater SedimentMSPAGRFLECADCQIGFEFPEGAEFGAIAKQFESHLCGPPIGILGWRTERRLVIVRHEGKVAAMASCAKCQLKFFTPETFARDPVGAELYLLDKFDLHECREEPTK
Ga0187817_1042327623300017955Freshwater SedimentLAHRELEYAVSKATSSPPPRHRIGILMSPAGRFLECADCQVGFKFPDGVQFGAIAKQFESYSCGPPIGIPGWRTERRFVIARHESKVPAMASCAKCQLKFFTPATFARDPVGAELYLLAGCGKMDSAT
Ga0187888_1002255183300018008PeatlandVSKGTSCPPPRHRIGILMSPAGRFLECADCQIGFKFPDGTQFGAIAKQFESYLCGPPIGIPDWRTERRFVIVRHEGKFPAMASCAKCQLKFFTPATFARDAVGAELYLLDKFDLHECKEGPKK
Ga0187810_1038672713300018012Freshwater SedimentVSKATSSPPPRHRIGILMSPAGRFLECRDCQINFKFPDGAQFGSIAKQFESHLCGSPIGIPGWRTERRFVIVRHEGEVPAMASCAKCQLKFFTPASFARDAVGAELYLLDKFDLHECKEV
Ga0187874_1007237823300018019PeatlandVGRFLQCRNCQLSYTFPDGAKFGAIAKQFESHLCLSPIRIPGWHTDRRFVIVRYEGKVPALASCAKCQRKFFTPTTLARDAVGAEEYLGSKFDAHDCPAEIEQRHK
Ga0187857_1023044213300018026PeatlandAMSASSSPSPPHRIAILMSSVGRFLQCRDCQLSFTFPVGAQFGTIAKLFGPYQCNSPIHISGWHTDRRFVIVRYEGKVPAMASCAKCERKFFTPTTLARDAVGAEEYLGSKFDAHDCPAEIEQRHK
Ga0187858_1019148233300018057PeatlandVGFFALSLNAELPRMKAYDGFVRGELEYAMSASSSPSPPHRIAILMSSVGRFLQCRDCQLSFTFPVGAQFGTIAKLFGPYQCNSPIHISGWHTDRRFVIVRYEGKVPAMASCAKCERKFFTPTTLARDAVGAEDYLGQKFDVHQCRREISGE
Ga0187858_1038259013300018057PeatlandVGRFLQCRNCQLSYTFPDGAKFGAIAKQFRSHLCLSPIRIPGWHTDRRFVIVRYEGKVPALASCAKCQRKFFTPTTLARDAVGAEEYLGSKFDAHDCPAEIEQRHK
Ga0210407_10001706133300020579SoilMSASSSPPPHRIAIVISADGRLLQCIDCHLTYPFPNGIKFGVVSKQFDAHSCVTPTRTPAWQTDRRFVILRYEGKVPTMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHLCEPDPQLVGDAV
Ga0210407_1006837233300020579SoilMPHSGASNAFRIGELGMSATSSPPLHRIVIVISRDGRLLQCIDCQLLYPFPNGIKFGVVSKQFDAHACVTPTRTPAWQTGRRFVILRYEGQVPAMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHRCDPDPQLVGDAV
Ga0210403_1000131343300020580SoilMSATSSPPLHRIVIVISRDGRLLQCIDCQLLYPFPNGIKFGVVSKQFDAHACVTPTRTPAWQTGRRFVILRYEGQVPAMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHRCDPDPQLVGDAV
Ga0210403_1096881223300020580SoilMVMSANSSPPPPHRIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAEQFGAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCARCERKFFTPTTLARDVVKAEEYLGRKFDVHECAGPKR
Ga0210399_1158231013300020581SoilMVMSANSFPPPPHRIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAEQFGAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCARCERKFFTPTTLARDVVKAEEYL
Ga0210395_1000604863300020582SoilMSTSSSSLSSLHHIGILMSPVGRFLQCSKCQLSFTFPDGITFGALAKQFDAHACAIPARRPAWQTDGHFVVLKYEGRIATWGSCERCERKFFTPTALMRDASGAEEYLGNKFDMHECAEPKR
Ga0210404_1002554243300021088SoilMDYVMSANSSPRPPHRIAILMCPEGRFLQCRDCQLTYTFPDGLKFGVVAKQFDAHTCVTPIRKPGWQTDRRFVVLRYEGKVPALASCAKCECKFFTPNTLLRDASGAVVYLGRKFDAHQCEETE
Ga0210405_1043263223300021171SoilMSASSSPPPPPHRIVIVIGRLLQCIDCQVTYPFPNDIKFGVVSKQFDAHACVAPTRTPAWHTDRRFVMLRYEGKVPTMASCAKCARKFFTPTALMRDASGAEMYPRRKFDLHRCGPDPL
Ga0210405_1115585213300021171SoilMPASSSPLSSLHHIGVLMSPVGRFLQCSDCQLSFTFPDGVRFGDLAKQFELHSCLTPIRRPAWRTDGRFIVLKYEGRVAALASCARCERKFFTPTALARDAVGAEEYLGRKFDVHECGPVPE
Ga0210388_1000840173300021181SoilMPASSSPLSSLHHIGVLMSPVGRFLQCSDCQLSFTFPDGVRFGDLAKQFELHSCLTPIHRPAWRTDGRFIVLKYEGRVAALASCARCERKFFTPTALARDAVGAEEYLGRKFDVHECGPVPE
Ga0210388_1093667523300021181SoilIWHRRGPNICAEGITHMDYVISANSTPPPPHHIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAKQFDAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCARCERKFFTPTTLARDVVKAEEYLGRKFDVHECAGPKR
Ga0210397_1024029213300021403SoilASSSPPPHRIAIVISADGRLLQCIDCHLTYPFPNGIKFGVVSKQFDAHSCVTPTRTPAWQTDRRFVILRYEGKVPTMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHLCEPDPQLVGDAV
Ga0210394_10001774243300021420SoilMSASRSSPPLPPHRIGIIVSPEGRFLQCRDCQLAYTFPDGIKFDEIAKQFDAHSCVSPIRTPAWQTDRRFVLLRYEGKVPALASCARCERKFFTPTTLMRDARGAEEYLGRKFDVHKCVPHPH
Ga0210394_1001795173300021420SoilMSTSSSPPSPPHRLAVIMSPEGRFLQCRDCQLTRTFPDGIKFGVIAKQFGAHSCVTPMHRPAWHNDRRFVILRYDGKVPALASCARCERKFFTPSALIRDASGAEEYLGSKFDVHECDPDPQREYRAMLSERQEK
Ga0210384_1050413213300021432SoilNAFRIGELGMSATSSPPLHRIVIVISRDGRLLQCIDCQLLYPFPNGIKFGVVSKQFDAHACVTPTRTPAWQTGRRFVILRYEGQVPAMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHRCDPDPQLVGDAV
Ga0210390_1064240413300021474SoilMSTSSSSLSSLHHIGILMSPVGRFLQCSKCQLSFTFPDGITFGALAKQFDAHACAIPARRPAWQTDGHFVVLKYEGRIATWGSCERCERKFFTPTALMRDASGAEEYLG
Ga0210390_1081210313300021474SoilMSASSSPPPHRIAIVISADGRLLQCIDCHLTYPFPNGIKFGVVSKQFDGHSCVTPTRTPAWQTDRRFVILRYEGKVPTMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHLCEPDPQLVGDAV
Ga0210398_1002377543300021477SoilMSTSSSSLSSLHHIGILMSPVGRFLQCSKCQLSFTFPDGITFGALAKQFDAHACAIPARRPAWQTDGRFVVLKYEGRIATWGSCERCERKFFTPTALMRDASGAEEYLGNKFDMHECAEPKR
Ga0242655_1013925713300022532SoilASNAFRIGELGMSATSSPPLHRIVIVISRDGRLLQCIDCQLLYPFPNGIKFGVVSKQFDAHACVTPTRTPAWQTGRRFVILRYEGQVPAMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHRCDPDPQLVGDAV
Ga0242666_118047713300022721SoilLEYAMSTSSSSLSSLHHIGILMSPVGRFLQCSKCQLSFTFPDGITFGALAKQFDAHACAIPARRPAWQTDGRFVVLKYEGRIATWGSCERCERKFFTPTALMRDASGAEEYLGNKFDMHECAEPKR
Ga0207663_1002895913300025916Corn, Switchgrass And Miscanthus RhizosphereMSASSSTPSPHRIANLMSPVGRFLQCRVCQLSFTFPDGAQFGPIAKQFGSYRCSSPIRIPSGRTDRHFVIVRCEGRAPAMASCAKCDRKFFTPTTLANDAIGAKEYLRRKFDVHECRVEEIK
Ga0207700_1009456343300025928Corn, Switchgrass And Miscanthus RhizosphereMASSSPPPHSLHRICIIMSPGGRFLQCRDCQLSYTFPDGVEYATLARQFESHLCLPPIRGPDWRTDSRFVIVRYEGKIPVMASCARCERKFFTPATFARGAVGAHEYLGQKFDVHICAESDEKQP
Ga0207664_1035991743300025929Agricultural SoilMASSSPPPHSLHRICIIMSPVGRFLQCRDCQLSYTFPDGVEYATLARQFESHLCLPPIRGPDWRTDSRFVIVRYEGKIPVMASCARCERKFFTPATFARGAVGAHEYLGQKFDVH
Ga0207948_101437113300027174Forest SoilPPLHRIVIVISRDGRLLQCIDCQLLYPFPNGIKFGVVSKQFDAHACVTPTRTPAWQTGRRFVILRYEGQVPAMASCAKCARKFFTPTALMRDASGAEMYLGRKFDLHRCDPDPQLVGDAV
Ga0209116_108291313300027590Forest SoilMDYVMSANSSPPPPPHRIAILMSPEGRFLQCRDCHLTYTFPDGLKFGVVAKQFYAHSCVTPVRKTAWQADRRFVVLRYEGKVPALASCARCERKFFTPSTFSRDAVGAAEYLGRKFDIHECAEPLGSIPCPPCG
Ga0209811_1002817323300027821Surface SoilMSASSSTPSPHRIANLMSPVGRFLQCRVCQLSFTFPDGAQFGPIAKQFGSYRCSSPIRIPAWRTDRQFVIVRYEGRVLAMASCGKCDRKFFTPTTVTRDAIGAEEYLGRKFDVHECEESKRSSDG
Ga0209580_1030985123300027842Surface SoilPPPHRIGILMSPVGRFLQCRDCQLSYTFPDGAKFGAIAKQFESHLCLSPIRFSAWQTDRRFVIVRYEGKVPAMASCAKCQRKFFTPTTLARDAVGAEEYLGRKFDVHDCPAEIEERHK
Ga0209067_1041717913300027898WatershedsPHRIGILISPVGRFLQCRDCELSYTFPDGAKFGAIAKQFESHLCLSPIRIPGWHTDRRFVIVRYEGKVPAMASCAKCQRKFFTPTPLARDAVGAEEYLGRKFDVHDCPAEIEERHK
Ga0209415_1079714723300027905Peatlands SoilMSASSSPPSSLHHIGIIMSPVGRFLQCSDCQLSFTFPHEVKFGDLAKQFELHSCVAPNRIPGGHSDRRFVILRYEGKVPALASCSKCEHKFFTPTTLMRDASGAEEYLGRKFDVHQCGPDPH
Ga0209168_1013791113300027986Surface SoilMSATSSPPPLHRIAIVISRDGRLLQCIDCQVTFPFPNGIKFGVVSKQFDAHSCVTPTRASSWQIGRRFVILRYEGQVPAMASCAKCARKFFTPTALIRDASGAEMYLGRKFDLHRCDPDPQLVGDTV
Ga0170834_10625986913300031057Forest SoilRIRAYDDLARGELEYAMSASSSPPPPHHIGILMSPVGRFLQCRDCQLTYTFPDGVHFGAIAKQFDAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCAKCERKFFTPTTLVRDAVGAEEYLGRKFDVHVCPEEIRARGRRRLQNTAGLDWR
Ga0170822_1273140423300031122Forest SoilRIAILMSPEGRFLQCRDCQLTYTFPDGVHFGAIAKQFDAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCAKCERKFFTPTTLVRDAVGSEEYLGRKFDVHVCPEEIRARGRRRLQNTAGLDWR
Ga0170823_1296779123300031128Forest SoilGELEYAMSASSSPPPPHHIGILMSPVGRFLQCRDCQLTYTFPDGVHFGAIAKQFDAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCAKCERKFFTPTTLVRDAVGAEEYLGRKFDVHVCPEEIRARGRRRLQNTAGLDWR
Ga0170824_12412224023300031231Forest SoilMDYVMSANSSPPPPPHRIAILMSPEGRFLQCRDCQLTYTFPDGVHFGAIAKQFDAHSCVTPIRKPAWQTDRRFVVLRYEGKVPALASCAKCERKFFTPTTLVRDAVGAEEYLGRKFDVHVCPEEIRARGRRRLQNTAGLDWR
Ga0311301_1005276213300032160Peatlands SoilMSPEGRFLQCRDCQLAYTFPDGIKFGDIAKQFGTHSCVTPMHRPAWHNDRRFVILRYEGKVPALASCARCERKFFTPPALIRDASGAEECP
Ga0311301_10058138123300032160Peatlands SoilMSASNSTPSPHRIAILMSPVGRFLQCRDCQLSFTFPDGAQFGTIAKQFGSYLCSSPIRVSGVLIGRHFVIVRCEGKVPAMASCAKCDRKFFTPTTLANDAIGAEEYLGRKFDVHECEESK
Ga0348332_1320153123300032515Plant LitterMSPEGRFLQCRDCQLTCTFPDGIKFGVIAKQFGAHSCVTPMHRPAWHNDRRFVILRYDGKVPALASCARCERKFFTPSALIRDASGAEEYLGSKFDVHECDPDPQREYRAMLSERQEK
Ga0335070_1000680243300032829SoilMSPAGRFLECADCQTSFKFPDGAEFSEIAKQFESHLCGSAISISGCRTQRRFIIVRHEGEVPAMASCAKCQLKFFTPATFAHDSVGAELYLLDKFDLHECGEEPMK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.