NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069287

Metagenome / Metatranscriptome Family F069287

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069287
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 80 residues
Representative Sequence MSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELANQATAVARLRCIGVESRWENGEIWTETRFETV
Number of Associated Samples 103
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 97.56 %
% of genes near scaffold ends (potentially truncated) 98.39 %
% of genes from short scaffolds (< 2000 bps) 89.52 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.194 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(25.000 % of family members)
Environment Ontology (ENVO) Unclassified
(28.226 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.226 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 40.40%    β-sheet: 21.21%    Coil/Unstructured: 38.38%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF08281Sigma70_r4_2 0.81
PF04542Sigma70_r2 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.81
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.81
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.81
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.19 %
UnclassifiedrootN/A0.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10041062All Organisms → cellular organisms → Bacteria1591Open in IMG/M
3300002561|JGI25384J37096_10043873All Organisms → cellular organisms → Bacteria1721Open in IMG/M
3300002914|JGI25617J43924_10092702All Organisms → cellular organisms → Bacteria1086Open in IMG/M
3300002914|JGI25617J43924_10196092All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300003350|JGI26347J50199_1016722All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300004080|Ga0062385_11288881All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300004091|Ga0062387_100055417All Organisms → cellular organisms → Bacteria → Acidobacteria1925Open in IMG/M
3300004091|Ga0062387_100745096All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300004092|Ga0062389_104667739All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300005172|Ga0066683_10404177All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300005176|Ga0066679_10671994All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300005186|Ga0066676_10569070All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300005406|Ga0070703_10016950All Organisms → cellular organisms → Bacteria → Acidobacteria2096Open in IMG/M
3300005439|Ga0070711_101210772All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300005451|Ga0066681_10807669All Organisms → cellular organisms → Bacteria566Open in IMG/M
3300005471|Ga0070698_100883143All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300005541|Ga0070733_10402761All Organisms → cellular organisms → Bacteria911Open in IMG/M
3300005542|Ga0070732_10673851All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300005557|Ga0066704_10245794All Organisms → cellular organisms → Bacteria1214Open in IMG/M
3300005557|Ga0066704_10317083All Organisms → cellular organisms → Bacteria1051Open in IMG/M
3300005598|Ga0066706_10632502All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300005610|Ga0070763_10729274All Organisms → cellular organisms → Bacteria → Acidobacteria582Open in IMG/M
3300006172|Ga0075018_10593562All Organisms → cellular organisms → Bacteria → Acidobacteria588Open in IMG/M
3300006176|Ga0070765_100134407All Organisms → cellular organisms → Bacteria → Acidobacteria2187Open in IMG/M
3300006804|Ga0079221_10184188All Organisms → cellular organisms → Bacteria1128Open in IMG/M
3300006954|Ga0079219_11845593All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300007258|Ga0099793_10575302All Organisms → cellular organisms → Bacteria → Acidobacteria563Open in IMG/M
3300007265|Ga0099794_10556432All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300009038|Ga0099829_10172283All Organisms → cellular organisms → Bacteria → Acidobacteria1739Open in IMG/M
3300009038|Ga0099829_11224752All Organisms → cellular organisms → Bacteria → Acidobacteria621Open in IMG/M
3300009088|Ga0099830_10768017All Organisms → cellular organisms → Bacteria → Acidobacteria795Open in IMG/M
3300009088|Ga0099830_10952889All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300009088|Ga0099830_11754023All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300009090|Ga0099827_10008726All Organisms → cellular organisms → Bacteria → Acidobacteria6396Open in IMG/M
3300009698|Ga0116216_10049722All Organisms → cellular organisms → Bacteria → Acidobacteria2589Open in IMG/M
3300010301|Ga0134070_10117914All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300010359|Ga0126376_11805982All Organisms → cellular organisms → Bacteria → Acidobacteria649Open in IMG/M
3300010396|Ga0134126_10119198All Organisms → cellular organisms → Bacteria → Acidobacteria3226Open in IMG/M
3300010398|Ga0126383_11690132All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium722Open in IMG/M
3300011269|Ga0137392_10183517All Organisms → cellular organisms → Bacteria → Acidobacteria1704Open in IMG/M
3300011269|Ga0137392_10456087All Organisms → cellular organisms → Bacteria1062Open in IMG/M
3300011269|Ga0137392_10664630All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300012201|Ga0137365_10906174All Organisms → cellular organisms → Bacteria → Acidobacteria643Open in IMG/M
3300012203|Ga0137399_10432843All Organisms → cellular organisms → Bacteria → Acidobacteria1099Open in IMG/M
3300012205|Ga0137362_10203953All Organisms → cellular organisms → Bacteria1702Open in IMG/M
3300012206|Ga0137380_10381483All Organisms → cellular organisms → Bacteria1254Open in IMG/M
3300012209|Ga0137379_11357191All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300012349|Ga0137387_10449811All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium934Open in IMG/M
3300012362|Ga0137361_11420755All Organisms → cellular organisms → Bacteria → Acidobacteria617Open in IMG/M
3300012363|Ga0137390_10195696All Organisms → cellular organisms → Bacteria → Acidobacteria2009Open in IMG/M
3300012363|Ga0137390_10647130All Organisms → cellular organisms → Bacteria1022Open in IMG/M
3300012363|Ga0137390_11369515All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300012923|Ga0137359_10923415All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300012924|Ga0137413_11274063All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300012944|Ga0137410_10546840All Organisms → cellular organisms → Bacteria → Acidobacteria951Open in IMG/M
3300012972|Ga0134077_10236123All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300014154|Ga0134075_10395432All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300015052|Ga0137411_1050660All Organisms → cellular organisms → Bacteria → Acidobacteria1124Open in IMG/M
3300017822|Ga0187802_10044549All Organisms → cellular organisms → Bacteria1611Open in IMG/M
3300018086|Ga0187769_10226158All Organisms → cellular organisms → Bacteria → Acidobacteria1388Open in IMG/M
3300018431|Ga0066655_10167242All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1325Open in IMG/M
3300018431|Ga0066655_10363171All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300018431|Ga0066655_11104210All Organisms → cellular organisms → Bacteria → Acidobacteria555Open in IMG/M
3300018468|Ga0066662_11340753All Organisms → cellular organisms → Bacteria → Acidobacteria736Open in IMG/M
3300019789|Ga0137408_1049816All Organisms → cellular organisms → Bacteria → Acidobacteria2607Open in IMG/M
3300020170|Ga0179594_10090134All Organisms → cellular organisms → Bacteria → Acidobacteria1091Open in IMG/M
3300020199|Ga0179592_10443887All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300020579|Ga0210407_10654678All Organisms → cellular organisms → Bacteria → Acidobacteria816Open in IMG/M
3300020579|Ga0210407_10857760All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300020580|Ga0210403_10342209All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300020580|Ga0210403_10975115All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300020580|Ga0210403_11142281All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300020581|Ga0210399_10551392All Organisms → cellular organisms → Bacteria956Open in IMG/M
3300020581|Ga0210399_10579525All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300020581|Ga0210399_11593921All Organisms → cellular organisms → Bacteria → Acidobacteria504Open in IMG/M
3300020583|Ga0210401_10167217All Organisms → cellular organisms → Bacteria → Acidobacteria2052Open in IMG/M
3300020583|Ga0210401_10207867All Organisms → cellular organisms → Bacteria1815Open in IMG/M
3300021168|Ga0210406_10446463All Organisms → cellular organisms → Bacteria1030Open in IMG/M
3300021170|Ga0210400_10645353All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300021171|Ga0210405_10152409All Organisms → cellular organisms → Bacteria → Acidobacteria1830Open in IMG/M
3300021178|Ga0210408_10131971All Organisms → cellular organisms → Bacteria → Acidobacteria1973Open in IMG/M
3300021403|Ga0210397_10691801All Organisms → cellular organisms → Bacteria783Open in IMG/M
3300021404|Ga0210389_11059724All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300021405|Ga0210387_11789696All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300021433|Ga0210391_10391972All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300021476|Ga0187846_10099524All Organisms → cellular organisms → Bacteria → Acidobacteria1252Open in IMG/M
3300021477|Ga0210398_10137963All Organisms → cellular organisms → Bacteria → Acidobacteria1983Open in IMG/M
3300021477|Ga0210398_11427142All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300021479|Ga0210410_10531030All Organisms → cellular organisms → Bacteria1049Open in IMG/M
3300021861|Ga0213853_11411351All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300024290|Ga0247667_1057622All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300025885|Ga0207653_10046718All Organisms → cellular organisms → Bacteria1432Open in IMG/M
3300025906|Ga0207699_10045214All Organisms → cellular organisms → Bacteria → Acidobacteria2569Open in IMG/M
3300025922|Ga0207646_10783635All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300026304|Ga0209240_1147249All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300026356|Ga0257150_1075300All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300026361|Ga0257176_1051671All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300026514|Ga0257168_1131842All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300026542|Ga0209805_1455281All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300027456|Ga0207482_102716All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300027651|Ga0209217_1218503All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300027655|Ga0209388_1232675All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300027725|Ga0209178_1069489All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300027783|Ga0209448_10180687All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300027842|Ga0209580_10417080All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300027862|Ga0209701_10072859All Organisms → cellular organisms → Bacteria → Acidobacteria2172Open in IMG/M
3300027882|Ga0209590_11011328All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300027894|Ga0209068_10142404All Organisms → cellular organisms → Bacteria1292Open in IMG/M
3300027915|Ga0209069_10127686All Organisms → cellular organisms → Bacteria → Acidobacteria1247Open in IMG/M
3300030764|Ga0265720_1010965All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300030878|Ga0265770_1005128All Organisms → cellular organisms → Bacteria → Acidobacteria1798Open in IMG/M
3300031718|Ga0307474_10297395All Organisms → cellular organisms → Bacteria1243Open in IMG/M
3300031718|Ga0307474_11020874All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300031720|Ga0307469_10508752All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300031740|Ga0307468_101614253All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300031754|Ga0307475_10559230All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300031820|Ga0307473_10666512All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300031823|Ga0307478_11509899All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300031962|Ga0307479_10260538All Organisms → cellular organisms → Bacteria → Acidobacteria1713Open in IMG/M
3300031962|Ga0307479_10656509All Organisms → cellular organisms → Bacteria → Acidobacteria1029Open in IMG/M
3300032180|Ga0307471_100152946All Organisms → cellular organisms → Bacteria → Acidobacteria2223Open in IMG/M
3300032783|Ga0335079_12148633All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300032805|Ga0335078_10186102All Organisms → cellular organisms → Bacteria → Acidobacteria2900Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil20.16%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.06%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.45%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil4.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.84%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.23%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.42%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.42%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.61%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.61%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds0.81%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.81%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.81%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003350Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009698Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_3_AS metaGEnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027456Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-ROWE17-E (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027783Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300030764Metatranscriptome of plant litter microbial communities from Maridalen valley, Oslo, Norway - NLI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030878Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1004106213300002558Grasslands SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVDSRWESGEIWTETR
JGI25384J37096_1004387313300002561Grasslands SoilMSFADRRRFLWILFLAGLVLIALVAQATTFSRLRFEELARQASGVARLRCLSAESHWENGEIWTETRFAVEETEKGLLGALTMVR
JGI25617J43924_1009270213300002914Grasslands SoilMSFVQRRRFLWILFLGGLVLLAVAANATTLSRLRFEGLVHQATAVARLRCIGVESRWENGEIWTETRFETVEVNKGLLP
JGI25617J43924_1019609213300002914Grasslands SoilMSFVQRRRFLWILFLTGLALLAVAASATTLSRLRFEELVNRATAVARLRCIGAESFLENGEIWTETLFETVELNKGLLPGVIRVRMIGGRVGN
JGI26347J50199_101672223300003350Bog Forest SoilMSYVERRRFLWILFLGGLALLVLAVAASATTLSRLKFEDLALESTAVARLRCLGATSQWEQGEIWTETRFEVVQREKGTLPGIVTVRLLGGQVGHL
Ga0062385_1128888123300004080Bog Forest SoilMSYVQRRRFLWILFLGGLALLAVAANATTLSRLKLEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQRE
Ga0062387_10005541713300004091Bog Forest SoilMSYIERRKFLWILFLAAILLTAIVANATTLARMKFDELTQQATAVARLRCLGAESRWEQGEIWT
Ga0062387_10074509613300004091Bog Forest SoilMNYVQRRRFLWILFLGGLALLAVAASATTLSRLKLEDLAQGSTAVARLRCLGAMSQWEQGEIWTETRFEVLQREKGALQGIITVRL
Ga0062389_10466773913300004092Bog Forest SoilMSYVERRRFLWILFLGGIALLAVAAVASATTLSRLKLEDMAQESTAVARMRCLGATSQWEQGEIWTETRFEVLEREKGALPGVVTVRLLGGSVGH
Ga0066683_1040417723300005172SoilMSFADRRRFLWILFLAGLVLIALVAQATTFSRLRFEELARQASGVARLRCISAESHWENGEIWTATRFEVEETEKGLLGALTTVRLP
Ga0066679_1067199413300005176SoilMSFIQRRRFLWVLFLAGLALLVVAANATTLSRLRFEELANQATAVARLRCLGAESRTQGGEIWT
Ga0066676_1056907013300005186SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVDSRWESGEIWTETRFETVEVNK
Ga0070703_1001695013300005406Corn, Switchgrass And Miscanthus RhizosphereMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTETRFEVVQRDKG
Ga0070711_10121077223300005439Corn, Switchgrass And Miscanthus RhizosphereMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTETRFEVVHQEKGALPGVVTVRLLGGNVGHLHS
Ga0066681_1080766923300005451SoilMSYVERRRFLWILFLMALTLLAIAANATTLARMRIEELAQQATAVARLRCLDTKSFWHNGEIWTDTQFEVIEQAKGALQATVIVR
Ga0070698_10088314333300005471Corn, Switchgrass And Miscanthus RhizosphereMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQQATAVARLRCLSSEARWEHGEIWTDTRFEVV
Ga0070733_1040276113300005541Surface SoilMSYVQRRRFLWILFLCALALLAVAASATTLSRLKLEDLAQESTAVARMRCLGASSQWDKGEIWTETRFEVLQTEKGA
Ga0070732_1067385113300005542Surface SoilMSYKQRRRFLWILFLVGLALIAITASATTLSRLKLEDLALESTAVARLRCLDATSFWNQGEIWTETRFA
Ga0066704_1024579423300005557SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVIQATAVARLRCIGVDSRWESGEIWTETRFETVEVNKGLLPGVV
Ga0066704_1031708323300005557SoilMSFVQRLRFLWILFLAGLALLAIAANATTLSRLRFEELANQATAIARLRCIGAESHWQGGEIWTETRFEVVELNKGLLPGVISI
Ga0066706_1063250213300005598SoilMSFVQRRRFLWILFLAGLALLAIAANATTLSRVRFEELANQATAIARLRCIGVESHWQGGEIWTETRFEVVELNKGMLPGVISIRMLGGSIGSL
Ga0070763_1072927413300005610SoilMSFEQRRRFLWILFLAGLVLLAVMANATTLARMRFEDLAKQATAVARVRCLGTESRWENGEIWTETRFDVVEQNKGQL
Ga0075017_10027843713300006059WatershedsMSFVQRRRFLWILFLAGLALLAVVANATTLSRLRFEELVNQATAVARLRCIGVESRWQEGEIWTETRFETVEVNKGLLPGVVKVRMLGGIVGHLHSRV
Ga0075018_1059356213300006172WatershedsMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVSQATAVARLRCISVESRWQEGEIWTETRFETVEVDKGLLPGLVSVRMLGGSI
Ga0070765_10013440713300006176SoilMNYVQRRRFLWILFLCGLALLAVAASATTLSRLKLEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGA
Ga0079221_1018418823300006804Agricultural SoilMSYVQRRQFLWILFLVGLALLAVVASATTMARLRLPDLTEQSTAIARLRCVGTKSLWDQGEIWTETKFEVV
Ga0079219_1184559313300006954Agricultural SoilMSYVQRRQFLWILFLVGLALLAVVASATTMARLRLPDLTEQSTAIARLRCVGTKSLWDQGEIWTETKFEVVQREKGVLSGIITVRMLGGDVGHLHSRVDEVPHFRRG
Ga0099793_1057530213300007258Vadose Zone SoilMSFMQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVESFLENGEIWTETRFEIVELNKGLLPAVVSVRMLGGRVGN
Ga0099794_1055643223300007265Vadose Zone SoilMSFVQRRRFLWILFLTGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGAESRWENREIWTE
Ga0099829_1017228323300009038Vadose Zone SoilMSYVERRRFLWILLLAALALLAVVANATTLARMRFEELARQATAVARVRCMAATSFWENGEIWTDTNFTVVVQAK
Ga0099829_1122475223300009038Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQHATSVARLRCISAEARWENGEIWTDT
Ga0099830_1076801723300009088Vadose Zone SoilMSYVERRRFLWILLLAALALLAVVANATTLARMRFEELARQATAVARVRCLAATSFWENGEIWTDTNFAV
Ga0099830_1095288913300009088Vadose Zone SoilMSFVQRRRFLWILFLAGLVLLTVAANATTLSRLRFEELANQATAVARLRCIGVESRWENGEIWTETLFEIVEVNKGLLPGLVSVRM
Ga0099830_1175402323300009088Vadose Zone SoilMSFVQRRRFLWILFLAGLALLAAVANATTLSRLRFEDLANQATAVARLRCLGVETRWENGEIWTETRFEIVEVNKGLLPGVVSVHMLGGSIG
Ga0099827_1000872693300009090Vadose Zone SoilMSFVQRRRFLWILFLTGLALLAIAATATTLSRLRFEELANQATAVARVRCIGVESRWQGAEIWTETQFEIVEVNKGLLPGVIS
Ga0116216_1004972213300009698Peatlands SoilMSYVQRRRFLWILFLAGLALLAGVANATTLARMRFEDLVQQATAVARVRCLGAESRWQNGEIWTETHFEVVEQNKGS
Ga0134070_1011791413300010301Grasslands SoilMSFADRRRFLRILFLAGLALIALAARATTFSRLRFEELARQASGVARLRCISAESHWENG
Ga0126376_1180598223300010359Tropical Forest SoilMSFVQRRRFLWALLLAALALLAIVANATTLARMRFEDLARQATAVARLRCLGASSFWKNGEIWTD
Ga0134126_1011919833300010396Terrestrial SoilMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTET
Ga0126383_1169013213300010398Tropical Forest SoilMSYVERRRFLWILLLATLTLLAIAANATTLARMRFEELAQQATAVARLRCLRTKSLWLNGEIWTDTDFEVIEQAKGALPA
Ga0137392_1018351713300011269Vadose Zone SoilMSFVQRRRFLWILFLGGLVLLAVAANATTLSRLRFEELVNQATAVARLRCIGVESRWENGEIWTETRFETVEVNKGLLPGMVSVRMLG
Ga0137392_1045608713300011269Vadose Zone SoilLWILFLTGLALLAIAATATTLSRLRFEELAHQATAVARVRCIGVESHWQGGEIWTQTH
Ga0137392_1066463013300011269Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQQATAVARLRCLSAEARWENGEIWTDT
Ga0137365_1090617423300012201Vadose Zone SoilMSYVERRRFLWILFFVALTLLAIAANATTLVRMRFEELAQQATAVARLRCLSTRSFWHNGEIWTDTQFEVVELAKGALPAT
Ga0137399_1043284323300012203Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQQATAVARLRCLSAETRWENGEIWTDTRFE
Ga0137362_1020395323300012205Vadose Zone SoilMSYVQRRRFLWILFLAGLALLAIAANATTLARLRFEDLALESTAVARLRCLGAESRWEQGEIWTETRFEV
Ga0137380_1038148313300012206Vadose Zone SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVIQATAMARLRCIGVDSRWESGEIWTETRFETVEVNKGLLPGVVTVRMLGGS
Ga0137379_1135719113300012209Vadose Zone SoilMSFADRRRFLWILFLAGLVLIALAARATTFSRLRFEELARQASGVARLRCISAESHWENGEIWTETRFAVEETEKGLLGALTRV
Ga0137387_1044981123300012349Vadose Zone SoilMSYVERRRFLWILLLAALTLLAVVANATTLARMRFEELARQATAVARLRCLGAKSFWENGEIWTDTSFEVVE
Ga0137361_1142075513300012362Vadose Zone SoilMSFVQRRRFLWILFLAALTLLAVVASATTLSRLRFEELAQQATDVARLRCLSAEARWENGEIWTDTRFEVVEQNKGLLPGLVTIRT
Ga0137390_1019569633300012363Vadose Zone SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELANQATAVARLRCIGVESRWENGEIWTETRFETV
Ga0137390_1064713013300012363Vadose Zone SoilMSFVQRRRFLWILFLTGLALLAVAASATTLSRLRFEELVNRATAVARLRCIGAESFLENGEIWTETLFETVELNK
Ga0137390_1136951523300012363Vadose Zone SoilLWILFLTGLALLAIAANATTLSRLRFEELANQATAVARVRCIGVESHWQGGEIWT
Ga0137359_1092341513300012923Vadose Zone SoilMSFVQRRRFLWILFLVGLGLLAVAANSTTLSRLRFEELVNQATAVARLRCIGVQSLWQDGEI
Ga0137413_1127406323300012924Vadose Zone SoilMSFVQRRRFLWILFLAGLALLAAANATTLSRLRFEELVNQATAVARLRCIGVETFLENGEIWTETRF
Ga0137410_1054684023300012944Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVAASATTLTRLRFEELAKQATAVARLRCLGAEARWENGEIWTDTRFEVVEENKG
Ga0134077_1023612323300012972Grasslands SoilMSFADRRRFLWILFLAGLVLIALAAQATTFSRLRFEELARQASGVARLRCISAESHWENGEIWTATRFEVEETEKGLL
Ga0134075_1039543223300014154Grasslands SoilMSFADRRRFLWILFLAGLVLIALVAQATTFSRLRFEELARQASGVARLRCLSAESHWENGEIWTETRFAVEETEKGLLGALTMVRLPGGRVGHIEAHVDG
Ga0137411_105066013300015052Vadose Zone SoilMSYVQRRRFLWILFLAGLALLAVAASATTLTRLRFEELAKQATAVARLRCLGAEARWENGEIWTDTRVRSRRREQGPAPG
Ga0187802_1004454923300017822Freshwater SedimentMSYAQRRRFLWILFLMGLALLAAVVNATSLARMSFQDLARQATAVARVRCAGVQSKWENGEIWTETRFEVLDQSKGTLPAMVTVRMIGGIV
Ga0187769_1022615823300018086Tropical PeatlandMSYVERRRFLWALFLLGLALLAVVANATTLARMRFEELARQATAVARLRCLGTQSFWEVGEIWTETRF
Ga0066655_1016724223300018431Grasslands SoilMSYVERRRFLWILFLMALTLLAIAANATTLARMRFEELAQQATAVARLRCLDTKGFWHNGEIWTDTQFEV
Ga0066655_1036317123300018431Grasslands SoilMSFADRRRFLWILFLAGLVLIALVAQATTFSRLRFEELARQASGVARLRCLSAESNWENGEIWTETRFAVEETEKGLLGALTRVRLPGGRVGHIEAHV
Ga0066655_1110421023300018431Grasslands SoilMSFVQRRRFLWILFLAALALLAVVARSTTLSRLRFEELALQATAVARLRWLGVDGRWRDAEIWTVIR
Ga0066662_1134075323300018468Grasslands SoilMSFAQRRRFLWILFLVGLALLAVAANATTLARMRFDELARQATAVARLRCLGAESFLERGEIWTETQF
Ga0137408_104981623300019789Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQQATAVARLRCLSAETRWENGEIWTDT
Ga0179594_1009013423300020170Vadose Zone SoilMSFVQRRRFLWILFLAALALLAVVAGATTLSRLRFEELAQQATAVARLRCLGAEARWENGEIWTDTRFEVVEQNKGLLPGLVTIRT
Ga0179592_1044388723300020199Vadose Zone SoilMSFVQRRRFLWILFLAGLALLAAANATTLSRLRFEELVNQATAVARLRCIGVETFLENGEIWTETRFETVELNKGLLPGVVSVRM
Ga0210407_1065467813300020579SoilMSFVQRRRFLWILFLAGLTLIAVVASATTLARLRFEELAGQATAVARVRCLGAQSRWDGGEIWTETLFEVVASHKGLLPGLV
Ga0210407_1085776023300020579SoilMSYIERRKFLWILFLAGLALTAIVANATTLARIGFDELTQQATAVARLRCLGAESRWENGEIW
Ga0210403_1034220913300020580SoilMSYIQRRRFLWILFLGGLALLAVAASATTLSRLRLEDLAQESTAVARMRCLGATSQWEQGEIWTETKFEVLEREKGALPGIVTIRLLGGNVGHLHSHVDEVPAFRTGEEVYLFL
Ga0210403_1097511523300020580SoilMSFVERRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVESRWENGEIWTETRFEMVELNKG
Ga0210403_1114228123300020580SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVESRWQDGEIWTETRFETVEANKGMLSGVVS
Ga0210399_1055139213300020581SoilMSFVQRRRFLWILFLAGLALLAIAANATTLSRLRFEELVKQATAVARLRCLGVESRWQDGEIWTETRFEVVELNKGLLPGVVSVRMLGGRVGSLHSR
Ga0210399_1057952523300020581SoilMSFVQRRRFLWILFLMGLALIAVVASATTLAGLSFEELAKQAKAVARLRCIGAENQWENGEIWTETRFEVVEQSKGRLPGLVTVRSIRNLP
Ga0210399_1159392123300020581SoilMSFVQRRRFLWILFLAGLGLIAVVASATTLARLRFEDLAGQATAVARLRCLGAQSRWEGGEIWTETLFEVVASHKG
Ga0210401_1016721713300020583SoilMSYIERRKFLWILFLAGLALTAIVANATTLARMKFDELTQQATAVARLRCLGAESRWENGEIWTETRFEVLERNKGALPGIVTVRMMG
Ga0210401_1020786713300020583SoilMSFVQRRRFLWILFLMGLALIAVVASATTLAGLSFEELAKQAKAVARLRCIGAENQWENGEIWTETRFEVVEQSKGRLPGLVTVRSIRNLPIAIFQLKLRKALEERIP
Ga0210406_1044646313300021168SoilMNYAQRRRFLWILFLGGLALLAVAASATTLSRLKLEDLAQESTAVARLRCLSATSQWEQGEIWTETRFEVLQREKGVL
Ga0210400_1064535323300021170SoilMKMNYVQRRRFLWILFLGGLALLAVAANATTLSRLKLEELAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGALPGIVTVRLMGGHVGH
Ga0210405_1015240913300021171SoilMSYVQRRRFLWILFLCALALLAVAASATTLSRLKLEDLAQESTAIARMRCLGVSSQWDKAEIWTE
Ga0210408_1013197123300021178SoilMSFVQRRRFLWILFLTALALLAIAANATTLSRLRFEELVKQATAVARLRCLSVESRWQDGEIWTETRFETVEVNKGLLPGVVSV
Ga0210397_1069180113300021403SoilMSYIQRRRFLWILFLGGLALLAVAASATTLSRLRLEDLAQESTAVARMRCLGATSQWEQGEIWTETKFEVLER
Ga0210389_1105972423300021404SoilMSYIQRRRFLWILFLGGLALLAVAASATTLSRLRLEDLAQESTAVARMRCLGATSQWEQGEIWTETKFEVLEREKGALPGIVTIRLLGGNVGHLHSHVDEVPAFRT
Ga0210387_1178969623300021405SoilMSYIERRKFLWILFLAGLALTAIAANATTLARIGFDELTQQATAVARLRCLGAESRWEKGEIWTETRFEVL
Ga0210391_1039197223300021433SoilMSYVQRRRFLWILFLCALALLAVAASATTLSRLKLEDLAQESTAVARLRCLGATSQWDKGEIWTETRFEVLQTEKGALPGIV
Ga0187846_1009952423300021476BiofilmMSYVERRRFLWMLFLAALALLAAVANATTLKRLRFEELAQQATAVARVRCLGAKSLWENGEIWTDTSFAVLEHVKGSL
Ga0210398_1013796323300021477SoilMNYVQRRRFLWILFLGGLALLAVAANATTLSRLTFEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGALPGIVTVRLMGGHVGHLHS
Ga0210398_1142714213300021477SoilMSYVQRRRFLWILFLCALALLAVAASATTLSRLKLEDLAQESTAVARLRCLGATSQWDKGEIWTETRFEVL
Ga0210410_1053103023300021479SoilMSFVQRRRFLWILFLAGLALLAIAANATTLSRLRFEELVKQATAVARLRCLSVESRWQDGEIWTETRF
Ga0213853_1141135123300021861WatershedsMSFVQRRKFLWILFLAGLALLAVAASATTLARLRFEELAGQATAVARLRCLGAQSRWEGGEIWTETLFEVVASHKGLLPGLVTIRTIGG
Ga0247667_105762223300024290SoilMSYVQRRRFLWILFLGGLVLIAVVASATTLSRLNLDDLAQESTAVARLRCLGTRSLWDQGEIWTETKFEVLEREKGDLPGIVTVRLIGGRLGHLHSRVDEVPAFRAG
Ga0207653_1004671823300025885Corn, Switchgrass And Miscanthus RhizosphereMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTETRFEVVQRDKGVLPGVVTVRLLGGNVGHL
Ga0207699_1004521423300025906Corn, Switchgrass And Miscanthus RhizosphereMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTETRFEVVQRDKGVL
Ga0207646_1078363533300025922Corn, Switchgrass And Miscanthus RhizosphereMSFVQRRRFLWILFLAALALLAVVASSTTLSRLRFEELAQQATAVARLRCLSSEARWEHGEIWTDTRFEVVEQNKGLLPGLV
Ga0209240_114724913300026304Grasslands SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVESRWQGGEIWTETRFETVEVNK
Ga0257150_107530023300026356SoilMSFVQRRRFLWILFLTGLALLAIAANATTLSRLRFEELANQATAVARVRCIGVESHWQGGEIWTETHFEVVELNKGLLPGVVSVRMLGGQI
Ga0257176_105167113300026361SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVETFLENGEIWTETLFETVELNKGLLPGVIRVR
Ga0257168_113184223300026514SoilMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELVNQATAVARLRCIGVETFLENGEIWTETRFETVELNKGLLPGVVSVRML
Ga0209805_145528113300026542SoilMSFVQRRRFLWILFLAGLALLAIAANATTLSRVRFEELANQATAIARLRCIGVESHWQGGEIWTETRFEVVELNKGMLPGVI
Ga0207482_10271613300027456SoilMSYVQRRRFLWILFLGGLALLAIAASATTLSPLKLEDLAQESTAVARLRCLGTTSQWEQGEIWTE
Ga0209217_121850323300027651Forest SoilMMNFVQRRRFLWKVFLVGLGLIAVVANATTFARVSFRELAQQSTAVGRFRCLSTESKWYNGEIWTETRF
Ga0209388_123267513300027655Vadose Zone SoilMSFVQRRRFLWILFLTGLALLAVAASATTLSRLRFEELVNRATAVARLRCIAVDSFLENGEIWTETLFETVEMNK
Ga0209178_106948923300027725Agricultural SoilMSYVQRRQFLWILFLVGLALLAVVASATTMARLRLPDLTEQSTAIARLRCVGTKSLWDQGEIWTETKFEVVQREKGALPGIILVRMLGGDVGHLHSRVDEV
Ga0209448_1018068723300027783Bog Forest SoilMNYVQRRRFLWILFLGGLALLAVAANATTLSRLKLEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGALPGIVTVRLM
Ga0209580_1041708013300027842Surface SoilMSYKQRRRFLWILFLVGLALIAITASATTLSRLKLEDLALESTAVARLRCLDATSFWNQGEIWTETRFAVLEREKGTLPGIV
Ga0209701_1007285923300027862Vadose Zone SoilMSFVQRRRFLWILFLGGLVLLAVAANATTLSRLRFEELVNQATAVARLRCIGVESRWENGEIWTETRFETVEVNKGLLP
Ga0209590_1101132813300027882Vadose Zone SoilMSFVQRRRFLWILFLVGLALLAIAANATTLSRLRFEELVNQATAVARLRCIGVESHWQNGEIWTETRFEIVEV
Ga0209068_1014240423300027894WatershedsMSFVRRRRFLWILFLAGLALLAVVANATTLSRLRFEELANQATAVARLRCLGVESVWENGEIWTETRFETVEL
Ga0209069_1012768613300027915WatershedsMSFVQRRRFLWILFLAGLALLAVAANATTLSRLRFEELANQATAVARLRCIGVESRWEDGEIWTETRFEIVELNKGLLPDAVSVRMLGGLLIGTIVTGLFL
Ga0265720_101096513300030764SoilMNYVQRRRFLWILFLGGLALLAVAANATTLSRLKLEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGALPGIVTVRLMGGHVGHLHSHVDEVPAFR
Ga0265770_100512813300030878SoilMNYVQRRRFLWILFLGGLALLAVAANATTLSRLKLEDLAQESTAVARLRCLGATSQWEQGEIWTETRFEVLQREKGALPGIVTVR
Ga0307474_1029739523300031718Hardwood Forest SoilMSYVERRRFLWILFIAALALLAVAASATTLARLKLEDLAQESTAVARMRCLGTTSQWEQGEIWTETRFEVLQ
Ga0307474_1102087413300031718Hardwood Forest SoilMGYARRRQILWILFLAGLGLLVLVAAASATTLAGMRFKEMAKQATAIARVRCLGVESRWEDGEIWTETRFQVVELNKGDLS
Ga0307469_1050875213300031720Hardwood Forest SoilMSYVQRRRFLWILFLGGIVLLALAASATTLSRLKLEDLAQESTAVARLRCLSATSRWEQGEIWTETRFEVVQREKGSLPGIVTVRLLGGSVGH
Ga0307468_10161425313300031740Hardwood Forest SoilMKYTMNYPSRTIMSHKERRRFFWTLFLGALAILALTVALSATTLSRLRFTELAQESTAVARLRCLSAVSRWEKEEIWTETRFEVLEAEKGLLPRLVTVRMLG
Ga0307475_1055923013300031754Hardwood Forest SoilMSFVQRRRFLWILFLAGLALLAIVANATTLSRLRFEELVNQATAVARLRCLGVESRWQDGEIWTETQFEVVELNKGLLPGVV
Ga0307473_1066651213300031820Hardwood Forest SoilMSFVQRRRFLWILFLTGLALLAIAASATTLSRLRFEELANQATAVARVRCIGVESHWQSGEIWTETHFEVVELNKGLLPGVISIRML
Ga0307478_1150989923300031823Hardwood Forest SoilMSFVQRRRFLWILFLAGLALLAIVANATTLSRLRFEELVNQATAVARLRCLGVESRWQDGEIWTETRFETVEVSKGLLPGVVSVRMLGGSVGS
Ga0307479_1026053813300031962Hardwood Forest SoilMSFVQRRRFLWILFLAGLALLAIAANATMLSRLRFEELVKQATAVARLRCLGVESRWQDGEIWTETQ
Ga0307479_1065650923300031962Hardwood Forest SoilMSFVQRRRFLWVLFLAGLALLAVAANATTLSRLRFEELVNQATAVGRLRCVGVESRWENGEIWTETRFEIVELNKGQLAGVVSVRMLGGSVG
Ga0307471_10015294613300032180Hardwood Forest SoilMSFVQRRRFLWILFLVGLGLLAIAANATTLSRLRFEELARQATAVARVRCIGVEIHWQGGEIWTETQFEVVELSKGLLPGVI
Ga0335079_1214863323300032783SoilMSYAQRRRFLWILFLIGIALIAVTAKATTLARLKLEDLAQESTAVARLRCVGATSVWDHGEIW
Ga0335078_1018610233300032805SoilMSYVERRRFLWVLFLIGIVLIALTASATTLAPLSLADLAQESTAVARLRCLGSRSFWEQGEIWTETHFEVVQRE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.