NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027178

3300027178: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05.2A5-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027178 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0091566 | Ga0207606
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05.2A5-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size30042109
Sequencing Scaffolds14
Novel Protein Genes16
Associated Families16

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1
All Organisms → cellular organisms → Bacteria4
Not Available5
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_71
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F006629Metagenome / Metatranscriptome368Y
F016050Metagenome / Metatranscriptome250Y
F017166Metagenome / Metatranscriptome242Y
F028301Metagenome192Y
F029216Metagenome / Metatranscriptome189Y
F034688Metagenome / Metatranscriptome174Y
F038225Metagenome / Metatranscriptome166Y
F041750Metagenome / Metatranscriptome159Y
F044695Metagenome / Metatranscriptome154Y
F045732Metagenome / Metatranscriptome152N
F059558Metagenome / Metatranscriptome133Y
F060880Metagenome / Metatranscriptome132N
F082749Metagenome / Metatranscriptome113Y
F085779Metagenome / Metatranscriptome111N
F087420Metagenome / Metatranscriptome110Y
F102800Metagenome / Metatranscriptome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207606_100068All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1669Open in IMG/M
Ga0207606_100070All Organisms → cellular organisms → Bacteria1664Open in IMG/M
Ga0207606_100103Not Available1569Open in IMG/M
Ga0207606_100346Not Available1209Open in IMG/M
Ga0207606_100966All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_7889Open in IMG/M
Ga0207606_101548Not Available774Open in IMG/M
Ga0207606_101552All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria773Open in IMG/M
Ga0207606_101604All Organisms → cellular organisms → Bacteria765Open in IMG/M
Ga0207606_102181All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia696Open in IMG/M
Ga0207606_102968All Organisms → cellular organisms → Bacteria634Open in IMG/M
Ga0207606_103398Not Available606Open in IMG/M
Ga0207606_103738All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium589Open in IMG/M
Ga0207606_104950All Organisms → cellular organisms → Bacteria541Open in IMG/M
Ga0207606_105179Not Available533Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207606_100068Ga0207606_1000682F029216MYVLIVVIGVLSQGASVLPVGVTSQVVGKFKNLDECKAAAKQPHAAGPIADITVVTTWGATWYCTYSGTN
Ga0207606_100070Ga0207606_1000701F059558EKRGKLLICSLKGEVKNKHNRNGWIFSCFAAHLFEQDLF
Ga0207606_100103Ga0207606_1001031F045732HVQKLRGPRAESLVSKMRHRRRKVFIDTHKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWVATVIDWTRRAA
Ga0207606_100346Ga0207606_1003461F087420GVASQAFACGSSRQMGPKLGTRAKRSATGVAIALALSALPHAPASAAKPPYGSCVAVTKQEYDSAKKQHMLQTRYTQYVRTGLPGRRQYWYCR
Ga0207606_100452Ga0207606_1004523F082749RKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS
Ga0207606_100966Ga0207606_1009663F041750AWTGYDRNMLKAPTDADIVRLRQFLRRLESDAKEAVEDAELCQHEIDRLKSEIAYLEAARSRAFLAQIGIDFGAAASGRRNYSAERGKPS
Ga0207606_101548Ga0207606_1015482F017166RGSGVNKIMFIVALGAMLFIGWLYLGEMSDVRSIKAAVTAAADSTAVALSQSANPERNTDADADGIFIKHIQTSSTLEDVSVKQSVEPISAGRLRQSVKVTARARTTLSEFFNMQGAEIDITATHDFDRKK
Ga0207606_101552Ga0207606_1015522F102800LSEIFEVSRTGELAASVRRPLGRGGLRAFVERESVY
Ga0207606_101604Ga0207606_1016042F034688VLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK
Ga0207606_102181Ga0207606_1021811F006629MVKVMGMLTLETVPGLNRHCRNALVAALSRIGLPVLDETEQPPTLPLPESTVQTITPLPVTC
Ga0207606_102968Ga0207606_1029682F060880MRTSADSRKMQRRREELRSHSEAAVATIWLVFYVLGLVVAVSSPIVSRALEFAAH
Ga0207606_103398Ga0207606_1033981F085779RFLGASGVAAAVLLYSLSSAVFAQDEPKRPINPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALELASELSPNQALNGMRPLPESVRSIDGVSRLYYI
Ga0207606_103738Ga0207606_1037381F028301RRRCIGFRIAFIFGILLAVINRGDWTLLSIALAGMAILVVAYWRDCRRPRHSP
Ga0207606_104950Ga0207606_1049501F016050MDIQPDANVEDQDRAKCGKNETGGMKSFACRVRKYVGNRAADDRSDDAEHDCPENGHVHMHHRFRNNSGDQPNNNVPD
Ga0207606_105179Ga0207606_1051791F038225MKRFSLALLGTVGAFFILTPAQAADYRVVQYNDTKICQVVDMAGPFKPISSKYTVLTKKSLPSFADAMKARADVGAKAKCIL
Ga0207606_105347Ga0207606_1053472F044695MKMLAVRCLVALTLLVAAAVAFANPFKSKIISSDDSVLEITVPGDHFMKITNFTQDGGLVTDRAVVEVTLP

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.