Basic Information | |
---|---|
IMG/M Taxon OID | 3300002079 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095583 | Gp0060320 | Ga0016756 |
Sample Name | CSTRmetagenomics |
Sequencing Status | Permanent Draft |
Sequencing Center | |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 89693915 |
Sequencing Scaffolds | 15 |
Novel Protein Genes | 17 |
Associated Families | 14 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 2 |
All Organisms → cellular organisms → Bacteria | 8 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division TA06 → candidate division TA06 bacterium 34_109 | 1 |
All Organisms → cellular organisms → Bacteria → Spirochaetes → Spirochaetia → Spirochaetales → unclassified Spirochaetales → Spirochaetales bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Wastewater Treatment Plant Microbial Communities From Luxembourg |
Type | Engineered |
Taxonomy | Engineered → Wastewater → Unclassified → Unclassified → Unclassified → Wastewater Treatment Plant → Wastewater Treatment Plant Microbial Communities From Luxembourg |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Belvaux, Luxembourg | |||||||
Coordinates | Lat. (o) | 49.507864 | Long. (o) | 5.943557 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F011593 | Metagenome / Metatranscriptome | 289 | Y |
F021489 | Metagenome / Metatranscriptome | 218 | N |
F023356 | Metagenome / Metatranscriptome | 210 | Y |
F029455 | Metagenome / Metatranscriptome | 188 | Y |
F030486 | Metagenome / Metatranscriptome | 185 | Y |
F045118 | Metagenome / Metatranscriptome | 153 | N |
F047964 | Metagenome / Metatranscriptome | 149 | N |
F048287 | Metagenome / Metatranscriptome | 148 | N |
F049316 | Metagenome / Metatranscriptome | 147 | N |
F064732 | Metagenome / Metatranscriptome | 128 | Y |
F066555 | Metagenome / Metatranscriptome | 126 | Y |
F073245 | Metagenome / Metatranscriptome | 120 | Y |
F078878 | Metagenome / Metatranscriptome | 116 | Y |
F104682 | Metagenome / Metatranscriptome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
CSTR_1009099 | Not Available | 1357 | Open in IMG/M |
CSTR_1009267 | All Organisms → cellular organisms → Bacteria | 1075 | Open in IMG/M |
CSTR_1019028 | All Organisms → Viruses → Predicted Viral | 1156 | Open in IMG/M |
CSTR_1019064 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 6454 | Open in IMG/M |
CSTR_1022109 | All Organisms → cellular organisms → Bacteria | 2470 | Open in IMG/M |
CSTR_1022359 | All Organisms → cellular organisms → Bacteria | 1044 | Open in IMG/M |
CSTR_1022360 | All Organisms → cellular organisms → Bacteria | 1119 | Open in IMG/M |
CSTR_1022430 | All Organisms → cellular organisms → Bacteria | 2967 | Open in IMG/M |
CSTR_1022636 | All Organisms → cellular organisms → Bacteria | 8854 | Open in IMG/M |
CSTR_1024451 | All Organisms → cellular organisms → Bacteria | 1352 | Open in IMG/M |
CSTR_1025975 | All Organisms → cellular organisms → Bacteria | 1309 | Open in IMG/M |
CSTR_1026402 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division TA06 → candidate division TA06 bacterium 34_109 | 1574 | Open in IMG/M |
CSTR_1028711 | All Organisms → cellular organisms → Bacteria → Spirochaetes → Spirochaetia → Spirochaetales → unclassified Spirochaetales → Spirochaetales bacterium | 7400 | Open in IMG/M |
CSTR_1028770 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium | 1270 | Open in IMG/M |
CSTR_1030708 | Not Available | 1035 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
CSTR_1008147 | CSTR_10081472 | F078878 | LSSIVLKIPYESQSFHAALTDATLRDRVQEWVAAAVTLDDAVYIPLRTDWMPNGPRRTADGVEVLLAGRWPPEVDAETGPRVSAAIFGRVWDTLCNATPADELEDIATCGLSGVNEEQTGNEVLEEAPAVIHIHATPVNHVDGW |
CSTR_1009099 | CSTR_10090992 | F064732 | MPKVSEYKWKDTEAEAEAATAAARERCTMPLVQSAERHVRFPSSPMAPGQYIAATATRSTGQLDHPEDTELGNHPFLGLL* |
CSTR_1009267 | CSTR_10092672 | F011593 | MTSSLAAPPWCRGCPDPIYDNRRGEWDWYCRQHSPTGIRCSNVAVLRERCYRVREYFAKKEASK* |
CSTR_1019028 | CSTR_10190282 | F066555 | MKGKLRKEFIERMEAVLYGDIVEEEVEDLIHDFCDEIENRLLDVCAELKRIKGLREIDAIRNAVEELVTDLW* |
CSTR_1019064 | CSTR_10190648 | F030486 | RYVALAQSRGGACFCTRGAVQDYAVITLFSRWQKP* |
CSTR_1022109 | CSTR_10221092 | F023356 | MSENSLKKAVLESRVSAAEAEVMSTELEKSKVITLRVEEPLFKAIEAQAELWNVKPAETIRRVLRFYFLPVALELELRGESEKFWKGEFTPEAVGEYMVFLLETTEKIRSSALFLSREAVKLSEATEGKLSEALEEEREGAET* |
CSTR_1022109 | CSTR_10221093 | F048287 | MYNKAVLLQTLQQVRLKVIDTLKLMLSDYEVSPESSLRVQPASYEVATGEKVEYPLFRNEAGTFYGSKAYLNTENWNLTLKPLPAGTRGVGAFLQFSVPKNYYGSNYFSVGEAGTRAALKKVEGELVQNGVFTNLEEAELSRVDTFKNIEPEEPFSSYFSLFSLLKARRAQQRGYGTTFLVSNSSQEFCVYDKLQEMREHGLETSELPETMRFEHRALKKEKVRSLYGFTSVSELFHGGYEVVKEKQVESWKNSLFSFSAEEVVVLGSKQLEQEMKVFQERFGSNWFQYFLRSYGAYSLAKFAGKEVVRAALENLNSERTKVWRTMKLLEEAERELLVLKQEEGSSKTLGELYEELRGKVCELAGRN* |
CSTR_1022359 | CSTR_10223593 | F045118 | AELSRVDTFKNIEPEEPFSSYFSLFSLLRARRAQQRGYGTTFLLSNSSQEFCIYDKLQEMREHGEETSELPETMRFEHRALKKEKVSSLYGFTKVKELFHGGYEVVKEKQVESWKNSLFSFSAEEVLVLGSKQLEQEMKVFQEKFGSNWFQYFLRSYGAYSLAKFAGKEVVRAALENLNSERTKVWRTMKLLEEAERELLVLKQEEGSSKTLGELYEELREKVCGTAERN* |
CSTR_1022360 | CSTR_10223601 | F045118 | AGTRAALKKVEGELDQSGVFTNLEEAELSRVDTFKNIEPEEPFSSYFSLFSLLKARKAQQRGYGTTFLVSNSSQEFCIYDKLQEMREHDLETSNLPETMRFEHRALKKEKVRSLYGFTRVSELFHGGYEVVKEKQVESWKNSLFSFSTEEVVVLGSKQLEQEMKVFQEKFGSNWFQYFLRSYGAYSLAKFAGKEVVRAALENLNSERTKVWRTMKLLEEAERELLVLKQEEGSSKTLGELYEELRGKVCELAGRN* |
CSTR_1022430 | CSTR_10224307 | F048287 | KVEFPLFRNEAGTFYGSKAYLNTENWNLTLKPLPAGNRGTGAFLQFSVPKNYYGSNYFSVGEAGTRASLKKVEGELAENGVLTNLEEAELSRVDTFKNIEPEEPFSSYFSLFSLLKARKAQQRGYGTTFLVSNSSQEFCIYDKLQEMREHGEETSELPETMRFEHRALTKRKVSSLYGFTRAGELFHGGYEVVREKQLESWKSSLFSFSAEEVVVLGSKQLEQEMRFFQEKFGANWFQYFLKSYGAYSLAKFAGKEVVRNALENLNSERTKVWRTMKLLEEAERELLVLKQEEGSSKTLGELYEELRGKVCELAGRN* |
CSTR_1022636 | CSTR_10226361 | F029455 | ARGSMADRVIDLDLWGEGAERLPAITITGQGLRIEGSQAEFDRLILIIGGWMK* |
CSTR_1024451 | CSTR_10244512 | F073245 | MIAEPITLLDLALGLTLLIALVAIVVNVIALVKEYIKSQNVRA* |
CSTR_1025975 | CSTR_10259753 | F045118 | FSLFSLLKARRAQQRGYGTTFLVANSSQEFCVYDKLQEMREHGLETSELPETMRFEHRALKKEKVRTLYGFTRVNELFHGGYEVVKEKQVESWKNSLFSFSAEEVVVLGSKQLEQEMKAFQEKFGSNWFQYFLRSYGAYSLAKFAGKEVVRSALENLNSERTKVWRTMKLLEEAERELLVLKQEEGSSKTLGELYEELRGKVCELAGRN* |
CSTR_1026402 | CSTR_10264021 | F021489 | MTIEKKIESVQEYNLLKLLKANSSLKRFGVEWIKLDHTLCRAFATNEFAMIVADLEPNQWHDIFDGLPEVIFIDKLKRDEVHYFEAKELVYYNYLSVFEAVGKEPNYQPVMHFDLELLRNLTDKFDNIYFVKQSGKALFMKLEGDKYPAGSYYGALMPKTISQEETAKIIEALSSLATDGEIRRQWVADLREFAQE* |
CSTR_1028711 | CSTR_10287116 | F104682 | MREIGIDRPRFDVKVSTEIFTLYFIPNLARKKYIDFWHRVEQMIEALKIADAKERKQAVDSLSTDDESDLLKEIIEVTLEANGIAYNETWWEEHTDAEMQLEFVRSVIEGSDNGKKKVLQVLKA* |
CSTR_1028770 | CSTR_10287702 | F047964 | MVSKKEMLVILNKVAAVFGKEIDEKLVSVYHLALKQYSRMALADAAMLCVQENSFMPRPKDLVEKIDKYDLERKWQVVEPSTERTYWVLFREEYETTDDINEIECKEIFADEYVGAPKLKKTSETNNMSDEERLIQFRKLY |
CSTR_1030708 | CSTR_10307081 | F049316 | MDKDKRITTIVLDKEADRKSKIIAEFTGKSLSAIYREALDMYFSHFVNNDLLSEVNEYLDKLYWQYKIDNPHIFAAVQFGNLSNAAVMIAEAIKQLPQEPEAQKLIDEGMRLISVLSK* |
⦗Top⦘ |