WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00016798 Gene Name  ets-8
Sequence Name  ? C50A2.4 Organism  Caenorhabditis elegans
Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be located in nucleus. Is an ortholog of human ETV1 (ETS variant transcription factor 1); ETV4 (ETS variant transcription factor 4); and ETV5 (ETS variant transcription factor 5). Biotype  SO:0001217
Genetic Position  IV :-21.5293± Length (nt)  ? 1126
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00016798

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C50A2.4.1 C50A2.4.1 759   IV: 1159361-1160486
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C50A2.4 C50A2.4 426   IV: 1159690-1159815

1 RNAi Result

WormBase ID
WBRNAi00042845

44 Allele

Public Name
gk963722
gk964482
gk963025
tm11413
gk193566
gk193565
gk193567
tm433
tm440
WBVar01821150
WBVar01724668
WBVar01724670
WBVar01724669
WBVar01667935
WBVar01667936
WBVar01510454
WBVar01510453
WBVar01510460
WBVar01510458
WBVar01510457
WBVar01510456
WBVar01510455
WBVar01510459
WBVar01647665
WBVar01686992
WBVar02097876
WBVar01686991
WBVar01686993
WBVar01686990
WBVar01450464

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00016798 1159361 1160486 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_1158029..1159360   1332 IV: 1158029-1159360 Caenorhabditis elegans

37 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts that showed significantly increased expression in whole animal day 1 N2 adults comparing to in whole animal day 8 N2 adults. DESeq2, FDR < 0.05, fold change > 2. WBPaper00066978:Day1Adult_vs_Day8Adult_upregulated_neuron
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin and 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Rifampicin_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_downregulated
  Genes with significantly increased expression in eat-2(ad465) treated with 2% DMSO for 72 hours, comparing to in N2 treated with 2% DMSO for 72 hours. Analysis of gene expression data was carried out with the Affymetrix Transcriptome Analysis Console. Data preprocessing (using RMA normalization) and QC metrics were performed using Affymetrix Expression Console TM and manually inspected afterwards. Expression analysis was carried out for each two pairwise conditions. FDR statistical correction for multiple testing resulted in a slightly lower number of DEGs in most cases. P-value < 0.05 and fold change > 2.0 were used to determine differentially expressed genes. WBPaper00048989:eat-2(ad465)_upregulated_in-DMSO
Bacteria infection: Serratia marcescens Genes with increased expression after 24 hours of infection by S.marcescens Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:S.marcescens_24hr_upregulated_TilingArray
  Genes uniquely expressed in endoderm, according to RNAseq studies on blastomere (with isolated AB, MS, E, C, D founder cells dividing in vitro) time course and whole embryo time course. Germ layers were assigned by correlating the average expression with germ-layer-specific patterns with a cutoff of 0.6 correlation with the following idealized vectors: endoderm = [00100]; ectoderm = [10000]; mesoderm = [01011], where the order is AB, MS, E, C and P3. Germ-layer genes were defined according to the sum of the genes identified by the clusters and are indicated in Fig. 2b. Authors further filtered the germ-layer gene sets by keeping only those genes whose expression was partitioned across the germ layers such that at least two-thirds of the expression was in that germ layer. WBPaper00046121:endoderm_unique
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
  Strictly embryonic class (SE): genes that are the subset of embryonic genes that are not also classified as maternal. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SE
Temprature shift to 28C for 24 hours. Transcripts that showed significantly increased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_upregulated
  Genes that showed significant differential expressed between control and 150 mg\/L Atrazine treatment. t-test, p < 0.05. WBPaper00036123:Atrazine_regulated
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin, 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Rifampicin-Allantoin_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin and 100uM Psora from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Psora_downregulated
  Embryonic transient class (ET): genes that are the subset of embryonic genes in which the latest significant increase is earlier than their latest significant decrease. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_ET
  Transcripts that showed significantly decreased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_downregulated
  Transcripts that showed significantly decreased expression in drh-3(rrr2) comparing to in N2. edgeR, log2 fold change > 2 or < -2. WBPaper00053888:drh-3(rrr2)_downregulated
  Genes in the bottom 10% of expression level across the triplicate L3 samples. To generate the top10 and bottom10 gene sets, authors ranked all genes by mean expression array signal intensity across the three replicates, then took the top and bottom deciles (1,841 genes each) to represent genes with high and low expression. To generate the top10 and bottom10 gene sets, authors ranked all genes by mean expression array signal intensity across the three replicates, then took the top and bottom deciles (1,841 genes each) to represent genes with high and low expression. WBPaper00032528:L3_depleted
  Genes that showed significantly increased expression level in rsr-2(RNAi) animals comparing to in gfp(RNAi) control. Fold change > 1.2 or < 0.8. WBPaper00042477:rsr-2(RNAi)_upregulated_TilingArray
  Embryonic (E) subclasses are based on the earliest significant increase(abbreviated pi for primary increase). A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E_pi(66_min)
  Strictly embryonic transient class (SET): genes that are the subset of embryonic transient genes that are not also classified as maternal. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SET
  Strictly embryonic (SE) subclasses are based on the earliest significant increase(abbreviated pi for primary increase). A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SE_pi(66_min)
  Transcripts that showed significantly decreased expression in daf-2(e1370) comparing to in N2. Student's t-test, fold change > 2, p-value < 0.05. WBPaper00055386:daf-2(e1370)_downregulated
  Transcripts that showed significantly decreased expression in eat-2(ad465) comparing to in N2. Student's t-test, fold change > 2, p-value < 0.05. WBPaper00055386:eat-2(ad465)_downregulated
Bacteria diet: Comamonas sp. 12022 MYb131 Transcripts that showed significantly increased expression after animals were fed by Comamonas sp. 12022 MYb131, comparing to animals fed by OP50. edgeR FDR <= 0.05, fold change >= 4. WBPaper00061424:Diet_MYb131_upregulated
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin, 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Psora-Allantoin_downregulated
  Transcripts that showed significantly decreased expression in mex-3(eu149) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. RPKM fold change > 2. WBPaper00058598:mex-3(eu149)_downregulated
  Coexpression clique No. 211, srj-42-srw-113, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:srj-42-srw-113
  Genes from N2 animals with significantly increased expression after 72 hours of treatment on growth media with 250uM allantoin in 2% DMSO. Analysis of gene expression data was carried out with the Affymetrix Transcriptome Analysis Console. Data preprocessing (using RMA normalization) and QC metrics were performed using Affymetrix Expression Console TM and manually inspected afterwards. Expression analysis was carried out for each two pairwise conditions. FDR statistical correction for multiple testing resulted in a slightly lower number of DEGs in most cases. P-value < 0.05 and fold change > 2.0 were used to determine differentially expressed genes. WBPaper00048989:N2_allantoin_upregulated
  Transcripts that showed significantly increased expression in DA116[eat-2(ad1116)] comparing to in N2. The DESeq2 package (v1.24.0) was used to identify differentially expressed genes (DEGs). Fold change > 2, FDR < 0.05. WBPaper00061040:eat-2(ad1116)_upregulated
  Transcripts that showed significantly decreased expression in spn-4(tm291) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. RPKM fold change > 2. WBPaper00058598:spn-4(tm291)_downregulated

4 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1019975 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2011384 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2029620 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1146823 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

10 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

13 Homologues

Type
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00016798 1159361 1160486 -1

10 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
1126

1 Sequence Ontology Term

Identifier Name Description
gene  

1 Strains

WormBase ID
WBStrain00029510

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_1160487..1163425   2939 IV: 1160487-1163425 Caenorhabditis elegans