WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000454 Gene Name  ceh-33
Sequence Name  ? C10G8.7 Brief Description  ceh-33 encodes a Six/sine oculis class homeodomain transcription factor; preliminary RNAi experiments suggest that CEH-33 activity may be required for gonad development; a ceh-33::gfp reporter shows very weak pharyngeal expression in late embryos and early larvae.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be located in nucleus. Predicted to be part of transcription regulator complex. Expressed in head and head muscle. Human ortholog(s) of this gene implicated in several diseases, including autosomal dominant nonsyndromic deafness 23; carcinoma (multiple); and optic disc anomalies with retinal and/or macular dystrophy. Is an ortholog of human SIX1 (SIX homeobox 1); SIX2 (SIX homeobox 2); and SIX6 (SIX homeobox 6).
Biotype  SO:0001217 Genetic Position  V :-2.22071 ±0.017614
Length (nt)  ? 1473
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000454

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C10G8.7.1 C10G8.7.1 880   V: 5323194-5324666
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C10G8.7 C10G8.7 786   V: 5323238-5323436

5 RNAi Result

WormBase ID
WBRNAi00040327
WBRNAi00114714
WBRNAi00010612
WBRNAi00028795
WBRNAi00101782

31 Allele

Public Name
gk963301
gk963591
gk963553
gk964259
gk964351
gk963850
tm244
WBVar01651494
WBVar01651493
gk954417
gk948315
WBVar00272019
WBVar00272020
gk234923
ok1362
gk791917
WBVar02043314
WBVar01459852
gk533619
h9979
gk397357
WBVar02054703
gk509105
gk486036
WBVar01459853
gk885518
gk742286
gk678639
gk818805
gk444954

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000454 5323194 5324666 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_5319578..5323193   3616 V: 5319578-5323193 Caenorhabditis elegans

61 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
Bacteria: E.faecalis strain OG1RF Transcripts that showed significantly increased expression after infection by E. faecalis OG1RF. Ballgown was used to calculate differential expression of genes using FPKM data and to generate tables with fold change and P values. Genes were shortlisted with a cutoff of 2-fold change and P values of less than 0.05. WBPaper00059754:E.faecalis_OG1RF_upregulated
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
  Expression Pattern Group C, enriched for genes involved in metabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_C
  Genes down-regulated following nhr-25(RNAi). Pair-wise significance testing (mutant/RNAi vs. wild-type/vector) was performed using the Bioconductor package limma and p-values were initially corrected for multiple testing using the false discovery rate (FDR) method of Benjamini and Hochberg. Authors defined differential expression as log2(ratio) >= 0.848 with the FDR set to 5%, and p-value <= 0.001. WBPaper00045015:nhr-25(RNAi)_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed altered expression from P0 to F2 generation animals after N2 parental generation were treated with antimycin, but not in damt-1(gk961032) P0 to F2 animals after the parenal generaton were treated with antimycin. N.A. WBPaper00055862:antimycin_damt-1(gk961032)_regulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_L1-larva_expressed
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
heat-shock hlh-1 Genes enriched in HLH-1 heat shock dataset. A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:hlh_1_enriched
  Transcripts depleted in RIS neurons comparing to in all cells. edgeR 3.24.3, FDR < 0.01 WBPaper00058969:RIS_depleted
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L2-larva_expressed
  Transcripts that showed differential expression in dauer mir-34(gk437) vs dauer mir-34(OverExpression) animals at 20C. N.A. WBPaper00050488:mir-34(gk437)_vs_mir-34(OverExpression)_regulated_dauer_20C
  Genome-wide analysis of developmental and sex-regulated gene expression profile. self-organizing map cgc4489_group_11
control(maintained under normal lab light (mostly dark, in incubators).) vs UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) at just prior to the third UVC dose (48h). Genes differentially expressed in control vs after UVC exposure and EtBr treatment at the -1h timepoint (just prior to the third UVC dose (48h)). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:control_vs_UVC-EtBr-exposed_48h
  Significantly upregulated genes from cyc-1(RNAi) microarrays using SAM algorithm with an FDR < 0.1 from adult-only chips. SAM algorithm with an FDR < 0.1. WBPaper00033065:cyc-1(RNAi)_upregulated
UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) vs UVC-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total).) at just prior to the third UVC dose (48h). Genes differentially expressed under EtBr treatment and UVC exposure vs under UVC exposure but without EtBr treatment at the -1h timepoint (just prior to the third UVC dose (48h)). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:UVC-EtBr-exposed_vs_UVC-exposed_48h
  Genes up-regulated in wdr-23(tm1817) mutants comparing to in N2. Differentially expressed genes at false discovery rate (FDR) of 0.05 were identified using the Cuffdiff module of the Cufflinks package. WBPaper00042215:wdr-23(tm1817)_upregulated
  Genes significantly enriched (> 2x, FDR < 5%) in a particular cell-type versus a reference sample of all cells at the same stage. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_embryo_enriched
  Genes that show selective expression in a subset of cell types vs broadly expressed in many cell types. Correspond to 20% - 57% of enriched_genes for a given cell type. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_embryo_SelectivelyEnriched
  Transcripts that showed differential expression in dauer N2 vs dauer mir-34(gk437) animals at 20C. N.A. WBPaper00050488:N2_vs_mir-34(gk437)_regulated_dauer_20C
  Transcripts that showed significantly decreased expression in pry-1(mu38) animals comparing to in N2 at L1 larva stage. DESeq, FDR < 0.05 WBPaper00055626:pry-1(mu38)_downregulated
  Genes with differential expression under 0.5mg/l Chlorpyrifos (CPF) and 0.5mg/l Diazinon (DZN) treatment at 24 centigrade. To identify the differentially expressed genes in each treatment authors used linear models per toxicant and temperature (gene expression = Toxicant (effect) + error). The lm function in R stats package was used to implement the linear models analysis with recommended default options. For threshold determination authors used a permutation approach. For each of the 23,232 permutations used authors randomly picked a transcript (array spot), which could only be picked once. Authors combined all the expression values of this transcript and randomly distributed them over the replicates and used them in the linear model. In this way authors obtained a threshold for each of the toxicants. Authors used a -log10 p-value 2 as common threshold for the analysis, which resembles to the following FDR per toxicant: 0.0155 for CPF at 24 centigrade, 0.0148 for DZN at 24 centigrade, 0.0168 for CPF+DZN at 24 centigrade, 0.0142 for CPF at 16 centigrade, 0.0151 for DZN at 16 centigrade, and 0.0148 for CPF+DZN, at 16 centigrade. WBPaper00040210:Chlorpyrifos_Diazinon_24C_regulated
Bacteria infection: Pseudomonas aeruginosa PA14. 24 hours of exposure. Small RNAs (21-26nt) that showed significantly increased expression after L4 animals were exposed to P .aeruginosa strain PA14 for 24 hours. DESeq2, FDR < 0.05 WBPaper00056868:P.aeruginosa_upregulated_smallRNA
  Transcripts that showed significantly decreased expression in mter-4(syb3662 syb3403) comparing to in N2. DESeq2, fold change > 2, FDR < 0.05. WBPaper00061995:mter-4(syb3662syb3403)_downregulated

12 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
Also expressed in (comments from author) : Note: primers are not unique and produce 2 products.Unidentified cells in head, possibly neural. Strain: BC11180 [ceh-33::gfp] transcriptional fusion. PCR products were amplified using primer A: 5' [CCTCTCAAGTCGCCTAATCC] 3' and primer B 5' [CGGTTGGATATTGCTCGGGTTG] 3'. Expr5233 Adult Expression: unidentified cells in head; Larval Expression: unidentified cells in head;  
    Expr2009878 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Clone: pUL#JRH/AH11   Expr7431 From late embryo to adult can see expression in anterior head muscles with processes to the nerve ring.  
Original chronogram file: chronogram.1077.xml [C10G8.7:gfp] transcriptional fusion. Chronogram65    
    Expr2028118 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Original chronogram file: chronogram.1262.xml [C10G8.7:gfp] transcriptional fusion. Chronogram235    
Original chronogram file: chronogram.123.xml [C10G8.7:gfp] transcriptional fusion. Chronogram201    
Original chronogram file: chronogram.1749.xml [C10G8.7:gfp] transcriptional fusion. Chronogram718    
    Expr15582    
    Expr1018373 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1144398 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr16085 Using a fosmid-based reporter in which the ceh-33 locus is tagged with gfp, we found that the CEH-33 protein shows no expression in the nervous system within or outside the pharynx at any developmental stage. The only observed expression was in a subset of head muscle cells.  

11 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  part_of
  located_in
  located_in
  involved_in
  located_in
  enables
  involved_in
  enables
  enables

13 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000454 5323194 5324666 -1

11 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  part_of
  located_in
  located_in
  involved_in
  located_in
  enables
  involved_in
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
1473

1 Sequence Ontology Term