WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000453 Gene Name  ceh-32
Sequence Name  ? W05E10.3 Brief Description  ceh-32 encodes a Six/sine oculis-type homeodomain protein most closely related to the Six3/6 subfamily that contains Drosophila OPTIX and human SIX3 (OMIM:603714, which when mutated leads to holoprosencephaly 2); CEH-32 appears to be essential for development and required for proper head morphogenesis; during embryogenesis, CEH-32 is expressed in hypodermal and neuronal precursors, and at later stages, in the descendants of these cells and in gonadal sheath cells; in some hypodermal cells, ceh-32 is a direct transcriptional target of VAB-3, a PAX-6 ortholog sufficient to induce ceh-32 expression in some cell types and able to bind the ceh-32 promoter in vitro.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Involved in neuron differentiation and post-embryonic animal morphogenesis. Located in nucleus. Expressed in several structures, including anterior hypodermis; gonadal sheath cell; head ganglion; pharyngeal neurons; and somatic nervous system. Human ortholog(s) of this gene implicated in several diseases, including holoprosencephaly 2; optic disc anomalies with retinal and/or macular dystrophy; and renal Wilms' tumor. Is an ortholog of human SIX3 (SIX homeobox 3) and SIX6 (SIX homeobox 6).
Biotype  SO:0001217 Genetic Position  V :3.26969 ±0.005764
Length (nt)  ? 3637
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000453

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:W05E10.3.1 W05E10.3.1 1862   V: 11701764-11705400
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:W05E10.3 W05E10.3 1320   V: 11702279-11702855

29 RNAi Result

WormBase ID
WBRNAi00064704
WBRNAi00064795
WBRNAi00064819
WBRNAi00054821
WBRNAi00009254
WBRNAi00026496
WBRNAi00114715
WBRNAi00036302
WBRNAi00101783
WBRNAi00068322
WBRNAi00068323
WBRNAi00068324
WBRNAi00068325
WBRNAi00068832
WBRNAi00068833
WBRNAi00066132
WBRNAi00073487
WBRNAi00073486
WBRNAi00073490
WBRNAi00001702
WBRNAi00072691
WBRNAi00072690
WBRNAi00072692
WBRNAi00073489
WBRNAi00073488
WBRNAi00064577
WBRNAi00064676
WBRNAi00106608
WBRNAi00093989

62 Allele

Public Name
gk963271
gk963301
gk964458
gk964459
gk964451
gk964452
gk963618
WBVar02061135
gk964054
gk964055
gk827037
WBVar01866200
gk915303
gk248022
tm245
WBVar01866197
WBVar01866196
WBVar01866199
WBVar01866198
gk804929
gk463066
gk660090
gk511308
gk931134
gk451563
gk655040
gk342440
gk330279
gk631256
gk334767

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000453 11701764 11705400 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_11701143..11701763   621 V: 11701143-11701763 Caenorhabditis elegans

218 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Neuronally enriched transcripts according to a comparison of neuronal nuclei IP samples to total nuclei using isolation of nuclei from tagged specific cell types (INTACT) technology. DESEQ2, fold change > 2 and FDR < 0.01. WBPaper00062103:neuron_enriched
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_daf-16(mu86);glp-1(e2141)
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_glp-1(e2141)
  Transcripts that showed significantly increased expression in day 3 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Single-cell RNA-Seq cell group 21_1 with unidentified tissue expression pattern. scVI 0.6.0 WBPaper00065841:21_1
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Genes that showed significantly increased expression in daf-2(e1370);hel-1(gk148684) comparing to in hel-1(gk148684) To identify DEGs, Students t test and the log2 median ratio test were performed to compute t values and median ratios for all the annotated genes. The adjusted P values from each test were computed using an empirical distribution of the null hypothesis, which was obtained from random permutations of the samples. Finally, the adjusted P values from the individual tests were combined to compute the overall P values using Stouffers method , and genes with overall P < 0.05 and fold change > 1.5 were selected as DEGs. WBPaper00047131:daf-2(e1370)_upregulated_hel-1(gk148684)-background
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Transcripts depleted in purified oocyte P bodies comparing to in the whole animal. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_WholeAnimal_depleted
  Transcripts that showed significantly increased expression after exposure to 75uM paraquat(PQ) from L1 to day 2 adult stage in skn-1(lax188) animals fold change > 2 WBPaper00058711:paraquat_upregulated
  Transcripts that showed significantly increased expression in 10-days post L4 adult hermaphrodite N2 grown at 20C, comparing to in 1-day post L4 adult hermaphrodite N2 animals grown at 20C. CuffDiff, fold change > 2. WBPaper00065096:Day10_vs_Day1_upregulated
  Transcripts that showed significantly decreased expression in 10-days post L4 adult hermaphrodite npr-8(ok1439) animals grown at 20C, comparing to in N2 animals. CuffDiff, fold change > 2. WBPaper00065096:npr-8(ok1439)_downregulated_Day10_20C
Gamma irradiation 100 mGY per hour for 72 hours since L1 larva. Transcripts that showed significantly increased expression after exposure to 100mGy per hour gamma irradiation from L1 to day 1 adult hermaphrodite stage. DESeq2, FDR <= 0.05, log2 fold change >= 0.3 or <= -0.3. WBPaper00058958:100mGy-irradiation-72h_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L2-larva_expressed
  Genes which expression is changed in isp-1;ctb-1 mutant and is not affected by developmental klf-1 RNAi, but is brought to wild type levels by klf-1 RNAi in adulthood. N.A. WBPaper00059194:klf-1(RNAi)_regulated
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated

14 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2009877 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1030259 Tiling arrays expression graphs  
Clone: pUL#IAH10A11   Expr7699 Expression was observed in the embryo from early to late stages, with complex and dynamic distribution. Expression was strong in 1 or 2 nerve cells in the head nerve ring, with processes running round the nerve ring and down the length of the nerve cord from L1 to adult. There was also weak background expression in the anterior and posterior intestine. All five independent lines examined gave the same expression apart from one which also showed some additional nerve cell expression. This line may be an integrated line and so this could be spurious.  
    Expr15581    
    Expr1632 Although the antibody and reporter patterns coincide in a number of key aspects, a few differences are evident. Expression in the head hypodermal cells upon hatching is only faintly detected with the gfp reporter construct. Moreover, only a few neurons in the head express the ceh-32::gfp construct, compared with the immunostaining results. These differences could reflect that positive control elements are missing or inactive in the reporter transgenes. The expression of ceh-32 begins during the embryonic development, during the gastrulation stage (in 100-min-old embryos), and persists until adults. Embryonic expression of ceh-32 is detected in the anterior part of the embryos in head hypodermal and neuronal precursor cells. Upon hatching, ceh-32 is expressed in the hypodermal and neuronal cells of the head as well as in the somatic gonad. The head hypodermal nuclei expressing CEH-32 were identified as the nuclei of hyp3, hyp4, hyp5, and the first ventral nuclei of hyp6. In the head neurons, ceh-32 is found expressed in 12 cells of the anterior ganglion, in the sensory neurons ADL, in a pair of neurons in the dorsal side of lateral and ventral ganglion, and in 8 neurons in the ventral side of the lateral and ventral ganglion. Some weakly expressing cells were not identified. In the somatic gonad, the expression of ceh-32 begins at the L1/L2 stage in the sheath/spermatheca (SS) precursor cells, and this expression is maintained during the development of the somatic gonad in the gonadal sheath cells. In the adult worm, all the gonadal sheath cells express ceh-32. The CEH-32 protein is detected exclusively in the nuclei.
    Expr10237 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr10238 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr2028117 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1028991 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr14289 A Pceh-32::GFP reporter was not expressed in BAG neurons.  
    Expr1158362 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr10239 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr1170050 Time-lapse fluorescence microscopy was performed, including DIC for morphology. Gene expression patterns were summarized in 4 manners: Average over time, Average over time and at different positions along the anterior-posterior (AP) axis, a voxelized representation over time, and on individual cells overlaid from a reference coordinate dataset (https://doi.org/10.1016/j.ydbio.2009.06.014). The analysis was done with a pipeline based on the multi-purpose image analysis software Endrov (https://doi.org/10.1038/nmeth.2478), which further is needed to browse the raw recording data. Thumbnail movies were also generated, using maximum Z projection for the 3D fluorescence channel. Raw recordings available in the Endrov OST-file format are available at https://www.ebi.ac.uk/biostudies/studies/S-BIAD191?query=S-BIAD191  
    Expr10240 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  

15 GO Annotation

Annotation Extension Qualifier
  located_in
  part_of
  located_in
  located_in
occurs_in(WBbt:0006833) involved_in
  located_in
  enables
  involved_in
  involved_in
  enables
  involved_in
  enables
  enables
  enables
  enables

12 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000453 11701764 11705400 -1

15 Ontology Annotations

Annotation Extension Qualifier
  located_in
  part_of
  located_in
  located_in
occurs_in(WBbt:0006833) involved_in
  located_in
  enables
  involved_in
  involved_in
  enables
  involved_in
  enables
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
3637

1 Sequence Ontology Term

Identifier Name Description
gene  

4 Strains

WormBase ID
WBStrain00027400
WBStrain00037713
WBStrain00049274
WBStrain00054992

0 Upstream Intergenic Region