WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000429 Gene Name  ceh-2
Sequence Name  ? C27A12.5 Brief Description  ceh-2 encodes a homeodomain protein homologous to Drosophila empty spiracles (EMS) and the vertebrate EMX1 and EMX2 proteins (OMIM:600034, 600035); although the exact biological role of CEH-2 in C. elegans development and/or behavior is not yet known, expression in pharyngeal neurons and muscle, as well as vulval cells (vulB1, vulB2, and vulC) at late stages of development suggests that CEH-2 may play a role in terminal differentiation events in diverse cell types.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Predicted to be involved in brain development; neuron differentiation; and regulation of transcription by RNA polymerase II. Located in nucleus. Expressed in neurons; pharynx; somatic nervous system; and vulva. Human ortholog(s) of this gene implicated in carcinoma (multiple) and stomach cancer. Is an ortholog of human EMX1 (empty spiracles homeobox 1).
Biotype  SO:0001217 Genetic Position  I :0.702718 ±0.001508
Length (nt)  ? 3664
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000429

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C27A12.5.1 C27A12.5.1 856   I: 6046788-6050451
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C27A12.5 C27A12.5 630   I: 6046950-6047011

3 RNAi Result

WormBase ID
WBRNAi00041295
WBRNAi00003026
WBRNAi00029244

36 Allele

Public Name
gk962858
gk962706
gk963902
gk964505
ch4
gk112261
gk434916
gk594160
WBVar01431800
WBVar01329472
WBVar01329583
tm270
gk112263
gk112264
gk112265
gk112259
gk112260
h7535
gk112262
WBVar00153941
WBVar00153940
WBVar02016331
gk365245
gk684266
gk613541
WBVar02011901
gk441965
gk931326
gk393902
gk935908

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000429 6046788 6050451 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_6046019..6046787   769 I: 6046019-6046787 Caenorhabditis elegans

95 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Genes that showed increased expression in wdr-5(ok1417) comparing with in N2. Statistical analysis for misexpression was performed using a moderated t test from the package limma. All genes with a false discovery rate (FDR) of <= 5% (p <= 0.05) were selected as differentially regulated. WBPaper00045861:wdr-5(ok1417)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Transcripts that showed significantly higher expression in somatic gonad precursor cells (SGP) vs. head mesodermal cells (hmc). DESeq2, fold change >= 2, FDR <= 0.01. WBPaper00056826:SGP_biased
  Transcripts that showed significantly increased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to RNAseq analysis towards total RNA. Gene expression quantification and differential expression was analyzed using cufflinks v2.2.1. Enriched contains only genes significantly enriched (differentially expressed >= 2.4 fold in total RNA or >= 3.2 fold in DSN treated total RNA) in the NSM neurons versus the reference. WBPaper00045974:NSM_enriched_totalRNA_RNAseq
  Transcripts that showed significantly increased expression in rrf-3(pk1426) comparing to in N2 at embryo stage. DESeq2v 1.18.1, fold change > 1.5, adjusted p-value < 0.01. WBPaper00056169:rrf-3(pk1426)_upregulated_embryo
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Transcripts that showed significantly increased expression in wdr-5(ok1417);skn-1(lax188) comparing to in skn-1(lax188) at day 2 adult stage. fold change > 2 WBPaper00058711:wdr-5(ok1417)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L2-larva_expressed
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed significantly increased expression in srbc-48(ac23);kyIs262;fer-1(b232ts) comparing to in kyIs262;fer-1(b232ts), 24h after infection with P.aeruginosa. DESeq2, FDR <0.05, fold change > 2. WBPaper00059664:srbc-48(ac23)_upregulated
Bacteria: B.thuringiensis Transcripts in elt-2(RNAi) animals that were significantly differentially expressed at least for one time point and one pathogenic strain Bt247 and Bt679 compared to the non pathogenic strain Bt407. Cuffdiff WBPaper00060358:B.thuringiensis_pathogen_regulated_elt-2(RNAi)
Starvation Transcripts that showed significantly altered expression by starvation with 100 mM salt (NaCl) DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:starvation_regulated_LowSalt
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to tiling array analysis towards total RNA. A linear model and moderated t-statistic were used to determine differentially expressed genes as implemented by the limma package (v3.21.4). Enriched list contains only genes significantly enriched in the NSM neurons versus the reference <=1.5X and <= 5% FDR. WBPaper00045974:NSM_enriched_totalRNA_tiling
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
heat-shock hlh-1 Genes enriched in HLH-1 heat shock dataset. A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:hlh_1_enriched
  Transcripts that showed significantly increased expression in spr-1(ok2144) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:spr-1(ok2144)_upregulated
  Transcripts depleted in RIS neurons comparing to in all cells. edgeR 3.24.3, FDR < 0.01 WBPaper00058969:RIS_depleted
Temprature shift to 28C for 48 hours. Transcripts that showed significantly increased expression after animals were exposed to 28C temperature for 48 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_48h_upregulated
  Transcripts that showed significantly increased expression in dpy-7(e88) animals comparing to N2 animals. Authors considered genes differentially expressed if they had a q-value <= 0.05 and a b-value >= 1 or <= -1. WBPaper00053771:up_at_dpy-7(e88)

14 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2009874 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1019828 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
Reporter gene fusion type not specified.   Expr2608 A 10 kb genomic promoter sequence is also expressed in vulval cells, which may not reflect the natural expression pattern of the ceh-2 gene, as that promoter extends into the next gene upstream. Reporter gene expression under the control of the ceh-2 promoter is detected shortly after formation of the pharynx primordium, and is strongest in elongated embryos and early larvae. 1.6 kb sequence upstream from the start codon drives expression only in the neurons (pTRB201); a fourth exon gfp fusion that includes 4.5 kb of upstream sequence and the large intron within the homeodomain is expressed also in e2 and m2 cells (pTRB202).  
    Expr2607 Earliest expression with the antibody was seen in a small cluster in late-gastrulation embryos. From their position, these cells may be the precursors of the pharyngeal cells that express GFP in the L1 larva. ceh-2 expression was found restricted to eleven cells (fourteen nuclei) of five types in the anterior pharynx (corpus) of larvae and adults: the I3 neuron that lies embedded in the dorsal sector of the pharynx muscle; the pairs of NSM and M3 motoneurons in the left and right subventral sectors; the three m2 muscle cells, each possessing two nuclei resulting from cell fusion during development; and the three e2 epithelial cells with the anterior-most pharynx nuclei. Antibody staining is nuclear as expected for a transcription factor.
Clone: pUL#JRH6E3   Expr7448 Expression is very specific to nerve cells in the nerve ring, in the pharynx and, sometimes, the nerve cord, late embryo to adult.  
    Expr1200102 Data from the TransgeneOme project  
    Expr15569    
    Expr15323    
    Expr2028114 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Original chronogram file: chronogram.1741.xml [C27A12.5:gfp] transcriptional fusion. Chronogram710    
Compelete expression pattern in vulva is described. No detail on expression pattern in other life stages or tissues. Reporter gene fusion type not specified. The ceh-2 promoter fragment that expresses in the vulva also contains the predicted upstream gene C27A12.6. Therefore, the ceh-2 gene may not be the functionally relevant target of the regulatory element responsible for vulval expression, although the gfp fusion point is in ceh-2.   Expr2356 During the L4 stage, ceh-2::gfp was expressed in vulB1 and vulB2. vulB1 expressed GFP consistently in all animals, whereas vulB2 expressed GFP in some animals only. From the late L4 to the adult, the expression was observed in vulC. The expression in vulB cells was variable in the adult.  
    Expr1145336 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1170046 Time-lapse fluorescence microscopy was performed, including DIC for morphology. Gene expression patterns were summarized in 4 manners: Average over time, Average over time and at different positions along the anterior-posterior (AP) axis, a voxelized representation over time, and on individual cells overlaid from a reference coordinate dataset (https://doi.org/10.1016/j.ydbio.2009.06.014). The analysis was done with a pipeline based on the multi-purpose image analysis software Endrov (https://doi.org/10.1038/nmeth.2478), which further is needed to browse the raw recording data. Thumbnail movies were also generated, using maximum Z projection for the 3D fluorescence channel. Raw recordings available in the Endrov OST-file format are available at https://www.ebi.ac.uk/biostudies/studies/S-BIAD191?query=S-BIAD191  
    Expr12300 Variable expression in all substages; low level of expression in B2, C from L4.1 to L4.7.  

12 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  located_in
  located_in

13 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000429 6046788 6050451 -1

12 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  located_in
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
3664

1 Sequence Ontology Term