WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000444 Gene Name  ceh-21
Sequence Name  ? T26C11.6 Brief Description  ceh-21 encodes a a ONECUT class CUT homeobox protein with a single N-terminal cut domain and an OCAM domain; the cut domain may be a compact DNA-binding domain composed of alpha helices; the OCAM domain is a nematode-specific motif conserved between CEH-21, CEH-41, and T02B5.2; ceh-21 is one of three nematode-specific ONECUT genes in a cluster with ceh-39 and ceh-41; CEH-21 may be required for muscle formation and differentiation, and is expressed in muscle precursor cells and differentiated gut cells; ceh-21 has no obvious function in mass RNAi assays.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be located in nucleus. Expressed widely.
Biotype  SO:0001217 Genetic Position  X :-17.4835 ±0.002838
Length (nt)  ? 3958
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000444

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:T26C11.6.1 T26C11.6.1 2586   X: 1848779-1852736
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:T26C11.6 T26C11.6 1488   X: 1850051-1850369

4 RNAi Result

WormBase ID
WBRNAi00054185
WBRNAi00019249
WBRNAi00019250
WBRNAi00092894

101 Allele

Public Name
gk963725
gk963864
gk964444
gk964445
tm10733
gk963977
WBVar02120739
WBVar01757481
WBVar01600815
WBVar01600816
WBVar01600814
WBVar01600819
WBVar01600817
WBVar01600818
WBVar01600820
WBVar01600821
WBVar00075531
WBVar00075532
WBVar00075533
WBVar00075534
WBVar00075530
tm261
tm266
tm453
tm787
WBVar01979497
WBVar01979495
WBVar01824727
WBVar01824728
WBVar01824729

1 Chromosome

WormBase ID Organism Length (nt)
X Caenorhabditis elegans 17718942  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000444 1848779 1852736 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_1848676..1848778   103 X: 1848676-1848778 Caenorhabditis elegans

161 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in N2 after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_N2
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to RNAseq analysis towards total RNA. Gene expression quantification and differential expression was analyzed using cufflinks v2.2.1. Enriched contains only genes significantly enriched (differentially expressed >= 2.4 fold in total RNA or >= 3.2 fold in DSN treated total RNA) in the NSM neurons versus the reference. WBPaper00045974:NSM_enriched_totalRNA_RNAseq
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_6h
  Transcripts that showed significantly increased expression in mrg-1(qa6200) comparing to in control animals in primordial germ cells (PGCs) at L1 larva stage. DESeq2(v1.32.0), FDR < 0.05. WBPaper00064315:mrg-1(qa6200)_upregulated_PGCs
  Transcripts that showed significantly increased expression in hrde-1(tm1200) animals, comparing to in N2, after growing at 25C for five generations (late generation). CuffDiff2 WBPaper00051265:F4_hrde-1(tm1200)_upregulated
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Transcripts that showed significantly increased expression in animals exposed to 400uM tamoxifen from L1 to L4 larva stage. DEseq2, fold change > 2 WBPaper00064505:tamoxifen_upregulated
  Transcripts that showed significantly increased expression after exposure to 75uM paraquat(PQ) from L1 to day 2 adult stage in skn-1(lax188) animals fold change > 2 WBPaper00058711:paraquat_upregulated
25C vs. 20C Transcripts that showed significantly increased expression in 1-day post L4 adult hermaphrodite N2 grown at 25C, comparing to in N2 animals grown at 20C. CuffDiff, fold change > 2. WBPaper00065096:25C_vs_20C_upregulated
  Transcripts that showed significantly increased expression in 10-days post L4 adult hermaphrodite N2 grown at 20C, comparing to in 1-day post L4 adult hermaphrodite N2 animals grown at 20C. CuffDiff, fold change > 2. WBPaper00065096:Day10_vs_Day1_upregulated
Gamma irradiation 100 mGY per hour for 72 hours since L1 larva. Transcripts that showed significantly increased expression after exposure to 100mGy per hour gamma irradiation from L1 to day 1 adult hermaphrodite stage. DESeq2, FDR <= 0.05, log2 fold change >= 0.3 or <= -0.3. WBPaper00058958:100mGy-irradiation-72h_upregulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L2-larva_expressed
  Transcripts that were regulated by both set-6(ok2195) and baz-2(tm0235) at 2-day post L4 adult hermaphrodite stage. N.A. WBPaper00059356:set-6(ok2195)_baz-2(tm0235)_regulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed

10 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2009868 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1019569 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1030252 Tiling arrays expression graphs  
    Expr15571    
Clone: pUL#JRH10A7   Expr7684 Lines all very mosaic but appears to be expression in all cells except germline from early embryo to adult. Some tissues express stronger: spermatheca, excretory cell, pharynx, intestine.  
    Expr10229 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr10230 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr2028108 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1157727 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1200003 Data from the TransgeneOme project  

9 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  enables
  involved_in
  enables
  enables
  enables
  located_in
  located_in

11 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000444 1848779 1852736 -1

9 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  enables
  involved_in
  enables
  enables
  enables
  located_in
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
3958

1 Sequence Ontology Term