WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000428 Gene Name  ceh-1
Sequence Name  ? F16H11.4 Brief Description  ceh-1 encodes a protein that contains a homeobox domain.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Predicted to be involved in cell differentiation and regulation of transcription by RNA polymerase II. Predicted to be located in nucleus. Expressed in DVC; lateral ganglion; and seam cell. Is an ortholog of human NKX1-1 (NK1 homeobox 1) and NKX1-2 (NK1 homeobox 2).
Biotype  SO:0001217 Genetic Position  X :-7.05299 ±0.059955
Length (nt)  ? 991
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000428

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F16H11.4.1 F16H11.4.1 843   X: 4653167-4654157
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F16H11.4 F16H11.4 516   X: 4653492-4653608

1 RNAi Result

WormBase ID
WBRNAi00044827

23 Allele

Public Name
gk964260
gk962667
WBVar02063535
WBVar02063534
tm455
WBVar01654822
gkDf38
WBVar01878203
gk646941
gk643033
gk921910
gk639524
gk597961
gk834617
gk687762
gk580290
gk659430
gk278618
gk932316
WBVar01468625
WBVar02050507
WBVar01878204
gk278619

1 Chromosome

WormBase ID Organism Length (nt)
X Caenorhabditis elegans 17718942  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000428 4653167 4654157 -1

3 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_4651744..4653166   1423 X: 4651744-4653166 Caenorhabditis elegans

47 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
  Significantly upregulated genes from clk-1(qm30) microarrays using SAM algorithm with an FDR < 0.1 from adult-only chips. SAM algorithm with an FDR < 0.1. WBPaper00033065:clk-1(qm30)_upregulated
  Genes with increased RNA expression after 24 hours rotenone treatment EdgeR provides statistical routines for determining differential expression in digital gene expression data using a model based on the negative binomial distribution. The resulting p-values were adjusted using the Benjamini and Hochbergs approach for controlling the false discovery rate (FDR). Transcripts with an adjusted p-value smaller 0.05 were assigned as differentially expressed. WBPaper00044426:rotenone_24h_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L2-larva_expressed
  Transcripts that showed significantly increased expression in ilc-17.1(syb5296) comparing to in N2 animals at L4 larva stage. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066594:ilc-17.1(syb5296)_upregulated
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed significantly decreased expression in eat-2(ad1116) comparing to in N2 at 3-days post L4 adult hermaphrodite animals. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:eat-2(ad1116)_downregulated
Bacteria: B.thuringiensis Transcripts in elt-2(RNAi) animals that were significantly differentially expressed at least for one time point and one pathogenic strain Bt247 and Bt679 compared to the non pathogenic strain Bt407. Cuffdiff WBPaper00060358:B.thuringiensis_pathogen_regulated_elt-2(RNAi)
  Transcripts that showed significantly increased expression in animals lacking P granules by RNAi experiments targeting pgl-1, pgl-3, glh-1 and glh-4, and unc-119-GFP(+), comparing to in control animals, at 2-day post L4 adult hermaphrodite stage. DESeq2, Benjamini-Hochberg multiple hypothesis corrected p-value < 0.05 and fold change > 2. WBPaper00050859:upregulated_P-granule(-)GFP(+)_vs_control_day2-adult
  Transcripts that showed significantly increased expression in animals lacking P granules by RNAi experiments targeting pgl-1, pgl-3, glh-1 and glh-4, and unc-119-GFP(-), comparing to in control animals, at 2-day post L4 adult hermaphrodite stage. DESeq2, Benjamini-Hochberg multiple hypothesis corrected p-value < 0.05 and fold change > 2. WBPaper00050859:upregulated_P-granule(-)GFP(-)_vs_control_day2-adult
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
  Transcripts that showed significantly increased expression in set-2(zr2012) animals at embryo stage, comparing to in N2 animals. DESeq2 (v2.1.8.3) was used to determine DE genes and to generate principal component and scatter plots. DE genes with FDR < 0.05 were analysed using g:Profiler with Bonferroni correction. WBPaper00060014:set-2(zr2012)_upregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
heat-shock hlh-1 Genes enriched in HLH-1 heat shock dataset. A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:hlh_1_enriched
  WT-Pico Pan-neural Depleted Genes, with genes found multiple times in a single dataset removed (without dups). To identify differentially expressed transcripts, normalized intensity values from the pan-neural data sets were compared to a reference (from all larval cells) using Significance Analysis of Microarray software (SAM). A two class unpaired analysis of the data was performed to identify neuron-enriched genes. Pan-neural enriched transcripts in the IVT and WT-Pico-derived data set were defined as 1.5X elevated vs the reference at a False Discovery Rate (FDR) = 3%. WBPaper00031532:Larva_Pan_Neuronal_Depleted
  Expression Pattern Group D, enriched for genes involved in catabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_D
  Transcripts that showed significantly increased expression in jmjd-3.1p::jmjd-3.1 comparing to in N2. DESeq2 Benjamini-Hochberg adjusted p-value < 0.05. WBPaper00049545:jmjd-3.1(+)_upregulated
  Transcripts that showed significantly increased expression in rgef-1p::jmjd-1.2 comparing to in N2. DESeq2 Benjamini-Hochberg adjusted p-value < 0.05. WBPaper00049545:rgef-1p-jmjd-1.2(+)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L2-larva_expressed
  Significantly downregulated genes from cyc-1(RNAi) microarrays using SAM algorithm with an FDR < 0.1 from adult-only chips. SAM algorithm with an FDR < 0.1. WBPaper00033065:cyc-1(RNAi)_downregulated
  Genes from N2 animals with significantly increased expression after 72 hours of treatment on growth media with 10uM rapamycin in 2% DMSO. Analysis of gene expression data was carried out with the Affymetrix Transcriptome Analysis Console. Data preprocessing (using RMA normalization) and QC metrics were performed using Affymetrix Expression Console TM and manually inspected afterwards. Expression analysis was carried out for each two pairwise conditions. FDR statistical correction for multiple testing resulted in a slightly lower number of DEGs in most cases. P-value < 0.05 and fold change > 2.0 were used to determine differentially expressed genes. WBPaper00048989:N2_rapamycin_upregulated
  Genes that showed significant differential expressed between control and 150 mg\/L Atrazine treatment. t-test, p < 0.05. WBPaper00036123:Atrazine_regulated
  Genes expressed in embryonic motor neurons (identified by unc-4::GFP expressing cells). Genes called Present by MAS 5.0 in 2 out of 3 unc-4::GFP hybridizations. WBPaper00025141:unc-4::GFP_Expressed_Genes
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin_downregulated
  Genes that showed flat mRNA expression level throughout the 16 hour time courses from L3 larva to young adult. The following three lines of R code were used to perform the classification: increasing <-2*amplitude-PC1 < -1.7; oscillating <-!increasing & (amplitude > 0.55); flat <-!increasing & !oscillating; Note that the amplitude of a sinusoidal wave corresponds to only half the fold change between trough and peak. WBPaper00044736:flat_dev_expression
  Down-regulated genes (fold change > 1.5) in two CoQ-deficient clk-1 mutant strains (e2519, qm30) compared to wild types N2. Fold-changes of intensities were calculated from the arithmetic mean of gene expression values between experimental and corresponding control group. Fold change >= 1.5 was used as cut-off. WBPaper00045774:clk-1_downregulated
  Transcripts that showed significantly decreased expression in ash-2(RNAi) comparing to in wild type animals injected with vector. Differential mRNA expression using DESeq2. mRNAs with a FDR < 0.05 and fold change > 2 considered differentially expressed. WBPaper00066232:ash-2(RNAi)_downregulated_WT
  Transcripts that showed significantly increased expression in hlh-26(ok1453) animals exposed to E. faecium for 8 hours, comparing to N2 animals exposed to E. faecium for 8 hours. Fold change > 2. WBPaper00062585:hlh-26(ok1453)_upregulated_E.faecium
  Genes that showed differential expression in Day 1, 10 and 20 adult stages. Those genes showing statistically significant differences (FDR adjusted P < 0.05) by DESeq and edgeR at least between two time points or by the baySeq test over the three time points were regarded as DEGs (n = 3,608). WBPaper00048923:Aging_regulated

9 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2009866 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Clone: pUL#IAH10E5   Expr7508 Quite strong, seam cell specific expression from mid-larval stages (strongest) to adult (weaker) in 7 of 8 lines examined. (No expression seen in eighth line.)  
    Expr15561    
    Expr15321    
    Expr2028106 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1148792 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1200023 Data from the TransgeneOme project  
    Expr1015000 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1170087 Time-lapse fluorescence microscopy was performed, including DIC for morphology. Gene expression patterns were summarized in 4 manners: Average over time, Average over time and at different positions along the anterior-posterior (AP) axis, a voxelized representation over time, and on individual cells overlaid from a reference coordinate dataset (https://doi.org/10.1016/j.ydbio.2009.06.014). The analysis was done with a pipeline based on the multi-purpose image analysis software Endrov (https://doi.org/10.1038/nmeth.2478), which further is needed to browse the raw recording data. Thumbnail movies were also generated, using maximum Z projection for the 3D fluorescence channel. Raw recordings available in the Endrov OST-file format are available at https://www.ebi.ac.uk/biostudies/studies/S-BIAD191?query=S-BIAD191  

10 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  located_in
  located_in
  involved_in
  located_in
  involved_in
  involved_in

0 Homologues

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000428 4653167 4654157 -1

10 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  located_in
  located_in
  involved_in
  located_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
991

1 Sequence Ontology Term