WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00016881 Gene Name  C52E2.2
Sequence Name  ? C52E2.2 Organism  Caenorhabditis elegans
Automated Description  Enriched in MC neuron; MCL; MCR; cephalic sheath cell; and dopaminergic neurons based on tiling array and single-cell RNA-seq studies. Is affected by several genes including mut-2; rsr-2; and rrf-3 based on tiling array; microarray; and RNA-seq studies. Is affected by Tunicamycin; Diazinon; and Sirolimus based on microarray studies. Biotype  SO:0001217
Genetic Position  Length (nt)  ? 290
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00016881

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C52E2.2.1 C52E2.2.1 236   II: 1852803-1853092
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C52E2.2 C52E2.2 213   II: 1852808-1852885

3 RNAi Result

WormBase ID
WBRNAi00043037
WBRNAi00012320
WBRNAi00030085

37 Allele

Public Name
WBVar02124326
WBVar02123182
WBVar02121537
gk964317
gk963801
WBVar02122524
WBVar02120604
WBVar02121124
WBVar02124543
WBVar02120776
WBVar02121946
WBVar02122773
WBVar02123007
WBVar02122097
WBVar02122433
WBVar02079806
WBVar00091181
WBVar00091057
WBVar02079843
WBVar02077026
WBVar02079710
WBVar02079709
WBVar01412761
WBVar02079707
WBVar02075086
gk962727
gk134756
WBVar01763220
gk637125
gk867827

1 Chromosome

WormBase ID Organism Length (nt)
II Caenorhabditis elegans 15279421  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00016881 1852803 1853092 1

2 Data Sets

Name URL
WormBaseAcedbConverter  
C. elegans genomic annotations (GFF3 Gene)  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrII_1853093..1854393   1301 II: 1853093-1854393 Caenorhabditis elegans

38 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
  Transcripts that showed significantly increased expression in ogt-1(ok1474) neuronal cells isolated by FACs comparing to in FACs isolated neuronal cells from wild type. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066485:ogt-1(ok1474)_upregulated_neuron
  Coexpression clique No. 60, 176662_at-Y53F4B.16, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:176662_at-Y53F4B.16
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:glr-1(+)-neurons_L2-larva_expressed
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  Transcripts down regulated in hpl-2(tm1489) embryo comparing to N2 in tiling array analysis. Oligos from the tiling array were mapped to chromosome coordinates of the exons from Wormbase WS180. Any oligo that mapped to a gene on both the Watson and Crick strands was excluded. The remaining oligos were then grouped together (perfect match and mismatch) into probe sets and written out into an Affymetrix CDF file. The CDF file was converted into an R-package and loaded into R. The expression values were calculated using the justRMA function from Bioconductor. This used a Benjamini and Hochberg false discovery rate correction. WBPaper00040560:hpl-2_embryo_downregulated
  Genes uniquely expressed in endoderm, according to RNAseq studies on blastomere (with isolated AB, MS, E, C, D founder cells dividing in vitro) time course and whole embryo time course. Germ layers were assigned by correlating the average expression with germ-layer-specific patterns with a cutoff of 0.6 correlation with the following idealized vectors: endoderm = [00100]; ectoderm = [10000]; mesoderm = [01011], where the order is AB, MS, E, C and P3. Germ-layer genes were defined according to the sum of the genes identified by the clusters and are indicated in Fig. 2b. Authors further filtered the germ-layer gene sets by keeping only those genes whose expression was partitioned across the germ layers such that at least two-thirds of the expression was in that germ layer. WBPaper00046121:endoderm_unique
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
hypergravitational 15xg force for 4 days via centrifugation. Transcripts that showed significantly decreased expression in animals grew under the hypergravitational 15xg force for 4 days via centrifugation. N.A. WBPaper00061274:hypergravity_15xg_downregulated
  Strictly embryonic class (SE): genes that are the subset of embryonic genes that are not also classified as maternal. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SE
  Transcripts that showed differential expression between 24 and 26 hours post hatching L2d and dauer committed larvae of daf-9(dh, triggered by the dafachronic acid (DA) growth hormone6). Cluster 2 genes' expression gradually increased into dauer. Benjamini Hochberg corrected q-value < 0.01. WBPaper00053388:dauer_regulated_Cluster2
  Genes that showed expression levels higher than the corresponding reference sample (Young adult all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:CEP-sheath-cells_Day1-adult_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L1-larva_expressed
  Transcripts that showed differential expression in dauer mir-34(gk437) vs dauer mir-34(OverExpression) animals at 20C. N.A. WBPaper00050488:mir-34(gk437)_vs_mir-34(OverExpression)_regulated_dauer_20C
  Genes with expression level regulated by genotype (N2 vs CB4856) at Old adults stage (214 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0136. WBPaper00040858:eQTL_regulated_old
  Genes from eat-2(ad465) animals with significantly increased expression after 72 hours of treatment on growth media with 10uM rapamycin in 2% DMSO. Analysis of gene expression data was carried out with the Affymetrix Transcriptome Analysis Console. Data preprocessing (using RMA normalization) and QC metrics were performed using Affymetrix Expression Console TM and manually inspected afterwards. Expression analysis was carried out for each two pairwise conditions. FDR statistical correction for multiple testing resulted in a slightly lower number of DEGs in most cases. P-value < 0.05 and fold change > 2.0 were used to determine differentially expressed genes. WBPaper00048989:eat-2(ad465)_rapamycin_upregulated
  Transcripts that showed significantly decreased expression in mex-1(or286) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. RPKM fold change > 2. WBPaper00058598:mex-1(or286)_downregulated
  Top 300 transcripts enriched in MC neuron, MCL, MCR according to single cell RNAseq. Top 300 enriched transcripts were determined by log2.ratio of the tpm in the cell type vs the tpm in the other cells * the log2 of the cell.type tpm. WBPaper00061340:MC
  Genes in the bottom 10% of expression level across the triplicate L3 samples. To generate the top10 and bottom10 gene sets, authors ranked all genes by mean expression array signal intensity across the three replicates, then took the top and bottom deciles (1,841 genes each) to represent genes with high and low expression. To generate the top10 and bottom10 gene sets, authors ranked all genes by mean expression array signal intensity across the three replicates, then took the top and bottom deciles (1,841 genes each) to represent genes with high and low expression. WBPaper00032528:L3_depleted
  Genes that showed significantly increased expression level in rsr-2(RNAi) animals comparing to in gfp(RNAi) control. Fold change > 1.2 or < 0.8. WBPaper00042477:rsr-2(RNAi)_upregulated_TilingArray
  Genes predicted to be downregulated more than 2.0 fold in rde-3(ne298) mutant worms as compared to wild-type animals (t-test P-value < 0.05). A t-test (5% confidence) was applied to the triplicate sample data for each transcript in each mutant to identify genes significantly elevated or decreased compared with the wild type. WBPaper00027111:rde-3(ne298)_downregulated
  Transcripts that showed significantly increased expression in animals treated with 25ug per mL tunicamycin for 4 hours, comparing to control animals. Fold-change > 1.5, ANOVA P values < 0.05. WBPaper00055482:Tunicamycin_upregulated
  Transcripts that showed significantly decreased expression in mex-3(eu149) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. RPKM fold change > 2. WBPaper00058598:mex-3(eu149)_downregulated
  Genome-wide analysis of developmental and sex-regulated gene expression profile. self-organizing map cgc4489_group_5
  Transcripts that showed significantly increased expression in adr-1(tm668) comparing to in N2. DESeq2, p-value < 0.05 and a fold enrichment log2fold > 0.5. WBPaper00055226:adr-1(tm668)_upregulated
  Down-regulated genes under 1 mg/l DZN treatment at 16 centigrade. The Rank Product package was used to identify the differentially expressed genes between controls and treatment in each experiment. Briefly, genes were ranked based on up- or downregulation by the treatment in each experiment. Then, for each gene a combined probability was calculated as a rank product (RP). The RP values were used to rank the genes based on how likely it was to observe them by chance at that particular position on the list of differentially expressed genes. The RP can be interpreted as a p-value. To determine significance levels, the RP method uses a permutation-based estimation procedure to transform the p-value into an e-value that addresses the multiple testing problem derived from testing many genes simultaneously. Genes with a percentage of false-positives (PFP) < 0.05 were considered differentially expressed between treatments and control in each experiment. This method has the advantage to identify genes with a response to the toxicants even when the absolute effect of the response was low. Because authors used sub-lethal concentrations of the toxicants, methods that use thresholds based on absolute fold change would not identify small changes in gene expression. Moreover, RP has proved to be a robust method for comparing microarray data from different sources and experiments. WBPaper00037113:DZN_16C_down-regulated

4 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2020090 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2001864 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1146999 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1010815 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

0 GO Annotation

0 Homologues

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00016881 1852803 1853092 1

0 Ontology Annotations

0 Regulates Expr Cluster

1 Sequence

Length
290

1 Sequence Ontology Term