WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000774 Gene Name  cpf-2
Sequence Name  ? F56A8.6 Organism  Caenorhabditis elegans
Automated Description  Predicted to enable mRNA binding activity. Predicted to be involved in mRNA 3'-end processing. Located in nuclear speck. Human ortholog(s) of this gene implicated in non-syndromic X-linked intellectual disability. Is an ortholog of human CSTF2 (cleavage stimulation factor subunit 2) and CSTF2T (cleavage stimulation factor subunit 2 tau variant). Biotype  SO:0001217
Genetic Position  III :21.1667 ±0.013483 Length (nt)  ? 1952
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000774

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F56A8.6.1 F56A8.6.1 1101   III: 13266932-13268883
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F56A8.6 F56A8.6 1011   III: 13267014-13267637

23 RNAi Result

WormBase ID
WBRNAi00048623
WBRNAi00099809
WBRNAi00032924
WBRNAi00081955
WBRNAi00113270
WBRNAi00100805
WBRNAi00071381
WBRNAi00099203
WBRNAi00099607
WBRNAi00099405
WBRNAi00113041
WBRNAi00081959
WBRNAi00002610
WBRNAi00008873
WBRNAi00025695
WBRNAi00114517
WBRNAi00100244
WBRNAi00100431
WBRNAi00100618
WBRNAi00100992
WBRNAi00069366
WBRNAi00069367
WBRNAi00110860

28 Allele

Public Name
gk963887
gk963904
gk963552
gk190117
gk190118
gk190121
gk190119
gk190120
WBVar01339222
gk371203
gk459022
gk655870
gk427458
ve619
gk773600
gk334221
dr99
gk534164
gk572808
gk355291
gk371204
gk408391
gk406670
gk782405
gk622974
gk697615
gk329520
gk2920

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000774 13266932 13268883 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

0 Downstream Intergenic Region

81 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  oocyte proteins identified by two or more unique peptides during proteomics study. In the pooled data set, 1453 C. elegans proteins were identified with a probability >= 0.9 according to ProteinProphet, of which 1165 proteins were identified by more than one unique peptide. WBPaper00038289:oocyte_protein
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:excretory-cell_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:glr-1(+)-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
  Transcriptions that showed significantly increased expression in skn-1(RNAi) comparing to empty vector injection into rrf-3(pk1426);daf-2(e1368) animals. Genes with an absolute fold changeof at least 2 and standard p-values below 0.05 were considered as differentially expressed. WBPaper00062193:skn-1(RNAi)_upregulated
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic
  Transcripts that showed significantly increased expression in set-2(tm1630) animals at embryo stage, comparing to in N2 animals. DESeq2 (v2.1.8.3) was used to determine DE genes and to generate principal component and scatter plots. DE genes with FDR < 0.05 were analysed using g:Profiler with Bonferroni correction. WBPaper00060014:set-2(tm1630)_upregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:pharyngeal-muscle_L1-larva_expressed
  Transcripts that showed differential expression in dauer mir-34(gk437) vs dauer mir-34(OverExpression) animals at 20C. N.A. WBPaper00050488:mir-34(gk437)_vs_mir-34(OverExpression)_regulated_dauer_20C
  Transcripts enriched in germline by comparing dissected germline tissue with dissected intestine tissue, both injected with empty RNAi vector. Genes were determined germline-enriched if the lowest expression value (log2(FPKM+1)) observed in the germline empty vector samples was at least 2-fold higher than the highest expression value observed in the intestine empty vector samples. WBPaper00051039:germline_enriched
Bacteria infection: Xenorhabdus nematophila Caenorhabditis elegans Genes with expression levels changed significantly after treatment of Xenorhabdus nematophila. Differential expression were calculated by empirical eBayes method using eBayes function. P_value <= 0.01 and log2 fold change > 1 were used to call differentially expressed genes in all datasets. WBPaper00041606:CE_X.nematophila_regulated
UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) vs EtBr-exposed(maintained under normal lab light (mostly dark, in incubators) and exposed to EtBr (5ug/mL in agar).) at 3 h after the third UVC dose (51h), which is also 3 h after being placed on food. Genes differentially expressed under UVC exposure and EtBr treatment vs under EtBr treatment but without UVC exposure at the -3h timepoint (3 h after the third UVC dose (51h), which is also 3 h after being placed on food). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:UVC-EtBr-exposed_vs_EtBr-exposed_51h
  Proteins differently expressed in old Day 10 atm-1 (gk186) worms compared to all other conditions. N.A. WBPaper00051555:atm-1(gk186)_regulated_protein

5 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030478 Tiling arrays expression graphs  
    Expr2010481 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1152391 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1026988 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2028721 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

10 GO Annotation

Annotation Extension Qualifier
  enables
  part_of
  located_in
  located_in
  involved_in
  located_in
  enables
  enables
  enables
  enables

9 Homologues

Type
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000774 13266932 13268883 -1

10 Ontology Annotations

Annotation Extension Qualifier
  enables
  part_of
  located_in
  located_in
  involved_in
  located_in
  enables
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
1952

1 Sequence Ontology Term