WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000781 Gene Name  cpr-1
Sequence Name  ? C52E4.1 Brief Description  cpr-1 encodes a cysteine protease of the cathepsin B-like cysteine protease family; cpr-1 appears to be required for embryogenesis; cpr-1 is specifically expressed in the gut of all stages except the embryo and around developing embryos in the gonad; expression of cpr-1 is regulated by three promoter GATA motifs.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable cysteine-type endopeptidase activity. Predicted to be involved in proteolysis involved in protein catabolic process. Predicted to be located in extracellular space. Human ortholog(s) of this gene implicated in several diseases, including autoimmune disease of the nervous system (multiple); carcinoma (multiple); and type 2 diabetes mellitus. Is an ortholog of human CTSB (cathepsin B).
Biotype  SO:0001217 Genetic Position  V :3.54654 ±0.001507
Length (nt)  ? 1206
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000781

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C52E4.1.1 C52E4.1.1 1078   V: 11975551-11976756
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C52E4.1 C52E4.1 990   V: 11975580-11975582

11 RNAi Result

WormBase ID
WBRNAi00086768
WBRNAi00007457
WBRNAi00043046
WBRNAi00012322
WBRNAi00076235
WBRNAi00027728
WBRNAi00063182
WBRNAi00085292
WBRNAi00089016
WBRNAi00115168
WBRNAi00090666

24 Allele

Public Name
gk963271
gk963301
gk964458
gk964459
gk963618
WBVar01866535
WBVar00006115
gk396000
gk672082
gk695553
gk248569
gk325333
gk248570
gk644425
gk928648
gk248568
gk708415
gk470492
gk768772
gk667274
gk248573
gk248571
gk248572
ok1344

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000781 11975551 11976756 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_11976757..11976776   20 V: 11976757-11976776 Caenorhabditis elegans

432 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
  Transcripts that showed significantly decreased expression in AGP22 [nhr-49(nr2041)I;glp-1(e2141)III] comparing to in CF1903 [glp-1(e2144)III] at Day 2 adults. Fold change > 2, p Value of < 0.05 and a false discovery rate (FDR) of < 0.05. WBPaper00061530:nhr-49(e2144)_downregulated
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Transcripts that showed significantly increased expression in csr-1a(tor159) comparing to in N2 at 25C. DESeq2, fold change > 2, p-value < 0.05. WBPaper00061753:csr-1(tor159)_upregulated_25C
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Genes that showed significantly increased expression in wrn-1(gk99) comparing to in N2, according to RNAseq. DESeq was used to calculate the fold changes, log fold changes, and significance of the changes for each comparison. WBPaper00045934:wrn-1(gk99)_upregulated
Fungi infection: Myzocytiopsis humicola Transcripts that showed significantly altered expression 12 hours after animals were infected by M. humicola. Differentially expressed genes as determined by Kallisto and Sleuth (pval<0.01, qval<0.1). WBPaper00060871:M.humicola-infection_12h_regulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts that showed significantly decreased expression at 11-days-post L4 adult N2 hermaphrodites comparing to 1-day-post L4 adult N2 hermaphrodites. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:Day11_vs_Day1_downregulated
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
mitochondrial sulfide delivery molecule (mtH2S) AP39 Transcripts that showed significantly increased expression in N2 animals treated with mitochondrial sulfide delivery molecule (mtH2S) AP39 starting from 1-day-post L4 until 11 days post L4. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:mtH2S-AP39-D0-treatment_upregulated_Day11
  Genes up regulated in alg-1(gk214) comparing to in N2. Differential expression was assessed using an empirical Bayes statistics using the eBayes function. WBPaper00040823:alg-1(gk214)_upregulated
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_daf-16(mu86);glp-1(e2141)
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_glp-1(e2141)

8 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030484 Tiling arrays expression graphs  
This information was extracted from published material (Archana Sharma-Oates, Andrew Mounsey and Ian A. Hope).   Expr681 Staining more intense in larvae than adults in intestinal cells. Anterior gut stains darker than posterior. Staining is restricted to the intestine.  
This information was extracted from published material (Archana Sharma-Oates, Andrew Mounsey and Ian A. Hope).   Expr621 High level of gut cell staining in all developmental stages except embryos. There was variability in intensity and the number of gut cells staining between individuals of the same transgenic lines. A small number of transformants within each line showed expression in all gut cells but the majority showed staining restricted to the most anterior (Int1) the mid anterior (Int2 and Int3) and the mid gut cells (Int4, 5 and 6). Both intensity and frequency of expression decreased from the most anterior to the mid-gut region. A few worms stained only in posterior gut cells.  
    Expr2010502 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
This information was extracted from published material (Archana Sharma-Oates, Andrew Mounsey and Ian A. Hope).   Expr682 1.1 kb transcript present in approx. the same amounts in L1-L4 larvae, dauer larvae and adults (hermaphrodite and male) but not detected in embryos.  
    Expr2028742 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1147006 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1011903 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

7 GO Annotation

Annotation Extension Qualifier
  involved_in
  located_in
  involved_in
  involved_in
  enables
  enables
  enables

5 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000781 11975551 11976756 1

7 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  located_in
  involved_in
  involved_in
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
1206

1 Sequence Ontology Term