WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000785 Gene Name  cpr-5
Sequence Name  ? W07B8.5 Brief Description  cpr-5 encodes a cysteine protease.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable cysteine-type endopeptidase activity. Predicted to be involved in proteolysis involved in protein catabolic process. Predicted to be located in extracellular space. Human ortholog(s) of this gene implicated in several diseases, including autoimmune disease of the nervous system (multiple); carcinoma (multiple); and type 2 diabetes mellitus. Is an ortholog of human CTSB (cathepsin B).
Biotype  SO:0001217 Genetic Position  V :-19.8522 ±0.000529
Length (nt)  ? 1439
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000785

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:W07B8.5.1 W07B8.5.1 1205   V: 1132598-1134036
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:W07B8.5 W07B8.5 1035   V: 1132686-1133020

2 RNAi Result

WormBase ID
WBRNAi00054943
WBRNAi00115172

36 Allele

Public Name
gk963591
gk963553
gk964259
gk963850
gk963899
gk963027
gk963889
gk962552
gk962551
gk443701
gk521156
gk365739
gk750783
gk364479
gk756931
gk407892
WBVar01610223
gk407893
gk502838
gk704837
gk810930
gk858087
gk887321
gk839604
gk672027
WBVar01818942
gk226082
gk875311
gk914286
gk804360

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000785 1132598 1134036 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_1131931..1132597   667 V: 1131931-1132597 Caenorhabditis elegans

354 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Genes that were downregulated in lin-15B(n744). For each gene in each microarray hybridization experiment, the ratio of RNA levels from the two samples was transformed into a log2 value and the mean log2 ratio was calculated. The log2 ratios were normalized by print-tip Loess normalization (Dudoit and Yang, 2002). All genes with a false discovery rate of <= 5% (q <= 0.05) (Storey and Tibshirani, 2003) and a mean fold-change ratio of >= 1.5 were selected for further analysis. WBPaper00038168:lin-15B(n744)_downregulated
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Genes that showed significantly increased expression in wrn-1(gk99) comparing to in N2, according to RNAseq. DESeq was used to calculate the fold changes, log fold changes, and significance of the changes for each comparison. WBPaper00045934:wrn-1(gk99)_upregulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts that showed significantly decreased expression at 5-days-post L4 adult N2 hermaphrodites comparing to 1-day-post L4 adult N2 hermaphrodites. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:Day5_vs_Day1_downregulated
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts that showed significantly decreased expression at 11-days-post L4 adult N2 hermaphrodites comparing to 1-day-post L4 adult N2 hermaphrodites. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:Day11_vs_Day1_downregulated
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts that showed significantly decreased expression in atfs-1(cmh15) (null allele) animals comparing to in N2 animals at L4 larva stage. edgeR, fold change > 2, FDR < 0.05 WBPaper00060909:atfs-1(cmh15)_downregulated
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
mitochondrial sulfide delivery molecule (mtH2S) AP39 Transcripts that showed significantly increased expression in N2 animals treated with mitochondrial sulfide delivery molecule (mtH2S) AP39 starting from 1-day-post L4 until 11 days post L4. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:mtH2S-AP39-D0-treatment_upregulated_Day11
  Genes up regulated in alg-1(gk214) comparing to in N2. Differential expression was assessed using an empirical Bayes statistics using the eBayes function. WBPaper00040823:alg-1(gk214)_upregulated
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_glp-1(e2141)
  Transcripts that showed significantly increased expression in rrf-3(pk1426) comparing to in N2 at embryo stage. DESeq2v 1.18.1, fold change > 1.5, adjusted p-value < 0.01. WBPaper00056169:rrf-3(pk1426)_upregulated_embryo
  Transcripts expressed in vulva. FPKM >= 1. WBPaper00064122:vulva_transcriptome
  Transcripts that showed significantly increased expression in rrf-3(pk1426) comparing to in N2 at L3 larva stage. DESeq2v 1.18.1, fold change > 1.5, adjusted p-value < 0.01. WBPaper00056169:rrf-3(pk1426)_upregulated_L3
  Transcripts that showed significantly increased expression after 24 hours of induction of human beta Amyloid at young adult stage A 2-fold change in expression level and a false discovery rate analog of p < 0.05. WBPaper00064130:Beta-Amyloid_24h_upregulated_mRNA
Bacteria infection: Bacillus thuringiensis Transcripts that showed significantly increased expression in N2 animals infected by bacteria BMB171/Cry5Ba, an acrystalliferous Bt mutant BMB171 transformed with toxin gene cry5Ba on the shuttle vector pHT304, comparing to N2 animals infected by BMB171/pHT304. N.A. WBPaper00064229:B.thuringiensis-Cry5Ba_upregulated

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030486 Tiling arrays expression graphs  
    Expr2010506 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
This information was extracted from published material (Archana Sharma-Oates, Andrew Mounsey and Ian A. Hope).   Expr742 Expressed in all life stages (embryo, L1-L4 and adult). Most abundant in L2 followed by L4 and then L3. Low level expression in embryo.  
    Expr1028465 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1158478 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2028746 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

7 GO Annotation

Annotation Extension Qualifier
  involved_in
  located_in
  involved_in
  involved_in
  enables
  enables
  enables

5 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000785 1132598 1134036 -1

7 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  located_in
  involved_in
  involved_in
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
1439

1 Sequence Ontology Term

Identifier Name Description
gene  

1 Strains

WormBase ID
WBStrain00032501

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_1134037..1134601   565 V: 1134037-1134601 Caenorhabditis elegans