WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000465 Gene Name  cpg-1
Sequence Name  ? C07G2.1 Brief Description  cpg-1 encodes, by alternative splicing, two isoforms of a chondroitin proteoglycan; CPG-1 contains three chitin-binding peritrophin-A domains and two mucin-like regions; CPG-1 is individually dispensable for normal embryonic development; however, CPG-1 and CPG-2 are jointly required for osmotic integrity of early embryos, error-free chromosomal segregation during meiosis, polar body extrusion, association of the sperm pronucleus/centrosome complex (SPCC) with the early embryonic cortex, localization of PAR-2 in early embryos, posterior localization of P granules or PIE-1, and pseudocleavage; CPG-1, like CPG-2, is covalently linked to chondroitin, which itself is required for vulval morphogenesis, polar-body extrusion, and separation of the eggshell from the embryonic plasma membrane; cpg-1 mRNA, like that of cpg-2, is expressed specifically in the adult hermaphrodite germ line and is bound by GLD-1; CPG-1 has five potential chondroitin attachment sites, one of them verified by mass spectrometry, and transgenic CPG-1 synthesized in mammalian cells carries chondroitin sulfate chains; CPG-1's multiple peritrophin-A domains may enable mechanical cross-linking of chitin.
Organism  Caenorhabditis elegans Automated Description  Enables chitin binding activity. Involved in several processes, including eggshell formation; mitotic cytokinesis; and regulation of cytokinesis. Located in external encapsulating structure. Expressed in eggshell; germ line; and oocyte.
Biotype  SO:0001217 Genetic Position  III :-3.15507 ±0.001827
Length (nt)  ? 2034
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000465

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C07G2.1a.1 C07G2.1a.1 1889   III: 4489359-4491392
Transcript:C07G2.1b.1 C07G2.1b.1 483   III: 4489363-4491262
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C07G2.1a C07G2.1a 1755   III: 4489363-4489612
CDS:C07G2.1b C07G2.1b 483   III: 4489363-4489612

10 RNAi Result

WormBase ID
WBRNAi00040019
WBRNAi00092494
WBRNAi00000152
WBRNAi00022636
WBRNAi00028645
WBRNAi00092482
WBRNAi00010402
WBRNAi00007842
WBRNAi00078684
WBRNAi00005076

40 Allele

Public Name
WBVar01263074
WBVar00099204
WBVar01837458
gk549128
gk713711
WBVar00563112
tn1728
WBVar01893255
tm2534
WBVar01644143
WBVar01644144
gk578244
gk172856
gk846073
gk330534
gk172858
gk319529
gk172857
gk582283
gk604911
WBVar02111726
gk316298
WBVar02111725
gk436476
ok1914
gk607038
ok1913
gk563984
gk532765
gk566024

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000465 4489359 4491392 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_4491393..4491626   234 III: 4491393-4491626 Caenorhabditis elegans

274 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
  Genes that were upregulated in lin-15B(n744). For each gene in each microarray hybridization experiment, the ratio of RNA levels from the two samples was transformed into a log2 value and the mean log2 ratio was calculated. The log2 ratios were normalized by print-tip Loess normalization (Dudoit and Yang, 2002). All genes with a false discovery rate of <= 5% (q <= 0.05) (Storey and Tibshirani, 2003) and a mean fold-change ratio of >= 1.5 were selected for further analysis. WBPaper00038168:lin-15B(n744)_upregulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression in day 3 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_upregulated_fem-3(q20)
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
  Transcripts that showed significantly decreased expression in hsp-6(mg585) comparing to in N2 at L4 larva stage. EdgeR, fold change > 2, FDR < 0.001. WBPaper00056290:hsp-6(mg585)_downregulated
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Transcripts that showed significantly increased expression in animals exposed to 400uM tamoxifen from L1 to L4 larva stage. DEseq2, fold change > 2 WBPaper00064505:tamoxifen_upregulated

9 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030269 Tiling arrays expression graphs  
Curated by authors based on online in-situ hybridization database (Y. Kohara, personal communication; http://nematode.lab.nig.ac.jp).   Expr4104 Expressed in medial germline.  
    Expr13388 mNG::3xFLAG::CPG-1 is expressed in the cytoplasm of oocytes and in the eggshell of embryos.  
    Expr13068   CPG-1 localizes to the inner layer of the trilaminar eggshell.
    Expr2010482 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1023887 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2028722 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2109 Expression is seen in larvae and adults in two cells, one close to the metacorpus of the pharynx, and one less frequently observed cell close to the terminal bulb.  
    Expr1144095 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

11 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  involved_in
  enables
  enables
part_of(WBbt:0005843) located_in
  involved_in
  located_in

36 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000465 4489359 4491392 1

11 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  involved_in
  enables
  enables
part_of(WBbt:0005843) located_in
  involved_in
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
2034

1 Sequence Ontology Term

Identifier Name Description
gene  

3 Strains

WormBase ID
WBStrain00032264
WBStrain00032265
WBStrain00005743

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_4477573..4489358   11786 III: 4477573-4489358 Caenorhabditis elegans