WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00006342 Gene Name  sup-37
Sequence Name  ? C01B7.1 Organism  Caenorhabditis elegans
Automated Description  Involved in embryonic digestive tract morphogenesis. Located in nucleus. Expressed in neurons; pharynx; and spermatheca. Biotype  SO:0001217
Genetic Position  V :1.60922± Length (nt)  ? 6237
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00006342

Genomics

7 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C01B7.1b.1 C01B7.1b.1 3394   V: 8786545-8791465
Transcript:C01B7.1c.1 C01B7.1c.1 3315   V: 8786545-8792776
Transcript:C01B7.1a.1 C01B7.1a.1 3290   V: 8786551-8792772
Transcript:C01B7.1b.2 C01B7.1b.2 3326   V: 8786836-8791465
Transcript:C01B7.1a.2 C01B7.1a.2 3231   V: 8786836-8792775
Transcript:C01B7.1c.2 C01B7.1c.2 3252   V: 8786836-8792781
Transcript:C01B7.1d.1 C01B7.1d.1 2742   V: 8786848-8792364
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C01B7.1a C01B7.1a 2808   V: 8786848-8786968
CDS:C01B7.1d C01B7.1d 2742   V: 8786848-8786968
CDS:C01B7.1b C01B7.1b 1653   V: 8786848-8786968
CDS:C01B7.1c C01B7.1c 2823   V: 8786848-8786968

11 RNAi Result

WormBase ID
WBRNAi00039278
WBRNAi00009901
WBRNAi00097216
WBRNAi00097365
WBRNAi00024428
WBRNAi00008317
WBRNAi00097261
WBRNAi00097306
WBRNAi00097351
WBRNAi00103155
WBRNAi00107451

103 Allele

Public Name
gk963301
WBVar02060384
WBVar02060385
gk964351
gk962860
gk964052
gk963442
gk963441
WBVar01863131
tm356
tm481
gk242125
gk242124
gk242126
gk242128
gk242127
gk242129
gk242116
gk242115
gk242117
gk242119
gk242118
gk242121
gk242120
gk242123
gk242122
gk948701
h7845
h16003
h11736

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00006342 8786545 8792781 1

3 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  

0 Downstream Intergenic Region

128 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Coexpression clique No. 203, sre-33-ZK1025.1_8337, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:sre-33-ZK1025.1_8337
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts that showed significantly decreased expression in atfs-1(cmh15) (null allele) animals comparing to in N2 animals at L4 larva stage. edgeR, fold change > 2, FDR < 0.05 WBPaper00060909:atfs-1(cmh15)_downregulated
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at Late reproduction stage (96 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0118. WBPaper00040858:eQTL_regulated_reproductive
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Transcripts that showed significantly increased expression in sftb-1(cer6) deletion homozygous comparing to to in N2 animals at L4 larva stage. DESeq2, fold change > 2 WBPaper00058725:sftb-1(cer6)_downregulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
  Transcripts that showed significantly increased expression in hda-1[KKRR]-smo-1 in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4748)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:GABAergic-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed significantly increased expression in srbc-48(ac23);kyIs262;fer-1(b232ts) comparing to in kyIs262;fer-1(b232ts), 24h after infection with P.aeruginosa. DESeq2, FDR <0.05, fold change > 2. WBPaper00059664:srbc-48(ac23)_upregulated
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Allantoin_downregulated

8 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr4699 Expressed in pharyngeal muscle and neuron.  
    Expr2035268 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr10043 All tested strains showed strong enrichment of sup-37 reporter expression within the pharynx beginning at the ~500-cell stage of embryogenesis and continuing throughout larval development and adulthood. In addition, weak-to-modest levels of sup-37 reporter expression were observed in all other cell types. In larval and adult stages, the full-length SUP-37::mCherry was expressed in the pharyngeal muscle groups pm3, pm4, and pm6, but not consistently in other cells of the pharynx. In addition, SUP-37::mCherry expression was faint but detectable in the spermatheca of adult hermaphrodites. Expression of SUP-37::mCherry was not detected in the somatic gonad sheath cells, which also play a role in ovulation. Full-length (isoform-a) SUP-37::GFP and SUP- 37::mcherry fusion proteins localize predominantly to nuclei during late stages of embryogenesis. Expression of these constructs was, however, relatively dim and quite variable as compared with the sup-37 transcriptional reporters. Expression of the SUP-37 translational reporters was also occasionally detected in both the cytoplasm and nuclei of early-stage embryos at time points preceding morphogenesis.
    Expr1032556 Tiling arrays expression graphs  
    Expr10471 Inferred Expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr10472 Inferred Expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr1143400 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2017132 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

6 GO Annotation

Annotation Extension Qualifier
  involved_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

0 Homologues

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00006342 8786545 8792781 1

6 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
6237

1 Sequence Ontology Term