WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004946 Gene Name  sop-3
Sequence Name  ? Y71F9B.10 Brief Description  The sop-3 gene encodes a novel protein that regulates Hox gene expression by modulating Wnt signaling.
Organism  Caenorhabditis elegans Automated Description  Predicted to be located in nucleus. Expressed in several structures, including P3.p hermaphrodite; P4.p hermaphrodite; P5.p hermaphrodite; P8.p hermaphrodite; and neurons.
Biotype  SO:0001217 Genetic Position  I :-7.20635 ±0.083641
Length (nt)  ? 14947
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004946

Genomics

7 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y71F9B.10a.1 Y71F9B.10a.1 4603   I: 2761158-2776097
Transcript:Y71F9B.10g.1 Y71F9B.10g.1 4334   I: 2761158-2776104
Transcript:Y71F9B.10c.1 Y71F9B.10c.1 4625   I: 2761158-2776101
Transcript:Y71F9B.10b.1 Y71F9B.10b.1 4302   I: 2761168-2776100
Transcript:Y71F9B.10f.1 Y71F9B.10f.1 423   I: 2761318-2762373
Transcript:Y71F9B.10d.1 Y71F9B.10d.1 3480   I: 2761318-2771140
Transcript:Y71F9B.10e.1 Y71F9B.10e.1 3498   I: 2761318-2771140
 

Other

7 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y71F9B.10a Y71F9B.10a 4428   I: 2761318-2761513
CDS:Y71F9B.10b Y71F9B.10b 4134   I: 2761318-2761513
CDS:Y71F9B.10c Y71F9B.10c 4446   I: 2761318-2761513
CDS:Y71F9B.10g Y71F9B.10g 4152   I: 2761318-2761513
CDS:Y71F9B.10d Y71F9B.10d 3480   I: 2761318-2761513
CDS:Y71F9B.10e Y71F9B.10e 3498   I: 2761318-2761513
CDS:Y71F9B.10f Y71F9B.10f 423   I: 2761318-2761513

28 RNAi Result

WormBase ID
WBRNAi00064589
WBRNAi00058174
WBRNAi00064759
WBRNAi00004794
WBRNAi00004803
WBRNAi00004806
WBRNAi00004807
WBRNAi00004812
WBRNAi00087231
WBRNAi00037777
WBRNAi00086353
WBRNAi00086357
WBRNAi00086350
WBRNAi00086354
WBRNAi00070902
WBRNAi00075710
WBRNAi00086352
WBRNAi00086356
WBRNAi00075711
WBRNAi00075714
WBRNAi00075715
WBRNAi00084380
WBRNAi00084402
WBRNAi00075713
WBRNAi00075712
WBRNAi00070903
WBRNAi00085234
WBRNAi00117642

300 Allele

Public Name
gk963902
gk964159
gk962616
gk964303
WBVar01691715
WBVar01691714
WBVar01691717
WBVar01691716
WBVar01691719
WBVar01691718
WBVar02066327
WBVar01694682
WBVar01694681
WBVar01694686
WBVar01694685
WBVar01694684
WBVar01694683
WBVar01694689
WBVar01694688
WBVar01694687
WBVar01694693
WBVar01694692
WBVar01694691
WBVar01694690
WBVar01712873
WBVar01712871
WBVar01712872
otn1135
WBVar01931775
WBVar01500400

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004946 2761158 2776104 -1

3 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  

0 Downstream Intergenic Region

97 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Transcripts that showed significantly increased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts expressed in vulva. FPKM >= 1. WBPaper00064122:vulva_transcriptome
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
  Significantly upregulated genes from clk-1(qm30) microarrays using SAM algorithm with an FDR < 0.1 from adult-only chips. SAM algorithm with an FDR < 0.1. WBPaper00033065:clk-1(qm30)_upregulated
  Transcripts that showed significantly changed expression in 6-day post-L4 adult hermaphrodite comparing to in 1-day post L4 adult hermaphrodite animals. Sleuth WBPaper00051558:aging_regulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_L3-L4-larva_expressed
Fungi infection: Haptoglossa zoospora. Transcripts that showed significantly altered expression after L4 N2 animals were exposed to omycete Haptoglossa zoospora for 6 hours. Kalisto abundance files were converted and analysed using Sleuth in a R pipeline. Standard Sleuth protocols were used to calculate differential expression. P value < 0.01 and FDR < 0.01. WBPaper00062354:H.zoospora_6h_regulated
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
  Transcripts that showed decreased expression in hlh-11(ko1) knockout strain comparing to in wild type background. DESeq2, FDR < 0.05 WBPaper00060683:hlh-11(ko1)_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L1-larva_expressed
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  Transcripts that showed significantly altered expression after 24 hour exposure to nitroguanidine (NQ). Multivariate permutation tests with random variance model implemented in BRB-Array Tools version 4.5 were performed to infer differentially expressed genes (DEGs). One thousand random permutations were computed per chemical class (i.e., a group of 16 arrays or samples). The confidence level of false discovery rate assessment was set at 80%, and the maximum allowed portion of false-positive genes was 10%. WBPaper00055899:nitroguanidine_regulated
EtBr-exposed(maintained under normal lab light (mostly dark, in incubators) and exposed to EtBr (5ug/mL in agar).) vs UVC-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total).) at 3 h after the third UVC dose (51h), which is also 3 h after being placed on food. Genes differentially expressed under EtBr treatment without UVC exposure vs after UVC exposure but without EtBr treatment at the -3h timepoint (3 h after the third UVC dose (51h), which is also 3 h after being placed on food). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:EtBr-exposed_vs_UVC-exposed_51h
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L1-larva_expressed

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034241 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr861 During embryogenesis they are widely expressed in many or all cells. During larval stages, they are expressed in the seam cells, head neurons, ventral cord, male ray cells and other tail neurons. The reporter genes are also strongly expressed in proliferating cells for example, they are expressed in the vulval precursor cells. In adult animals, the reporters are expressed mainly in neurons. For both reporters, GFP fluorescence was nuclear.  
    Expr1032461 Tiling arrays expression graphs  
    Expr1022762 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2016006 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1161593 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

4 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  enables
  enables

0 Homologues

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004946 2761158 2776104 -1

4 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
14947

1 Sequence Ontology Term