WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00010868 Gene Name  somi-1
Sequence Name  ? M04G12.4 Organism  Caenorhabditis elegans
Automated Description  Predicted to enable DNA binding activity and metal ion binding activity. Predicted to be involved in cell differentiation. Located in nucleus. Expressed in several structures, including gonad; hypodermal cell; intestine; neurons; and vulval precursor cell. Biotype  SO:0001217
Genetic Position  V :5.24135 ±0.007619 Length (nt)  ? 5338
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00010868

Genomics

4 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:M04G12.4a.1 M04G12.4a.1 2339   V: 13386058-13391393
Transcript:M04G12.4c.1 M04G12.4c.1 2346   V: 13386062-13391395
Transcript:M04G12.4b.1 M04G12.4b.1 2241   V: 13386067-13389490
Transcript:M04G12.4d.1 M04G12.4d.1 1662   V: 13386649-13389484
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:M04G12.4a M04G12.4a 1740   V: 13386649-13386795
CDS:M04G12.4c M04G12.4c 1749   V: 13386649-13386795
CDS:M04G12.4b M04G12.4b 1653   V: 13386649-13386795
CDS:M04G12.4d M04G12.4d 1662   V: 13386649-13386795

16 RNAi Result

WormBase ID
WBRNAi00093858
WBRNAi00050919
WBRNAi00093866
WBRNAi00017186
WBRNAi00093860
WBRNAi00093861
WBRNAi00093862
WBRNAi00093867
WBRNAi00093868
WBRNAi00093863
WBRNAi00093864
WBRNAi00093865
WBRNAi00093859
WBRNAi00093825
WBRNAi00107207
WBRNAi00110880

89 Allele

Public Name
gk963271
gk963706
gk963301
gk964458
gk964459
WBVar02061803
WBVar01869528
WBVar01869529
WBVar01869530
WBVar01869531
WBVar01869532
WBVar01869533
WBVar01869534
WBVar01869524
WBVar01869525
WBVar01869526
WBVar01869527
WBVar01869523
WBVar01710414
tm562
gk964294
h14029
gk953423
otn20095
gk251528
gk251539
gk251538
gk251537
gk251536
gk251535

1 Chromosome

WormBase ID Organism Length (nt)
V Caenorhabditis elegans 20924180  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00010868 13386058 13391395 -1

3 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrV_13385899..13386057   159 V: 13385899-13386057 Caenorhabditis elegans

166 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts that showed significantly decreased expression at 5-days-post L4 adult N2 hermaphrodites comparing to 1-day-post L4 adult N2 hermaphrodites. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:Day5_vs_Day1_downregulated
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Transcripts that showed significantly increased expression in mrg-1(qa6200) comparing to in control animals in primordial germ cells (PGCs) at L1 larva stage. DESeq2(v1.32.0), FDR < 0.05. WBPaper00064315:mrg-1(qa6200)_upregulated_PGCs
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Transcripts that showed significantly increased expression in alg-1(gk214), comparing to in N2. DESeq2, Fold change > 1.5. WBPaper00051404:alg-1(gk214)_upregulated
  Significantly upregulated genes from clk-1(qm30) microarrays using SAM algorithm with an FDR < 0.1 from adult-only chips. SAM algorithm with an FDR < 0.1. WBPaper00033065:clk-1(qm30)_upregulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
Reduced humidity (98% relative humidity). Genes that were down-regulated after one day exposure to reduced humidity (98% relative humidity) according to microarray analysis. Multiple hypothesis testing with the Benjamini-Hochberg correction was applied on calculated p-values. A change in the expression level was considered to be significant if the adjusted p-value was less than 0.001. WBPaper00044578:reduced-humidity_downregulated_microarray
  Proteins that showed significantly decreased expression after 1-day-old wild type adults were exposed to cisplatin (300ug per mL) for 6 hours. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:Cisplatin_downregulated_WT
  Transcripts that showed significantly increased expression in hpk-1(pk1393) comparing to in N2 at adult day 2. DESeq 2, fold change > 2, FDR < 0.05. WBPaper00065581:hpk-1(pk1393)_upregulated
  Transcripts that showed significantly increased expression in hda-1[KKRR]-smo-1 in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4748)_upregulated
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated

8 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034239 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1154617 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr9654 The somi-1p::gfp transcriptional fusion gene was expressed in the hypodermal seam cells and the somatic gonad and VPCs. Seam cell expression of somi-1p::gfp was most common in L4 larvae and adults, suggesting that somi-1 is upregulated upon differentiation of this tissue (data not shown). The earliest somi-1p::gfp expression was in comma stage embryos. somi-1p::gfp was also expressed in body wall muscle and certain neurons in the head and tail (data not shown).  
    Expr2016004 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr9655 Immunofluorescence using antisera raised against a full-length His-SOMI-1A fusion protein confirmed the expression pattern revealed by somi-1p::gfp, except that hypodermal expression of SOMI-1 was more evident by immunostaining. SOMI-1 was detected in the nuclei of embryos beginning at the comma stage and in larvae of all stages and adults in hypodermal, body wall muscle, and most other cells in wild type but not in a somi-1(mg415) mutant. SOMI-1 was also detected in the somatic gonad, including the distal tip and sheath cells (data not shown). SOMI-1 immunofluorescence concentrated in nuclear foci and colocalized extensively with DNA.
    Expr9656 SOMI-1::GFP was expressed in the same tissues as the transcriptional somi-1p::gfp fusion gene as well as the hypodermal syncytium and gut (data not shown). Fusion of the somi-1 promoter and genomic protein-coding sequence to the N terminus of GFP yielded a rescuing SOMI-1::GFP reporter that was typically localized to the nucleus but excluded from the nucleolus. SOMI-1::GFP was often concentrated in nuclear foci, which were most apparent in embryos, but could also be detected in larvae.
    Expr1034762 Tiling arrays expression graphs  
    Expr1024811 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

6 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  enables
  enables
  located_in

0 Homologues

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00010868 13386058 13391395 -1

6 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  enables
  enables
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
5338

1 Sequence Ontology Term