WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004896 Gene Name  smu-2
Sequence Name  ? Y49F6B.4 Brief Description  smu-2 encodes a protein with homology to the mammalian nuclear protein RED that affects the accumulation of alternatively spliced unc-52 transcripts; SMU-2 activity is required, along with that of SMU-1, for regulation of splice site choice in the pre-mRNAs of unc-52 and other genes; SMU-2 physically interacts with SMU-1 both in vitro and in vivo, and stabilizes SMU-1; SMU-2 is ubiquitously expressed and localizes to both the nucleus and the nucleolus.
Organism  Caenorhabditis elegans Automated Description  Involved in several processes, including mechanosensory behavior; nematode larval development; and regulation of alternative mRNA splicing, via spliceosome. Located in nucleolus. Expressed widely. Is an ortholog of human IK (IK cytokine).
Biotype  SO:0001217 Genetic Position  II :-6.254 ±0.004598
Length (nt)  ? 7744
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004896

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y49F6B.4.1 Y49F6B.4.1 1823   II: 3482408-3490151
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y49F6B.4 Y49F6B.4 1644   II: 3482513-3482529

13 RNAi Result

WormBase ID
WBRNAi00057039
WBRNAi00020798
WBRNAi00091018
WBRNAi00069434
WBRNAi00024233
WBRNAi00037303
WBRNAi00083646
WBRNAi00083747
WBRNAi00062443
WBRNAi00062875
WBRNAi00023358
WBRNAi00024158
WBRNAi00023871

258 Allele

Public Name
gk964317
gk963801
gk963053
WBVar01693909
WBVar01692014
WBVar01695368
WBVar01695367
WBVar01695366
WBVar01695365
WBVar01695369
WBVar01695371
WBVar01695370
WBVar01695372
WBVar01603674
WBVar02069247
WBVar01437009
WBVar01437010
WBVar01437015
WBVar01437016
WBVar01437013
WBVar01437014
WBVar01437011
WBVar01437012
WBVar01719239
WBVar01719238
WBVar01719235
WBVar01719237
WBVar01719236
WBVar01719240
WBVar01823125

1 Chromosome

WormBase ID Organism Length (nt)
II Caenorhabditis elegans 15279421  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004896 3482408 3490151 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrII_3490152..3491096   945 II: 3490152-3491096 Caenorhabditis elegans

127 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts that showed significantly increased expression in animals exposed to 400uM tamoxifen from L1 to L4 larva stage. DEseq2, fold change > 2 WBPaper00064505:tamoxifen_upregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Genes down regulated by mir-243(n4759). RNAs that changed at least 2-fold with a probability of p > 0.05 in three biological replicates were considered differentially regulated between wild-type and mir-243. WBPaper00036130:mir-243_down_regulated
  Genes with increased RNA expression after 24 hours rotenone treatment EdgeR provides statistical routines for determining differential expression in digital gene expression data using a model based on the negative binomial distribution. The resulting p-values were adjusted using the Benjamini and Hochbergs approach for controlling the false discovery rate (FDR). Transcripts with an adjusted p-value smaller 0.05 were assigned as differentially expressed. WBPaper00044426:rotenone_24h_upregulated
Temprature shift to 28C for 24 hours. Transcripts that showed significantly decreased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_downregulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Proteins interacting with HA-PPM-1.D. N.A. WBPaper00062498:PPM-1.D_interacting
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed altered expression from P0 to F2 generation animals after N2 parental generation were treated with antimycin, but not in damt-1(gk961032) P0 to F2 animals after the parenal generaton were treated with antimycin. N.A. WBPaper00055862:antimycin_damt-1(gk961032)_regulated
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
  Transcripts that showed decreased expression in hlh-11(ko1) knockout strain comparing to in wild type background. DESeq2, FDR < 0.05 WBPaper00060683:hlh-11(ko1)_downregulated
Starvation 48 hours at L1 arrest Transcripts that showed significantly increased expression in starved N2 animals (48 hours at L1 arrest) Fold change > 2. WBPaper00064005:starvation_upregulated_N2_mRNA
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034163 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1032435 Tiling arrays expression graphs  
    Expr3427 SMU-2::GFP is a nuclear protein. SMU-2::GFP was observed at all stages of development including early embryogenesis and in oocytes. To show that SMU-2 is ubiquitously expressed, embryos and larvae carrying the integrated smu-2::gfp array were costained with the DNA stain DAPI and SMU-2::GFP was observed in essentially every cell that was stained by DAPI. Unlike most transgenes in C. elegans, the smu-2::gfp transgene was expressed in the germ line. To confirm this result with another SMU-2-expressing transgene, a rescuing transgene that placed a HA tag at the very end of the protein were constructed. This construct also led to nuclear expression in all cells, including the germ line. SMU-2::GFP was occasionally observed localized to a subnuclear compartment that probably is the nucleolus because of its morphology and its increased size in ncl-1 mutants, which have enlarged nucleoli. However, nucleolar localization was dependent on the health of the animal. Animals suffering from prolonged exposure to sodium azide showed a dramatic increase in nucleolar localization. Nucleolar localization of SMU-2::HA was rarely observed. SMU-2 probably is not normally localized to the nucleolus. No other consistent pattern of subnuclear localization was observed.
    Expr2015930 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1025656 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1160483 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

21 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  part_of

5 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004896 3482408 3490151 1

21 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  part_of

0 Regulates Expr Cluster

1 Sequence

Length
7744

1 Sequence Ontology Term

Identifier Name Description
gene  

5 Strains

WormBase ID
WBStrain00034413
WBStrain00034411
WBStrain00034415
WBStrain00034394
WBStrain00054680

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrII_3481685..3482407   723 II: 3481685-3482407 Caenorhabditis elegans