WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004895 Gene Name  smu-1
Sequence Name  ? CC4.3 Brief Description  smu-1 encodes a highly conserved WD40 repeat-containing protein that is orthologous to the vertebrate and plant SMU proteins and S. cerevisiae Pfs2p; SMU-1 activity is required, along with that of SMU-2, for regulation of splice site choice in the pre-mRNAs of unc-52 and other genes; SMU-1 physically interacts with SMU-2 both in vitro and in vivo, and is stabilized by SMU-2; SMU-1 is ubiquitously expressed and localizes to the nucleus.
Organism  Caenorhabditis elegans Automated Description  Enables protein homodimerization activity. Involved in several processes, including mechanosensory behavior; nematode larval development; and regulation of alternative mRNA splicing, via spliceosome. Located in nucleus. Expressed in several structures, including germ line. Is an ortholog of human SMU1 (SMU1 DNA replication regulator and spliceosomal factor).
Biotype  SO:0001217 Genetic Position  I :16.8484 ±0.056728
Length (nt)  ? 17057
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004895

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:CC4.3.2 CC4.3.2 1830   I: 12995803-13012859
Transcript:CC4.3.1 CC4.3.1 1815   I: 12995803-13004334
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:CC4.3 CC4.3 1533   I: 12995889-12995917

6 RNAi Result

WormBase ID
WBRNAi00043298
WBRNAi00003250
WBRNAi00083609
WBRNAi00083757
WBRNAi00083847
WBRNAi00116873

257 Allele

Public Name
gk963849
gk964175
h7311
h7151
h16100
h13890
WBVar01434123
WBVar01434124
WBVar01434125
WBVar01434126
WBVar01434127
WBVar01434128
WBVar01434129
WBVar01434130
WBVar01714450
WBVar01825412
WBVar02028529
WBVar02028532
WBVar02028531
WBVar02028530
gk962760
WBVar01762432
WBVar01500563
WBVar01500562
WBVar01500565
WBVar01500564
WBVar01500561
WBVar01500567
WBVar01500566
gk357691

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004895 12995803 13012859 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

0 Downstream Intergenic Region

103 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Top 300 transcripts enriched in ABalppppppa, ABpraaapppa according to single cell RNAseq. Top 300 enriched transcripts were determined by log2.ratio of the tpm in the cell type vs the tpm in the other cells * the log2 of the cell.type tpm. WBPaper00061340:ASE_parent
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Transcripts that showed significantly increased expression in ilc-17.1(syb5296) comparing to in N2 animals at L4 larva stage. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066594:ilc-17.1(syb5296)_upregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Proteins interacting with HA-PPM-1.D. N.A. WBPaper00062498:PPM-1.D_interacting
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
  Transcripts that showed decreased expression in hlh-11(ko1) knockout strain comparing to in wild type background. DESeq2, FDR < 0.05 WBPaper00060683:hlh-11(ko1)_downregulated
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
Bacteria infection: Serratia marcescens Genes with increased expression after 24 hours of infection by S.marcescens Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:S.marcescens_24hr_upregulated_TilingArray
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Transcripts that showed significantly decreased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_downregulated
  Transcripts that showed significantly depleted expression in sensory neuron (labeled by iaIs25[Pgcy-37::GFP + unc-119(+)]) comparing to in whole worm. Fold change > 2, p-value < 0.05. WBPaper00060661:sensory-neuron_depleted
  Transcripts that showed significantly decreased expression in pals-22(jy3) comparing to in N2 at L4 larva stage. Differential expression analysis was performed in RStudio (v1.1.453) using R (v3.50) and Bioconductor (v3.7) packages. As outlined in the RNAseq123 vignette, data was imported, filtered and normalized using edgeR, and linear modeling and differential expression analysis was performed using limma. An FDR cutoff of <0.01 was used to define differentially expressed genes; no fold-change criteria was used. WBPaper00056034:pals-22(jy3)_downregulated
  Transcripts enriched in germline by comparing dissected germline tissue with dissected intestine tissue, both injected with empty RNAi vector. Genes were determined germline-enriched if the lowest expression value (log2(FPKM+1)) observed in the germline empty vector samples was at least 2-fold higher than the highest expression value observed in the intestine empty vector samples. WBPaper00051039:germline_enriched

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034162 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1032434 Tiling arrays expression graphs  
Germ line expression was surprising because most transgenic arrays seem to be subject to germ line silencing. [smu-1::gfp] translational fusion and [smu-1::HA] translational fusion. smu-1 is full-length in the constructs. smu-1 was fused to either the gene for GFP or sequence encoding three tandem HA tags. The expression patterns of these two full-length rescuing transgenes were analyzed by direct visualization of GFP or by antibody staining with anti-GFP or anti-HA antibodies. Expr1498 The transgenes were ubiquitously expressed throughout development. In hermaphrodite larvae, expression of the rescuing transgenes was seen in all types of somatic nuclei and in the nuclei of the female germ line. Occasionally neuronal nuclei failed to stain with anti-GFP antibodies in integrated lines containing the rescuing smu-1::gfp transgene. This was a very small proportion of the neurons, and lack of staining in these cases could have been due either to technical aspects of the antibody staining protocol or to incomplete expression of the integrated transgene. Nuclei of the other major tissue types (hypodermis, intestine, and muscle) always stained under the same conditions. In mid- to late-stage embryos, autofluorescing GFP was visible in all or most nuclei. In embryos of less than about 200 cells, it was unable to detect GFP directly, but fixation and staining with anti-GFP antibodies indicated that GFP was present in all of the nuclei of early embryos that were not in the process of dividing. Tiny foci of anti-GFP staining within these nuclei were often seen, but it is not clear whether they represent a pattern of subnuclear localization or are due to fixation or staining artifacts. Obvious foci were not seen in the cells of other developmental stages, although in some cases the fluorescence had a rough or uneven appearance when studied closely. The protein products were found localized to nuclei.
    Expr2015929 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1027341 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1147276 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

26 GO Annotation

Annotation Extension Qualifier
  part_of
  part_of
  part_of
  located_in
  enables
  enables
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

5 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004895 12995803 13012859 1

26 Ontology Annotations

Annotation Extension Qualifier
  part_of
  part_of
  part_of
  located_in
  enables
  enables
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
17057

1 Sequence Ontology Term

Identifier Name Description
gene  

6 Strains

WormBase ID
WBStrain00034408
WBStrain00034402
WBStrain00034414
WBStrain00034373
WBStrain00034393
WBStrain00051046

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_12995476..12995802   327 I: 12995476-12995802 Caenorhabditis elegans