WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004886 Gene Name  smi-1
Sequence Name  ? Y39A3CR.1 Brief Description  smi-1 encodes the C. elegans ortholog of human Gemin2, a novel protein that interacts with the product of the survival motor neuron (SMN) gene, mutations in which are associated with spinal muscular atrophy; in C. elegans, smi-1 is an essential gene required for embryonic development past the mid-proliferation stage; in vitro, SMI-1 physically interacts with C. elegans SMN-1, indicating that the interaction between these two proteins is conserved; SMI-1 is expressed throughout development in multiple tissue types including the gut, neurons, and body wall muscles; SMI-1 localizes primarily to nuclei, with some protein also detected in the cytoplasm and in some neuronal processes.
Organism  Caenorhabditis elegans Automated Description  Involved in embryo development. Located in cytoplasm. Expressed in body wall musculature; intestine; and ventral cord neurons. Used to study spinal muscular atrophy. Is an ortholog of human GEMIN2 (gem nuclear organelle associated protein 2).
Biotype  SO:0001217 Genetic Position  III :-15.8236 ±0.017184
Length (nt)  ? 2502
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004886

Genomics

4 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y39A3CR.1b.1 Y39A3CR.1b.1 868   III: 1878365-1880866
Transcript:Y39A3CR.1a.1 Y39A3CR.1a.1 892   III: 1878365-1880866
Transcript:Y39A3CR.1c.1 Y39A3CR.1c.1 432   III: 1879319-1880747
Transcript:Y39A3CR.1d.1 Y39A3CR.1d.1 456   III: 1879319-1880747
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y39A3CR.1b Y39A3CR.1b 741   III: 1878373-1878606
CDS:Y39A3CR.1c Y39A3CR.1c 432   III: 1879319-1879372
CDS:Y39A3CR.1a Y39A3CR.1a 765   III: 1878373-1878606
CDS:Y39A3CR.1d Y39A3CR.1d 456   III: 1879319-1879372

3 RNAi Result

WormBase ID
WBRNAi00103033
WBRNAi00020360
WBRNAi00103032

94 Allele

Public Name
gk962532
gk964281
otn12077
WBVar01326258
WBVar01326260
WBVar01694014
WBVar01606874
WBVar01540717
WBVar01540715
WBVar01540716
gk422624
gk832133
gk813962
gk844797
gk669152
gk635221
gk730089
gk321150
gk571899
gk884827
gk382029
gk727249
gk508960
gk753017
gk757462
gk757463
gk728983
gk466625
gk466626
otn371

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004886 1878365 1880866 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_1880867..1881155   289 III: 1880867-1881155 Caenorhabditis elegans

57 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_upregulated
  Transcripts that showed significantly higher expression in somatic gonad precursor cells (SGP) vs. head mesodermal cells (hmc). DESeq2, fold change >= 2, FDR <= 0.01. WBPaper00056826:SGP_biased
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Transcripts that showed significantly increased expression in oocyte germline cells comparing to in mitosis germline cells. Log2 Fold change > 2 or <-1, p-value < 0.05. WBPaper00053599:oocyte_vs_mitosis_upregulated
Temprature shift to 28C for 24 hours. Transcripts that showed significantly decreased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
Starvation 48 hours at L1 arrest Transcripts that showed significantly increased expression in starved N2 animals (48 hours at L1 arrest) Fold change > 2. WBPaper00064005:starvation_upregulated_N2_mRNA
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Transcripts that showed differential expression between 24 and 26 hours post hatching L2d and dauer committed larvae of daf-9(dh, triggered by the dafachronic acid (DA) growth hormone6). Cluster 2 genes' expression gradually increased into dauer. Benjamini Hochberg corrected q-value < 0.01. WBPaper00053388:dauer_regulated_Cluster2
  Transcripts enriched in germline by comparing dissected germline tissue with dissected intestine tissue, both injected with empty RNAi vector. Genes were determined germline-enriched if the lowest expression value (log2(FPKM+1)) observed in the germline empty vector samples was at least 2-fold higher than the highest expression value observed in the intestine empty vector samples. WBPaper00051039:germline_enriched
Bacteria infection: Xenorhabdus nematophila Caenorhabditis elegans Genes with expression levels changed significantly after treatment of Xenorhabdus nematophila. Differential expression were calculated by empirical eBayes method using eBayes function. P_value <= 0.01 and log2 fold change > 1 were used to call differentially expressed genes in all datasets. WBPaper00041606:CE_X.nematophila_regulated
  Genes identified as down-regulated at a 5% false discovery rate through RNAseq experiments with three tatn-1(qd182) and three N2 RNA samples. ANOVA with FDR <= 0.05. WBPaper00044656:tatn-1(qd182)_downregulated
  Genes expressed in embryonic motor neurons (identified by unc-4::GFP expressing cells). Genes called Present by MAS 5.0 in 2 out of 3 unc-4::GFP hybridizations. WBPaper00025141:unc-4::GFP_Expressed_Genes
Bacteria diet: complex environmental microbiotas Transcripts that showed significantly decreased expression after L1 larva grew on soil microbiota at 25C for 3 days. N.A. WBPaper00056139:soil-microbiota_downregulated
  Transcripts that showed significantly decreased expression in animals with germline-specific inx-14(RNAi) comparing to in aniamls fed with control vector, both exposed to PA14 infection. DESeq2. Differentially-expressed genes (DEG) were identified based on two criteria: FDR (False discovery rateusing Benjamini-Hochberg adjusted p-values) < 0.01 and absolute value of log2(Fold Change) > 1. WBPaper00066146:germline-inx-14(RNAi)_downregulated_PA14

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr4282 Transgenic animals expressing SMI-1::GFP demonstrated strong expression throughout development, up to and including adulthood. SMI-1 was expressed in the gut at all stages through development. In addition, fluorescence was also detected in a subset of nerve cells in the head, in ventral nerve cord cell bodies and also in body-wall muscle cells. A high level of fluorescence was observed in vulval abnormalities, moulting defects and sterility. Expressed in ventral nerve cord cell bodies
    Expr2034151 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1032425 Tiling arrays expression graphs  
    Expr2015918 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1159661 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1027897 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

17 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  part_of
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  part_of
  part_of

5 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004886 1878365 1880866 1

17 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  part_of
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  part_of
  part_of

0 Regulates Expr Cluster

1 Sequence

Length
2502

1 Sequence Ontology Term

Identifier Name Description
gene  

0 Strains

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_1878316..1878364   49 III: 1878316-1878364 Caenorhabditis elegans