WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004858 Gene Name  sma-4
Sequence Name  ? R12B2.1 Brief Description  sma-4 encodes a Smad protein and a homolog of human DPC4, which when mutated leads to pancreatic carcinoma (OMIM:260350); SMA-4 is similar to members of the vertebrate protein family of Dwarfins; during development, sma-4 functions as part of a DBL-1/SMA-6 TGF-beta-related signaling pathway that controls body size and male tail sensory ray and spicule formation; sma-4, through this pathway, also regulates reproductive aging; studies have shown that a reduction of TGF-beta pathway genes extends reproductive span by maintaining oocyte and germline quality; SMA-4 can physically interact with the RNT-1/RUNX transcription factor, which interacts and functions with the TGF-beta signaling pathway, in regulating body size and male tail development.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific; I-SMAD binding activity; and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Involved in several processes, including determination of adult lifespan; nematode male tail tip morphogenesis; and positive regulation of growth. Predicted to be located in cytoplasm and nucleus. Predicted to be part of heteromeric SMAD protein complex. Expressed in head neurons and pseudocoelom. Human ortholog(s) of this gene implicated in several diseases, including carcinoma (multiple); colorectal adenoma; and female reproductive organ cancer (multiple). Is an ortholog of human SMAD4 (SMAD family member 4).
Biotype  SO:0001217 Genetic Position  III :-1.43987 ±0.00065
Length (nt)  ? 3973
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004858

Genomics

3 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:R12B2.1a.2 R12B2.1a.2 1882   III: 5816232-5820204
Transcript:R12B2.1a.1 R12B2.1a.1 1832   III: 5816275-5820197
Transcript:R12B2.1b.1 R12B2.1b.1 570   III: 5819047-5820065
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:R12B2.1a R12B2.1a 1698   III: 5816277-5816348
CDS:R12B2.1b R12B2.1b 570   III: 5819047-5819132

17 RNAi Result

WormBase ID
WBRNAi00009064
WBRNAi00026094
WBRNAi00051828
WBRNAi00086336
WBRNAi00086337
WBRNAi00113601
WBRNAi00005142
WBRNAi00086330
WBRNAi00086331
WBRNAi00027660
WBRNAi00086338
WBRNAi00086339
WBRNAi00097939
WBRNAi00097940
WBRNAi00086324
WBRNAi00086325
WBRNAi00075970

61 Allele

Public Name
gk964518
gk175463
gk175464
gk175465
gk175466
gk175467
gk175468
gk175469
gk964338
gk964339
WBVar01263952
WBVar01263954
WBVar01263953
tm4731
ok3140
WBVar01656716
e805
e729
e792
gk870293
gk483855
gk793759
gk655811
gk358236
gk555582
gk539383
gk394372
gk683105
jj278
gk361825

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004858 5816232 5820204 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_5820205..5823023   2819 III: 5820205-5823023 Caenorhabditis elegans

146 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_upregulated
  Genes that showed increased expression in wdr-5(ok1417) comparing with in N2. Statistical analysis for misexpression was performed using a moderated t test from the package limma. All genes with a false discovery rate (FDR) of <= 5% (p <= 0.05) were selected as differentially regulated. WBPaper00045861:wdr-5(ok1417)_upregulated
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) at Late reproduction stage (96 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0118. WBPaper00040858:eQTL_regulated_reproductive
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Transcripts that showed significantly decreased expression in N2 animals exposed to 0.1mM Paraquat from hatching to reaching adult stage. DESeq2 version 1.22.2, p < 0.05 WBPaper00064716:paraquat_downregulated
  Transcripts that showed significantly increased expression in daf-2(e1370) comparing to in control animals. NOIseq(v2.34.0), fold change > = 1.5, Differentially expressed genes (DEGs) were defined as having a probability of differentialexpression > 95%. WBPaper00064727:daf-2(e1370)_upregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
  Genes with increased RNA expression after 24 hours rotenone treatment EdgeR provides statistical routines for determining differential expression in digital gene expression data using a model based on the negative binomial distribution. The resulting p-values were adjusted using the Benjamini and Hochbergs approach for controlling the false discovery rate (FDR). Transcripts with an adjusted p-value smaller 0.05 were assigned as differentially expressed. WBPaper00044426:rotenone_24h_upregulated
  Transcripts that showed significantly increased expression in nuo-6(qm200) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:nuo-6(qm200)_upregulated
  Transcripts that showed significantly increased expression in pie-1(ne4443[PIE-1-Degron-GFP]) DESeq2. Differentially expressed genes were defined as twofold change and adjusted p-value less than 0.05. WBPaper00061478:pie-1(ne4433)_upregulated
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated
  Transcripts that showed significantly increased expression in mep-1(ne4629[MEP-1-GFP-Degron]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:mep-1(ne4629)_upregulated
  Transcripts that showed significantly increased expression in ubc-9(ne4833[ubc-9(G56R)] in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:ubc-9(ne4833)_upregulated
  Genes significantly enriched (> 2x, FDR < 5%) in a particular cell-type versus a reference sample of all cells at the same stage. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_larva_enriched
  Genes that show selective expression in a subset of cell types vs broadly expressed in many cell types. Correspond to 20% - 57% of enriched_genes for a given cell type. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_larva_SelectivelyEnriched

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034127 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Clone: pUL#JS9B4   Expr7640 Expression is seen in nerves of the head and also faintly in the pseudocoelom. There is also some expression in the posterior gut. Expression is seen from the late embryo onwards.  
    Expr1032410 Tiling arrays expression graphs  
    Expr1155487 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2015894 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1013575 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

39 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
occurs_in(WBbt:0005733) involved_in
occurs_in(WBbt:0005733) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

5 Homologues

Type
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004858 5816232 5820204 1

39 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
occurs_in(WBbt:0005733) involved_in
occurs_in(WBbt:0005733) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

4 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in sma-4(rax3) comparing to in N2 at 1-day post-L4 adult hermaphrodite HTseq-count was used to count reads mapped to each gene and counting data was imported to EdgeR for statistical analysis. Statistical significance was defined by adjusted P value (false discovery rate, FDR) of <0.05. WBPaper00053184:sma-4(rax3)_upregulated
  Genes upregulated in sma-2 L4 (3 arrays) or sma-4 L4 (1 array) vs. N2 L4. FDR = 0%, SAM. WBPaper00037682:sma-2_sma-4_upregulated
  Transcripts that showed significantly decreased expression in sma-4(rax3) comparing to in N2 at 1-day post-L4 adult hermaphrodite HTseq-count was used to count reads mapped to each gene and counting data was imported to EdgeR for statistical analysis. Statistical significance was defined by adjusted P value (false discovery rate, FDR) of <0.05. WBPaper00053184:sma-4(rax3)_downregulated
  Genes downregulated in sma-2 L4 (3 arrays) or sma-4 L4 (1 array) vs. N2 L4. FDR = 0%, SAM. WBPaper00037682:sma-2_sma-4_downregulated

1 Sequence

Length
3973

1 Sequence Ontology Term