WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004859 Gene Name  sma-5
Sequence Name  ? W06B3.2 Brief Description  sma-5 encodes a serine/threonine kinase homologous to the mammalian MAP kinase MAPK7/ERK5 (OMIM:602521, required for development of extraembryonic vasculature and embryonic cardiovasculature); SMA-5 is required for normal body size morphogenesis, growth rates, and intestinal granule distribution, and for regulating the size of the intestine, body wall muscle, and hypodermis, as well as the number of intestinal nuclei; SMA-5 is expressed in the intestine and in hypodermal seam cells.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable protein serine/threonine kinase activity. Involved in several processes, including determination of adult lifespan; parturition; and positive regulation of cell size. Predicted to be located in cytoplasm and nucleus. Expressed in excretory cell; hypodermis; intestine; and pharynx.
Biotype  SO:0001217 Genetic Position  X :6.99341 ±0.016149
Length (nt)  ? 8035
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004859

Genomics

7 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:W06B3.2a.2 W06B3.2a.2 2618   X: 11997515-12005549
Transcript:W06B3.2d.1 W06B3.2d.1 1853   X: 11997524-12000632
Transcript:W06B3.2f.1 W06B3.2f.1 1754   X: 11997526-12000632
Transcript:W06B3.2b.1 W06B3.2b.1 1940   X: 11997526-12001389
Transcript:W06B3.2a.1 W06B3.2a.1 2033   X: 11997528-12001387
Transcript:W06B3.2c.1 W06B3.2c.1 1530   X: 11997961-12005004
Transcript:W06B3.2e.1 W06B3.2e.1 1566   X: 11998022-12005004
 

Other

6 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:W06B3.2b W06B3.2b 1461   X: 11997961-11998091
CDS:W06B3.2f W06B3.2f 1314   X: 11997961-11998091
CDS:W06B3.2c W06B3.2c 1530   X: 11997961-11998091
CDS:W06B3.2a W06B3.2a 1497   X: 11998022-11998091
CDS:W06B3.2d W06B3.2d 1350   X: 11998022-11998091
CDS:W06B3.2e W06B3.2e 1566   X: 11998022-11998091

5 RNAi Result

WormBase ID
WBRNAi00054867
WBRNAi00113184
WBRNAi00087939
WBRNAi00061290
WBRNAi00112954

130 Allele

Public Name
gk964260
gk964029
gk962707
gk964028
gk963810
WBVar01690677
WBVar01690678
tm11477
tm448
WBVar01825958
WBVar01621228
WBVar01621229
WBVar01621230
WBVar01621231
WBVar01571181
WBVar01954586
WBVar01954585
WBVar01571172
WBVar01571171
WBVar01571170
WBVar01571169
WBVar01571168
WBVar01571167
WBVar01571166
WBVar01571165
WBVar01571180
WBVar01571179
WBVar01571178
WBVar01571177
WBVar01571176

1 Chromosome

WormBase ID Organism Length (nt)
X Caenorhabditis elegans 17718942  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004859 11997515 12005549 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_11995840..11997514   1675 X: 11995840-11997514 Caenorhabditis elegans

201 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_upregulated
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when no food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_NoFood
  Genes that were upregulated in lin-15B(n744). For each gene in each microarray hybridization experiment, the ratio of RNA levels from the two samples was transformed into a log2 value and the mean log2 ratio was calculated. The log2 ratios were normalized by print-tip Loess normalization (Dudoit and Yang, 2002). All genes with a false discovery rate of <= 5% (q <= 0.05) (Storey and Tibshirani, 2003) and a mean fold-change ratio of >= 1.5 were selected for further analysis. WBPaper00038168:lin-15B(n744)_upregulated
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Transcripts that showed significantly increased expression in nhr-114(gk849) comparing to wild type animals at L4 larva. DESeq2 1.26.0, fold change > 2, FDR < 0.05. WBPaper00064539:nhr-114(gk849)_upregulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Transcripts depleted in purified oocyte P bodies comparing to in the whole animal. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_WholeAnimal_depleted
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
Starvation Starvation-induced transcripts that showed significantly increased expression in post dauer animals comparing to wild type control. edgeR WBPaper00053713:Starvation-induced_postdauer_vs_control_upregulated
  Genes that showed oscillating mRNA expression level throughout the 16 hour time courses from L3 larva to young adult. The following three lines of R code were used to perform the classification: increasing <-2*amplitude-PC1 < -1.7; oscillating <-!increasing & (amplitude > 0.55); flat <-!increasing & !oscillating; Note that the amplitude of a sinusoidal wave corresponds to only half the fold change between trough and peak. WBPaper00044736:oscillating_dev_expression
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:excretory-cell_L2-larva_expressed
  Transcripts that showed significantly altered expression at URX, AQR, and PQR neurons in camt-1(ok515) animals comparing to in wild type AX1888-1 strain. RNA-seq data were mapped using PRAGUI - a Python 3-based pipeline for RNA-seq data analysis. WBPaper00061902:camt-1(ok515)_regulated_URX-AQR-PQR
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated

7 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034128 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr3269 A second GFP fusion construct pPDW06-9, which contains the same promoter for W06B3.2c and the entire genomic region of W06B3.2a/c except for the termination codon, rescued the phenotypes of the sma-5 mutant, and looks to be expressed in the same regions as was pPDW06-1a. Fusion protein is localized throughout the intestine. The third construct pPDW06-c carries the 4.9 kb sequence upstream of exon 1 and a GFP gene. GFP in this promoter fusion is expressed in hypodermis and pharynx. pPDW06-1a expresses a GFP fusion with N-terminal five amino acids of W06B3.2a/c under a control of promoter for W06B3.2c of 1.6 kb. GFP expression was found in intestine and excretory cell, in all stages of a transgenic line carrying extrachromosomal arrays of pPDW06-1a. In the intestine, the four most anterior cells show stronger GFP expression, although entire intestine is fluorescent. The expression in the excretory cell was confirmed by comparison with the expression pattern of vha-1p::gfp that is predominantly expressed in the excretory cell. GFP in the excretory cell is seen continuously along the lateral lines for most of the length of the worm. Weak expression in hypodermis is also seen.  
    Expr1032411 Tiling arrays expression graphs  
    Expr13166 the SMA-5::EGFP fluorescence was exclusively detected in the cytoplasm of intestinal cells in strain BJ269. This was somewhat in contrast to the previous observations of Watanabe et al. (Watanabe et al., 2005), who found expression of two sma-5 reporter constructs containing 1.6 kb of the upstream region and almost none or all of the gene-coding region not only in the intestine but also in the excretory cell and to some degree in the hypodermis. In addition, our observations were in disagreement with the distribution of another reporter containing 4.9 kb of the region upstream of exon 1 described by the same group whose expression was noted only in the pharynx and hypodermis.  
    Expr1024285 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1158412 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2015895 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

28 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
occurs_in(WBbt:0005792) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  involved_in
  involved_in
occurs_in(WBbt:0005733) involved_in
occurs_in(WBbt:0005733) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
occurs_in(WBbt:0005772),has_input(GO:0044840) involved_in
occurs_in(WBbt:0005812) involved_in
  involved_in
  involved_in
  involved_in

7 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004859 11997515 12005549 -1

28 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
occurs_in(WBbt:0005792) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  involved_in
  involved_in
occurs_in(WBbt:0005733) involved_in
occurs_in(WBbt:0005733) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
occurs_in(WBbt:0005772),has_input(GO:0044840) involved_in
occurs_in(WBbt:0005812) involved_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
8035

1 Sequence Ontology Term

Identifier Name Description
gene  

6 Strains

WormBase ID
WBStrain00026840
WBStrain00027059
WBStrain00007516
WBStrain00007517
WBStrain00007519
WBStrain00007518

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_12005550..12007896   2347 X: 12005550-12007896 Caenorhabditis elegans