WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004771 Gene Name  sem-2
Sequence Name  ? C32E12.5 Brief Description  sem-2 encodes an HMG-box transcription factor that affects the migration, division, and cell-fate of the 2 sex myoblasts, and also affects embryonic viability, elongation of embryos, egg laying, anterior body-wall muscle development, and organization of the anterior hypodermis.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription activator activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Involved in cell fate specification; positive regulation of developmental process; and positive regulation of egg-laying behavior. Located in nucleus. Expressed in several structures, including E lineage cell; M.v nucleus; RMH; body wall muscle cell from M lineage; and neuroblasts. Human ortholog(s) of this gene implicated in several diseases, including Coffin-Siris syndrome (multiple); hypotrichosis-lymphedema-telangiectasia syndrome; and hypotrichosis-lymphedema-telangiectasia-renal defect syndrome. Is an ortholog of several human genes including SOX11 (SRY-box transcription factor 11); SOX18 (SRY-box transcription factor 18); and SOX4 (SRY-box transcription factor 4).
Biotype  SO:0001217 Genetic Position  I :-0.266303 ±0.002475
Length (nt)  ? 15286
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004771

Genomics

3 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C32E12.5.2 C32E12.5.2 2102   I: 5204530-5219815
Transcript:C32E12.5.1 C32E12.5.1 2001   I: 5204530-5208128
Transcript:C32E12.5.3 C32E12.5.3 2059   I: 5204530-5215218
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C32E12.5 C32E12.5 1215   I: 5205309-5205536

4 RNAi Result

WormBase ID
WBRNAi00041681
WBRNAi00108024
WBRNAi00107238
WBRNAi00116510

183 Allele

Public Name
gk962706
gk963902
WBVar01431569
WBVar01431568
gk855212
gk835365
gk514374
gk807946
gk780713
gk869474
gk884693
gk877849
gk807947
gk904055
gk377315
gk814522
gk493002
gk597734
gk791514
gk412376
gk344915
gk638500
gk862348
gk513021
gk913370
gk584258
gk680164
gk395060
gk878950
gk451101

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004771 5204530 5219815 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_5203505..5204529   1025 I: 5203505-5204529 Caenorhabditis elegans

169 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts that showed significantly decreased expression in AGP22 [nhr-49(nr2041)I;glp-1(e2141)III] comparing to in CF1903 [glp-1(e2144)III] at Day 2 adults. Fold change > 2, p Value of < 0.05 and a false discovery rate (FDR) of < 0.05. WBPaper00061530:nhr-49(e2144)_downregulated
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_daf-16(mu86);glp-1(e2141)
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_glp-1(e2141)
Bacteria infection: Bacillus thuringiensis Transcripts that showed significantly increased expression in N2 animals infected by bacteria BMB171/Cry5Ba, an acrystalliferous Bt mutant BMB171 transformed with toxin gene cry5Ba on the shuttle vector pHT304, comparing to N2 animals infected by BMB171/pHT304. N.A. WBPaper00064229:B.thuringiensis-Cry5Ba_upregulated
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Transcripts that showed significantly increased expression in alg-1(gk214), comparing to in N2. DESeq2, Fold change > 1.5. WBPaper00051404:alg-1(gk214)_upregulated
  Transcripts that showed significantly increased expression at the intestine cells of daf-2(e1370) comparing to the intestine cells of N2 animals at L2 larva stage. DESeq2 (version 1.24.0), fold change >= 2, FDR < 0.05 WBPaper00064632:daf-2(e1370)_upregulated_intestine
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Genes that showed oscillating mRNA expression level throughout the 16 hour time courses from L3 larva to young adult. The following three lines of R code were used to perform the classification: increasing <-2*amplitude-PC1 < -1.7; oscillating <-!increasing & (amplitude > 0.55); flat <-!increasing & !oscillating; Note that the amplitude of a sinusoidal wave corresponds to only half the fold change between trough and peak. WBPaper00044736:oscillating_dev_expression
  Transcripts that showed significantly altered expression at URX, AQR, and PQR neurons in camt-1(ok515) animals comparing to in wild type AX1888-1 strain. RNA-seq data were mapped using PRAGUI - a Python 3-based pipeline for RNA-seq data analysis. WBPaper00061902:camt-1(ok515)_regulated_URX-AQR-PQR
  Transcripts that showed significantly increased expression in ilc-17.1(syb5296) comparing to in N2 animals at L4 larva stage. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066594:ilc-17.1(syb5296)_upregulated
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
Skin Wounding: skin wounding using femtosecond or Micropoint UV laser or with single stabs of a microinjection needle to the anterior or posterior body 24 h after the L4 stage. Transcripts that showed significantly decreased expression after skin wounding using femtosecond or Micropoint UV laser or with single stabs of a microinjection needle to the anterior or posterior body 24 h after the L4 stage of N2 animals. The cutoff for differential expressed genes (DEGs) were: Benjamini-Hochberg adjusted p-value less than 0.05 and fold change larger than 1.5. WBPaper00059895:wounding_downregulated
  Transcripts that showed significantly decreased expression in set-2(tm1630) animals at embryo stage, comparing to in N2 animals. DESeq2 (v2.1.8.3) was used to determine DE genes and to generate principal component and scatter plots. DE genes with FDR < 0.05 were analysed using g:Profiler with Bonferroni correction. WBPaper00060014:set-2(tm1630)_downregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed

8 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr9973 Expression of gfp::sem-2 was first detectable in a subset of cells of the E and MS lineages in early gastrulating-stage embryos. The gfp::sem-2 expression persisted through embryonic and larval development in many cell types, including vulval, hypodermal and intestinal cells. gfp::sem-2 expression in the M lineage was first detectable at the 16-M stage in the SM mother cells, M.v(l/r)pa, and remained in both of their daughter cells, M.v(l/r)paa and M.v(l/r)pap. The presence of GFP::SEM-2 in M.v(l/r)pap was transient: GFP::SEM-2 was not detectable after M.v(l/r)pap differentiated into BWMs. However, GFP::SEM-2 persisted in the nuclei of the SM cells and all their descendants until the 8-SM stage, and became undetectable at the 16-SM stage. The GFP::SEM-2 protein was nuclear localized, consistent with its predicted role as a transcription factor.
    Expr1032364 Tiling arrays expression graphs  
    Expr12603 sem-2 expression was identified in several early blastomeres and in some neuronal and non-neuronal progenitors. Its expression colocalized with that of sox-2 in a few progenitors. As in vertebrates, sem-2 expression was observed in postmitotic neurons, but in contrast to the broad expression of SoxC genes in vertebrate postmitotic neurons, only a single class of postembryonic neurons expresses sem-2, the RMH class. sem-2 expression in these neurons is observed throughout larval and adult stages. We did not detect any sem-2 expression in the RME neurons themselves, but sem-2 is instead expressed in the progenitor of RMEL/R.  
    Expr3492 Expressed in: early MS and E lineages.  
    Expr2015741 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1014812 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2033974 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1145704 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

23 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
  involved_in
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

63 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004771 5204530 5219815 -1

23 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in
  involved_in
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
15286

1 Sequence Ontology Term

Identifier Name Description
gene  

3 Strains

WormBase ID
WBStrain00027046
WBStrain00028919
WBStrain00036910

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_5219816..5230548   10733 I: 5219816-5230548 Caenorhabditis elegans