WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00012802 Gene Name  set-25
Sequence Name  ? Y43F4B.3 Brief Description  set-25 encodes a putative histone H3 lysine-9 methyltransferase, with a C-terminal SET domain but no obvious non-nematode orthologs; SET-25 is required both for normal axonal guidance during development, and for a normally low somatic mutation rate; set-25(RNAi) animals have defective DD/VD motoneuron commissures, and also display an elevated somatic mutation rate, but are otherwise normal.
Organism  Caenorhabditis elegans Automated Description  Enables histone H3K9 methyltransferase activity. Involved in negative regulation of gene expression, epigenetic. Part of heterochromatin. Expressed in hypodermis; intestine; muscle cell; and neurons.
Biotype  SO:0001217 Genetic Position  III :21.203 ±0.001426
Length (nt)  ? 9267
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00012802

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y43F4B.3.1 Y43F4B.3.1 2800   III: 13283824-13293090
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y43F4B.3 Y43F4B.3 2145   III: 13284466-13284508

14 RNAi Result

WormBase ID
WBRNAi00066113
WBRNAi00056486
WBRNAi00056489
WBRNAi00020541
WBRNAi00020542
WBRNAi00094013
WBRNAi00094227
WBRNAi00113458
WBRNAi00006230
WBRNAi00006437
WBRNAi00037043
WBRNAi00061925
WBRNAi00064127
WBRNAi00091520

134 Allele

Public Name
gk963887
gk963904
gk963552
gk190151
gk190154
gk190155
gk190152
gk190153
gk190156
gk190162
gk190163
gk190160
gk190161
gk190166
gk190164
gk190165
gk190158
gk190159
gk190157
WBVar00182609
WBVar00182610
WBVar00182611
gk945198
gk945197
WBVar01827989
h17777
h14914
WBVar01339241
WBVar01339240
WBVar01339243

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00012802 13283824 13293090 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

0 Downstream Intergenic Region

131 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in N2 after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_N2
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at Late reproduction stage (96 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0118. WBPaper00040858:eQTL_regulated_reproductive
Bacteria diet: Escherichia coli HB101. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria E. coli HB101 for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:HB101_downregulated
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
Heat Shock: 35C 4 hours at L4 larva stage. Transcripts that showed significantly decreased expression after L4 larva N2 animals were heat stressed at 35C for 4 hours DESeq2 WBPaper00057154:HeatShock_downregulated_mRNA
  Transcripts that showed significantly increased expression after exposure to 75uM paraquat(PQ) from L1 to day 2 adult stage in skn-1(lax188) animals fold change > 2 WBPaper00058711:paraquat_upregulated
Temprature shift to 28C for 24 hours. Transcripts that showed significantly decreased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_downregulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Transcripts that showed significantly increased expression in ilc-17.1(syb5296) comparing to in N2 animals at L4 larva stage. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066594:ilc-17.1(syb5296)_upregulated
  Transcripts that showed altered expression in cat-1(RNAi) animals comparing to control animals injected with empty vector. p-value <= 0.05 WBPaper00066902:cat-1(RNAi)_regulated
Fungi infection: Haptoglossa zoospora. Transcripts that showed significantly altered expression after L4 N2 animals were exposed to omycete Haptoglossa zoospora for 6 hours. Kalisto abundance files were converted and analysed using Sleuth in a R pipeline. Standard Sleuth protocols were used to calculate differential expression. P value < 0.01 and FDR < 0.01. WBPaper00062354:H.zoospora_6h_regulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
Bacteria: B.thuringiensis Transcripts in N2 animals that were significantly differentially expressed at least for one time point and one pathogenic strain Bt247 and Bt679 compared to the non pathogenic strain Bt407. Cuffdiff WBPaper00060358:B.thuringiensis_pathogen_regulated_N2
  Transcripts that showed significantly decreased expression in pfd-6(gk493446); daf-2(e1370) comparing to in daf-2(e1370). Limma version 3.24.15. Fold change < 0.67 (p < 0.05). WBPaper00055827:pfd-6(gk493446)_downregulated
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  Transcripts that showed significantly altered expression after 24 hour exposure to nitroguanidine (NQ). Multivariate permutation tests with random variance model implemented in BRB-Array Tools version 4.5 were performed to infer differentially expressed genes (DEGs). One thousand random permutations were computed per chemical class (i.e., a group of 16 arrays or samples). The confidence level of false discovery rate assessment was set at 80%, and the maximum allowed portion of false-positive genes was 10%. WBPaper00055899:nitroguanidine_regulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_L1-larva_expressed
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic

9 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034003 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr14353    
    Expr16248   SET-13 and SET-25 were enriched in the nucleus. SET-13::GFP was expressed through eight-cell stage to late-embryonic stage, while SET-25::GFP was expressed in the whole embryonic stage and the germline in young adults.
    Expr14249 For mCherry::SET-25 we saw expression in the nuclei of the mitotic zone of the germline. This expression was detected from larval stage 2 (L2) onward. Faint expression was also detected in all nuclei of embryos from about the 8- to 16-cell stage. mCherry::SET-25 was seen to be associated with the chromatin throughout mitotic cell divisions in the embryos.  
    Expr1035663 Tiling arrays expression graphs  
    Expr1159958 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1015985 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2015770 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr10578   SET-25 Localizes to Heterochromatic Perinuclear Foci.

14 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in

3 Homologues

Type
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00012802 13283824 13293090 -1

14 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in

2 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly decreased expression (fold change > 2, adjusted p-value < 0.05) in EKM99[met-2(n4256) set-25(n5021)] injected with vector RNAi, comparing to in control animals (N2 injected with vector RNAi). Differential expression analysis was performed using DESeq2 v1.6.3 in R version 3.2.3. All analyses were performed with genes that had average expression level above 1 RPKM (fragments per kilobase per million, as calculated by Cufflinks). WBPaper00050193:met-2(n4256)set-25(n5021)_downregulated_L1
  Transcripts that showed significantly increased expression (fold change > 2, adjusted p-value < 0.05) in EKM99[met-2(n4256) set-25(n5021)] injected with vector RNAi, comparing to in control animals (N2 injected with vector RNAi). Differential expression analysis was performed using DESeq2 v1.6.3 in R version 3.2.3. All analyses were performed with genes that had average expression level above 1 RPKM (fragments per kilobase per million, as calculated by Cufflinks). WBPaper00050193:met-2(n4256)set-25(n5021)_upregulated_L1

1 Sequence

Length
9267

1 Sequence Ontology Term

Identifier Name Description
gene  

9 Strains

WormBase ID
WBStrain00027579
WBStrain00050680
WBStrain00050682
WBStrain00050692
WBStrain00050691
WBStrain00051708
WBStrain00051707
WBStrain00051705
WBStrain00054548

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_13293091..13293680   590 III: 13293091-13293680 Caenorhabditis elegans