WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00003078 Gene Name  lsm-4
Sequence Name  ? F32A5.7 Brief Description  lsm-4 encodes a protein with similarity to the Sm-like LSM4 small nuclear ribonucleoproteins that are core components of the spliceosomal U6 small nuclear ribonucleoprotein complex; RNAi experiments indicate that lsm-4 is an essential gene required for development, growth, locomotion, body morphogenesis, and normal lifespan; about 5% to 20% of lsm-4 transcripts are RNA edited to change A to G resulting in a change to residue 13 from an N to a D.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable U6 snRNA binding activity. Predicted to be involved in P-body assembly and spliceosomal snRNP assembly. Located in cytoplasm and nucleus. Expressed in germ line and somatic cell. Human ortholog(s) of this gene implicated in atherosclerosis. Is an ortholog of human LSM4 (LSM4 homolog, U6 small nuclear RNA and mRNA degradation associated).
Biotype  SO:0001217 Genetic Position  II :0.499911 ±4.5e-05
Length (nt)  ? 932
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00003078

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F32A5.7.1 F32A5.7.1 609   II: 7252232-7253163
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F32A5.7 F32A5.7 372   II: 7252407-7252553

31 RNAi Result

WormBase ID
WBRNAi00070567
WBRNAi00025344
WBRNAi00046073
WBRNAi00046079
WBRNAi00075249
WBRNAi00117294
WBRNAi00014196
WBRNAi00075363
WBRNAi00075362
WBRNAi00075365
WBRNAi00075364
WBRNAi00075359
WBRNAi00075361
WBRNAi00075360
WBRNAi00075393
WBRNAi00075394
WBRNAi00083737
WBRNAi00083825
WBRNAi00071162
WBRNAi00025345
WBRNAi00082968
WBRNAi00082970
WBRNAi00075248
WBRNAi00117284
WBRNAi00070564
WBRNAi00070566
WBRNAi00070565
WBRNAi00027312
WBRNAi00117305
WBRNAi00083641

9 Allele

Public Name
gk963801
gk963053
gk962682
gk427604
gk882452
WBVar00173221
WBVar00173222
ok3151
tm8176

1 Chromosome

WormBase ID Organism Length (nt)
II Caenorhabditis elegans 15279421  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00003078 7252232 7253163 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrII_7252016..7252231   216 II: 7252016-7252231 Caenorhabditis elegans

72 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Heat shock: 35C for 1 hour. Transcripts that showed significantly increased expression immediately after 1-day post L4 adult hermaphrodite endu-2(tm4977) animals were exposed to 35C for 1 hour. The DESeq2 (GalaxyVersion 2.11.40.6 + galaxy1) was used to determine differentiallyexpressed features from count tables of differential transcript abundances. log2FC > 1, FDR < 0.01. WBPaper00065749:Heat-Shock_upregulated_endu-2(tm4977)
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
Pheromone Pheromone-induced transcripts that showed significantly decreased expression in post dauer animals comparing to wild type control. edgeR WBPaper00053713:Pheromone-induced_postdauer_vs_control_downregulated
Temprature shift to 28C for 24 hours. Transcripts that showed significantly decreased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_downregulated
  Transcripts that showed significantly altered expression at URX, AQR, and PQR neurons in camt-1(ok515) animals comparing to in wild type AX1888-1 strain. RNA-seq data were mapped using PRAGUI - a Python 3-based pipeline for RNA-seq data analysis. WBPaper00061902:camt-1(ok515)_regulated_URX-AQR-PQR
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Proteins identified in extracellular vesicle. N.A. WBPaper00062669:extracellular-vesicle_protein
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic
  Transcripts that showed significantly increased expression in set-2(tm1630) animals at embryo stage, comparing to in N2 animals. DESeq2 (v2.1.8.3) was used to determine DE genes and to generate principal component and scatter plots. DE genes with FDR < 0.05 were analysed using g:Profiler with Bonferroni correction. WBPaper00060014:set-2(tm1630)_upregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
  Transcripts enriched in germline by comparing dissected germline tissue with dissected intestine tissue, both injected with empty RNAi vector. Genes were determined germline-enriched if the lowest expression value (log2(FPKM+1)) observed in the germline empty vector samples was at least 2-fold higher than the highest expression value observed in the intestine empty vector samples. WBPaper00051039:germline_enriched
Bacteria infection: Xenorhabdus nematophila Caenorhabditis elegans Genes with expression levels changed significantly after treatment of Xenorhabdus nematophila. Differential expression were calculated by empirical eBayes method using eBayes function. P_value <= 0.01 and log2 fold change > 1 were used to call differentially expressed genes in all datasets. WBPaper00041606:CE_X.nematophila_regulated
control(maintained under normal lab light (mostly dark, in incubators).) vs EtBr-exposed(maintained under normal lab light (mostly dark, in incubators) and exposed to EtBr (5ug/mL in agar).) at just prior to the second UVC dose (24h). Genes differentially expressed in control vs under EtBr treatment without UVC exposure, at the -25h timepoint. Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:control_vs_EtBr-exposed_24h
control(maintained under normal lab light (mostly dark, in incubators).) vs UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) at just prior to the second UVC dose (24h). Genes differentially expressed in control vs after UVC exposure and EtBr treatment at the -25h timepoint (just prior to the second UVC dose (24h)). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:control_vs_UVC-EtBr-exposed_24h
control(maintained under normal lab light (mostly dark, in incubators).) vs UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) at just prior to the third UVC dose (48h). Genes differentially expressed in control vs after UVC exposure and EtBr treatment at the -1h timepoint (just prior to the third UVC dose (48h)). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:control_vs_UVC-EtBr-exposed_48h
UVC-EtBr-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total) and exposed to EtBr (5ug/mL in agar).) vs UVC-exposed(exposed to 7.5 J/m2 UVC radiation 3 times, 24 h apart (48 h total).) at just prior to the third UVC dose (48h). Genes differentially expressed under EtBr treatment and UVC exposure vs under UVC exposure but without EtBr treatment at the -1h timepoint (just prior to the third UVC dose (48h)). Transcripts were defined as fold-change >1.2, p < 0.05 based on Rosetta Resolver analysis for all pairwise treatment comparisons. The fold-change refers to the second intensity over the first. WBPaper00041939:UVC-EtBr-exposed_vs_UVC-exposed_48h
  Genes identified as down-regulated at a 5% false discovery rate through RNAseq experiments with three tatn-1(qd182) and three N2 RNA samples. ANOVA with FDR <= 0.05. WBPaper00044656:tatn-1(qd182)_downregulated
  Genes expressed in embryonic motor neurons (identified by unc-4::GFP expressing cells). Genes called Present by MAS 5.0 in 2 out of 3 unc-4::GFP hybridizations. WBPaper00025141:unc-4::GFP_Expressed_Genes
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin and 100uM Psora from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Psora_downregulated
  Genes with differential expression under 0.5mg/l Chlorpyrifos (CPF) and 0.5mg/l Diazinon (DZN) treatment at 24 centigrade. To identify the differentially expressed genes in each treatment authors used linear models per toxicant and temperature (gene expression = Toxicant (effect) + error). The lm function in R stats package was used to implement the linear models analysis with recommended default options. For threshold determination authors used a permutation approach. For each of the 23,232 permutations used authors randomly picked a transcript (array spot), which could only be picked once. Authors combined all the expression values of this transcript and randomly distributed them over the replicates and used them in the linear model. In this way authors obtained a threshold for each of the toxicants. Authors used a -log10 p-value 2 as common threshold for the analysis, which resembles to the following FDR per toxicant: 0.0155 for CPF at 24 centigrade, 0.0148 for DZN at 24 centigrade, 0.0168 for CPF+DZN at 24 centigrade, 0.0142 for CPF at 16 centigrade, 0.0151 for DZN at 16 centigrade, and 0.0148 for CPF+DZN, at 16 centigrade. WBPaper00040210:Chlorpyrifos_Diazinon_24C_regulated
Bacteria diet: complex environmental microbiotas Transcripts that showed significantly decreased expression after L1 larva grew on soil microbiota at 25C for 3 days. N.A. WBPaper00056139:soil-microbiota_downregulated

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1031453 Tiling arrays expression graphs  
    Expr12402   LSM-4 was expressed both in the nucleus and cytoplasm in somatic and germ cells.
    Expr2013291 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1010323 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1149969 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2031522 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

19 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  part_of
  part_of
  part_of
  part_of
  located_in
  located_in
  located_in
  located_in
  part_of
  enables
  enables

6 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00003078 7252232 7253163 -1

19 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  part_of
  part_of
  part_of
  part_of
  located_in
  located_in
  located_in
  located_in
  part_of
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
932

1 Sequence Ontology Term