WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00003639 Gene Name  nhr-49
Sequence Name  ? K10C3.6 Brief Description  nhr-49 encodes a nuclear hormone receptor (NHR) related to the mammalian HNF4 (hepatocyte nuclear factor 4) family of NHRs; nhr-49 functions as a key regulator of fat metabolism and lifespan by regulating induction of beta-oxidation genes upon food deprivation and activation of stearoyl-CoA desaturase in fed animals, respectively; NHR-49 activates transcription in conjunction with the MDT-15 mediator subunit with which it physically interacts.
Organism  Caenorhabditis elegans Automated Description  Enables DNA-binding transcription factor binding activity. Involved in determination of adult lifespan; positive regulation of transcription by RNA polymerase II; and regulation of fatty acid metabolic process. Located in cytoplasm and nucleus. Expressed in hypodermis; intestinal cell; intestine; muscle cell; and neurons. Used to study alcohol use disorder and obesity. Human ortholog(s) of this gene implicated in several diseases, including Fanconi renotubular syndrome 4; glucose metabolism disease (multiple); and liver disease (multiple). Is an ortholog of human HNF4A (hepatocyte nuclear factor 4 alpha).
Biotype  SO:0001217 Genetic Position  I :3.96404 ±0.005389
Length (nt)  ? 5030
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00003639

Genomics

7 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:K10C3.6b.2 K10C3.6b.2 2119   I: 9869651-9874673
Transcript:K10C3.6a.2 K10C3.6a.2 2128   I: 9869652-9874680
Transcript:K10C3.6c.1 K10C3.6c.1 1506   I: 9869718-9874124
Transcript:K10C3.6d.1 K10C3.6d.1 1461   I: 9869767-9874124
Transcript:K10C3.6a.1 K10C3.6a.1 1902   I: 9871228-9874590
Transcript:K10C3.6b.1 K10C3.6b.1 1935   I: 9871228-9874626
Transcript:K10C3.6e.1 K10C3.6e.1 1065   I: 9872822-9874124
 

Other

5 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:K10C3.6a K10C3.6a 1434   I: 9871230-9871325
CDS:K10C3.6e K10C3.6e 1065   I: 9872822-9873046
CDS:K10C3.6b K10C3.6b 1431   I: 9871230-9871325
CDS:K10C3.6c K10C3.6c 1506   I: 9869718-9869787
CDS:K10C3.6d K10C3.6d 1461   I: 9869767-9869791

49 RNAi Result

WormBase ID
WBRNAi00102848
WBRNAi00102849
WBRNAi00034245
WBRNAi00050502
WBRNAi00025936
WBRNAi00025937
WBRNAi00087333
WBRNAi00087335
WBRNAi00087334
WBRNAi00064636
WBRNAi00003976
WBRNAi00025938
WBRNAi00114385
WBRNAi00071048
WBRNAi00087306
WBRNAi00113001
WBRNAi00113231
WBRNAi00114381
WBRNAi00107530
WBRNAi00064840
WBRNAi00064533
WBRNAi00116851
WBRNAi00086179
WBRNAi00086181
WBRNAi00086192
WBRNAi00086193
WBRNAi00102843
WBRNAi00087307
WBRNAi00087886
WBRNAi00110686

78 Allele

Public Name
gk962858
gk962706
gk963902
gk963849
h12339
h9072
gk963112
WBVar01432636
WBVar01826425
gk119623
gk119624
gk119625
gk119626
nr2041
gk119619
gk119620
gk119621
gk119622
gk119631
gk119632
gk119633
gk119627
WBVar01343190
gk119628
gk119629
gk119630
gk405
WBVar01343179
ttTi39505
ok2165

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00003639 9869651 9874680 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_9874681..9874697   17 I: 9874681-9874697 Caenorhabditis elegans

104 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  oocyte proteins identified by two or more unique peptides during proteomics study. In the pooled data set, 1453 C. elegans proteins were identified with a probability >= 0.9 according to ProteinProphet, of which 1165 proteins were identified by more than one unique peptide. WBPaper00038289:oocyte_protein
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to RNAseq analysis towards total RNA. Gene expression quantification and differential expression was analyzed using cufflinks v2.2.1. Enriched contains only genes significantly enriched (differentially expressed >= 2.4 fold in total RNA or >= 3.2 fold in DSN treated total RNA) in the NSM neurons versus the reference. WBPaper00045974:NSM_enriched_totalRNA_RNAseq
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_6h
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Transcripts that showed significantly increased expression in daf-2(e1370) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:daf-2(e1370)_upregulated
  Transcripts that showed significantly increased expression in nuo-6(qm200) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:nuo-6(qm200)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:excretory-cell_L2-larva_expressed
  Transcripts of coding genes that showed significantly increased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_enriched_coding-RNA
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Transcripts that showed significantly increased expression in sma-2(rax5) comparing to in N2 at 1-day post-L4 adult hermaphrodite HTseq-count was used to count reads mapped to each gene and counting data was imported to EdgeR for statistical analysis. Statistical significance was defined by adjusted P value (false discovery rate, FDR) of <0.05. WBPaper00053184:sma-2(rax5)_upregulated
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
  Genes depleted in muscle cells (24hr muscle dataset). Dissociated myo-3::GFP embryos were cultured for 24 hours before FACS sorting. A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:24hr_muscle_depleted

9 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1031683 Tiling arrays expression graphs  
    Expr12144 Animals expressing NHR-49::GFP showed widespread fluorescence throughout embryonic and larval development. In adults, it was visible in all somatic tissues (such as neurons, muscle, and hypodermis), localized to both nuclei and cytoplasm, with highest expression in intestinal cells.  
    Expr16231 Both NHR-49::GFP and mRuby::NHR-79 could be detected in the nuclei of intestinal and hypodermal cells.  
    Expr10378 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr1021226 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
Operon: CEOP1566   Expr9486   Sub-cellular localization within the body wall muscle: Nucleus only
    Expr1154213 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2014198 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2032439 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

42 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
has_input(WB:WBGene00009801),part_of(GO:0042594) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
part_of(WBbt:0006804) located_in
  located_in
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables

8 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00003639 9869651 9874680 1

42 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
has_input(WB:WBGene00009801),part_of(GO:0042594) involved_in
  involved_in
  involved_in
  involved_in
  involved_in
part_of(WBbt:0006804) located_in
  located_in
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables

4 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly decreased expression in AGP22 [nhr-49(nr2041)I;glp-1(e2141)III] comparing to in CF1903 [glp-1(e2144)III] at Day 2 adults. Fold change > 2, p Value of < 0.05 and a false discovery rate (FDR) of < 0.05. WBPaper00061530:nhr-49(e2144)_downregulated
  Transcripts that showed significantly increased expression in AGP22 [nhr-49(nr2041)I;glp-1(e2141)III] comparing to in CF1903 [glp-1(e2144)III] at Day 2 adults. Fold change > 2, p Value of < 0.05 and a false discovery rate (FDR) of < 0.05. WBPaper00061530:nhr-49(e2144)_upregulated
  Proteins found to be significantly regulated by quantitative proteomics upon RNAi knock down of nhr-49 compared to control worms. N.A. WBPaper00040134:nhr-49(RNAi)_regulated_protein
  Transcripts that showed more than four fold increase of expression after 4 hours of exposure to 7.5mM tBOOH. Top Tags in edgeR with log-fold change of (+2) and FDR <0.05 cutoffs. WBPaper00053798:nhr-49(nr2041)_tBOOH_upregulated

1 Sequence

Length
5030

1 Sequence Ontology Term