WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00003378 Gene Name  mml-1
Sequence Name  ? T20B12.6 Brief Description  mml-1 encodes, by alternative splicing, two isoforms of a bHLH-ZIP protein orthologous to human MLX (OMIM:602976), MLXIP (OMIM:608090), and MLXIPL (OMIM:605678, deleted in Williams-Beuren syndrome); MML-1 has five N-terminal Mondo Conserved Regions, an N-terminal nuclear localization sequence, and a C-terminal bHLHZip domain; with MXL-2, MML-1 is probably required for normal migration of ray 1 precursor cells in the male tail and for proper epidermal expression of extracellular matrix component genes; MML-1 is expressed in epidermal cells from 50-100 cell embryos onward, and in intestinal cells at the 4E stage, until adulthood; MML-1 requires MXL-2 for protein stability; MML-1 binds MXL-2 but not MXL-1 in two-hybrid assays; either coexpressed MML-1/MXL-2 or MML-1 alone can activate transcription via CACGTG E-boxes.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Involved in determination of adult lifespan and negative regulation of transcription by RNA polymerase II. Located in mitochondrion and nucleus. Expressed in excretory cell; hypodermis; intestine; and muscle cell. Human ortholog(s) of this gene implicated in myocardial infarction. Is an ortholog of human MLXIP (MLX interacting protein) and MLXIPL (MLX interacting protein like).
Biotype  SO:0001217 Genetic Position  III :-0.747815 ±0.001114
Length (nt)  ? 4602
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00003378

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:T20B12.6b.1 T20B12.6b.1 3570   III: 7370130-7374725
Transcript:T20B12.6a.1 T20B12.6a.1 3337   III: 7370131-7374731
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:T20B12.6a T20B12.6a 3030   III: 7370432-7370518
CDS:T20B12.6b T20B12.6b 2814   III: 7370886-7371054

9 RNAi Result

WormBase ID
WBRNAi00066962
WBRNAi00067454
WBRNAi00053585
WBRNAi00018880
WBRNAi00005429
WBRNAi00035688
WBRNAi00071312
WBRNAi00115787
WBRNAi00073277

58 Allele

Public Name
gk964518
gk963887
WBVar02069626
gk509620
gk724319
gk894011
WBVar00060160
WBVar00060165
gk639275
WBVar00060170
gk402844
gk451428
gk907925
gk901144
gk842345
WBVar00060175
gk657881
gk527189
gk907493
gk808078
gk780356
WBVar00060180
gk466642
WBVar01628787
gk599258
gk907492
gk549153
gk449279
gk178280
gk178279

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00003378 7370130 7374731 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_7369539..7370129   591 III: 7369539-7370129 Caenorhabditis elegans

138 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts that showed significantly higher expression in somatic gonad precursor cells (SGP) vs. head mesodermal cells (hmc). DESeq2, fold change >= 2, FDR <= 0.01. WBPaper00056826:SGP_biased
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
Dietary restriction Transcripts that showed significantly decreased expression after N2 animals were under dietary restriction (DR, OP50 OD = 0.1) from 3-day post L4 till 6-day post L4 adult hermaphrodite stage, comparing to under ad libtum (AL, OP50 OD = 3) condition. Bioconductor package edgeR, p < 0.05. WBPaper00056443:DietaryRestriction_downregulated
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Transcripts that showed significantly decreased expression in N2 animals exposed to 0.1mM Paraquat from hatching to reaching adult stage. DESeq2 version 1.22.2, p < 0.05 WBPaper00064716:paraquat_downregulated
  Transcripts that showed significantly increased expression in daf-2(e1370) comparing to in control animals. NOIseq(v2.34.0), fold change > = 1.5, Differentially expressed genes (DEGs) were defined as having a probability of differentialexpression > 95%. WBPaper00064727:daf-2(e1370)_upregulated
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
  Proteins that showed significantly decreased expression after 1-day-old wild type adults were exposed to cisplatin (300ug per mL) for 6 hours. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:Cisplatin_downregulated_WT
  Transcripts that showed significantly increased expression in daf-2(e1370) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:daf-2(e1370)_upregulated
  Transcripts that showed significantly increased expression in nuo-6(qm200) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:nuo-6(qm200)_upregulated
  Transcripts that were regulated by both set-6(ok2195) and baz-2(tm0235) at 2-day post L4 adult hermaphrodite stage. N.A. WBPaper00059356:set-6(ok2195)_baz-2(tm0235)_regulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:excretory-cell_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:GABAergic-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Transcripts that showed significantly increased expression in animals lacking P granules by RNAi experiments targeting pgl-1, pgl-3, glh-1 and glh-4, and unc-119-GFP(+), comparing to in control animals, at 2-day post L4 adult hermaphrodite stage. DESeq2, Benjamini-Hochberg multiple hypothesis corrected p-value < 0.05 and fold change > 2. WBPaper00050859:upregulated_P-granule(-)GFP(+)_vs_control_day2-adult

11 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
Picture: Fig 5.   Expr4908 MML-1::GFP was observed in epidermal cells as early as the 50 to 100 cell stage of embryogenesis and in intestinal cells at the 4E stage. Expression persisted in these two cell types through all larval stages and adulthood. Nuclear at all stages.
    Expr1031556 Tiling arrays expression graphs  
    Expr12923 A rescuing MML-1::GFP translational reporter was widely expressed and found in the cytoplasm and nuclei of the intestine, neurons, muscle, hypodermis, excretory cell and other tissues. A closer examination of the subcellular localization revealed that MML-1::GFP also co-localized with the mitochondria, comparable to mammalian MondoA.  
    Expr1200050 Data from the TransgeneOme project  
    Expr10365 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr10366 Inferred expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr2013627 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1025664 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1157145 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2031861 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr11744   MML-1::GFP was observed to reside in both the nucleus and the cytoplasm under basal conditions, contrary to previous reports.

22 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
has_input(WB:WBGene00003073) involved_in
  involved_in
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables

6 Homologues

Type
least diverged orthologue
least diverged orthologue
orthologue
orthologue
orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00003378 7370130 7374731 -1

22 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
has_input(WB:WBGene00003073) involved_in
  involved_in
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables

2 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly decreased expression in neuron specific mml-1(RNAi); glp-1(e2141); TU3401 animals, comparing to in glp-1(e2141); TU3401 animals fed with control vector. Fold change > 2, FDR < 0.01. WBPaper00065993:mml-1(RNAi)-neuron_downregulated
  Transcripts that showed significantly increased expression in neuron specific mml-1(RNAi); glp-1(e2141); TU3401 animals, comparing to in glp-1(e2141); TU3401 animals fed with control vector. Fold change > 2, FDR < 0.01. WBPaper00065993:mml-1(RNAi)-neuron_upregulated

1 Sequence

Length
4602

1 Sequence Ontology Term

Identifier Name Description
gene  

1 Strains

WormBase ID
WBStrain00031665

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_7374732..7377850   3119 III: 7374732-7377850 Caenorhabditis elegans