WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00006807 Gene Name  unc-75
Sequence Name  ? C17D12.2 Brief Description  unc-75 encodes an RNA-binding protein with two N-terminal RNA recognition motifs (RRMs), a glutamine/asparagine-rich linker domain, and a third C-terminal RRM that is orthologous to the mammalian CELF/BrunoL proteins that control pre-mRNA splicing; UNC-75 is required for neuron-specific splicing of unc-32 mRNA; UNC-75 binds the unc-32 intron 7a in vitro; both UNC-75 and EXC-7 are required in parallel for normal cholinergic neurotransmission; UNC-75 is expressed in all neurons and in neurosecretory gland cells, and is required for normal modulation of GABA- and acetylcholine-mediated neurotransmission; UNC-75 protein is found with other RRM proteins in dynamic nuclear speckles, consistent with a role in alternative mRNA splicing; unc-75 mutations can be rescued in vivo by a human unc-75 transgene, but not by exc-7 or W02D3.11, indicating that UNC-75 acts on evolutionarily conserved but highly specific pre-mRNA substrates.
Organism  Caenorhabditis elegans Automated Description  Enables single-stranded RNA binding activity. Involved in several processes, including cholinergic synaptic transmission; positive regulation of synaptic transmission; and regulation of alternative mRNA splicing, via spliceosome. Located in nuclear speck. Expressed in neurons and pharyngeal gland cell. Is an ortholog of human CELF5 (CUGBP Elav-like family member 5) and CELF6 (CUGBP Elav-like family member 6).
Biotype  SO:0001217 Genetic Position  I :9.28192 ±0.021291
Length (nt)  ? 9467
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00006807

Genomics

4 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C17D12.2a.1 C17D12.2a.1 1990   I: 11592246-11601712
Transcript:C17D12.2b.1 C17D12.2b.1 1395   I: 11592302-11601323
Transcript:C17D12.2c.1 C17D12.2c.1 1251   I: 11596947-11601323
Transcript:C17D12.2d.1 C17D12.2d.1 1101   I: 11596947-11601323
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C17D12.2d C17D12.2d 1101   I: 11596947-11597108
CDS:C17D12.2a C17D12.2a 1545   I: 11592302-11592406
CDS:C17D12.2b C17D12.2b 1395   I: 11592302-11592406
CDS:C17D12.2c C17D12.2c 1251   I: 11596947-11597108

10 RNAi Result

WormBase ID
WBRNAi00089748
WBRNAi00040788
WBRNAi00040789
WBRNAi00002961
WBRNAi00002963
WBRNAi00028998
WBRNAi00116889
WBRNAi00090005
WBRNAi00090164
WBRNAi00090323

250 Allele

Public Name
gk962858
gk962706
gk963849
h8722
h4606
h4893
gk748668
gk815266
gk905256
gk737570
gk932907
gk360877
gk922886
gk710275
gk896522
gk345310
gk326181
gk499256
gk389678
gk883659
gk411977
gk458846
gk757892
gk383865
gk410180
gk896523
gk428972
gk484890
gk913395
gk529593

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00006807 11592246 11601712 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_11601713..11601861   149 I: 11601713-11601861 Caenorhabditis elegans

164 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes significantly enriched (> 2x, FDR < 5%) in a particular cell-type versus a reference sample of all cells at the same stage. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_larva_enriched
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Neuronally enriched transcripts according to a comparison of neuronal nuclei IP samples to total nuclei using isolation of nuclei from tagged specific cell types (INTACT) technology. DESEQ2, fold change > 2 and FDR < 0.01. WBPaper00062103:neuron_enriched
  Transcripts that showed significantly increased expression in csr-1a(tor159) comparing to in N2 at 25C. DESeq2, fold change > 2, p-value < 0.05. WBPaper00061753:csr-1(tor159)_upregulated_25C
Bacteria: E.faecalis strain OG1RF Transcripts that showed significantly increased expression after infection by E. faecalis OG1RF. Ballgown was used to calculate differential expression of genes using FPKM data and to generate tables with fold change and P values. Genes were shortlisted with a cutoff of 2-fold change and P values of less than 0.05. WBPaper00059754:E.faecalis_OG1RF_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to RNAseq analysis towards total RNA. Gene expression quantification and differential expression was analyzed using cufflinks v2.2.1. Enriched contains only genes significantly enriched (differentially expressed >= 2.4 fold in total RNA or >= 3.2 fold in DSN treated total RNA) in the NSM neurons versus the reference. WBPaper00045974:NSM_enriched_totalRNA_RNAseq
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_daf-16(mu86);glp-1(e2141)
  Transcripts that showed significantly increased expression in rrf-3(pk1426) comparing to in N2 at embryo stage. DESeq2v 1.18.1, fold change > 1.5, adjusted p-value < 0.01. WBPaper00056169:rrf-3(pk1426)_upregulated_embryo
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Top 300 transcripts enriched in ABalppppppa, ABpraaapppa according to single cell RNAseq. Top 300 enriched transcripts were determined by log2.ratio of the tpm in the cell type vs the tpm in the other cells * the log2 of the cell.type tpm. WBPaper00061340:ASE_parent
  Genes that showed significantly increased expression in daf-2(e1370) comparing to in N2. To identify DEGs, Students t test and the log2 median ratio test were performed to compute t values and median ratios for all the annotated genes. The adjusted P values from each test were computed using an empirical distribution of the null hypothesis, which was obtained from random permutations of the samples. Finally, the adjusted P values from the individual tests were combined to compute the overall P values using Stouffers method , and genes with overall P < 0.05 and fold change > 1.5 were selected as DEGs. WBPaper00047131:daf-2(e1370)_upregulated_N2-background
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Transcripts depleted in purified oocyte P bodies comparing to in the whole animal. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_WholeAnimal_depleted

7 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
unc-75 mutant animals expressing an unc-75 cDNA under control of the unc-75 promoter are rescued for their locomotory defects, demonstrating that the 5' regulatory region of the gene contains all the elements required for unc-75 function.   Expr2668 Transgenic animals that express the gfp gene under control of 5 kb of upstream regulatory region from the unc-75 locus (punc-75::gfp) show expression in all neurons as well as in neurosecretory gland cells, but in no other tissue. Reporter gene expression can first be observed around the 300-cell stage when neurons are generated and expression is maintained throughout adulthood.  
    Expr2669 UNC-75::GFP localizes exclusively to the nucleus of all neurons examined. UNC-75::GFP is not distributed uniformly within the nucleus but is localized to distinct subnuclear puncta. These puncta are variable in size and number and are evident in all neuronal nuclei. The puncta are excluded from nucleoli and do not cluster below or travel toward the nuclear envelope.
    Expr2036040 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1032874 Tiling arrays expression graphs  
    Expr2017904 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1013866 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1144828 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

20 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

15 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00006807 11592246 11601712 1

20 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in

1 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  mRNAs that bind with neuronally expressed FLAG-tagged UNC-75 in unc-75(null) mutants CZ14662[unc-75(e950); Prgef-1-UNC-75S(juIs369)], according to CLIP-seq analysis. N.A. WBPaper00049626:UNC-75_target

1 Sequence

Length
9467

1 Sequence Ontology Term

Identifier Name Description
gene  

32 Strains

WormBase ID
WBStrain00022558
WBStrain00022481
WBStrain00023970
WBStrain00023509
WBStrain00023514
WBStrain00023515
WBStrain00023516
WBStrain00023517
WBStrain00023510
WBStrain00023511
WBStrain00023512
WBStrain00023513
WBStrain00023518
WBStrain00023519
WBStrain00023520
WBStrain00023521
WBStrain00023522
WBStrain00026573
WBStrain00026574
WBStrain00027078
WBStrain00027081
WBStrain00027269
WBStrain00033388
WBStrain00034583
WBStrain00055176
WBStrain00055055
WBStrain00006226
WBStrain00006227
WBStrain00005473
WBStrain00004220

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_11587758..11592245   4488 I: 11587758-11592245 Caenorhabditis elegans