WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00000527 Gene Name  cle-1
Sequence Name  ? C36B1.1 Brief Description  cle-1 encodes the only vertebrate type XV/XVIII collagen homolog in C. elegans and generates multiple developmentally regulated protein isoforms via alternative splicing; cle-1 is required for neuron migrations, including the migration of the HSN, ALM and AVM neurons, axon guidance of the sensory and motor neurons, distal tip cell migration, and for proper morphogenesis of the mail tail; cle-1 is also involved in synaptogenesis and cholinergic synaptic transmission; RNA interference of cle-1 causes significant embryonic and larval lethality; cle-1 is strongly expressed in neurons and at the junctions between body wall muscles and localizes to basement membranes.
Organism  Caenorhabditis elegans Automated Description  Predicted to be an extracellular matrix structural constituent conferring tensile strength. Involved in several processes, including cholinergic synaptic transmission; generation of neurons; and inductive cell migration. Located in basement membrane and cell junction. Expressed in several structures, including basal lamina; head mesodermal cell; neurons; smooth muscle; and somatic nervous system. Human ortholog(s) of this gene implicated in primary angle-closure glaucoma. Is an ortholog of human COL15A1 (collagen type XV alpha 1 chain) and COL18A1 (collagen type XVIII alpha 1 chain).
Biotype  SO:0001217 Genetic Position  I :3.03569 ±0.003514
Length (nt)  ? 18612
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00000527

Genomics

4 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C36B1.1c.1 C36B1.1c.1 2271   I: 8705668-8710955
Transcript:C36B1.1a.1 C36B1.1a.1 3734   I: 8705669-8724279
Transcript:C36B1.1b.1 C36B1.1b.1 2743   I: 8705670-8719456
Transcript:C36B1.1d.1 C36B1.1d.1 3528   I: 8705986-8724279
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C36B1.1c C36B1.1c 1953   I: 8705986-8706054
CDS:C36B1.1a C36B1.1a 3417   I: 8705986-8706054
CDS:C36B1.1b C36B1.1b 2337   I: 8705986-8706054
CDS:C36B1.1d C36B1.1d 3528   I: 8705986-8706054

10 RNAi Result

WormBase ID
WBRNAi00001257
WBRNAi00042032
WBRNAi00042033
WBRNAi00042034
WBRNAi00027899
WBRNAi00003096
WBRNAi00003100
WBRNAi00003641
WBRNAi00110300
WBRNAi00106825

206 Allele

Public Name
gk962858
gk962706
gk963902
gk963849
gk964316
h11175
otn9345
gk963349
gk963350
gk390
gk421
WBVar00155305
gk727112
gk461222
tm10587
WBVar01432355
otn13992
WBVar01957402
WBVar01957399
ttTi28313
WBVar02016350
ju34
tm1465
WBVar01346468
WBVar01346462
ttTi40494
WBVar00155290
WBVar00155293
WBVar00155294
WBVar00155291

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00000527 8705668 8724279 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

0 Downstream Intergenic Region

245 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes significantly enriched (> 2x, FDR < 5%) in a particular cell-type versus a reference sample of all cells at the same stage. A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_larva_enriched
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
  Neuronally enriched transcripts according to a comparison of neuronal nuclei IP samples to total nuclei using isolation of nuclei from tagged specific cell types (INTACT) technology. DESEQ2, fold change > 2 and FDR < 0.01. WBPaper00062103:neuron_enriched
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Coexpression clique No. 60, 176662_at-Y53F4B.16, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:176662_at-Y53F4B.16
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Genes significantly enriched in NSM neurons (isolated by FACS) versus the reference, according to RNAseq analysis towards total RNA. Gene expression quantification and differential expression was analyzed using cufflinks v2.2.1. Enriched contains only genes significantly enriched (differentially expressed >= 2.4 fold in total RNA or >= 3.2 fold in DSN treated total RNA) in the NSM neurons versus the reference. WBPaper00045974:NSM_enriched_totalRNA_RNAseq
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_daf-16(mu86);glp-1(e2141)
  Genes with expression level regulated by genotype (N2 vs CB4856) at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_aging
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_glp-1(e2141)
  Genes with expression level regulated by genotype (N2 vs CB4856) at Late reproduction stage (96 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0118. WBPaper00040858:eQTL_regulated_reproductive
  Transcripts that showed significantly increased expression in day 3 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
Dietary restriction Transcripts that showed significantly decreased expression after N2 animals were under dietary restriction (DR, OP50 OD = 0.1) from 3-day post L4 till 6-day post L4 adult hermaphrodite stage, comparing to under ad libtum (AL, OP50 OD = 3) condition. Bioconductor package edgeR, p < 0.05. WBPaper00056443:DietaryRestriction_downregulated
  Transcripts that showed significantly increased expression in hrde-1(tm1200) animals, comparing to in N2, after growing at 25C for five generations (late generation). CuffDiff2 WBPaper00051265:F4_hrde-1(tm1200)_upregulated

8 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030314 Tiling arrays expression graphs  
    Expr859 cle-1AGFP expression was first detected in comma stage ( 390min) embryos in cephalic neurons. Expression in interneurons and rectal epithelial cells is seen by the 2-fold stage, 60 min later, and continues through larval and adult stages. cle-1BGFP expression was first observed at the 3-fold stage ( 520 min) of embryogenesis in four neurons of the lateral ganglia and in the DD dorsal motorneurons. The postembryonic VD dorsal motorneurons also express cle-1BGFP starting in the second larval stage, and this pattern persists through larval and adult stages. cle-1CGFP expression is first seen in comma stage embryos in pharynx and body wall muscles and later appears in the canal-associated neurons (CAN), head mesodermal cell, accessory muscles, and intestine. During larval development, expression is restricted to the ventral GLR glia like cells, the canal-associated neurons, and the head mesodermal cell. Beginning in the late fourth larval stage and continuing through adult, cle-1CGFP expression appears in somatic sheath cells of the gonad and reappears in body wall muscles.  
    Expr860 CLE-1 strongly accumulates in a punctate pattern associated with the nerve ring and the dorsal and ventral nerve cords. This pattern is first visible at the two-fold stage of embryogenesis (~460 min) and persists in all subsequent stages. Beginning in the L4 larval stage, CLE-1 is also observed in basement membranes surrounding the gonad, pharynx, and intestine, and under body wall muscles where it preferentially accumulates at junctions between muscle cells and along dense body lines. The apparently more limited distribution of CLE-1 in early development may result from low sensitivity of detection with the antibody. Under body wall muscles where it preferentially accumulates at junctions between muscle cells and along dense body lines
    Expr2010047 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1020118 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2028288 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2567   At the light microscope level, it is not possible to determine whether NID-1 or CLE-1 is present within or excluded from synaptic clefts. However, these molecules are present at higher concentrations along the lateral region of the nerve cord, where synaptic junctions are abundant, relative to the medial region, which has many fewer synapses. The CLE-1 and NID-1 proteins appear to be present on different faces of the nerve cords relative to synaptic domains. NID-1 is present lateral to the region where synapses form, between the nerve cord and the muscle. CLE-1 is concentrated between the nerve cords and the pseudocoelomic space or overlying muscle arms, i.e., along the dorsal face of the ventral nerve cord and the ventral face of the dorsal nerve cord. Although CLE-1 and NID-1 are found overlapping or closely apposed to UNC-17 along the nerve cords, no consistent association of NID-1 or CLE-1 with UNC-17 puncta was observed, and vice versa. UNC-17 is not present at all of the presynaptic zones in the nerve cords. Similar distributions of CLE-1 and NID-1 relative to a marker for GABAergic presynaptic zones, juIs1, was observed. NID-1 and CLE-1 are also detected in nonsynaptic regions of the nerve cords, but generally at significantly lower levels.
    Expr1146029 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

15 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  involved_in
  located_in
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  part_of

36 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00000527 8705668 8724279 -1

15 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  involved_in
  located_in
  part_of
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  part_of

0 Regulates Expr Cluster

1 Sequence

Length
18612

1 Sequence Ontology Term

Identifier Name Description
gene  

5 Strains

WormBase ID
WBStrain00036123
WBStrain00036159
WBStrain00036191
WBStrain00050770
WBStrain00005031

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_8724280..8727217   2938 I: 8724280-8727217 Caenorhabditis elegans