WormMine

WS296

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004746 Gene Name  sdc-2
Sequence Name  ? C35C5.1 Brief Description  The sdc-2 gene encodes a protein that represses transcription of X chromosomes to achieve dosage compensation, and that also represses the male sex-determination gene her-1 to elicit hermaphrodite differentiation.
Organism  Caenorhabditis elegans Automated Description  Involved in dosage compensation by hypoactivation of X chromosome and negative regulation of transcription by RNA polymerase II. Located in nuclear chromosome.
Biotype  SO:0001217 Genetic Position  X :4.34757 ±0.06798
Length (nt)  ? 10744
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004746

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:C35C5.1.1 C35C5.1.1 9247   X: 11522707-11533450
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:C35C5.1 C35C5.1 8889   X: 11522739-11523857

35 RNAi Result

WormBase ID
WBRNAi00070592
WBRNAi00024740
WBRNAi00008489
WBRNAi00102094
WBRNAi00064646
WBRNAi00064806
WBRNAi00092118
WBRNAi00064749
WBRNAi00092100
WBRNAi00092110
WBRNAi00092114
WBRNAi00113286
WBRNAi00068532
WBRNAi00068533
WBRNAi00068534
WBRNAi00113057
WBRNAi00063915
WBRNAi00063917
WBRNAi00070055
WBRNAi00063916
WBRNAi00000770
WBRNAi00041963
WBRNAi00064561
WBRNAi00064630
WBRNAi00064730
WBRNAi00064847
WBRNAi00068977
WBRNAi00068976
WBRNAi00068978
WBRNAi00070590

134 Allele

Public Name
gk964260
gk964029
gk962707
gk964028
gk963810
gk963816
WBVar01759475
WBVar01690529
WBVar01690530
WBVar01690531
WBVar00243489
gk963817
gk294361
gk635001
gk294360
gk294359
WBVar01620604
gk651409
gk670627
gk692128
gk434301
gk440000
gk478739
gk882288
gk645526
gk358852
gk638433
gk445972
gk327688
WBVar01825275

1 Chromosome

WormBase ID Organism Length (nt)
X Caenorhabditis elegans 17718942  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004746 11522707 11533450 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_11533451..11533592   142 X: 11533451-11533592 Caenorhabditis elegans

140 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Coexpression clique No. 256, pek-1_22220-pek-1_33, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:pek-1_22220-pek-1_33
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts that showed significantly decreased expression in atfs-1(cmh15) (null allele) animals comparing to in N2 animals at L4 larva stage. edgeR, fold change > 2, FDR < 0.05 WBPaper00060909:atfs-1(cmh15)_downregulated
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
Bacteria infection: Staphylococcus aureus Transcripts that showed significantly decreased expression in animals experimentally colonised by a wild microbiota community and infected by the widespread animal pathogen, Staphylococcus aureus, comparing to animals not colonized by microbiota and not infected by pathogen. DeSeq2 (v. 1.42.0), Wald analyses testing against a null hypothesis of < |1.5|-fold change in gene expression between treatments (BenjaminiHochberg adjusted false detection rate of p <= 0.05. WBPaper00067479:Microbiota-Pathogen_vs_control_downregulated
Bacteria infection: Staphylococcus aureus Transcripts that showed significantly decreased expression in animals experimentally colonised by a wild microbiota community and infected by the widespread animal pathogen, Staphylococcus aureus, comparing to animals colonized by microbiota but not infected by pathogen. DeSeq2 (v. 1.42.0), Wald analyses testing against a null hypothesis of < |1.5|-fold change in gene expression between treatments (BenjaminiHochberg adjusted false detection rate of p <= 0.05. WBPaper00067479:Microbiota-Pathogen_vs_Microbiota_downregulated
  Transcripts that showed significantly increased expression in alg-1(gk214), comparing to in N2. DESeq2, Fold change > 1.5. WBPaper00051404:alg-1(gk214)_upregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts that showed significantly increased expression after exposure to 75uM paraquat(PQ) from L1 to day 2 adult stage in skn-1(lax188) animals fold change > 2 WBPaper00058711:paraquat_upregulated
  Transcripts that showed significantly decreased expression in 10-days post L4 adult hermaphrodite npr-8(ok1439) animals grown at 20C, comparing to in N2 animals. CuffDiff, fold change > 2. WBPaper00065096:npr-8(ok1439)_downregulated_Day10_20C
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
Growth temperature Transcripts that are significantly downregulated at 15C compared to both 25C and 20C, with no statistical difference between 25C and 20C, in worms feeding B. subtilis PY79. DESeq2 and EdgeR, adjusted p-value < 0.05. WBPaper00053814:15C_downregulated_PY79
  Transcripts that showed significantly decreased expression in eat-2(ad1116) comparing to in N2 at 3-days post L4 adult hermaphrodite animals. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:eat-2(ad1116)_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_downregulated
  Proteins identified in extracellular vesicle. N.A. WBPaper00062669:extracellular-vesicle_protein
  Transcripts that showed significantly increased expression in animals lacking P granules by RNAi experiments targeting pgl-1, pgl-3, glh-1 and glh-4, and unc-119-GFP(+), comparing to in control animals, at 2-day post L4 adult hermaphrodite stage. DESeq2, Benjamini-Hochberg multiple hypothesis corrected p-value < 0.05 and fold change > 2. WBPaper00050859:upregulated_P-granule(-)GFP(+)_vs_control_day2-adult
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L1-larva_expressed
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Transcripts that showed significantly increased expression in animals fed with JM103 bacteria producing Cry5B, comparing to control animals fed with JM103. ANOVA, p-value < 0.05. WBPaper00056167:Cry5B_upregulated
  Transcripts that showed significantly increased expression in sma-4(rax3) comparing to in N2 at 1-day post-L4 adult hermaphrodite HTseq-count was used to count reads mapped to each gene and counting data was imported to EdgeR for statistical analysis. Statistical significance was defined by adjusted P value (false discovery rate, FDR) of <0.05. WBPaper00053184:sma-4(rax3)_upregulated
  Transcripts that showed significantly increased expression in lin-22(ot269) comparing to in N2 at L3 larva. Differences in gene expression were then calculated using the negative binomial test in the DESeq package (FDR = 0.1). WBPaper00053295:lin-22(ot269)_upregulated
  Expression Pattern Group D, enriched for genes involved in catabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_D
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
  Genes upregulated in sma-2 L4 (3 arrays) or sma-4 L4 (1 array) vs. N2 L4. FDR = 0%, SAM. WBPaper00037682:sma-2_sma-4_upregulated

7 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1032344 Tiling arrays expression graphs  
    Expr10429 Inferred Expression. EPIC dataset. http://epic.gs.washington.edu/ Large-scale cellular resolution compendium of gene expression dynamics throughout development. This reporter was inferred to be expressing in this cell or one of its embryonic progenitor cells as described below. To generate a compact description of which cells express a particular reporter irrespective of time, the authors defined a metric "peak expression" for each of the 671 terminal ("leaf") cells born during embryogenesis. For each of these cells, the peak expression is the maximal reporter intensity observed in that cell or any of its ancestors; this has the effect of transposing earlier expression forward in time to the terminal set of cells. This metric allows straightforward comparisons of genes' cellular and lineal expression overlap, even when the expression occurs with different timing and despite differences in the precise time point that curation ended in different movies, at the cost of ignoring the temporal dynamics of expression, a topic that requires separate treatment. For simplicity, the authors use the term "expressing cells" to mean the number of leaf cells (of 671) with peak expression greater than background (2000 intensity units) and at least 10% of the maximum expression in that embryo. Quantitative expression data for all cells are located here: ftp://caltech.wormbase.org/pub/wormbase/datasets-published/murray2012/  
    Expr2015659 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1028636 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1145975 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2033892 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr11875   The highly charged SDC-2 protein, which bears a coiled-coil motif, localized to X chromosomes.

10 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in

1 Homologues

Type
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004746 11522707 11533450 1

10 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  located_in
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
10744

1 Sequence Ontology Term

Identifier Name Description
gene  

4 Strains

WormBase ID
WBStrain00035100
WBStrain00035105
WBStrain00035098
WBStrain00035166

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrX_11519237..11522706   3470 X: 11519237-11522706 Caenorhabditis elegans