WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004025 Gene Name  phy-2
Sequence Name  ? F35G2.4 Brief Description  phy-2 encodes a catalytic alpha subunit of collagen prolyl 4-hydroxylase (P4H) orthologous to human P4H alpha (II) isoform (OMIM: 600608); phy-2 is responsible for hydroxylation of cuticle collagen; loss-of-function mutations result in phenotypically wild-type animals, however phy-1; phy-2 double mutants arrest as embryos and larvae, suggesting that the PHY-1 and PHY-2 P2H alpha subunits function redundantly during development; phy-2 in complex with phy-1 is essential for the survival of the C. elegans; PHY-2, are expressed in the cuticle collagen-synthesizing hypodermal cells in a cyclical fashion that corresponds the moulting cycle and the times of maximal cuticle collagen synthesis; phy-2 is also expressed excretory cell, and spermatheca.
Organism  Caenorhabditis elegans Automated Description  Enables peptidyl-proline 4-dioxygenase activity. Located in endoplasmic reticulum. Expressed in excretory cell; hypodermis; and spermatheca. Human ortholog(s) of this gene implicated in myopia. Is an ortholog of human P4HA1 (prolyl 4-hydroxylase subunit alpha 1).
Biotype  SO:0001217 Genetic Position  IV :5.80229 ±0.00735
Length (nt)  ? 4891
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004025

Genomics

1 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F35G2.4.1 F35G2.4.1 1736   IV: 12211097-12215987
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F35G2.4 F35G2.4 1620   IV: 12211212-12211394

6 RNAi Result

WormBase ID
WBRNAi00115626
WBRNAi00002076
WBRNAi00046413
WBRNAi00014372
WBRNAi00090781
WBRNAi00098167

93 Allele

Public Name
gk964278
gk964078
gk964500
gk962765
gk964475
gk964320
WBVar02122961
WBVar02122962
h5572
WBVar01454464
WBVar01454463
WBVar01454465
gk340604
gk796127
WBVar01454462
gk538929
gk583815
gk879796
gk697684
gk417260
gk679980
gk347451
WBVar01859150
gk683226
WBVar01859151
gk842959
gk1916
WBVar01859152
gk1917
gk719146

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004025 12211097 12215987 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_12210797..12211096   300 IV: 12210797-12211096 Caenorhabditis elegans

250 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  oocyte proteins identified by two or more unique peptides during proteomics study. In the pooled data set, 1453 C. elegans proteins were identified with a probability >= 0.9 according to ProteinProphet, of which 1165 proteins were identified by more than one unique peptide. WBPaper00038289:oocyte_protein
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_upregulated
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Transcripts that showed significantly increased expression glp-1(e2141); TU3401 animals comparing to in TU3401 animals. Fold change > 2, FDR < 0.01. WBPaper00065993:glp-1(e2141)_upregulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts that showed significantly decreased expression in atfs-1(cmh15) (null allele) animals comparing to in N2 animals at L4 larva stage. edgeR, fold change > 2, FDR < 0.05 WBPaper00060909:atfs-1(cmh15)_downregulated
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly increased expression in rrf-3(pk1426) comparing to in N2 at embryo stage. DESeq2v 1.18.1, fold change > 1.5, adjusted p-value < 0.01. WBPaper00056169:rrf-3(pk1426)_upregulated_embryo
  Transcripts expressed in vulva. FPKM >= 1. WBPaper00064122:vulva_transcriptome
Bacteria infection: Bacillus thuringiensis Transcripts that showed significantly increased expression in N2 animals infected by bacteria BMB171/Cry5Ba, an acrystalliferous Bt mutant BMB171 transformed with toxin gene cry5Ba on the shuttle vector pHT304, comparing to N2 animals infected by BMB171/pHT304. N.A. WBPaper00064229:B.thuringiensis-Cry5Ba_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
Dietary restriction Transcripts that showed significantly decreased expression after N2 animals were under dietary restriction (DR, OP50 OD = 0.1) from 3-day post L4 till 6-day post L4 adult hermaphrodite stage, comparing to under ad libtum (AL, OP50 OD = 3) condition. Bioconductor package edgeR, p < 0.05. WBPaper00056443:DietaryRestriction_downregulated
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Genes that showed significantly increased expression in daf-2(e1370);hel-1(gk148684) comparing to in hel-1(gk148684) To identify DEGs, Students t test and the log2 median ratio test were performed to compute t values and median ratios for all the annotated genes. The adjusted P values from each test were computed using an empirical distribution of the null hypothesis, which was obtained from random permutations of the samples. Finally, the adjusted P values from the individual tests were combined to compute the overall P values using Stouffers method , and genes with overall P < 0.05 and fold change > 1.5 were selected as DEGs. WBPaper00047131:daf-2(e1370)_upregulated_hel-1(gk148684)-background
  Transcripts that showed significantly decreased expression in N2 animals exposed to 0.1mM Paraquat from hatching to reaching adult stage. DESeq2 version 1.22.2, p < 0.05 WBPaper00064716:paraquat_downregulated
  Transcripts that showed significantly increased expression in daf-2(e1370) comparing to in control animals. NOIseq(v2.34.0), fold change > = 1.5, Differentially expressed genes (DEGs) were defined as having a probability of differentialexpression > 95%. WBPaper00064727:daf-2(e1370)_upregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Expression Pattern Group C, enriched for genes involved in metabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_C
  Transcripts depleted in purified oocyte P bodies comparing to in whole oocytes. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_oocyte_depleted
  Transcripts that showed differential expression between 24 and 26 hours post hatching L2d and dauer committed larvae of daf-9(dh6), triggered by the dafachronic acid (DA) growth hormone. Cluster 3 genes increased expression transiently at 26 hour post hatching. Benjamini Hochberg corrected q-value < 0.01. WBPaper00053388:dauer_regulated_Cluster3
  Transcripts depleted in purified oocyte P bodies comparing to in the whole animal. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_WholeAnimal_depleted
  Transcripts that showed significantly increased expression in wdr-5(ok1417);skn-1(lax188) comparing to in skn-1(lax188) at day 2 adult stage. fold change > 2 WBPaper00058711:wdr-5(ok1417)_upregulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
Reduced humidity (98% relative humidity). Genes that were down-regulated after one day exposure to reduced humidity (98% relative humidity) according to microarray analysis. Multiple hypothesis testing with the Benjamini-Hochberg correction was applied on calculated p-values. A change in the expression level was considered to be significant if the adjusted p-value was less than 0.001. WBPaper00044578:reduced-humidity_downregulated_microarray

9 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1031927 Tiling arrays expression graphs  
    Expr1150278 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr3473 phy-2:GFP was strongly expressed in the hypodermis, excretory cell, and spermatheca.  
Expression pattern for phy-1, phy-2, pdi-2 are identical, see Expr1928, Expr1930. Reporter gene fusion type not specified.   Expr1929 Exclusively expressed in the hypodermal cells that synthesize cuticle collagens.  
Expression pattern for phy-1, phy-2, pdi-2 are identical, see Expr1931, Expr1933. No detailed description on expression pattern at other life stages.   Expr1932 Immunostaining patterns in the hypodermal cells were observed at pre-elongated 1.5-fold embryo stage when the collagenous cuticle has not been formed. In elongated embryos the first larval cuticle has been synthesized, the hypodermal ER location was maintained. The nuclear-excluded pattern observed probably represents an endoplasmic reticulum (ER) location.
Original chronogram file: chronogram.1990.xml [F35G2.4:gfp] transcriptional fusion. Chronogram943    
    Expr1025120 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2014914 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2033150 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

18 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  involved_in
  located_in

34 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004025 12211097 12215987 -1

18 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  located_in
  involved_in
  located_in

0 Regulates Expr Cluster

1 Sequence

Length
4891

1 Sequence Ontology Term

Identifier Name Description
gene  

3 Strains

WormBase ID
WBStrain00022588
WBStrain00031639
WBStrain00002845

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_12215988..12217376   1389 IV: 12215988-12217376 Caenorhabditis elegans