WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00001999 Gene Name  hrpa-1
Sequence Name  ? F42A6.7 Brief Description  hrp-1 encodes a putative heterogeneous nuclear ribonucleoprotein (hnRNP) required for mRNA surveillance, normally rapid growth, normally short lifespan, normally short body length, vulval development, DAF-18 function, fertility, and general viability; HRP-1 has two N-terminal RRM domains and a low-complexity C-terminal domain; HRP-1's homologs include HNRNPA0 (OMIM:609409), HNRNPA1 (HNRPA1; OMIM:164017; overexpressed in vitamin D resistance); HNRPA3 (OMIM:605372); and HNRNPA1L2.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable mRNA 3'-UTR binding activity. Involved in hemidesmosome assembly. Located in chromosome, telomeric region. Expressed in several structures, including germ line. Human ortholog(s) of this gene implicated in several diseases, including Alzheimer's disease; amyotrophic lateral sclerosis type 20; carcinoma (multiple); distal myopathy 3; and inclusion body myopathy with early-onset Paget disease of bone with or without frontotemporal dementia 2. Is an ortholog of human HNRNPA1 (heterogeneous nuclear ribonucleoprotein A1) and HNRNPA3 (heterogeneous nuclear ribonucleoprotein A3).
Biotype  SO:0001217 Genetic Position  IV :-4.68426 ±0.025935
Length (nt)  ? 1948
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00001999

Genomics

10 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F42A6.7b.1 F42A6.7b.1 1542   IV: 3336920-3338851
Transcript:F42A6.7a.2 F42A6.7a.2 1555   IV: 3336920-3338867
Transcript:F42A6.7b.2 F42A6.7b.2 1371   IV: 3336923-3338849
Transcript:F42A6.7b.3 F42A6.7b.3 1498   IV: 3336924-3338550
Transcript:F42A6.7c.1 F42A6.7c.1 1380   IV: 3336925-3338863
Transcript:F42A6.7b.4 F42A6.7b.4 1547   IV: 3336926-3338521
Transcript:F42A6.7c.2 F42A6.7c.2 1322   IV: 3336927-3338300
Transcript:F42A6.7a.1 F42A6.7a.1 1473   IV: 3336940-3338544
Transcript:F42A6.7d.2 F42A6.7d.2 1187   IV: 3337233-3338548
Transcript:F42A6.7d.1 F42A6.7d.1 1223   IV: 3337233-3338845
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F42A6.7b F42A6.7b 930   IV: 3337233-3337510
CDS:F42A6.7c F42A6.7c 927   IV: 3337233-3337510
CDS:F42A6.7d F42A6.7d 1044   IV: 3337233-3337510
CDS:F42A6.7a F42A6.7a 1041   IV: 3337233-3337510

17 RNAi Result

WormBase ID
WBRNAi00095616
WBRNAi00047067
WBRNAi00075257
WBRNAi00075256
WBRNAi00066294
WBRNAi00066142
WBRNAi00083104
WBRNAi00028212
WBRNAi00061560
WBRNAi00080106
WBRNAi00008781
WBRNAi00080105
WBRNAi00025470
WBRNAi00066340
WBRNAi00080187
WBRNAi00064327
WBRNAi00069874

34 Allele

Public Name
gk963722
gk963907
WBVar02123769
WBVar02123805
tm781
WBVar00187084
WBVar00187083
WBVar00187082
gk603470
gk326272
gk595119
gk502727
gk798626
gk852507
gk373546
gk626039
gk760112
gk373909
gk400721
gk588053
gk706015
gk723577
gk688538
gk776024
WBVar00571077
WBVar01451655
ok963
WBVar01726924
zen91
gk197810

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00001999 3336920 3338867 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_3336874..3336919   46 IV: 3336874-3336919 Caenorhabditis elegans

203 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  oocyte proteins identified by two or more unique peptides during proteomics study. In the pooled data set, 1453 C. elegans proteins were identified with a probability >= 0.9 according to ProteinProphet, of which 1165 proteins were identified by more than one unique peptide. WBPaper00038289:oocyte_protein
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_aging
  Genes with expression level regulated by genotype (N2 vs CB4856) at Late reproduction stage (96 hours at 24 centigrade). Authors permuted transcript values and used a genome-wide threshold of log10 P-value = 2, which resembles a false discovery rate (FDR) of 0.0118. WBPaper00040858:eQTL_regulated_reproductive
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
Gamma irradiation 100 mGY per hour for 72 hours since L1 larva. Transcripts that showed significantly increased expression after exposure to 100mGy per hour gamma irradiation from L1 to day 1 adult hermaphrodite stage. DESeq2, FDR <= 0.05, log2 fold change >= 0.3 or <= -0.3. WBPaper00058958:100mGy-irradiation-72h_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:glr-1(+)-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
Bacteria infection: Streptococcus gordonii Transcripts that showed significantly decreased expression after L4 larva animals were exposed to wild type S. gordonii for 2-3 hours, comparing to animals exposed to S. gordonii delta-spxB. Fold change > 2, FDR corrected p-value < 0.05. WBPaper00055049:S.gordonii_downregulated

7 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1031164 Tiling arrays expression graphs  
    Expr1150931 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2940 HRP-1 was localized at the ends of the chromosomes of the germ cells. Localized at the ends of the chromosomes.
    Expr2939 HRP-1 was expressed in all nuclei in somatic cells. nuclei
    Expr2030835 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr2012599 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1025031 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

16 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  enables
  involved_in
  part_of
  part_of
  located_in
  located_in
  enables
  enables
  enables
  enables

24 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
least diverged orthologue
orthologue
orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00001999 3336920 3338867 -1

16 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  enables
  involved_in
  part_of
  part_of
  located_in
  located_in
  enables
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
1948

1 Sequence Ontology Term