Data Access

Download

The complete piRNAome dataset is available as bulk flat files and a ready-to-query SQLite database, plus a no-authentication REST API for programmatic access.

v1.0 piRNAome v1.0  ·  Homo sapiens  ·  genome assembly hg38  ·  integrated from seven source databases.
9,718,659
piRNA sequences
3,830,170
isoform families
9,655,926
with genomic locus
4,957,731
ping-pong piRNAs
1,022,931
SNV-annotated
10,988,020
source aliases
📦
Before going live: the buttons below point to DOWNLOAD_BASE (set in the page script, default /download/files/). Place the generated files at that path (or repoint it to a Zenodo/figshare/S3 URL), drop in the real md5 sums and sizes, then delete this note.
Data files
🧬
piRNAome_hg38_sequences.fa.gz
All 9,718,659 non-redundant piRNA sequences integrated from seven source databases (piRBase, piRNABank, piRNAdb, piRNAQuest, NCBI, piOxiDB, proTRAC), in FASTA format. Headers carry the canonical ID, display ID, length, and quality tier.
FASTA gzip ~800 MB
md5 <to be filled at release>
Download
📋
piRNAome_hg38_annotation.tsv.gz
Full annotation table, one row per piRNA: sequence, verified coordinates, quality, anchor, motif, isoform family, ping-pong partners, TE / gene / miRNA overlap, SNV interpretation and quality flags, and all source-database aliases.
TSV gzip ~2 GB
md5 <to be filled at release>
Download
📍
piRNAome_hg38_coordinates.bed.gz
BED6 of all verified genomic loci (VERIFIED + REALIGNED entries), covering 9,655,926 located piRNAs. Compatible with the UCSC genome browser, IGV, and bedtools.
BED6 gzip hg38
md5 <to be filled at release>
Download
🗄️
piRNAome_hg38.db.gz
The complete SQLite database used by this website. Tables: pirna, aliases, loci, clusters, denovo_clusters, junction_candidates, ping_pong. FTS5 full-text indexed; queryable with any SQLite client.
SQLite gzip ~18 GB FTS5
md5 <to be filled at release>
Download
Integrity

Checksums

Verify each download against the published md5 sums. A combined manifest is provided at CHECKSUMS.md5; check a file with md5sum -c CHECKSUMS.md5.

CHECKSUMS.md5
API access (available now)

REST API

All data is accessible programmatically through the REST API. No authentication required; responses are JSON.

GET /api/entry/{canonical_id} Full record for one piRNA
GET /api/search?q={query}&quality={tier}&chr={chr} Search with filters, paginated
GET /api/family/{family_id} All members of a piRNA family
GET /api/stats Database-wide summary statistics
GET /api/ppseed/{seed} All piRNAs sharing a ping-pong seed
Use & citation

License

piRNAome is freely available for academic and non-commercial research. Final licensing terms accompany the manuscript release; until then, please contact the authors before redistribution.

How to cite

If you use piRNAome in your work, please cite:

[Authors]. piRNAome: a unified human piRNA database with isoform-family integration and SNV-level annotation. Nucleic Acids Research (2026). DOI: to be assigned upon publication.