Data Access
Download
The complete piRNAome dataset is available as bulk flat files and a ready-to-query
SQLite database, plus a no-authentication REST API for programmatic access.
v1.0
piRNAome v1.0 · Homo sapiens ·
genome assembly hg38 · integrated from seven source databases.
3,830,170
isoform families
9,655,926
with genomic locus
4,957,731
ping-pong piRNAs
📦
Before going live: the buttons below point to
DOWNLOAD_BASE
(set in the page script, default /download/files/).
Place the generated files at that path (or repoint it to a Zenodo/figshare/S3 URL),
drop in the real md5 sums and sizes, then delete this note.
Data files
🧬
piRNAome_hg38_sequences.fa.gz
All 9,718,659 non-redundant piRNA sequences integrated from seven source
databases (piRBase, piRNABank, piRNAdb, piRNAQuest, NCBI, piOxiDB, proTRAC),
in FASTA format. Headers carry the canonical ID, display ID, length, and
quality tier.
FASTA
gzip
~800 MB
md5 <to be filled at release>
Download
📋
piRNAome_hg38_annotation.tsv.gz
Full annotation table, one row per piRNA: sequence, verified coordinates,
quality, anchor, motif, isoform family, ping-pong partners, TE / gene / miRNA
overlap, SNV interpretation and quality flags, and all source-database aliases.
TSV
gzip
~2 GB
md5 <to be filled at release>
Download
📍
piRNAome_hg38_coordinates.bed.gz
BED6 of all verified genomic loci (VERIFIED + REALIGNED entries), covering
9,655,926 located piRNAs. Compatible with the UCSC genome browser, IGV, and
bedtools.
BED6
gzip
hg38
md5 <to be filled at release>
Download
🗄️
piRNAome_hg38.db.gz
The complete SQLite database used by this website. Tables: pirna,
aliases, loci, clusters,
denovo_clusters, junction_candidates,
ping_pong. FTS5 full-text indexed; queryable with any SQLite client.
SQLite
gzip
~18 GB
FTS5
md5 <to be filled at release>
Download
Integrity
Checksums
Verify each download against the published md5 sums. A combined manifest is
provided at CHECKSUMS.md5; check a file with
md5sum -c CHECKSUMS.md5.
CHECKSUMS.md5
API access (available now)
REST API
All data is accessible programmatically through the REST API. No authentication
required; responses are JSON.
GET
/api/entry/{canonical_id}
Full record for one piRNA
GET
/api/search?q={query}&quality={tier}&chr={chr}
Search with filters, paginated
GET
/api/family/{family_id}
All members of a piRNA family
GET
/api/stats
Database-wide summary statistics
GET
/api/ppseed/{seed}
All piRNAs sharing a ping-pong seed
Use & citation
License
piRNAome is freely available for academic and non-commercial research. Final
licensing terms accompany the manuscript release; until then, please contact the
authors before redistribution.
How to cite
If you use piRNAome in your work, please cite:
[Authors]. piRNAome: a unified human piRNA database with isoform-family
integration and SNV-level annotation. Nucleic Acids Research (2026).
DOI: to be assigned upon publication.