Insights in Biology and Medicine (IBM) maintains a multi-layered digital preservation and archiving program that safeguards the Version of Record (VoR), associated metadata, supplementary materials, and machine-readable renditions (e.g., JATS XML) across independent, geographically distributed systems. Preservation is aligned with community best practices for scholarly publishing, enabling verification, citation, and reuse over decades.

  • Scope Articles, issues, supplements, editorials, corrections, retractions, datasets (where hosted), and supplementary files.
  • Formats PDF/PDF-A, HTML, XML/JATS, image assets (SVG/PNG/JPEG), audio/video, and ancillary data packages.
  • Redundancy At least three preserved copies in distinct networks with regular fixity checks and integrity reporting.

Preservation Goals

  1. Integrity: Ensure bit-level and content-level integrity through checksums, version control, and documented change logs.
  2. Authenticity: Maintain provenance and unbroken DOI resolution to the canonical landing page.
  3. Accessibility: Provide reliable access with reasonable accommodations (accessible PDFs/HTML, alternative text, captions).
  4. Interoperability: Offer standards-based exports (JATS, Crossref, DOAJ XML, OAI-PMH) for libraries and aggregators.
  5. Continuity: Guarantee content availability via independent preservation networks in case of platform changes or cessation.

Preservation Systems & Layers

Primary Platform Preservation

Articles are hosted on the journal platform with routine backups and off-site replicas. Storage utilizes RAID redundancy and snapshotting with scheduled exports of PDFs, HTML, and JATS XML to cold storage.

  • Nightly database and file-store backups retained with a tiered retention policy (daily/weekly/monthly).
  • Immutable “vault” snapshots to protect against accidental deletion and ransomware.
  • Automated DOI link checking and landing-page audits to validate persistence.
Community Preservation Networks

IBM participates in community archiving networks (e.g., LOCKSS/CLOCKSS–style programs and PKP PN equivalents) to ensure long-term survivability independent of the publisher’s infrastructure.

  • Distributed, library-operated nodes with consensus repair using peer-to-peer polling.
  • Triggered access policies that make preserved content accessible if the journal becomes unavailable.
  • Regular content audits and fixity verification.
Repository & Index Backstops

Where applicable, VoR or AAM copies are deposited in trusted repositories and index-linked services to increase resilience and discovery.

  • Institutional and subject repositories with DOI-based canonical linking (see Repository Policy).
  • Abstracting and indexing services that maintain metadata mirrors and archival thumbnails.
  • Open metadata exposure via OAI-PMH and Crossref deposits for reference continuity.
Research Data Preservation

Datasets and code associated with articles should be preserved in domain-relevant repositories (e.g., Dryad, Zenodo, SRA, GEO, PRIDE), with persistent identifiers cited in the article’s Data Availability statement.

  • Preferred: repository-assigned DOIs/accessions; immutable releases for code.
  • Checksums and version labels included; metadata aligns with FAIR principles.
  • Licensing consistent with dataset terms (e.g., CC0 for metadata, CC BY or ODC for data).

Preservation Formats & Normalization

IBM maintains the VoR in the original publisher formats and, where feasible, normalized archival variants for long-term readability.

Content Type Preferred Format(s) Archival Variant Notes
Article text HTML, PDF PDF/A-2b; JATS XML Semantic tagging in JATS for machine readability.
Figures PNG/JPEG TIFF (for raster), SVG (for vector) ≥300 dpi for raster; embed fonts for vector.
Tables HTML JATS XML tables Preserve captions and footnotes.
Supplementary files PDF/CSV/ZIP Checksumed ZIP; README.txt Document file origins and software versions.
Audio/Video MP3/MP4 (H.264) Lossless masters where supplied Include captions/transcripts where available.
Code Repository release (ZIP/TAR) Immutable release + DOI Dependency manifest preserved.
Fixity: Files are assigned SHA-256 checksums; periodic fixity checks run on all preserved locations with alerts on mismatch.

Disaster Recovery & Continuity

Our disaster-recovery plan ensures continuity of access and editorial operations in adverse events (infrastructure failures, data loss, legal transitions, force majeure).

  • RPO/RTO: Recovery Point Objective ≤ 24 hours; Recovery Time Objective ≤ 72 hours for read access to VoR.
  • Failover: Read-only mirror service to expose preserved VoR and metadata while primary systems recover.
  • Runbooks: Documented steps for DNS failover, cache invalidation, and DOI update verification.
  • Incident Logs: Post-incident reports shared with stakeholders and preservation partners.
Triggered Access: If the journal ceases operations or becomes unavailable for an extended period, preservation networks can provide triggered access to the VoR until a new steward is designated.

Corrections, Retractions & Takedowns

IBM follows the principle of preserving the scholarly record while transparently signaling amendments.

  • Corrections: Linked from the VoR with bidirectional metadata; preserved as separate citable items.
  • Retractions: VoR retained with clear watermark and explanatory notice; metadata flags (retraction) propagated to Crossref and repositories.
  • Takedowns: In rare legal/ethical cases, access may be restricted; tombstone pages remain with reason codes and DOI continuity.

Roles & Responsibilities

Publisher & Editorial Office
  • Maintain preservation SLAs and audit schedules.
  • Deposit Crossref metadata, updates, and relationships (corrections/retractions).
  • Coordinate with preservation networks and repositories for content verification.
  • Monitor DOI resolution, OAI-PMH exposure, and JSON-LD correctness on article pages.
Authors
  • Provide final files in preferred archival formats where possible.
  • Use recognized data/code repositories with persistent identifiers.
  • Ensure third-party permissions are documented for long-term availability.
Libraries & Repositories
  • Harvest metadata via OAI-PMH; maintain DOI backlinks to the canonical VoR.
  • Honor license terms; expose dc.rights.uri and licenseRef fields.
  • Propagate amendments to maintain record integrity.

Security & Integrity Controls

  • Checksums & Manifests: Per-object checksums; release manifests signed by the system.
  • Access Controls: Principle of least privilege; immutable logs for preservation actions.
  • Versioning: Every file replacement generates a new version with diff notes; prior versions preserved.
  • Monitoring: Uptime, DOI resolve rates, OAI-PMH response health, and fixity dashboards.
Format Obsolescence: Periodic reviews identify at-risk formats; migration pathways are tested (e.g., PDF → PDF/A; proprietary images → TIFF/SVG).

Metadata, Identifiers & Interoperability

Rich metadata underpins preservation. IBM publishes machine-readable metadata on article pages and through aggregators.

  • Identifiers: DOI for articles and components; ORCID for contributors; funder IDs; grant numbers.
  • Exchanges: Crossref deposits (relations for updates), DOAJ export, OAI-PMH (oai_dc, jats), and JSON-LD/Schema.org on the VoR.
  • Rights: CC BY 4.0 license exposed in machine-readable fields; third-party exceptions labeled.

Testing, Audits & Reporting

IBM runs scheduled tests to validate recoverability and content completeness.

  1. Quarterly restore drills from off-site snapshots to a sandbox environment.
  2. Monthly fixity verification reports with exception handling and remediation notes.
  3. Annual preservation statement with metrics: object counts, failed fixity events, restore test outcomes.
Sample Metrics
Metric Target Status Signal
DOI resolution uptime ≥ 99.95% OK if ≥ target; investigate if ↓
Fixity mismatches per month 0 Any > 0 triggers incident workflow
Restore drill success 100% Report steps and timings

Accessibility & Inclusive Preservation

Preserved content aims to meet accessibility expectations over time. PDFs include tagged structures and bookmarks; HTML includes ARIA roles, alt text for figures, and captions/transcripts for media when provided. Preservation packages retain these features to avoid regressions during migrations.

Policy Updates & Review Cycle

This policy is reviewed annually or upon significant platform changes. Amendments are versioned and time-stamped; historic policy versions remain accessible for auditability.

Frequently Asked Questions

What happens if the journal changes platforms?

Content and metadata are migrated with preservation copies validated against checksums; DOIs continue to resolve to the canonical pages.

Can authors retain personal copies?

Yes. Authors are encouraged to keep personal copies and deposit AAM/VoR in institutional repositories per our Repository Policy.

How are data supplements preserved?

Where hosted by IBM, supplements are packaged with README, checksums, and versioning; otherwise, we reference the external repository DOI/accession.

Sources and References

  • Journal website: https://www.biologymedjournal.com/
  • OAI-PMH v2.0 & JATS (NISO Z39.96) preservation-aligned practices
  • LOCKSS/CLOCKSS community preservation principles
  • Crossref schema documentation (relations for updates/corrections)
  • FAIR Data Principles; repository best practices (Dryad, Zenodo, NCBI, EMBL-EBI)
  • DOAJ/COPE guidance on preserving the scholarly record

Last updated: September 29, 2025 · Approx. word count: 1,700 · Content slug: preservation-archiving-ibm-2025-09