USFDA Technical Specifications: Submitting Next-Generation Sequencing Data to the Division of Antivirals
- Sharan Murugan
- 37 minutes ago
- 3 min read
Next-generation sequencing (NGS) has become an essential tool in antiviral drug development, enabling detailed analysis of viral genetic changes and the identification of resistance-associated variants. Compared with traditional sequencing methods, NGS generates extensive sequence data that provides a deeper understanding of viral populations throughout clinical development.

To promote consistent and high-quality regulatory submissions, the U.S. Food and Drug Administration (FDA) published the technical specifications document Submitting Next-Generation Sequencing Data to the Division of Antivirals. The document provides recommendations for sponsors on preparing, analyzing, documenting, and submitting NGS protocols, datasets, and resistance analyses to support antiviral drug development and regulatory review.
Why This Guidance Is Important
The FDA's Division of Antivirals performs independent analyses of NGS data submitted during antiviral drug development to evaluate viral resistance. These analyses support regulatory decision-making, contribute to antiviral product labeling, and help monitor the emergence of drug-resistant viral variants.
Because NGS generates highly complex datasets, the guidance aims to standardize data submission practices, improve the consistency of bioinformatics analyses, and ensure that resistance data can be efficiently reviewed throughout the drug development process.
Planning NGS Studies
The guidance recommends that sponsors engage with the FDA early during drug development, preferably before initiating pivotal clinical trials. Sponsors are encouraged to submit their NGS protocol, including sequencing methods and bioinformatics analysis plans, in advance for regulatory discussion.
To verify that datasets are formatted appropriately, sponsors are also encouraged to submit a small mock NGS dataset containing representative sequencing files before providing large clinical datasets.
Information to Include in an NGS Submission
A complete NGS submission should include a detailed sequencing protocol, raw sequencing data, consensus sequences, reference sequences, amino acid frequency tables, and an NGS report summarizing resistance analyses.
The sequencing protocol should describe laboratory procedures such as sample collection, nucleic acid extraction, amplification, library preparation, sequencing methods, quality control criteria, and data analysis workflows. Sponsors are also encouraged to provide validation information for their sequencing methods whenever available.
Sequencing Data and Bioinformatics Analysis
The guidance recommends submitting all raw sequencing data in FASTQ format together with consensus sequences in FASTA format. Additional reference sequences, primer sequences, and adapter sequences should also be provided to support FDA review.
Sponsors should clearly describe the bioinformatics pipeline used for sequence analysis, including methods for read filtering, quality trimming, mapping reads to reference sequences or assembling contigs, variant detection, and mutation filtering. The guidance emphasizes that mutation frequency thresholds should be scientifically justified and supported through assay validation to distinguish true viral variants from sequencing errors.
NGS Reporting Requirements
An NGS report should summarize resistance analysis results and provide sufficient information for FDA review. This includes descriptions of baseline viral polymorphisms, treatment-emergent substitutions, resistance-associated mutations, and their relationship to virologic or clinical outcomes.
The report should also summarize sequencing success rates, quality metrics, missing sequencing data, and overall dataset characteristics. Where applicable, sponsors should include analyses evaluating sequence conservation and identify viral variants that may influence antiviral resistance.
Quality Control Considerations
The guidance places strong emphasis on quality assurance throughout the sequencing process. Sponsors should establish quality control criteria for samples, sequencing libraries, sequencing runs, and sequencing results while incorporating appropriate positive and negative controls.
For sequencing protocols that include PCR amplification, sponsors are encouraged to validate assay performance using samples containing known variant frequencies. The guidance also recommends documenting approaches used to minimize sequencing errors and confirm the integrity of submitted data.
File Formats and Submission Procedures
To facilitate efficient regulatory review, the FDA recommends standardized file formats and submission methods for all NGS-related information. Documents such as the NGS protocol, NGS report, and reference sequences should be submitted electronically through the FDA Electronic Submissions Gateway (ESG), while large sequencing datasets, including FASTQ and FASTA files, should generally be provided on an external hard drive.
The guidance also recommends standardized naming conventions for sequencing files and consensus sequences so that participant identifiers, study visits, sequencing reads, and viral genotypes can be easily identified during regulatory review.
Following these recommendations can improve the efficiency of regulatory review while supporting accurate assessment of antiviral resistance throughout clinical development.