Input¶
Uploading Data¶
BugSeq has two options for submitting data to the platform. All data is stored securely and encrypted, and is only accessible to your own account.
Uploading Directly¶
To upload files from your computer/server, drag and drop all files into the upload box, or select your files by clicking on the upload box.
Tip
Submit all files for your analysis together (ie. multiple samples) to speed up processing time and perform cross-sample analyses.
BaseSpace Integration¶
For BaseSpace users, we have an integration to retrieve data directly from BaseSpace. This is significantly easier, faster and less error-prone.
Once it is set up, your projects will appear in BugSeq. When you click on a BaseSpace project, you will see the samples in the project and have the ability to add them to an analysis:
We require your BaseSpace API key to link BugSeq with BaseSpace. API keys are stored securely and encrypted within the BugSeq infrastructure, and are not accessible to anyone.
To get onboarded to our BaseSpace integration:
- Please reach out to support@bugseq.com so we can securely obtain your BaseSpace API key.
- We will enable the integration on your account.
- We will confirm once this is complete by responding to your email.
- You’re ready to go!
Format¶
BugSeq supports all standard sequencing file formats. The following file formats are currently accepted:
FASTQ
: Basecalled nanopore, PacBio or Illumina sequences- Acceptable file extensions:
- Uncompressed:
.fq
,.fastq
- Compressed:
.fq.gz
,.fastq.gz
- Uncompressed:
- Note: only DNA/RNA sequencing data with IUPAC notation is acceptable.
- Acceptable file extensions:
FAST5
: Raw nanopore sequencing data- Acceptable file extensions:
.fast5
- Acceptable file extensions:
Info
Illumina data must be paired-end and contain separate files for forward and reverse reads. These files must have names with format {SAMPLE_NAME}_R[1/2].fastq[.gz]
, {SAMPLE_NAME}_[1/2].fastq[.gz]
or {SAMPLE_NAME}_L[/d]+_R[1/2]_001.fastq[.gz]
.
Tip
Uploading gzip
compressed files will reduce transfer time of large files to BugSeq.
Read Preprocessing¶
As BugSeq performs automatic read preprocessing and leverages all aspects of the input data to produce optimal results, we recommend against any data manipulation before submitting to BugSeq. For example, BugSeq uses FASTQ
quality information to correct reads, yielding optimal assemblies. BugSeq is built to handle raw data, saving our users time and complexity before data submission while ensuring optimal results.
Experimental Design¶
Platform¶
You may submit files from any Oxford Nanopore, PacBio or Illumina sequencer.
BugSeq will automatically detect the sequencing platform and perform tailored, best-practice analyses. The following processes are adjusted based on platform:
- Basecalling
- Demultiplexing
- Quality evaluation
- Adapter trimming
- Quality trimming and filtering
- Metagenomic classification
- Alignment
- Assembly
- Taxonomic binning
- Post-assembly analyses, including:
- Antimicrobial resistance prediction
- Strain typing
- Other pathogen-specific analyses
Sequencing Strategy¶
BugSeq supports all widely accepted sequencing strategies. These include:
- Metagenomic/Metatranscriptomic Sequencing
- Whole Genome Sequencing (WGS)
- Amplicon Sequencing
- 16S/ITS
- MLST
- Viral amplicons
Sequenced material can be DNA or RNA. Nucleic acid can be sequenced directly or amplified with techniques such as PCR.
Multiplexing¶
BugSeq automatically performs demultiplexing and adapter trimming on nanopore sequencing data. Illumina and PacBio data should already be demultiplexed before submission, but may have adapters, as BugSeq trims these automatically.
Strict (dual-barcode) nanopore demultiplexing
BugSeq parses FASTQ
headers for barcoding data. Occasionally, users may want BugSeq to perform strict demultiplexing of nanopore data, looking for barcodes on both ends of reads. Users should either perform strict demulitplexing before submitting to BugSeq, or perform no demultiplexing before submitting to BugSeq. Files which have already been demultiplexed with default (single-ended barcode) demultiplexing will not be further demultiplexed by BugSeq.
New Designs¶
Don’t see your experimental design combination here? Get in touch with us via support.