Data Submission¶
Uploading Data¶
BugSeq has two options for submitting data to the platform. All data is stored securely and encrypted, and is only accessible to your own account.
Uploading Directly¶
To upload files from your computer/server, drag and drop all files into the upload box, or select your files by clicking on the upload box.
Tip
Submit all files for your analysis together (ie. multiple samples) to speed up processing time and perform cross-sample analyses.
BaseSpace Integration¶
For BaseSpace users, we have an integration to retrieve data directly from BaseSpace. This is significantly easier, faster and less error-prone.
Once it is set up, your projects will appear in BugSeq. When you click on a BaseSpace project, you will see the samples in the project and have the ability to add them to an analysis:
We require your BaseSpace API key to link BugSeq with BaseSpace. API keys are stored securely and encrypted within the BugSeq infrastructure, and are not accessible to anyone.
To get onboarded to our BaseSpace integration:
- Please reach out to support@bugseq.com so we can securely obtain your BaseSpace API key.
- We will enable the integration on your account.
- We will confirm once this is complete by responding to your email.
- You’re ready to go!
File Format¶
BugSeq supports all standard sequencing file formats. The following file formats are currently accepted:
FASTQ
: Basecalled ONT, PacBio, Ion Torrent or Illumina sequences- Acceptable file extensions:
- Uncompressed:
.fq
,.fastq
- Compressed :
.fq.gz
,.fastq.gz
- Uncompressed:
- Note: only DNA/RNA sequencing data with IUPAC notation is acceptable.
- Acceptable file extensions:
BAM
: Basecalled nanopore or PacBio sequences- Acceptable file extensions:
.bam
- Note: all reads must be unmapped, such as those directly output by the sequencer or basecalling software. BAM files with mapped reads will fail analysis.
- Acceptable file extensions:
Info
We accept both single and paired-end Illumina data. Paired-end data must contain separate files for forward and reverse reads with format {SAMPLE_NAME}_R[1/2].fastq[.gz]
, {SAMPLE_NAME}_[1/2].fastq[.gz]
or {SAMPLE_NAME}_L[/d]+_R[1/2]_001.fastq[.gz]
.
Read Preprocessing¶
As BugSeq performs automatic read preprocessing and leverages all aspects of the input data to produce optimal results, we recommend against any data manipulation before submitting to BugSeq. For example, BugSeq uses FASTQ
quality information to correct reads, yielding optimal assemblies. BugSeq is built to handle raw data, saving our users time and complexity before data submission while ensuring optimal results.
Experimental Design¶
Platform¶
You may submit files from any Oxford Nanopore, PacBio or Illumina sequencer.
BugSeq will automatically detect the sequencing platform and perform tailored, best-practice analyses. The following processes are adjusted based on platform:
- Basecalling
- Demultiplexing
- Quality evaluation
- Adapter trimming
- Quality trimming and filtering
- Metagenomic classification
- Alignment
- Assembly
- Taxonomic binning
- Post-assembly analyses, including:
- Antimicrobial resistance prediction
- Strain typing
- Other pathogen-specific analyses
Sequencing Strategy¶
BugSeq supports all widely accepted sequencing strategies. These include:
- Metagenomic/Metatranscriptomic Sequencing
- Whole Genome Sequencing (WGS)
- Amplicon Sequencing
- 16S/ITS
- MLST
- Viral amplicons
Sequenced material can be DNA or RNA. Nucleic acid can be sequenced directly or amplified with techniques such as PCR.
New Designs¶
Don’t see your experimental design combination here? Get in touch with us via support.