Login

FastQC: A quality control tool for high throughput sequence data

NL
Nour Larifi
Feb 12, 2024

Introduction


FastQC is a widely used and powerful quality control tool that aims to simplify the process of checking the quality of raw sequence data generated by high-throughput sequencing pipelines. It offers a modular set of analyses that allow users to quickly assess the quality of their data and identify potential issues that may need to be addressed before proceeding with downstream analyses. It is easy to use and is compatible with a variety of sequencing platforms, making it a valuable tool for researchers working with different types of sequencing data.


Main functions of FastQC :


  • Providing a quick overview to tell you in which areas there may be problems
    • Summary graphs and tables to quickly assess your data
      • Export of results to an HTML based permanent report

        Steps to follow


        Ensure that the


        1. version 0.1.2 brick is loaded
          1. So first, upload your fastqc folder to the Databox.
            1. Then, create a new experiment.
              1. Import your resource

                Link it to the


                task available in the


                1. brick.
                  1. Run your experiment
                    1. Get your HTML report that will provide detailed information on the quality of the input sequencing data


                      Description of output file


                      The reports include a variety of visualizations and statistical analyses, such as quality score plots, sequence length distribution plots, per-base sequence quality scores, per-base sequence content, GC content plots, adapter content, and overrepresented sequences, among others.