Skip to content

How to visualize on a map the geographical origins of samples of Bombyx mori nucleopolyhedrovirus from Thailand.

License

Notifications You must be signed in to change notification settings

wennj/bmnpv-diversity-thailand

Repository files navigation

Analysis of the geographical and genetic distance of samples of the Bombyx mori nucleopolyhedrovirus from Thailand

The figures and content in this repository are derived from the following publication:

  • Wennmann, J.T., Senger, S., Ruoff, B., Jehle, J.A., Suraporn, S. (2024). Distribution and genetic deviersity of Bombyx mori nucleopolyhedrovirus in mass-reared silkworms in Thailand. Journal of Invertebrate Pathology, https://doi.org/10.1016/j.jip.2024.108221.

Aim of this repository

  • Demonstration of the R code for creating a map of Thailand to mark sampling sites of baculovirus samples.

  • Learn how R can be used to analyse the correlation between geographical and genetic distance.

  • Provide a Galaxy workflow to extract BmNPV homologous genes from BmNPV genome sequences.

Map of Thailand

As part of the study, 21 locations in Thailand were sampled to collect BmNPV (Wennmann et al., 2024). For a better visualization, a map showing all the provinces of Thailand was created. The provinces were dividgied into four regions. The regions were coloured according to the intensity of agricultural cultivation of mulberry trees. The locations of the BmNPV samples were marked with different symbols, depending on whether they were analysed using whole genome sequencing (WGS) or only partial gene sequencing (Sanger sequencing). Phylogenetic analyses were performed based on the sequence data, and depending on the phylogenetic clade, the symbols on the map were coloured accordingly. This complex representation conveys a lot of interesting information (Figure).

Click here for the R code used to create the figure.

Map Preview

Click to expand the full-sized map Full Map of Thailand

BmNPV Phylogeny

Two phylogenies were constructed based on partially sequenced genes and the sequence of 138 open reading frames. The phylogenies should also be linked to the map and thus to the location where the BmNPV samples were collected. The phylogeny itself was carried out using MEGA and the trees were read into R. Details of the sequence and phylogenetic analysis can be found in the publication (Wennmann et al., 2024).

Click here for the R code used to create the figure.

Phylogeny

Correlation between geographic and genetic distance

The geographical information (longitude/latitude) can be used to calculate the distances between the BmNPV collection sites. Genetic distances, in this study the Kimura 2 parameter, can be calculated from the sequence data. A correlation can be used to determine whether there is a correlation between these two.

Click here for the R code used to create the next two figures.

Map Preview

The correlation can also be checked for all CDS individually. The alignments of the individual CDS serve as a basis for this. Output is only generated for the CDS that show a positive correlation.

Map Preview

Click to expand the full-sized map Full Map of Thailand

Analysis of the genetic distance at the CDS level

The alignments of all 138 CDSs of the fully sequenced genomes also allowed the genetic K2P distance to be considered at the level of individual CDSs. This was done by calculating the K2P distance for each CDS and displaying the result in a matrix. It was also possible to check whether there was a correlation between geographical and genetic distance.

Click here for the R code used to create the figure.

Click to expand the full-sized map Full Map of Thailand

A Galaxy-Workflow for extracting homologous ORFs

Next is a figure of a workflow that can be run on a Galaxy server (usegalaxy.org, usegalaxy.eu). Two inputs are required: (i) BmNPV genomes in FASTA format and (ii) a FASTA file with all CDS of a singe BmNPV (extracted from gff3 or genbank file). During this workflow, homologous genes are searched using Blast and combined with each other. In the last step, all homologous genes are aligned and concatenated. Based on resulting data, a phylogenetic tree can then be constructed or genetic distanced be calculated (see above).

The Galaxy workflow file can be found here.

About

How to visualize on a map the geographical origins of samples of Bombyx mori nucleopolyhedrovirus from Thailand.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published