3.3 Identification of a Large Deletion Event in the ASFV Genomes within Pig 8
Due to the failure of DNA isolated samples from pig 8 to yield PCR products for fragments 01 and 02, EDTA-blood, bone-marrow and spleen samples of pig 8 were sequenced directly by Illumina MiSeq. No reads were present that mapped to pos. 6364-16878 of the ASFV Podlaskie reference sequence, therefore we designed primers that spanned across this region, from pos. 6188-17145 (del-PCR). Use of these primers generated a PCR product of ~500 bp in all of the pig 8 samples (
Figure 3B). The EDTA-blood samples from the other pigs and from the inoculum were screened for the presence of this deletion. Samples from pigs 1 and 7 also displayed prominent products of ~500 bp, consistent with the presence of the deletion, while the inoculum displayed a very weak product of this size (
Figure 3A). All samples, except those from pig 7 and pig 8, also yielded products at ~11 kb, consistent with the expected full-length PCR products (without the deletion). The ca. 500bp PCR product from pig 8 was sequenced using the Nanopore and Sanger systems, which each revealed that the deletion occurred between positions 6362 and 16849, a total of 10487 bp. This deletion results in the loss of 21 complete genes, the majority are members of the MGF 110 (2L-14L) family, with a 3’ truncation of the MGF 110-1L gene, and ASFV G ACD 00090, 00120, 00160, 00190, 00240, and 00270, as well as MGF 360-4L. To confirm the deletion of nucleotides 6362-16849, we designed primers located within this region at pos. 6708-7668 (noDel-PCR), and screened all samples as above. All samples, except those from pig 8, yielded the expected product at ~1000 bp (
Figure 3C), consistent with the ability to produce full-length PCR fragments 01 and 02. Screening of the other samples from pig 8, revealed a very weak ~1000 bp product in the bone-marrow and serum samples (
Figure 3D).
In order to elucidate when this deletion variant arose in the inoculum, the first passage sample of the ASFV stock was also screened using these PCRs. They yielded only a full-length PCR product for the del-PCR, and the expected ~1000 bp product for the noDel-PCR (
Figure S1), indicating that the deletion event took place during the production of the second passage virus stock, in porcine pulmonary alveolar macrophages (PPAM), which was used as the inoculum in the infection study.
3.6 Indel Analysis of EDTA Blood Samples
All samples were screened for the presence of indels above 5 bp in the full-length Nanopore reads derived from the various PCR products in order to detect larger deletions. As Nanopore reads contain a lot of noise, indels below 5 bp were ignored.
The identified 10487 bp deletion in the ASFV DNA from pig 8, which spans the region 6362-16849, results in the failure of the PCRs that generate fragments 01 and 02. This is due to the locations of their reverse and forward primers, respectively, within the deleted region. Consequently, only full-length versions of this region of the genome are successfully amplified. This means that the large deletion in the virus from pig 8 (and pigs 1 and 7) does not appear in this indel analysis.
The inoculum had a total of 4 indels above the cut-off, where 3 of these were 5 bp at 3.1-7.4% frequency and located in coding homopolymer regions, causing frameshifts in EP1242L, C717R, and G1340L (
Figure 6 and
Table S3). Each of these deletions were found in the EDTA-blood samples from at least one of the infected pigs. The 10 bp insertion, a tandem repeat ‘TATATAGGAA’, at 172423 was located in a non-coding region between the I73R and I329L genes with a frequency of 43.6% and was shared at similar frequencies (49-55%) in all the other samples. This insertion was also found in the aligned Miseq data, but at lower apparent frequencies. Due to this discrepancy, we counted the number of tandem repeats in the trimmed reads (pre-alignment), which revealed that that 96.3%-100% of the reads in both the Nanopore and MiSeq reads corresponded to the IGR-II variant. The disparity between the raw reads and alignment is due to reference bias (see Discussion). The published ASFV/POL/2015/Podlaskie reference strain sequence [
26] does not contain this tandem repeat, however the results from this current study demonstrate that the ASFV/POL/2015/Podlaskie strain is actually an IGR-II variant. This disparity is discussed below.
In pig 1, the virus population exhibited 5 deletions, including a unique deletion of 5 bp with a frequency of 6%, located at position 115012 in a homopolymer region. This particular deletion caused a frameshift within the G1211R coding sequence. Within pig 2, the ASFV genomes displayed 2 deletions. One was a unique 5 bp deletion at position 116281 in a homopolymer region with a frequency of 3%, which also results in a frameshift within the G1211R coding sequence. The second deletion, of 5bp, was shared with pigs 4, 10, and 12 at position 85921, and ranged in frequency between 3.2% to 7%. This shared deletion caused a frameshift in the C475L gene coding sequence.
In the sample from pig 4, there were an additional 6 deletions. Two of these deletions were unique, with frequencies of 6.8% and 2.1%, and sizes of 6 bp and 5 bp, respectively. The 5 bp deletion led to a frameshift in the C717R gene, while the 6 bp deletion was situated in a non-coding homopolymer region. The remaining 4 deletions were 5 bp in size, found at frequencies of 2.2% to 2.5%, and were shared with at least one other pig, albeit sometimes at frequencies below the designated cut-off value.
The virus population in the sample from pig 7 exhibited 6 additional deletions, 3 of which were unique. The 6 bp deletion at pos. 167679 resulted in an in-frame deletion within the E248R gene at a 2.1% frequency. Another 5 bp deletion at pos. 139760, located in a homopolymeric region within the D339L ORF, and a 5 bp deletion at pos. 173667 in a non-coding region were also identified. Additionally, a 6 bp deletion at 170815 in a homopolymeric region caused an in-frame deletion in the I226R ORF at a 5.8% frequency and was present in all samples, albeit some below the cut-off level. The 5 bp deletion at 93795 at a 5.8% frequency caused a frameshift in the B962L ORF and was also detected below the cut-off level in pig 1.
The sample from pig 8 had 3 additional deletions, which were also detected in the sample from pig 12, though some below the cut-off level. Two of these deletions caused frameshifts within Q706L, in homopolymeric regions, present at frequencies of 2.3-2.5%
and 1.4-2.3%, respectively, and the third was in a non-coding homopolymer region. In contrast to all other samples, the virus in the sample from pig 9 did not have any unique indels.
The virus population in pig 10 showed 3 unique deletions, all resulting in frameshifts within the F1055L, G1340L, and R298L ORFs, in homopolymeric regions, at frequencies of 3%, 2.1%, and 4.3%, respectively. Another 4 deletions (5-6 bp) were identified in at least one other pig, though some were below the designated cut-off value. Two of these deletions caused in-frame deletions located in homopolymeric regions, in the C257L gene (pos. 85015), and the D1133L gene (pos. 141628), with frequencies of 2% and 4%, respectively. Another deletion occurred at a 2.3% frequency, and was situated in a homopolymeric region, and caused a frameshift in the B962L ORF.
The ASFV genomes within pig 11, displayed 4 unique 5 bp deletions, all leading to frameshifts within the C475L, G1211R , CP2475L, and H108R ORFs. There was also a 5 bp deletion in a non-coding homopolymeric region and an additional 6bp deletion in G1211R. These 6 deletions occurred at frequencies of 2-5.2%. The remaining 11 deletions (5-7 bp) were present in at least one other sample, although below the cut-off value. Of these deletions, 5 caused frameshifts in the MGF 505-2R, B962L, B169L, B602L, and H339R genes. Another 5 were located in homopolymeric regions within genes F165R, C962R, B175L, G1211R, and D1133L, with the four former changes causing frameshifts, and the latter being an in-frame deletion.
The virus in EDTA-blood from pig 12 had 1 unique deletion located at pos. 424 in a homopolymeric non-coding region with a frequency of 77 %. Another 5 bp frameshift deletion located at nt 158292 affecting the G706L ORF with a 2.1% frequency, was also detected in the inoculum but at below the designated cut-off level.
3.7 Samples from Pig 2
As mentioned above, the virus inoculum had a total for 4 indels present at above the cut-off frequency of 2%, where 3 of these were 5 bp in length at 3.1-7.4% frequency and located in coding regions, causing frameshifts in EP1242L, C717R, and G1340L (
Figure 7 and
Table S4). Two of these deletions were located in homopolymeric regions. The deletion at pos. 112741 (G1340L) was not present in any of the samples from pig 2. The 10 bp tandem repeat insertion (IGR-II) was present in each of the different samples from pig 2 at 46.7%-53.3% in the aligned Nanopore reads, and subsequent screening of the trimmed Nanopore and MiSeq reads also revealed a 99.9%-100% frequency. The 5 bp deletion at 83946 in the C717R gene, as found in the inoculum, was only detected in the serum sample, but was present below the cut-off level, and the 5 bp deletion at 69772 in the EP1242L ORF in the inoculum was only detected in the spleen sample, but at below the designated cut-off level.
In the EDTA-blood sample from pig 2, a distinctive 5 bp deletion at nt 85921 in the C475L gene, occurring at a frequency of 3.19%, was identified. This deletion was not found in other samples from pig 2, but was also present in the EDTA-blood of pigs 4, 10, and 12. Additionally, the 5 bp deletion at pos. 116281 in the G1211R ORF was observed at a 3.1% frequency in the EDTA-blood sample and at 2.5% in the bone-marrow sample.
In the bone marrow sample from pig 2, a unique 5 bp deletion causing a frameshift within the MGF 505-4R ORF, in a homopolymeric region, at a frequency of 2.3% was identified. Additionally, an in-frame deletion of 6 bp affecting the I226R gene, in a homopolymeric region, at a 2.4% frequency was present in the inoculum and in other samples from pig 2, although below the cut-off value.
Figure 6.
Indel frequency plots of insertions (red) and deletions (blue). Light grey background indicates areas with no PCR coverage, and dark grey background indicates areas where with failed PCRs (due to the deletion). Genes on the forward strand are indicated in blue, and genes on the reverse strand are indicated in gold, with affected genes in dark grey. (A) Indel frequencies in Inoculum. (B) Indel frequencies in sample E1. (C) Indel frequencies in sample E2. (D) Indel frequencies in sample E4. (E) Indel frequencies in sample E7. (F) Indel frequencies in sample E8. (G) Indel frequencies in sample E9. (H) Indel frequencies in sample E10. (I) Indel frequencies in sample E11. (J) Indel frequencies in sample E12.
Figure 6.
Indel frequency plots of insertions (red) and deletions (blue). Light grey background indicates areas with no PCR coverage, and dark grey background indicates areas where with failed PCRs (due to the deletion). Genes on the forward strand are indicated in blue, and genes on the reverse strand are indicated in gold, with affected genes in dark grey. (A) Indel frequencies in Inoculum. (B) Indel frequencies in sample E1. (C) Indel frequencies in sample E2. (D) Indel frequencies in sample E4. (E) Indel frequencies in sample E7. (F) Indel frequencies in sample E8. (G) Indel frequencies in sample E9. (H) Indel frequencies in sample E10. (I) Indel frequencies in sample E11. (J) Indel frequencies in sample E12.
Figure 7.
Indel frequency plots of silent (green) and missense (red) mutations. Light grey background indicates areas with no PCR coverage. Genes on forward strand are indicated in blue, and genes on reverse strand in gold, with affected genes in dark grey. (A) Indel frequencies in the Inoculum. (B) Indel frequencies in sample E2. (C) Indel frequencies in sample B2. (D) Indel frequencies in sample SP2. (E) Indel frequencies in sample S2.
Figure 7.
Indel frequency plots of silent (green) and missense (red) mutations. Light grey background indicates areas with no PCR coverage. Genes on forward strand are indicated in blue, and genes on reverse strand in gold, with affected genes in dark grey. (A) Indel frequencies in the Inoculum. (B) Indel frequencies in sample E2. (C) Indel frequencies in sample B2. (D) Indel frequencies in sample SP2. (E) Indel frequencies in sample S2.
The spleen sample from pig 2 exhibited two unique 5 bp deletions causing frameshifts in the MGF 360-1La and B385R ORFs, in homopolymeric regions, with frequencies of 2.9% and 2.6%, respectively. The serum sample from pig 2 had no unique indels above the cut-off value.