Investigating Annual and Seasonal Diet Preference of Southern Resident Killer Whales in the Salish Sea: A Genetic Metabarcoding Approach

Author

Sofia Kaiaua1, Deborah Giles2,1, Amy Van Cise1

  1. University of Washington, Seattle, WA
  2. Wild Orca, San Juan, WA


Corresponding author email: avancise@uw.edu


Background

The endangered southern resident killer whale (Orcinus orca ater, hereafter SRKW) population has not recovered since the 1970s despite protective measures (NOAA/NMFS, 2016). SRKW exhibit culturally learned behaviors, including diet preferences, primarily consuming Chinook (Oncorhynchus tshawytscha), chum (O. keta), and coho (O. kisutch) salmon (Hanson et al. 2021; Van Cise et al. 2024). Decadal decline in size and abundance of preferred prey may contribute to malnutrition or starvation in this population (Hanson et al. 2021); additionally, prey limitation can increase their vulnerability to cumulative environmental stressors (Wade, Reeves, and Mesnick 2012). Comprehensive analysis of seasonal and annual variability in SRKW diet is vital to constructing recovery strategies for this endangered species; genetic metabarcoding of prey DNA from historically collected fecal samples promises to fill existing data gaps, providing vital insight regarding foraging behavior during non-summer months as well as greater resolution into seasonal variability in foraging behavior within pods, which is currently poorly understood (Van Cise et al. 2024). In this pilot study, we sequence a set of fecal samples collected by Wild Orca between 2019 and 2024. The primary goals of this pilot study are to explore the feasibility of applying a genetic metabarcoding approach to samples collected and processed using a variety of methods, and to investigate seasonal and annual variability in the prey species identified in this sample set.

Methods

We collaborated with Wild Orca to collect fecal samples from SRKW, and supplemented samples collected during the 2024 field season with historical samples collected between 2019-2023. Samples were either swabbed using a cotton-tipped swab or scooped out of the water using a sterilized scoop. A subset of wet scoop samples were later lyophilized (dried), and some were stored as wet samples. Whole genomic DNA was extracted from swabbed, wet scooped, and lyophilized samples using the QIAmp Fast DNA Stool Mini Kit (Qiagen). We quantified the amount of DNA extracted from each sample, then amplified the 16S mitochondrial region using published primers and protocols (Van Cise et al. 2024). We determined PCR success through visual examination using gel electrophoresis, and re-amplified unsuccessful samples under a variety of parameters to increase sample success rate. Those parameters included: 1:10 dilution, 1:2 dilution, DNA cleaned using a Zymo Clean and Concetrate kit, and 2x concetration of cleaned DNA. After all iterations of PCR1 were run, we collected those samples with successfully amplified product to continue through the library preparation protocol.

Next we cleaned the amplified PCR product using the AMPure bead clean protocol and indexed each sample using Unique Dual Indexes (UDI, Illumina, Inc). Because a significant amount of DNA can be lost during each bead cleaning step, we added 8 duplicate samples to the indexing reaction using product that had not been cleaned to determine whether this AMPure bead clean step significantly affects indexing and downstream results. We then performed a second AMPure bead clean to remove all unused reagents, and pooled all samples to contain an equimolar mix of prey DNA from each sample.

Once the amplified product was pooled, we performed a final gel cleaning step using the MinElute PCR purificaton kit (Qiagen, Inc) to remove any non-target DNA fragments, then diluted and denatured the DNA according to the MiSeq sequencing protocol and loaded the pooled DNA and 20% PhiX control onto an in-house MiSeq sequencer.

We quantified the concentration of DNA (ng/ul) in all samples using a Qubit flourometer (ThermoFisher, Inc.) immediately post extraction and post indexing, both in order to guide decision-making regarding the volume of DNA extract to use in the pipeline, and to draw inference regarding the effect of post-amplification cleaning on DNA concentration.

We processed the raw sequence data using a custom dada2-based bioinformatic pipeline (version 1.34.0), implemented in R (version 4.4.2). Briefly, this pipeline removes low quality and short sequence reads before merging sequences and identifying unique individual sequences, called amplicon sequence variants (ASVs), based on sequence similarity. We then compared ASVs to a custom 16S reference database of North Pacific fish species, targeting SRKW prey (Van Cise et al. 2024), using a naive Bayesian algorithm to classify each ASV to the lowest possible taxonomic level (Wang et al. 2007). Following this, we removed all killer whale sequences from the dataset, then removed any samples that only contained killer whale sequences.

We performed all downstream data analysis in R using phyloseq (version 1.50.0), vegan, and ggplot.

Results

Genetic sequencing

Of the 76 SRKW samples provided by Wild Orca, we successfully amplified 37 samples. Most of these (n = 21) were successful in the first iteration of amplification (i.e. original concentration). We gained 1 additional sample in the 1:10 dilution, 2 samples in the 1:2 dilution, 5 samples from the Zymo-cleaned extract, and 8 samples at 2x concentration of cleaned extract. All 38 successfully amplified samples were then cleaned and indexed, and 8 duplicates were indexed without cleaning. This resulted in 45 samples that underwent a post-index bead clean and were pooled into a final product for sequencing.

After aligning and merging forward and reverse raw sequence reads, we retained 36 of the 45 samples; the remaining 9 samples were removed due to low sequence quality. Table 1 shows the mean number of reads retained after each step in the dada2 quality control pipeline; the final number of reads per sample ranged from 458 to 368,431. Of the 9 samples that were not successfully sequenced, 5 were cleaned samples with successful uncleaned duplicates.

Table 1. Mean number of reads retained at each step in the dada2 QAQC pipline.
input filtered denoisedF denoisedR merged nonchim
239,011 68,439 68,400 68,381 67,791 67,259

After removing host sequences, we additionally removed WADE-002-076 and WADE-002-077 which did not have any sequence reads assigning to prey species. This resulted in a final total of 34 samples with prey sequence data that were used in downstream data analyses and visualization.

Identification of prey species

Chinook salmon was the most common prey species across all samples during all time periods (Figure 1). Coho salmon was the next most common prey species across all samples with proportions up to 28.4% (Table 2). Chum salmon was the third most common prey species across all samples. Lingcod (Ophiodon elongatus) was present in early summer months in proportions up to 5.2%. Uncommonly detected species included arrowtooth flounder (Atheresthes stomias), steelhead (Oncorhynchus mykiss), swordspine rockfish (Sebastes ensifer), Pacific dover sole (Microstomus pacificus), and Alaska polluck (Gadus chalcogrammus), were in low frequencies throughout all months (≤1%). Great white shark (Carcharodon carcharias) represented approximately 20% of one sample collected on Sept 8, 2022 near San Juan Island.

Fig 1. Relative abundance of prey items detected in southern resident killer whale samples. Species are only included if they make up >1% of at least one sample. Each sample shown on x axis, proportion of each species (indicated by color) indicated on the y axis.

Seasonal and annual variability in diet

When samples are organized into summer and non-summer months (Figure 2), we see that, in general, diet is more variable in non-summer months than in summer months. Chinook is the most common species in both seasons, and we observed that chum and coho both increased in late summer to early fall months. Notably, we observed lingcod, steelhead, sockeye salmon and arrowtooth flounderin the summer diet, which have not been identified as part of the summer diet in previous studies.

Fig 2. Seasonal variability in southern resident killer whale diet (top) in summer vs. non-summer months and (bottom) per species-month.

We also report mean monthly abundance for each species in table format below:

Table 2. Mean proportion of each prey species by month.
Month Atheresthes stomias Hippoglossus stenolepis Oncorhynchus keta Oncorhynchus kisutch Oncorhynchus tshawytscha Ophiodon elongatus
5 0.01766 0 0.0002672 0.003044 0.9239 0.04009
6 0.009371 0 0.001937 0.0002625 0.9263 0.05952
7 0 0 0 0.000919 0.9947 0
8 0.007008 0.002434 0.1975 0.2197 0.56 0
9 0 0 0.07135 0.2839 0.6214 0
10 0 0 0 0 1 0
11 0 0.1868 0 0 0.8132 0

Finally, we observed a large amount of diet variability among years, with no immediately discernible trends in diversity or content (Figure 3).

Fig 3. Annual variability in souther resident killer whale diet composition (top) and diversity (bottom).

Discussion

Our findings align with previous studies indicating significant diet contributions from major prey species including Chinook, coho, and chum salmon (Hanson et al. 2021; Van Cise et al. 2024). Similar to those previous studies, we observed a decrease in Chinook proportions in August and September compared to early summer months, made up for by increasing proportions of chum and coho salmon increase.

This pilot study successfully generated data for 34 samples collected from all three SRKW pods across a seven month time period and spanning five years; because we know that SRKW diet can vary substantially seasonally and among pods, it is important to remember that this small sample size may have biased our results. With that said, we observed several patterns in this initial subset of data that merit follow-up once the full sample set from Wild Orca is sequenced and combined with the existing sequence data from NOAA samples. We mention these observations below.

Notably, our data suggests a greater prey species contribution from coho salmon compared to existing studies identifying chum salmon as the second greatest contributor to SRKW diet. This may be an effect of bias due to small sample size, or may reflect annual fluctuations in the proportion of coho and chum in SRKW diet.

The detection of steelhead and lingcod in the summer diet supplements existing data indicating that these species were only part of the diet in winter months (Jan-Mar; Van Cise et al. (2024)). Similarly, sockeye salmon have previously been identified in the diet of SRKW in February and September; here we observe sockeye salmon in samples collected in July and November. Arrowtooth, previously observed in SRKW diet primarily in October, was observed in this sample set in samples collected in May, June, and August. These findings expand our understanding of SRKW seasonal foraging ecology, suggesting that a number of non-salmonid species might be consumed opportunistically throughout the year.

The most surprising observed species in this sample set is the great white shark, which is a species that is not known to inhabit the inland waters of the Salish Sea. In order to confirmed this observation we conducted a search of NCBI Genbank, and international database of genetic sequence data, and found that our sequence is a 100% DNA match to existing great white shark species, with no alternative matches. The sample was collected on September 8, 2022 from a large group of SRKW comprising both J and K pod individuals located off San Juan Island, from and individual that was genetically determined to be J40. J pod was near the San Juan Islands on September 6 and 7, and near the Fraser River on September 5. However, K pod was along the outer coast of Washington until the night of September 7, and were acoustically observed to enter the Salish Sea during the night of September 7. Because some matrilines of J pod will sometimes travel with K pod, we don’t know whether J40 was offshore or inland waters in the days before this sample was collected. Similarly, because we don’t know gut transit time for killer whales, we don’t have an accurate estimate for when the shark may have been eaten.

The observation of great white shark in the SRKW diet has a number of implications. SRKW have previously been observed to eat skate (Hanson et al. 2021; Van Cise et al. 2024), and dogfish sharks (Ford et al. 2016), but no white shark (i.e. mako sharks, salmon sharks, porbeagle sharks, and great white sharks) has ever been observed in the diet of these animals. We know from morphological observations that offshore killer whales, which are thought to primarily eat sharks, suffer significant wearing and damage to their teeth over their lifetimes. The lack of similar wear in SRKW teeth suggests that shark is a rare prey item - it is possible that this predation event was an anomaly, or that it was precipitated by climate change (northward shifting prey distributions), lack of their regular prey base, or both. On a broader scale, annual or secular trends in diet diversity may reflect shifts in foraging ecology reflecting a shifting and deteriorating prey base; additional sequencing of historical and contemporary samples will be important to testing this hypothesis.

We also detected small amounts of Alaska polluck and swordspine rockfish. Both of these fishes were detected in relatively small proportions, and are known prey of SRKW prey items (e.g. salmonids, flounder, halibut, sablefish), therefore we hypothesize that we are observing these species as secondary prey items (i.e. fishes that were eaten by fishes eaten by killer whales).

Recent studies (Hanson et al. 2021, Van Cise et a. 2024) and this pilot study illustrate that genetic metabarcoding of SRKW fecal samples continues to provide new and valuable insight into seasonal variation in diet and foraging ecology. While this is certainly true for winter months, for which we have very few samples, increasing our observational power during summer months has also proven valuable. Our results not only expand our understanding of opportunistic foraging throughout the year in this population, but also add new species to the diet of this population, and may indicate that SRKW are expanding their diet in response to shifts in their prey base.

The sparsity of diet samples over time, space, and social groups makes it so that any new samples enrich our understanding of SRKW foraging ecology in multiple dimensions. Future work by Wild Orca and the San Diego Zoo Wildlife Alliance to sequence historical and contemporary samples, in collaboartion with NOAAs ongoing diet monitoring, promises to provide insight on forgaing ecology that will inform the management of the endangered SRKW population.

References

Ford, Michael J., Jennifer Hempelmann, M. Bradley Hanson, Katherine L. Ayres, Robin W. Baird, Candice K. Emmons, Jessica I. Lundin, Gregory S. Schorr, Samuel K. Wasser, and Linda K. Park. 2016. “Estimation of a Killer Whale (Orcinus Orca) Population’s Diet Using Sequencing Analysis of DNA from Feces.” PLoS ONE 11 (1): 1–14. https://doi.org/10.1371/journal.pone.0144956.
Hanson, M Bradley, Candice K Emmons, Michael J Ford, Meredith Everett, Kim Parsons, Linda K Park, Jennifer Hempelmann, et al. 2021. “Endangered Predators and Endangered Prey: Seasonal Diet of Southern Resident Killer Whales.” Edited by David Hyrenbach. PLOS ONE 16 (3): e0247031. https://doi.org/10.1371/journal.pone.0247031.
Van Cise, Amy M., M. Bradley Hanson, Candice Emmons, Dan Olsen, Craig O. Matkin, Abigail H. Wells, and Kim M. Parsons. 2024. “Spatial and Seasonal Foraging Patterns Drive Diet Differences Among North Pacific Resident Killer Whale Populations.” Royal Society Open Science 11 (9): rsos240445. https://doi.org/10.1098/rsos.240445.
Wade, Paul R., Randall R. Reeves, and Sarah L. Mesnick. 2012. “Social and Behavioural Factors in Cetacean Responses to Overexploitation: Are Odontocetes Less ‘Resilient’ Than Mysticetes?” Journal of Marine Biology 2012: 1–15. https://doi.org/10.1155/2012/567276.
Wang, Qiong, George M. Garrity, James M. Tiedje, and James R. Cole. 2007. “Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy.” Applied and Environmental Microbiology 73 (16): 5261–67. https://doi.org/10.1128/AEM.00062-07.

Appendix I: Protocol Optimization

Because sample collection and processing differs between Wild Orca and NOAA, it was neccessary to optimize the lab protocol used to generate genetic sequence data. The following tests were conducted:

  1. Analysis of extracted DNA concentration by sample type: wet scoop, swab, or lyophilized (dried).
  2. Analysis of the difference in extracted DNA concentration before and after cleaning using a Zymo Clean and Concentrate kit (Zymo, Inc.).
  3. Analysis of PCR success by DNA concentration.
  4. Analysis of the effect of a post-amplification cleaning step on final DNA concentration, sequencing depth (read count), and resulting analysis of prey species in each sample.

Results

Results of each of these analyses are shown below, and in a poster presented at the UW Summer Research Symposium in August 2024.

Based on these results, the WADE Lab genetic metabarcoding protocol was optimized for WO samples in the following way:

  1. Ideal sample input is 0.1g of lyophilized (dried) sample material. Fecal samples collected using a plastic scoop should be passed through a net or sieve before centrifugation to remove seawater, and/or lyophilized to remove water content. Swab samples had a high rate of failure due to very low sample volume.
  2. All samples failing the amplification step should be cleaned and re-amplified using 2x input volume (up to 10 ul).
  3. Samples that still fail the amplification step may be diluted to a DNA concentration of 12-45 ng/ul to increase likelihood of amplification success.
  4. Post amplification cleaning step should be removed, retaining the cleaning step after indexing and the final gel cleaning for size selection.
  5. Samples with a final DNA concentration too low for detection should not be pooled for sequencing.

Supplemental Figure 1. Top: Extracted DNA concentration by sample type (left) and before and after cleaning (middle), and amplification success by DNA concentration. Bottom: Comparison among duplicated samples with and without a post-amplification cleanup step. Left: DNA concentration (ng/ul) after the post-index cleanup step. Middle: Final read count post-QAQC pipeline. Right: Proportional abundance of prey by species.

Supplemental Figure 1 (top left) shows the difference in extracted DNA concentration across each of three difference sample types collected by Wild Orca. Wet scoops are collected from the sea surface using a sterilized plastic scoop and centrifuged before pouring off as much seawater as possible. Extracted DNA concentrations from this type sample were highly variable, likely due to variation in seawater content. Swabbed samples are a small volume of fecal matter smeared onto a cotton tip, and resulted in low DNA concentrations and a high rate of sample failure. Lyophilized (dried) samples had the highest consistency in producing ideal DNA concentration.

Supplemental Figure 1 (top) also shows the difference in extracted DNA concentration before and after cleaning (middle), indicating a high rate of DNA loss, and the amplification success rate by extracted DNA concentration (right), indicating that samples with ~12-45 ng/ul of extracted DNA had the highest rate of success.

Supplemental Figure 1 (bottom) shows the difference in final DNA concentration (left), read count (middle), and proportional abundance (right) between duplicated samples with and without a post-amplification cleaning step. Although highly variable, post-amplification cleaning on average resulted in a loss of DNA. In 5 of the 8 duplicates, DNA concentration was reduced to near 0 ng/ul (below Qubit detection levels), resulting in a failure to sequence these samples. In the three samples for which both duplicates were successfully sequenced, the proportional abundance of prey species was identical between duplicates.

Sofia Kaiaua presented the intermediate results of this research at the UW Summer Research Symposium following a 10-week internship program. This presentation is included as Supplemental Figure 2.

Supplemental Figure 2. Poster presented by Sofia Kaiaua at the UW Summer Research Symposium in August 2024, detailing intermediate results of genetic metabarcoding protocol optimization.