For ethical and logistical reasons, population-genetic studies of parasites often rely on the non-invasive sampling of offspring shed from their definitive hosts. However, if the sampled offspring are naturally derived from a small number of parents, then the strong family structure can result in biased population-level estimates of genetic parameters, particularly if reproductive output is skewed. Here, we document and correct for the strong family structure present within schistosome offspring (miracidia) that were collected non-invasively from humans in western Kenya. By genotyping 2,424 miracidia from 12 patients at 12 microsatellite loci and using a sibship clustering program, we found that the samples contained large numbers of siblings. Furthermore, reproductive success of the breeding schistosomes was skewed, creating differential representation of each family in the offspring pool. After removing the family structure with an iterative jacknifing procedure, we demonstrated that the presence of relatives led to inflated estimates of genetic differentiation and linkage disequilibrium, and downwardly-biased estimates of inbreeding coefficients (F IS). For example, correcting for family structure yielded estimates of F ST among patients that were 27 times lower than estimates from the uncorrected samples. These biased estimates would cause one to draw false conclusions regarding these parameters in the adult population. We also found from our analyses that estimates of the number of full sibling families and other genetic parameters of samples of miracidia were highly intercorrelated but are not correlated with estimates of worm burden obtained via egg counting (Kato-Katz). Whether genetic methods or the traditional Kato-Katz estimator provide a better estimate of actual number of adult worms remains to be seen. This study illustrates that family structure must be explicitly accounted for when using offspring samples to estimate the genetic parameters of adult parasite populations.
Genetic epidemiology uses genetic data to uncover patterns of disease processes. To acquire data for these analyses, individual pathogens are collected and scored at genetic markers, and the resultant data are analyzed to infer biological patterns about the pathogen populations. In lieu of invasive sampling of adult pathogens in humans, researchers have relied on non-invasive sampling of parasite offspring (often shed in fecal samples). One potential problem with this approach is that analyses using the offspring data will be biased because many of the offspring are related and family sizes are likely to be unequal. We show that this sampling issue is relevant in a natural transmission zone in western Kenya and that it yields biases in three important parameters: genetic differentiation, inbreeding coefficients, and estimates of the amount of non-random association between loci (linkage disequilibrium). We also develop a method to remove these biases by removing the sibling structure present in the dataset. Finally, we suggest that our measure of family number, as well as other genetic measures, may be useful measures of the worm burdens in patients.