Introduction

Novel studies have revealed high molecular variability in the azoospermia factor C (AZFc) region among Y chromosomes of different haplogroups (Repping et al. 2006; reviewed by Carvalho and Santos 2005). This region, which spans 3.5 Mb along the long arm of the Y chromosome, contains testis-specific genes involved in male spermatogenesis (Kuroda-Kawaguchi et al. 2001; Jobling and Tyler-Smith 2003). The genomic region has a complex structure full of palindromes and repeated segments prone to internal rearrangements (Kuroda-Kawaguchi et al. 2001; Machev et al. 2004; Repping et al. 2003, 2006). Recombination of the direct repetitions b2/b4 results in complete deletion of AZFc region, causing spermatogenic failure in men (Kuroda-Kawaguchi et al. 2001). Partial deletions also occur frequently (3.8×10−4; Repping et al. 2006), producing a polymorphic Y chromosome (Jobling et al. 1996; Fernandes et al. 2004). A specific and recurrent partial AZFc deletion, called gr/gr, removes 1.6 Mb of the AZFc and it has been identified as a risk factor in spermatogenic failure (Repping et al. 2003; de Llanos et al. 2005; Lynch et al. 2005; Ravel et al. 2006), although this has not been confirmed by other groups (Machev et al. 2004; Hucklenbroich et al. 2005; Carvalho et al. 2006). According to Repping et al. (2003) the D2b lineage, a D haplogroup sub-lineage particularly prevalent in Japan, should result in a risk of spermatogenic failure, because they would have the gr/gr deletion fixed.

Remarkably, Kuroki et al. (1999) found fertile haplogroup D Japanese males with low sperm count. They suggested spermatogenic failure was more likely to occur among haplogroup D than among other Y chromosome types. Our group has previously published an association study of Japanese azoospermic and unknown fertility males (Carvalho et al. 2003) that did not support the findings of Kuroki et al (1999). In contrast with their results, we observed no statistical difference between Y chromosome haplotype frequencies in fertile and infertile males. However, the detailed characterization of the haplogroup D sub-lineages as well as the occurrence of the gr/gr deletion were not established in those samples. The current literature also provides few information about that issue (Repping et al. 2003). We therefore decided to investigate in detail the frequency of gr/gr deletion in Japanese fertile and infertile males. We used our previously published fertile and infertile Japanese males (Carvalho et al. 2003) and typed for the new biallelic markers inside the D branch, in accordance with Jobling and Tyler-Smith (2003). The gr/gr deletion was analyzed in both fertile and infertile groups using the same single-tagged sequence (STS) markers proposed by Repping et al. (2003).

This study shows that all gr/gr deleted chromosomes found in Japanese samples can be classified as haplogroup D2, suggesting a founder effect. The subtype D2b1 was most frequent among the Japanese, both in control and infertile samples. Some haplogroups, D2b2, D*, and D1, were not found in any population group and no significant association of haplogroup D with increased risk of infertility was observed in this study. We were also able to establish the ancestral chromosome that suffered a gr/gr deletion, and propose a new Y chromosome phylogeny for haplogroup D and its derivatives. In summary, we were able to define the frequency of gr/gr deletion in Japanese males and showed that the gr/gr deletion was probably already present among the ancestral Y chromosome population that entered Japan at least 12,000 years ago.

Materials and methods

The Japanese subjects are described elsewhere (Fujisawa et al. 2001; Carvalho et al. 2003). Briefly, we studied 78 infertile males, and 56 males of unknown fertility as control. The infertile males were subdivided into two groups—49 patients were classified as azoospermic (Sertoli cell only, severe hypospermatogenesis, maturation arrest) on the basis of clinical and biopsy data and 29 were classified generally as infertile because they did not provide biopsy data. These samples had previously been tested for the occurrence of AZFa, b, and c deletions (Carvalho et al. 2003). All samples were submitted to molecular analysis of Y chromosome gr/gr deletion in accordance with Repping et al. (2003). First, we screened samples for the presence or absence of the STS markers sY1291 and 50f2/C. Both define the partial deletions commonly found in the Y chromosome, which are gr/gr, b1/b3 (Repping et al. 2003), and b2/b3 (Fernandes et al. 2004; Repping et al. 2004). The polymorphic deletion markers 50f2/C or DYS7/C (Genbank accession number Y07728) and sY1191 lie in the proximally located u3 block. In a second round, the individuals with deletions were tested by STS typing: sY1161, sY1191, sY1206, sY1201, and sY1258. All STSs primers were used at 0.5 μmol L−1 concentration for PCR. To avoid false negative results we performed all STS PCR assays in duplex format containing primers for the Yp marker SRY+465 (Shinka et al. 1999) at 0.25 μmol L−1. Results were always repeated to confirmation. The primer sequences for STSs are available from GenBank under the accession numbers: sY1161, G66148; sY1191, G73809; sY1291, G72340; sY1206, G68331; sY1201, G67170; and sY1258, G75479. The procedure for detection of the 50f2/C polymorphic deletion has been described elsewhere (Jobling et al. 1996). The PCR products were resolved in 6% polyacrylamide gels and silver-stained in accordance with Santos et al. (1996).

All samples had previously been assigned to the Y chromosome haplogroups (Carvalho et al. 2003). In this study haplogroup D (defined by the presence of the Alu insertion, YAP+) samples were also typed by minisequencing (Carvalho and Pena 2005) using the single nucleotide polymorphisms (SNPs) M174, M64, M116.1, M151, and M125. We used the nomenclature proposed by Jobling and Tyler-Smith (2003). Other insertion/deletion polymorphisms M15 and 12f2.2 were typed using 6% polyacrylamide gel as described above. The procedure for analysis of the 12f2.2 marker is described elsewhere (Rosser et al. 2000). The primers used to detect the M15 insertion/deletion and other SNP markers are reported in the supplementary information. Statistical tests were performed using Arlequin ver. 2.0 software (Schneider et al. 2000). Specifically, we used an exact test of population differentiation to verify whether haplogroup frequencies were statistically different among groups. We also used AMOVA (population genetic structure inferred by analysis of variance) to quantify the distribution of variation between and within fertile and infertile groups. We used the PS power and sample-size program of Dupont and Plummer (Version 1.0.17; freely available at: http://www.mc.vanderbilt.edu/prevmed/ps.htm) for sample-size estimation.

Results

Among 134 Japanese males, 41 had gr/gr deleted Y chromosomes: 19 control, 11 azoospermic, and 11 infertile males without biopsy data (Table 1). Using an exact test of population differentiation, the presence of gr/gr deletions was compared among azoospermic, infertile without biopsy data, and control males (P=0.26235±0.01458). The exact test revealed no statistically significant difference at the 5% level for all groups.

Table 1 Y chromosome deletions detected in Japanese infertile (with and without biopsy data) and control males. Infertile samples with the biopsy data are described in detail

We also observed other kinds of partial AZFc deletion in both groups. We did not use sY1197 and sY142 (Repping et al. 2003, 2004) to identify b2/b3 and b1/b3 deletions, but analysis of the Y chromosome map indicates that sY1258 can provide the same information. We identified four b2/b3-deleted samples, two in the control and two in the infertile group (Table 1). All were typed as haplogroup N, which have the b2/b3 deletion fixed (Fernandes et al. 2004; Repping et al. 2004). The b1/b3 deletion was detected in one control sample (Table 1). None of these is believed to be associated with spermatogenic failure.

Interestingly, a new type of deletion was detected in this study and is marked by the absence of STS sY1191, 50f2/C, and sY1258. This deletion has not previously been observed when using these STS markers. We detected two control samples with this new deletion, both typed as Y chromosome haplogroup C and bearing an atypical αhI polymorphism type (Carvalho et al. 2003). Using several probes annealing to Yp and Yq (including 50f2), Jobling et al. (1996) reported that at least two different deletion events, both involving the absence of 50f2/C band, could be observed for haplogroup C chromosomes. The most frequent one (nine out of ten deleted individuals) was called “large”, because it co-deleted bands from other probes present at Yq11. The 50f2/C “large” deletion has been detected in several populations, including Mongolian and Japanese. We therefore speculate that the 50f2/C “large” polymorphism found in haplogroup C males using southern blot (Jobling et al. 1996) is probably the same as the deletion detected by us using STS sY1191, 50f2/C, and sY1258.

Analysis of gr/gr deleted Y chromosomes revealed that all were classified as haplogroup D. The biallelic markers M15, M174, M116.1, M64, M125, M151, and 12f2.2 were used to resolve the D branch in both control and infertile samples. The haplotyping results can be seen in Table 2 and in the phylogenetic tree in Fig. 1. We observed that all Japanese D males (n=41) also belong to the lineage D2 (M174 and M64-derived alleles and gr/gr deletion). According to our haplotyping results (Fig. 1), the D2 lineage could be further divided into D2b* (defined as derived allele at M116.1 and ancestral at M125, 12f2.2, and M151), D2b1 (defined as derived alleles at M116.1, M125, and 12f2.2 and ancestral at M151) ,and D2b2 (defined as derived alleles at M116.1 and M151 and ancestral at M125 and 12f2.2). The frequency was highest for D2b1, both in control (53%) and infertile samples (73%), and D2b* was present in 37% of control individuals and in 18% of infertile males. The frequency of D2* (defined as derived at M64, and ancestral at M116.1, M151, M125, and 12f2.2) was about 9–10% in both groups. Haplogroups D2b2, D*, and D1 were not found in any population group (Fig. 1). The haplogroup frequencies were compared among control, azoospermic, and infertile males without biopsy data using the exact test of population differentiation (P=0.39550±0.01292). The test revealed no statistically significant differences at the 5% level for all group comparisons. The AMOVA results also indicated that 100% of the variation was found within the population, i.e. no differentiation could be attributed to a comparison of Y the chromosomes of control and infertile patients. That we did not find an association between the presence of the gr/gr deletion in Y chromosomes and the infertile phenotype in Japanese males might be because of the small sample size of infertile males. To achieve a significance level of 0.05 with a statistical power of 80%, a sample size of over 1,274 well-characterized patients would have been required. Replication studies in larger cohorts, and functional studies, are needed.

Table 2 Frequency distribution of infertile and control samples from the D branch. All samples are gr/gr deleted
Fig. 1
figure 1

Detailed D branch phylogeny of the human Y chromosome tree. The biallelic markers specified were tested in all control and infertile males. a D branch according to Jobling and Tyler-Smith (2003)*. b D branch proposed by this study. The markers altered are highlighted in the figure. The designations for the branches are based on Jobling and Tyler-Smith (2003). *Although we have not tested markers to detect the D2a haplogroup specifically, we depict it here to show the discrepancies between the previous published YCC tree and the most likely tree according to our data

To define when the gr/gr deletion event occurred along the D lineage, we analyzed all haplogroup D subdivisions. A haplogroup D1 Mongolian male (M15 derived) does not have the gr/gr deletion, suggesting that the deletion appears only in chromosomes of the M64-derived branch, the origin of the D2 lineage. On the basis of these results, a new organization of the D branch was proposed (Fig. 1). This also changes the current phylogeny (The Y Chromosome Consortium 2002; Jobling and Tyler-Smith 2003) concerning the position of the 12f2.2 mutation. According to our analysis of haplogroup D and its derived sublineages, the 12f2.2 deletion occurred more recently in an ancestral chromosome of the D2b1 lineage. We have also used a sample (YCC-76) from the Y Chromosome Consortium, which has been re-classified in the new tree (Fig. 1). Although we have not typed P42, this sample has the derived state in the marker M125, which defines the lineage D2b1, the most common in Japan. This confirms a previous doubt about the haplogroup classification of YCC-76 male that should bear a derived state in M125 (F.R. Santos, personal data).

Discussion

Although this work showed that the gr/gr deletion is fixed in D2b haplogroup, as suggested by Repping et al. (2003), we also showed it occurred in an older ancestral Y chromosome lineage, which is D2. Approximately 83–88% of D Japanese males can be classified into clade D2b and 100% into D2 (Table 2). The D2 chromosomes are mainly (if not exclusively) found in Japanese males, in which its frequency of occurrence is high. A hypothetical selection against D2b gr/gr deleted chromosomes (Repping et al. 2003) can hardly explain the high frequency of this chromosome in Japan. As proposed by Underhill et al. (2001) and Hammer et al. (2006), genetic drift is the most likely factor underlying the distribution of haplogroup D (including derived ones) in Asia. Molecular studies have proposed that haplogroup D entered Japan approximately 12,000–20,000 years ago from central Asia (Hammer et al. 2006; Katoh et al. 2005), probably brought by the Jomon, the first colonizers of the Japanese islands. Among the ancestral Jomon Y chromosomes there was probably a D2 haplogroup carrier. The gr/gr deletion present in D2 chromosomes could have originated in an ancestral male before or just after the migration to Japan. Thus, it seems the gr/gr deletion found in D2 chromosomes has not restricted the spread of the chromosome. D2 chromosome frequencies are highly variable, depending on the island or the population analyzed (Tajima et al. 2002): Ainu (from Hokkaido: 87.5%), Honshu (36.6%), Kyushu (28%), and Okinawa (55.6%). D2 is, however, the most representative haplogroup found in Japan.

Other clues also indicate that the sub-lineage D2 (defined as M64-derived and the gr/gr deletion) is found exclusively in Japanese males. The D clade is rare in Asian populations other than in Japan and Tibet (Underhill et al. 2000, 2001; Tajima et al. 2002; Jin et al. 2003; Deng et al. 2004; Katoh et al. 2005; Hammer et al. 2006). Available data also indicate that D*(×D2), but not D2, occurs in Tibet (Deng et al. 2004; Hammer et al. 2006). The population differences found in Japan can be attributed to another important migration wave which occurred 2,300 years ago, bringing the Yayoi people from the Korean peninsula, carrying Y lineages like O, but not D lineages. The admixture hypothesis proposed by Hammer and Horai (1995) suggested that modern Japanese people are derived from both Jomon and Yayoi people. The haplotype differences observed among the islands would probably reflect the impact of this later migration. Thus, even after the more recent arrival of Yayoi people, who also brought the language spoken today, D2 Y chromosomes still remain in the Japanese population at high frequency, despite the occurrence of intensive gene flow bringing non-gr/gr deleted Y-chromosomes. We suggest that possible differentiation observed among Japanese populations, or fertile/infertile comparisons, can be more easily explained by drift, population substructure, and differential gene flow. Taken together, our results suggest a lack of significant evidence associating the presence of the gr/gr deletion in Y chromosomes and the infertile phenotype in Japanese males.

Unexpectedly we discovered a different organization of the D2 sub-lineage tree. The previous tree suggests the 12f2.2 deletion as a marker of the D2 lineage; we showed here this is not correct. The Y Chromosome Consortium (2002) organized the phylogenetic Y chromosome tree using information provided by several research groups, many times using different markers. Thus, some nodes in the tree are not well resolved, so fine scale revision is required, as we did for D2 (Fig. 1). It is important to re-consider the haplotyping of the YCC sample panel in the light of the new revisions, as we did for the YCC-76 sample (Fig. 1). It was re-classified as D2b1 haplogroup instead of D2a, as previously reported. All related markers should be retyped, however, to confirm the actual shape of the tree around the D haplogroup.