UNIVERSITY PARK, Pa. — Newly generated, complete “end-to-end” reference genomes for the sex chromosomes of five great ape species and one lesser ape species — produced by an international collaborative team led by researchers at Penn State, the National Human Genome Research Institute and the University of Washington — highlight extremely rapid changes on the male-specific Y chromosome among ape species. These findings shed light on the evolution of sex chromosomes and inform understanding of diseases related to genes on these chromosomes in both apes and humans.
“The Y chromosome is important for human fertility, and the X chromosome harbors genes critical for reproduction, cognition and immunity,” said Kateryna Makova, Verne M. Willaman Chair of Life Sciences, professor of biology at Penn State and leader of the research team. “Our study opens doors for many future investigations of sex chromosomes, how they evolved and diseases associated with them. The living non-human great ape species we studied are all endangered. The availability of their complete sex chromosome sequences will facilitate studies of their sex-specific dispersal in the wild and of their genes important for reproduction and fertility.”
Such reference genomes act as a representative example that are useful for future studies of these species. The team found that, compared to the X chromosome, the Y chromosome varies greatly across ape species and harbors many species-specific sequences. However, it is still subject to purifying natural selection — an evolutionary force that protects its genetic information by removing harmful mutations.
The new study appeared today (May 29) in the journal Nature.
"Researchers sequenced the human genome in 2001, but it wasn’t actually complete,” Makova said. “The technology available at the time meant that certain gaps weren’t filled in until a renewed effort led by the Telomere-to-Telomere, or T2T, Consortium in 2022-23. We leveraged the experimental and computational methods developed by the Human T2T Consortium to determine the complete sequences for the sex chromosomes of our closest living relatives—great apes.”
The team produced complete sex chromosome sequences for five species of great apes — chimpanzee, bonobo, gorilla, Bornean orangutan and Sumatran orangutan, which comprise most great ape species living today — as well as a lesser ape, siamang. They generated sequences for one individual of each species. The resulting reference genomes act as a map of genes and other chromosomal regions, which can help researchers sequence and assemble the genomes of other individuals of that species. Previous sex chromosome sequences for these species were incomplete or — for the Bornean orangutan and siamang — did not exist.
“The Y chromosome has been challenging to sequence because it contains many repetitive regions, and, because traditional short-read sequencing technology decodes sequences in short bursts, it is difficult to put the resulting segments in the correct order,” said Karol Pál, postdoctoral researcher at Penn State and a co-first author of the study. “T2T methods use long-read sequencing technologies that overcome this challenge. Combined with advances in computational analysis, on which we collaborated with Adam Phillippy’s group at the NHGRI, this allowed us to completely resolve repetitive regions that were previously difficult to sequence and assemble. By comparing the X and Y chromosomes to each other and among species, including to the previously generated human T2T sequences of the X and the Y, we learned many new things about their evolution.”
High variability on the Y chromosome
“Sex chromosomes started like any other chromosome pair, but the Y has been unique in accumulating many deletions, other mutations and repetitive elements because it does not exchange genetic information with other chromosomes over most of its length,” said Makova, who is also the director of the Center for Medical Genomics at Penn State.
As a result, across the six ape species, the research team found that the Y chromosome was much more variable than the X over a variety of characteristics, including size. Among the studied apes, the X chromosome ranges in size from 154 million letters of the ACTG alphabet — representing the nucleotides that make up DNA — in chimpanzee and human to 178 million letters in gorilla. In contrast, the Y chromosome ranges from 30 million DNA letters in siamang to 68 million letters in Sumatran orangutan.
The amount of DNA sequence shared between species was also more variable on the Y. For example, about 98% of the X chromosome aligns between human and chimpanzee, but only about a third of the Y aligns between them. The researchers found that this is in part because the Y chromosome is more likely to be rearranged or have portions of its genetic material duplicated.
Additionally, the percentage of the chromosome occupied by sequences that are repeated is highly variable on the Y. Whereas, depending on the species, 62% to 66% of the X chromosomes are occupied by repetitive elements, 71% to 85% of the Y chromosomes are occupied by them. These percentages are higher on both the X and the Y than in other chromosomes in the human genome.
How the Y has survived
“We found the ape Y to be shrinking, accumulating many mutations and repeats, and losing genes,” Makova said. “So why hasn’t the Y chromosome disappeared, as some previous hypotheses suggested? In collaboration with Sergei Kosakovsky Pond from Temple University and others, we found that the Y chromosome still has a number of genes evolving under purifying selection — a type of natural selection that keeps gene sequences intact. Many of these genes are important for spermatogenesis. This means that the Y chromosome is unlikely to disappear any time soon.”
The researchers found that many genes on the Y chromosome seem to use two strategies to survive. The first takes advantage of genetic redundancy — the presence of multiple copies of the same gene on a chromosome — so that intact copies of the gene can compensate for copies that might acquire mutations. The team quantified this genetic redundancy by completing the landscape of multi-copy gene families on ape sex chromosomes for the first time.
The second survival strategy takes advantage of palindromes, where the sequence of letters in the DNA alphabet is followed by the same, but inverted sequence, for example, ACTG-GTCA. When located within a palindrome, genes benefit from the palindrome’s ability to correct mutations.
“We found that the Y chromosome can exchange genetic information with itself between the repeated sequences of the two palindrome arms, which fold so that the inverted sequences align,” Pál said. “When two copies of the same gene are located within palindromes, and one copy is hit by a mutation, the mutation can be rescued by the genetic exchange with another copy. This can compensate for the Y’s lack of genetic information exchange with the other chromosomes.”
The research team obtained the complete sequences of palindromes on ape sex chromosomes also for the first time, as they were previously difficult to sequence and study. They found that palindromes are particularly abundant and long on the ape Y chromosome, yet they are usually only shared among closely related species.