The structure of human prions

A.I.G.S.S. Association



The structure of human prions

Written by: Claudia Y. Acevedo-Morantes and Holger Wille -Department of Biochemistry and Centre for Prions and Protein Folding Diseases, University of Alberta, Edmonton, AB T6G 2M8, Canada - Eric O. Freed -20 October 2014

1. Introduction
The prion protein can exist in multiple isoforms, predominantly the cellular, non-infectious PrPC and the disease-causing PrPSc. The prion protein is most highly expressed in the central nervous system (CNS), but it can be found in other tissues and cell types as well. The misfolded PrPSc causes the prion diseases, also known as transmissible spongiform encephalopathies (TSEs), a group of progressive, neurodegenerative disorders in humans that include kuru, Creutzfeldt-Jakob disease (CJD), Gerstmann-Sträussler-Scheinker (GSS) syndrome, and fatal familial insomnia (FFI). PrPSc is also responsible for chronic wasting disease (CWD) in cervids; bovine spongiform encephalopathy (BSE) in cattle, and its analogues in antelopes and wild felids; scrapie in sheep, mufflons, and goats; transmissible mink encephalopathy (TME) in ranch-reared mink; and feline spongiform encephalopathy (FSE) in domestic cats.

2. The Human Prion Protein Gene (PRNP)
The human PRNP gene is located on the short arm of chromosome 20 between the end of this arm and the position 12 (p12-pter). The structure of the prion gene for all species of mammals studied to date contains three exons. The open reading frame (ORF) lies entirely within exon 3 and transcribes an mRNA of 2.1–2.5 kb in length, with approximately 50 copies/cell in neurons. To date, there are no reported prion disease-associated mutations in either exon 1 or 2, or within any of the introns. Codons 51 to 91 of the PRNP gene encode a nonapeptide (PQGGGGWGQ) followed by four octapeptide repeats (PHGGGWGQ), which are almost identical to the nonapeptide except for the omission of a glycine and the presence of a histidine instead of a glutamine at the second position. A variety of insertion mutations have been found in the octapeptide repeat region. These insertions so far comprise one to nine extra octapeptide repeats. Raman spectroscopy studies have demonstrated that the binding of copper ions (Cu2+) to the octapeptide repeats in PrP is coordinated by histidine residues, suggesting that this region may be involved in the transport of extracellular Cu2+ ions to an endosomal compartment. Mutations in the PRNP gene can cause the development of prion diseases and lead to different clinical phenotypes, including CJD, GSS and FFI. Mutations of the prion protein gene can be classified as: (1) point mutations (i.e., single nucleotide substitutions), which can cause an amino change (missense mutation), can be silent (do not cause alteration in the amino acid sequence), or less commonly can cause the coding to prematurely terminate (stop or nonsense mutation); and (2) insertions and deletions, which are associated with prion diseases.

3. Evolutionary Origins of PRNP
Recently it has been proposed that the emergence of the prion protein founder gene was based on two genomic rearrangements that occurred hundreds of millions of years ago. The first event occurred within the N-terminal ectodomain of a ZIP predecessor gene, when metazoans first emerged on the planet. As a consequence of this rearrangement a cysteine-flanked core (CFC) region was generated within this ectodomain. The second event, which allowed the creation of the prion protein founder gene to be traced back, occurred approximately a half-billion years ago (around the Cambrian explosion). This later event resulted in the apparent genomic retro-insertion of a spliced and C-terminally truncated ZIP gene transcript, indicating that the prion protein founder gene is equivalent to a retrogene.

4. Polymorphisms and Mutations in the PRNP Gene ORF
More than 30 mutations in the PRNP gene have been linked to inherited prion diseases, including CJD, GSS, and FFI. In addition, several common polymorphisms (variations) were identified in the PRNP gene as well (Figure 1). Mutations can lead to a change of a single amino acid in PrP, insert additional residues, or cause an abnormally short version of PrP to be expressed. These mutations affect the primary structure of PrP, may lead to changes in the secondary and tertiary structure, and could result in the emergence of PrPSc conformers. In contrast, polymorphisms do not cause prion disease, but may influence a person’s risk of developing a prion disease. The most common and best-studied polymorphism in human PrP occurs at codon 129 and acts as a predisposing factor for sporadic, iatrogenic, and variant CJD. Either methionine (M) or valine (V) is encoded at this position. M/M homozygosity at this position appears to be associated with an earlier age of onset and rapidly progressive dementia; whereas a more prolonged course with an ataxic onset is most often associated with the V/V allele. It is important to highlight that the allele frequencies at codon 129 differ across populations, for example compared with Caucasian frequencies of M (0.66): V (0.34), the allelic frequencies reported in Japan are M (0.96): V (0.04), and although the incidence of disease has not been reported to be higher in Japan, it may predispose an individual to develop a particular clinical phenotype.

Figure 1. Schematic representation of human PrP mutations and polymorphisms. The 0.76 kb ORF of the PRNP gene encodes a 253 amino acid protein, PrPC. Human PrP consist of a cleaved signal peptide (1–22), an octapeptide repeat-containing unfolded domain (OR, 51–91), three α-helices (H1, H2, and H3), one small, antiparallel β-sheet (β1 and β2), and a GPI-anchor signal (232–253). The N-terminal octapeptide repeat motif is comprised of eight residues: P(H/Q) GGG(-/G) WGQ. Normal PrP contains five copies of this motif; a single OR deletion is considered a non-pathogenic polymorphism. However, insertional mutations consisting of one to nine additional OR are pathogenic. Polymorphisms and pathogenic mutations of the PRNP gene are represented above and below the schematic, respectively. Letters preceding the numbers indicate the normal amino acid residue for the position and letters following the numbers designate the new residue. An earlier investigation using a quantitative interactome analysis identified ZIP6 and ZIP10 as potential molecular interactors of PrP at the cell surface, revealing that the known prion proteins are structurally related to an extracellular domain of ZIP6 and ZIP10. Similarly, other authors using interactome analyses have identified multiple interactions between PrPC and other molecules, like the neural cell adhesion molecule (NCAM), the laminin receptor precursor, Na/K ATPases and protein disulfide isomerases (PDI). Together, these findings suggest that PrPC organizes its molecular environment by binding adhesion molecules, which in turn recognize oligomannosebearing membrane proteins. While M or V homozygosity at codon 129 in human PrP results in a predisposition to sporadic or iatrogenic CJD, the glutamate (E) to lysine (K) substitution at codon 219 appears to have a protective effect against s-CJD. The E219K polymorphism has been reported to occur in the Japanese population with an allelic frequency of 6%. This polymorphism was also reported on the same allele as the P102L mutation (see below) in a Japanese family in which dementia rather than ataxia was prominent and cerebellar plaque pathology was less prominent compared with cases of the P102L mutation alone. Recent studies in knock-in mice have shown

4.1. Mutations Associated with GSS This subtype of prion disease is very rare and associated with autosomal-dominant inheritance. The clinical features include ataxia of gait and/or dysarthria, dementia, spastic paraparesis, and variable degrees of pyramidal and extrapyramidal symptoms. The incidence is not clear but was estimated to be 1 to 10 per 100 million. GSS has been associated with the following missense mutations: P102L; P105L; A117V; F198S; D202N, Q212P; Q217R, and in some cases of octapeptide repeat insertions (OPRI), especially those with larger inserts (approximately 192 bp).

4.2. Mutations Associated with CJD
LCJD is another neurodegenerative disorder characterized by failing memory, behavioral changes, lack of coordination, and visual disturbances. As the disease progresses, mental deterioration becomes more pronounced, jerky movements, blindness, and coma may occur. This disease leads to death usually within 1 year of onset of illness. There are three major categories of CJD:

  • Sporadic CJD (s-CJD): the most common type of CJD, accounting for at least 85% of cases. The disease manifests even though the person has no known risk factors for the disease (e.g., no mutation in the PRNP gene)
  • Familial CJD (f-CJD): accounting for about 5% to 10% of cases of CJD. The individual has a family history of the disease and/or tests positive for a PRNP mutation
  • Iatrogenic CJD (i-CJD): accounting for less than 1% of CJD cases. This form of CJD is transmitted by exposure to brain or tissue from an infected person, usually through a medical procedure, such as a blood transfusion or dura mater transplant.

f-CJD has been associated with the following missense mutations: D178N, V180I, T188A, E196K , E200K, V203I, R208H, V210I, E211Q, M232R; and one to nine octapeptide inserts.

4.3. Mutations Associated with FFI
Fatal familial insomnia (FFI) is a rare genetic sleep disorder, diagnosed in less than 40 families worldwide. This disease is caused by the missense mutation D178N, but it can also be developed spontaneously in patients with a sporadic variant called sporadic fatal insomnia (s-FI). The clinical profile includes intractable insomnia that may be observed for several months prior to the obvious onset of disease, which may include dysautonomia, ataxia, variable pyramidal and extrapyramidal signs and symptoms, and relatively spared cognitive function until late in the course. The average survival for FFI patients after the onset of symptoms is 18 months.

5. Biochemistry and Structure of Prion Proteins
Dalla scoperta dei prioni, sono stati fatti molti tentativi per chiarire le differenze strutturali tra PrPC e PrPSc e la conversione patogena che porta alla formazione di PrPSc.
Tecniche biofisiche e biochimiche a bassa risoluzione sono state utilizzate da molti gruppi di ricerca per ottenere approfondimenti sulle conformazioni sia di PrPC che di PrPSc, tuttavia questi approcci presentano limitazioni, come verrà discusso di seguito. PrPC e PrPSc differiscono per quanto riguarda solubilità, tendenza alla formazione di fibrille e resistenza alla proteinasi K.

Il PrPC è una proteina solubile con un'alta suscettibilità alla digestione proteolitica, mentre il PrPSc è caratterizzato dalla sua insolubilità nei detergenti e dalla parziale resistenza alla proteolisi nella sua forma aggregata. Queste differenze sembrano essere basate sulle loro differenze strutturali, poiché studi precedenti basati sul dicroismo circolare e la spettroscopia infrarossa hanno dimostrato che il PrPC è dominato dalle eliche α (42%) e ha solo una piccola parte del contenuto di fogli β (3%), mentre PrPSc contiene prevalentemente fogli β (> 43%). Il PrPSc troncato N-terminale, designato PrP 27–30 ha un contenuto di fogli β superiore (> 54%). Contrariamente al PrP 27–30, che polimerizzava in amiloidi a forma di bastoncino, PrPC non forma aggregati rilevabili mediante microscopia elettronica.

5.1. Structure of PrPC
The prion protein exists in multiple conformations and its cellular isoform, PrPC, which is found in healthy organisms, is among the most extensively studied proteins. In humans, the newly synthesized and unprocessed PrPC is approximately 253 amino acids in length and has a molecular weight of 35–36 kDa (Figure 2). PrPC is GPI-anchored and resides in rafts at the cell membrane. After ~5 h at the cell surface (its average half-life) it is internalized through a caveolae-dependent mechanism and is degraded in the end lysosome compartment. It has been speculated that the PrPC conversion to PrPSc may occur in caveolae-like domains.

Figure 2. Organization of human PrP. The unprocessed PrP is 253 amino acid residues in length and includes a signal peptide (1–22), four OR, a hydrophobic region (113–135), one disulphide bond between cysteine residues (179–214), two N-linked glycosylation sites (at residues 181 and 197), and a GPI-anchor attached to the C-terminus of PrP replacing the GPI-anchor signal (residues 232 to 254). The four OR in the N-terminal domain have a high affinity for copper ions (Cu2+), while a preceding nonapeptide (PQGGGGWGQ) lacks the histidine that is necessary to bind a Cu2+ ion. Mutated forms of PrP can contain insertions of one to nine additional OR or a deletion of one OR. A palindromic region, AGAAAAGA (113–120), lies in the hydrophobic region (113–135) and is thought to be important in the conversion of PrPC to PrPSc. OR: Octapeptide repeat; GPI: glycophosphatidylinositol; PK: proteinase K; CHO: carbohydrates. The structure of PrPC could not be studied using NMR spectroscopy or X-ray crystallography due to the low yield of protein that is obtained during the purification process. This limitation was overcome by using recombinant PrP (rec PrP) as a surrogate for PrPC, since NMR studies have shown that rec PrP appears to have the same molecular architecture as PrPC. A first attempt to predict the three-dimensional structure of PrPC was based on circular dichroism and infrared spectroscopy measurements, proposing a X-bundle α-helical structure. Later the groups of Wüthrich and James determined the structure of the C-terminal, folded domain of PrP (amino acids 121–231) by NMR spectroscopy. For all species analyzed, the structure of rec PrP consists of three α-helices (amino acids 144–154, 175–193 and 200–219) and a small antiparallel β-sheet (amino acids 128– 131 and 161–164). It has an amino-terminal signal peptide (amino acids 1–23), a more or less flexible N-terminal domain (amino acids 23–120), a folded C-terminal domain (amino acids 121– 231), and a signal peptide for membrane attachment via a glycophosphatidylinositol (GPI) anchor (amino acids 232–254). It contains a single disulfide bond linking the cysteine residues at positions 179 and 214, connecting helices 2 and 3 and two N-linked glycosylation sites at asparagines 181 and 197. The N-terminal domain contains an eight amino acid repetitive motif comprised of the residues PHGGGWGQ, which has a high affinity for copper ions (Cu2+). Therefore, it has been hypothesized that PrPC is involved in copper metabolism, but its ancestral relationship to the ZIP proteins also suggests functional interactions with zinc ions (Zn2+). The latter assumption is supported by recent findings from Glenn Millhauser’s group suggesting that electrostatic interactions stabilize the PrP-Zn2+ complex. These results indicate that Zn2+ drives a tertiary structure contact between the Zn2+-bound octapeptide and the C-terminal domain surface that includes many of the PrP mutations that give rise to familial prion diseases. Other research findings have suggested that the octapeptide repeats and the N-terminal polybasic region in PrPC mediated the zinc influx into neuronal cells via α-amino-3-hydroxy-5-methyl-4-isoxazolepropionate (AMPA) receptors, with PrPC acting as a zinc sensor and the AMPA receptor as a zinc transporter. These results also suggest that PrP-mediated zinc uptake may contribute to neurodegeneration in prion and other neurodegenerative diseases. A hydrophobic sequence in the middle of the protein (amino acids 113–135) may serve as a transmembrane domain in some prion protein isoforms. Within this sequence stretch lies a palindromic region (113–120) consisting of an alanine-rich tract of amino acids, AGAAAAGA, which is highly conserved across a wide variety of mammalian species. Previous studies targeting this region have shown that deletion of this palindrome prevents conversion of both mutant and co-expressed wild-type (wt) PrP. Moreover, peptides generated from this palindrome interfere with in vitro formation of PK-resistance. Figure 3 shows a schematic representation of PrPC-bound to the cell membrane.

5.2. Structure of PrPSc
Despite a large interest in determining the structure of PrPSc, little is known about the molecular details of this isoform. The high-resolution structure of PrPSc and its proteolytically truncated variant, PrP 27–30, have evaded experimental determination due to their insolubility and propensity to aggregate. However, it is widely accepted that: (1) during the conversion of PrPC to PrPSc the β- strand content increases substantially; (2) fibrillar assemblies of PrPSc display a typical cross-β sheet architecture, which is characteristic for amyloid; (3) conversion of PrPC to PrPSc increases its proteinase K resistance, although this feature is not obligatory. To date, data generated by biochemical and biophysical methods, such as spectroscopy analysis, electron microscopy, X-ray fiber diffraction, small-angle X-ray scattering, limited proteolysis, hydrogen/deuterium (H/D) exchange, and surface reactivity measurements, have led to several 3D structural models of PrPSc. However, a recent review exposed discrepancies between the experimental data and these models, indicating that all of these models fail to accommodate the experimental data.

Figure 3. Schematic diagram of the structure of PrPC. The carbohydrate moieties that are linked to Asn 181 (down) and Asn 197 (up) are shown in pink. The C-terminal GPI-anchor is shown in green and is extending into the cell membrane in blue and red. OR residues in the N-terminal domain are known to bind copper ions (shown in blue). The composite data file for the structure [90,91] was kindly provided by Dr. Glenn Millhauser (University of California, Santa Cruz) and includes coordinates for the carbohydrates, the GPI-anchor, and the membrane that were provided by Dr. Valerie Daggett (University of Washington, Seattle).

5.3. Molecular Models for the Structures of PrPSc and PrP 27–30
The first structural model proposed for PrPSc was based on Fourier-transform infrared (FTIR) spectroscopy measurements on PrP 27–30. This model predicted a four-stranded β-sheet covered on one face by two C-terminal α-helices. However, later experimental observations indicated that PrPSc appears not to contain any residual α-helical structure. Downing and Lazo proposed that in PrPSc β-strands would project from an antiparallel, intertwined β-sheet core, which itself spans the height of eight β-strands equivalent to ~38.4 Å. However, this model contradicts the X-ray fiber diffraction data, which indicated a molecular height of ~19.2 Å Furthermore, this unusual model is different from any known protein fold in the PDB database, which makes it difficult to assess its plausibility. Based on electron micrographs of 2D crystals of PrP 27–30 and a redacted miniprion, PrPSc106, Govaerts and colleagues proposed a parallel β-helix model for the core of the infectious conformer. The authors detected a compatibility between the N-terminal region of PrP 27–30 (amino acids 89– 176) and a parallel β-helix structure, and proposed that this region of PrP could get converted into a triangular β-helix containing four turns (four-rung model). Furthermore, they proposed that the β- helix associated with an intact C-terminal α2-α3 bundle, which retained its α-helical structure from the PrPC fold. Individual PrP 27–30 models were then assembled into a trimeric unit (p3 symmetry) by using the sidewall of the β-helix as an interface. The β-helical PrP trimer can then easily be stacked to form amyloid fibrils. As mentioned above, the C-terminal α-helices that are retained in the model are no longer supported by experimental data. Using molecular dynamics (MD) simulations and sequence comparisons, Langedijk and colleagues suggested the next and alternative model. It proposed a reduced number of β-helix rungs per PrPSc molecule going from four to two rungs with eight residues in one edge of the triangle, in contrast to the five residues proposed in the previous model. This model helped to overcome problems with the packing density involving a stretch of the PrP sequence (amino acids 106–126) that is rich in small amino acids (glycine and alanine). However, the drawbacks for this model are: First, this model keeps the C-terminal α- helices, which are no longer supported by experimental data. Second, the reduction in the number of β-helix rungs results in a molecular height of 9.6 Å for each molecule, which contradicts the experimentally observed molecular height of 19.2 Å (corresponding to four β-helix rungs). De Marco and Daggett proposed a “spiral model” where all three α-helices from the original PrP structure were retained, while the number of β-strands was extended to four. Using the structure of rec PrP as a starting point, the conformational fluctuations during its conversion under amyloidogenic conditions (acidic pH) were simulated using molecular dynamics. The resulting model proposed a spiraling core of extended structure, consisting of three short β-strands (spanning amino acids 116–119, 129–132 and 160–164) and the recruitment of a nascent β-strand (amino acids 135–140). The molecular dynamics results suggested that the conformational change occurred in the N-terminal region of the folded core of PrP, while the C-terminal α2 and α3 helices remained intact. In a fibrillar model, which was built up from individual monomers, the β-strands are oriented at angles that are not perpendicular to the fibril axis, which made this model incompatible with the X-ray fiber diffraction results. Similarly, this model contains of high proportion of α-helical structure, which again is inconsistent with recent experimental results. Tattum and colleagues used electron micrographs of amyloid fibrils grown in vitro from rec PrP (amino acids 91–231) to generate a 3D reconstruction. The authors found densities of two intertwined protofilaments with a repeating distance of ~60 Å. The repeating unit along each protofilament suggested a distinct, ladder-shaped fibril structure. Other researchers have supported this ladder-shaped model, by proposing that both the N-terminal region (amino acids 89–143) and the C-terminal region (amino acids 154–227) of the folded core of PrP were prone to conversion into β-structure independently. A common structural trait for these models is the presence of a long unstructured loop connecting N- and C-terminal β-helical domains, which were introduced to fit with the ladder-like appearance of the rec PrP amyloid fibers. However, this ladder-like appearance is exclusive to this particular sample preparation, which has not been reported to generate infectivity. Furthermore, the ladderlike configuration has not been found in PrPSc or PrP 27–30 fibril samples. Cobb and colleagues analyzed rec PrP amyloid using site-directed spin labeling and electron paramagnetic resonance (EPR). Based on interresidue distances (structural constraints), they proposed a parallel in-register β-sheet model, where the rec PrP amyloid consists of β-strands and relatively short turns and/or loops with no residual α-helices. Furthermore, each molecule of PrP contributes only with 4.8 Å to the length of the amyloid fibril. The authors also postulate that the C-terminal region (amino acids 159–219 in mouse PrP) is converted to a hairpin structure and that the fibrils are formed as a long two-layered β-sheet. Although some researchers have adopted this model to explain the structure of PrPSc, this rec PrP amyloid preparation has not been reported to produce infectivity and the model does not correlate with the repeating unit size of 19.2 Å observed by X-ray fiber diffraction experiments on PrPSc and PrP 27–30. Similarly, a recent study using solid-state NMR spectroscopy (ssNMR) to analyze the formation of rec PrP amyloid fibrils that were seeded with PrPSc, proposed that the C-terminal, cysteine-flanked core that contains the α-helices 2 and 3 also adopts a parallel in-register β-sheet architecture. The authors determined the space between labeled isoleucine (Ile-1- 13C), phenylalanine (Phe-1-13C), and leucine (Leu-1-13C) residues in rec PrP amyloid and observed a distance of ~0.5 nm for four Ile residues (182, 184, 203 and 205). Based on this observation, the authors concluded that these residues refold from a α-helical structure into a parallel in-register intermolecular β-sheet. The three Phe residues (140, 174 and 197) showed an intermolecular distance of 0.5–0.6 nm, suggesting that two-thirds of the Phe residues were stacked in-register, but that one third was not. However, these findings do not correlate with the molecular height of 19.2 Å that has been observed in PrPSc and PrP 27–30 amyloid fibrils. Furthermore, it has not been shown to what degree PrPSc-seeded rec PrP amyloid resembles brain-derived PrPSc with respect to infectivity and structure.

© 2020 A.I.G.S.S.
Italian Association against the disease of GERSTMANN STRAUSSLER SCHEINKER

Headquarter: via Alfieri, 2 - 20813 Bovisio Masciago (MB) ITALY