Introduction
Frontotemporal dementia (FTD) is a clinically and pathologically heterogeneous type of early-onset dementia, typically characterized by atrophy of the frontal and/or temporal lobes [
1]. The clinical profile of FTD shows behavioural and language disturbances, with cognitive deficits in executive function and relative sparing of memory and visuospatial abilities [
2]. Up to 40% of FTD cases have an autosomal dominant pattern of inheritance. Mutations in the progranulin (
GRN), microtubule-associated protein tau (
MAPT) and chromosome 9 open reading frame 72 (
C9orf72) genes are the most common causes [
3]. Early diagnosis—albeit difficult due to the heterogeneous symptoms and overlap with other forms of dementia and psychiatric disorders—is essential for proper patient management and planning, non-pharmacological treatment, and patient stratification in upcoming disease-modifying clinical trials [
4].
Research in the genetic FTD field has been increasingly moving towards the presymptomatic and early prodromal stages, as the critical time-window for treatment most likely lies prior to overt symptom onset. With promising avenues opening for clinical trials, identifying robust biomarkers is of utmost importance [
5]. Previous neuropsychological studies showed that subtle cognitive deficits and decline are present in the presymptomatic stage, and gene-specific cognitive profiles can be detected [
6‐
8]. These findings suggest that neuropsychological assessment in the presymptomatic and early prodromal stages can provide sensitive cognitive markers for FTD.
The semantic fluency test is one of the most widely used tests in neuropsychological assessments. In this brief, easy-to-apply test, people have to generate items from a particular semantic category (e.g., animals, foods) in 1 min [
9]. The semantic fluency test presents high sensitivity and specificity for dementia diagnosis [
9], with impaired performance found in both symptomatic [
10] and presymptomatic FTD [
7,
8]. Although the total number of items generated is commonly used to quantify test performance, qualitative, psycholinguistic information embedded in the output can also be investigated, including clusters (number of multiword strings
1), switches (number of transitions between clusters), age of acquisition (AoA; the age at which a word is learned), and lexical frequency (LF; how often a word occurs in daily language) [
12,
13]. Previous research demonstrated the prognostic value of qualitative fluency measures in cognitively healthy subjects at-risk for and in conversion from prodromal to overt Alzheimer’s Dementia (AD) [
14,
15]. This approach has been underexplored in presymptomatic and/or prodromal genetic FTD, while this psycholinguistic information may be able to detect the subtle development of FTD’s characteristic language symptoms at an early stage.
The aim of this study was therefore to investigate longitudinal changes in five qualitative aspects of semantic fluency (i.e., number of clusters and switches, cluster size, AoA, and LF) in mutation carriers that developed FTD (phenoconverters), presymptomatic mutation carriers, and non-carriers from autosomal dominant GRN- and MAPT-FTD families. We were specifically interested in the inflection point (i.e., when in the disease trajectory) at which the qualitative measures start to deviate from normal. Additionally, we explored the co-correlation between the qualitative measures, and their associations with cognitive decline and grey matter (GM) volume loss, and the prognostic value of decline in qualitative measures in predicting symptomatic onset.
Discussion
This study examined longitudinal changes in qualitative aspects of the semantic fluency task in a large cohort of FTD phenoconverters, presymptomatic mutation carriers, and non-carriers from GRN- and MAPT-FTD families. Phenoconverters showed a decline in the total score from at least 4 years pre-phenoconversion, with individually-varying inflection points and longitudinal trajectories in qualitative fluency measures in GRN and MAPT. At least 4 years pre-phenoconversion, GRN phenoconverters started producing fewer but larger clusters, and switched less between clusters, which was correlated with executive dysfunction. A decline in switching was predictive of phenoconversion. At least 4 years pre-phenoconversion, MAPT phenoconverters demonstrated an increase in LF and a decline in AoA, which was correlated with semantic deficits. A decline in cluster size was predictive of phenoconversion. Increase in LF and decline in AoA were associated with GM volume loss of predominantly temporal areas, while decline in the number of clusters, cluster size, and switches correlated with GM volume loss of predominantly frontal areas.
The semantic fluency total score is strongly intertwined with the qualitative aspects of the task. For accurate and timely word retrieval, both AoA and LF [
26], and clustering and switching components [
27] are required. Coinciding with the results of our principal component analysis, in which we found a AoA-LF component and a clusters-switches component, previous studies found strong correlations between AoA and LF, and between the number of clusters and the number of switches. With respect to the relation between AoA and LF, correlations have found to be high in natural languages, as early-acquired words tend to occur more frequently than late-required words [
28]. The number of switches and the number of clusters are correlated, as by identifying the number of clusters, one can generate the number of switches (corresponds to the number of clusters minus one) [
27].
Irrespective of the underlying FTD mutation, we showed a decline in the total score in mutation carriers from at least 4 years prior to phenoconversion. Decline in semantic fluency was found to be an early cognitive marker in other neurodegenerative diseases. For instance, in preclinical AD and Huntington’s disease, as early as 12 years before the onset of dementia, decline in a measure of semantic memory was found [
29,
30]. In another study, it was amongst the most statistically sensitive cognitive measures of symptomatic conversion [
31]. Studies into semantic fluency decline in presymptomatic FTD have shown somewhat contrasting results. One study demonstrated decline in semantic fluency in
MAPT mutation carriers from 6 years before estimated symptom onset [
7], whereas another only found decline at estimated symptom onset [
6]. It should be noted that both studies used estimated years to symptom onset as a proxy for actual onset, which can be less reliable in familial FTD [
6]. Utilizing a similar research design as our current study, decline of semantic fluency from 4 years before symptom onset in
MAPT converters, and decline of semantic fluency was found to be the best predictor for having an
MAPT mutation [
8]. Although semantic fluency did not decline in
GRN converters, decline on phonemic fluency was found to be predictive of an
GRN mutation, confirming the value of fluency tasks in presymptomatic FTD as they can distinguish the underlying genotype [
8].
Starting at least 4 years pre-phenoconversion,
GRN mutation carriers produced fewer but larger clusters, and had fewer switches, than
MAPT phenoconverters and controls. A likely explanation for the decline in their total score is that
GRN phenoconverters deteriorate in cognitive flexibility [
32], and thereby lose the ability to switch between semantic clusters in order to generate more words. Indeed, in our study the decline in the number of clusters and switches, and the increase in cluster size, correlated with decline in executive function. Executive dysfunction is known to be a distinctive cognitive feature in
GRN mutations, demonstrating deficits in symptomatic mutation carriers [
33], extending to the presymptomatic stage [
7,
8]. When converting to the symptomatic stage,
GRN mutation carriers also show the most decline in executive function [
34].
Starting at least 4 years pre-phenoconversion,
MAPT mutation carriers produced words with a higher LF and a lower AoA, and had fewer clusters and smaller clusters than
GRN phenoconverters and controls. These findings point towards deterioration of the semantic system as an explanation as to why qualitative fluency measures change in
MAPT. First, semantic decline is most likely to affect words with a lower LF and a higher AoA first, as the categorical organization of the system retrieves the ‘typical’ exemplars faster and more accurately, and they are better represented and more interconnected to other concepts than those that enter the semantic system later in life [
35]. The reliance of both processes on semantic processing is further supported by their correlation with the verbal SAT which assesses verbal semantic deficits [
36]. Clustering relies on lexical retrieval, vocabulary size and lexical access, and thus is mainly supported by the integrity of the semantic system [
13].
We demonstrated that—irrespective of the underlying mutation—decline in the number of clusters, cluster size, and switches correlated with GM volume loss of predominantly frontal areas, while worse LF and AoA performance was associated with GM volume loss of predominantly temporal areas. These neuroanatomical correlates are in line with the predominant frontal involvement in
GRN mutation carriers [
6,
34], and link the degradation of the fronto-insula network to less cognitive flexibility—and as a consequence early clustering-switching impairment—as the most likely underpinning of declining fluency performance in conversion to
GRN-associated FTD. The finding that LF and AoA rely on temporal lobe functioning could explain why these qualitative features are changing early in
MAPT mutation carriers, as temporal volume loss is considered the neuroimaging hallmark of
MAPT [
37,
38], being present up to several decades before symptom onset [
39]. Although PPA is not a frequent clinical phenotype, semantic impairments are well-described in
MAPT-related FTD [
40]. Our cohort includes three
P301L and three
G272V phenoconverters, which is too small to investigate differences between the two tau mutations. Nevertheless, with larger sample sizes it would be interesting to explore if there is clinical heterogeneity across the
MAPT mutations [
41], as the
P301L mutation often presents with a language phenotype with semantic deficits [
42]. Impairments in semantic fluency are found to be common as the result of cerebellar pathology, as this subcortical region plays a crucial role in motor performance and executive processes necessary for organizing and monitoring word output [
43].
The key strength of our study is our longitudinal design, spanning up to 6 years of follow-up in a large single-centre sample of participants from
GRN and
MAPT FTD-families. This design allowed the investigation of mutation carriers as they were converting to the symptomatic stage, which provides us more accurate information about the underlying disease process than previous studies that used estimated years to onset as a proxy [
40]. We chose multilevel linear modelling to handle potential missing data and unbalanced time-points that were the result of our ongoing prospective study. The small sample of phenoconverters, in combination with the large fluctuations in the data of the qualitative measures, are the largest drawback of the study, which has hampered our statistical power and interpretation of results. As the multilevel model assumes a linear relationship between genetic status and fluency performance it is possible that we have missed non-linear effects. Ideally, the semantic fluency test should not have been used in determining phenoconversion, however in our multidisciplinary approach we have used all available clinical information—e.g., MR brain imaging, anamnestic and heteroanamnestic information, questionnaires—so that symptom onset did not solely depend on the neuropsychological assessment. Although theoretically a cluster can consist of a single word, one cannot measure interword intervals for fewer than two words, so that a single-word “cluster” cannot be corroborated by the measurement of interword intervals. Following Ledoux et al. [
11], we therefore defined clusters not as single words, but only as multiword strings (i.e. two or more consecutive words) whose relationship is defined by one of the scoring rules, but realize this could have penalized patients with a low total output. Lastly, the analyses on the presymptomatic mutation carriers were performed using the original baseline and follow-up visits, regardless of years from potential phenoconversion, therefore they might have lost some sensitivity to detect decline. Future directions include replication of our findings in larger multicentre cohorts, including
C9orf72 mutation carriers. Moreover, using qualitative measures in discriminative event-based models could help us understand the dynamics of disease progression and how other biomarkers (e.g., NfL) fit into this [
44]. Lastly, future studies could look into the effect of using time-bins next to the usual 60-s output, as most people start with readily available animals and produce less familiar exemplars as the task develops (which affects the qualitative measures), and was found to be particularly sensitive to mutation status in presymptomatic
APOE-ε4 carriers at-risk of developing AD [
14].
Conclusion
Our pilot study shows that qualitative aspects of semantic fluency change in presymptomatic FTD, and shows different profiles and inflection points depending on the mutation involved. This could provide important insight into the mechanisms as to why the “traditional” total score is declining. Its brief and easy-to-apply nature makes the total score of the semantic fluency test a likely candidate cognitive biomarker for upcoming clinical trials for FTD, but more research with a larger sample of phenoconverters is needed to replicate our findings, and to explore the additional value of qualitative measures in identifying and tracking mutation carriers as they convert to the symptomatic stage.