Test-retest reliability of functional near-infrared spectroscopy during a finger-tapping and postural task in healthy older adults

Veerle de Rond; Moran Gilat; Nicholas D’Cruz; Femke Hulzinga; Jean-Jacques Orban de Xivry; Alice Nieuwboer

doi:10.1117/1.NPh.10.2.025010

26 May 2023 Test-retest reliability of functional near-infrared spectroscopy during a finger-tapping and postural task in healthy older adults

Veerle de Rond, Moran Gilat, Nicholas D’Cruz, Femke Hulzinga, Jean-Jacques Orban de Xivry, Alice Nieuwboer

Author Affiliations +

Neurophotonics, Vol. 10, Issue 2, 025010 (May 2023). https://doi.org/10.1117/1.NPh.10.2.025010

Abstract

Significance

Functional near-infrared spectroscopy (fNIRS) is increasingly employed in studies requiring repeated measurements, yet test-retest reliability is largely unknown.

Aim

To investigate test-retest reliability during a postural and a finger-tapping task with and without cap-removal.

Approach

Twenty healthy older adults performed a postural and a finger-tapping task. The tasks were repeated twice in one session and once the next day. A portable fNIRS system measured cortical hemodynamics (HbO₂) in five regions of interest for the postural task and in the hand motor region for finger-tapping.

Results

Test-retest reliability without cap-removal was excellent for the prefrontal cortex (PFC), the premotor cortex (PMC) and the somatosensory cortex (SSC) (intraclass correlation coefficient ( ICC ) ≥ 0.78), and fair for the frontal eye fields (FEF) and the supplementary motor area (SMA) (ICC ≥ 0.48). After cap-removal, reliability reduced for PFC and SSC (ICC ≥ 0.50), became poor for SMA (ICC = 0.01) and PMC (ICC = 0.00) and remained good for FEF (ICC = 0.64). Similarly, good reliability (ICC = 0.66) was apparent for the hand motor region without cap-removal, which deteriorated after cap-removal (ICC = 0.38).

Conclusions

Test-retest reliability of fNIRS measurements during two separate motor tasks in healthy older adults was fair to excellent when the cap remained in place. However, removing the fNIRS cap between measurements compromised reliability.

1. Introduction

Functional near-infrared spectroscopy (fNIRS) is a neuroimaging technique that uses the principle of neuro-vascular coupling to estimate the blood oxygen-level dependent (BOLD) response as a surrogate for neural activation and deactivation.¹ Brain oxygenation is measured through light-emitting and receiving optodes using two different wavelengths within the “optical window” of 700 to 900 nm differentiating between oxygenated ( ${HbO}_{2}$ ) and deoxygenated hemoglobin (HHb) absorption.²^,³ fNIRS is a non-invasive and user-friendly technique, which has been validated against functional magnetic resonance imaging.⁴ Among the drawbacks of fNIRS are its limited spatial resolution and cortical penetration.⁵ However, fNIRS systems are mobile and less sensitive to motion artifacts than other mobile neuroimaging systems, such as electroencephalography (EEG),⁶ allowing for measurements during actual whole-body movements in natural environments.⁷

For these reasons, fNIRS is becoming increasingly popular for investigating the cortical activation patterns during postural and gait-related tasks.⁵^,⁸ In addition, more and more intervention studies are implementing fNIRS as a primary outcome, comparing cortical activation before and after training in repeated measures designs.⁹^–¹³ For example, studies have used fNIRS to investigate learning-induced neural changes in the prefrontal cortex (PFC) as well as in other motor regions during balance⁹^,¹²^,¹³ and manual tasks¹⁰^,¹¹ in young adults. These studies, however, did not take test-retest reliability into account, crucial for interpreting the changes in the BOLD signal as being meaningful rather than signifying measurement error. There are several potential sources of error, including the limited spatial specificity of fNIRS,¹⁴ as well as the systemic changes in physiological measures that may arise during movement due to heart rate and blood flow alterations.¹⁵ Other possible inaccuracies may arise from day-to-day variability in hemodynamic oscillations and whether the fNIRS-cap was repositioned precisely by the operator.¹⁶

So far, fNIRS test-retest reliability has shown to be fair to excellent during resting-state.¹⁷^–¹⁹ Seven studies investigated fNIRS test-retest reliability of the PFC or the contralateral motor cortex during motor tasks, based on the intraclass correlation coefficient (ICC).²⁰^–²⁴ The ICC provides an overall estimate of the correlation and agreement between measurements. It is calculated as the proportion of between-subject variance over the total variance.²⁵ Four studies reported good to excellent reliability during manual tasks ( $ICC \geq 0.60$ ), of which two studied task-specific motor channels ( $ICC \geq 0.62$ ).²⁰^,²² One study reported very poor test-retest reliability ( $ICC = 0.002$ ) during passive hand movements imposed by a robot.²¹ Only two studies investigated test-retest reliability during gait-related tasks,²⁶^,²⁷ showing fair to good reliability for straight walking ( $ICC > 0.40$ ;²⁶ $ICC = 0.71$ ²⁷) and turning ( $ICC = 0.67$ ²⁷) in the PFC in young and middle-aged adults. None of these studies were conducted in older adults, who may present lower signal to noise ratios.²⁸ Two studies pertained to patients with multiple sclerosis²⁶ and traumatic brain injury,²³ resulting in poor ( $ICC < 0.40$ ) and good ( $ICC = 0.70$ ) reliability in the PFC.

Given the above-described gaps in the literature, we set out to investigate the test-retest reliability of fNIRS in healthy older adults during two different motor tasks, namely a postural weight-shifting and a finger tapping task. First, test-retest reliability of five regions of interest (ROIs) was determined during a weight-shifting task comparing two tests, which were repeated twice on the same day and on two consecutive days following cap removal and repositioning. Next, the task-specificity of test-retest reliability was investigated during a control finger tapping task, focusing on the contralateral finger region of the motor cortex.²⁹ Based on earlier work during both gross and fine motor tasks in young adults²⁶^,²⁷ and the anticipated lower signal to noise ratios in older people,²⁸ we hypothesized that test–retest reliability would be more compromised than that obtained in young adults. Furthermore, we expected lower reliability after cap removal compared to when the cap remained stationary.

2. Materials and Methods

2.1.

Participants

Healthy older adults were recruited via an existing local database, compliant with the general data protection regulation (GDPR), as part of a larger randomized controlled trial (RCT; clinicaltrial.gov ID: NCT04594148). Participants had to be at least 65 years old, right handed (self-reported) and be able to independently stand upright for at least 5 min. Participants were excluded if they had a self-reported history of neurological disorders, balance impairments (i.e., vestibular disorders), uncorrected visual impairment, chronic musculoskeletal problems (e.g., osteoarthritis, osteoporosis), cardiovascular (e.g., uncontrolled hypertension, peripheral vascular disease) or respiratory (e.g., chronic obstructive pulmonary disease) disease, or diabetes related polyneuropathy. Additionally, participants with a cognitive impairment (Montreal cognitive assessment $(MoCA) < 26$ ) were excluded. Written informed consent was obtained from all participants prior to enrolment. The study was approved by the Ethics Committee Research UZ/KU Leuven (study ID: S62917).

2.2.

Experimental Procedure and Tasks

2.2.1.

Procedure

Only participants who were randomized to the passive control group of the RCT, who received no intervention, were included. Participants were assessed on 2 consecutive days. Motor tests were assessed twice on day 1, before and after a resting period of 30 min while keeping the fNIRS cap and optodes in place. After a complete removal of the fNIRS system, the cap was re-attached the next day and the third test was conducted around the same time of day as the first measurement the day before. Cap position across days was standardized (see Sec. 2.2.4). Motor and fNIRS assessments were conducted using a block design with each block consisting of seven trials of 20 sec of rest followed by 20 sec of movement. Prior pilot testing showed an 80% true positive rate when using this design. The start of each trial was marked in the fNIRS signal with a task-synchronized trigger. The 20-sec resting period, conducted in stance for the postural task and in sitting for the tapping task, served as a baseline for the following movement trial. The postural task blocks were conducted before the tapping task in a fixed order.

2.2.2.

Postural task

The postural task consisted of a non-immersive virtual reality weight-shifting task,³⁰ as described in detail elsewhere (see Figure S1 in the Supplementary Material).³¹ In short, participants were standing in front of a screen at approximately three meters distance. They were instructed to shift their weight mediolaterally $> 80 %$ of their a-priori determined individual limits of stability, which activated a virtual water jet. With the water jet, they attempted to hit as many virtual wasps as possible, which appeared on the left and right side of the screen alternately. Participants’ center of mass (CoM) was captured within Nexus software (Vicon, Oxford Metrics, United Kingdom) by recording reflective markers placed bilaterally on the acromia, posterior superior iliac spines, lateral epicondyles, and lateral malleoli. Calculation of the CoM was done online within the D-Flow software (Motek Medical BV, Amsterdam, The Netherlands; version 3.28), based on the formulation by Winter (2009).³² During rest, subjects were asked to stand and watch the screen, displaying a video recording of the wasp game.

2.2.3.

Finger tapping task

The finger-tapping task, as part of the cloud-UPDRS application (version 1.3.0),³³ was performed with the right dominant hand on a smartphone. It consisted of two visual targets with a diameter of 0.6 inch, positioned at a 2-inch distance from each other (see Figure S2 in the Supplementary Material). Participants were seated on a chair in front of the smartphone placed on a table. They were instructed to alternately tap the left and right targets with their right index finger while holding the smartphone still with their left hand. To prevent any learning effect, tapping frequency was set at 180 beats per minute imposed by a metronome beat. The researcher verbally indicated the start and termination of the task. Participants were instructed to sit as still as possible and place their right hand flat on the table during the 20-sec resting periods in between tapping trials.

2.2.4.

fNIRS assessment

Brain hemodynamics were recorded following recent consensus guidelines.⁵ A continuous wave, single-phase fNIRS system (NIRSport2, NIRx, Berlin, Germany), using light emitting diodes (LEDs) with wavelengths of 760 and 850 nm at a sampling frequency of 7.81 Hz, recorded brain hemodynamics within the Aurora software (version 2020.7). First, participants’ head circumference was measured and matched to the closest corresponding cap size, varying from 54 to 60 cm. The Cz anatomical landmark was then determined using the inion, nasion, and pre-auricular points after which the lightweight cap was carefully placed on the head. To ensure similar cap placement on day two, the Cz, CP2, and FC1 anatomical landmarks were marked. Participants were asked not to wash their hair in between measurement days.

A total of 32 optodes (16 sources, 16 detectors) were used to cover predefined ROIs, including the prefrontal cortex (PFC; Brodmann areas 9, 10, and 46), frontal eye fields (FEF; Brodmann area 8), supplementary motor area (SMA; Brodmann area 6, medial), premotor cortex (PMC; Brodmann area 6, lateral), and somatosensory cortex (SSC; Brodmann areas 1, 3, 5 and 7). The 10-10 layout in the fNIRS Optodes’ Location Decider (fOLD) toolbox³⁴ was used to specify the optode locations (see Table S1 in the Supplementary Material) with a source-detector separation of $\sim 3 cm$ . Sixteen short-separation channels with a source-detector separation of 8 mm were included, one over each source, to correct for physiological noise in the fNIRS signal.³⁵ Furthermore, two accelerometers were placed at the back of the cap to correct for movement artefacts related to head movements. At the onset of testing, signal quality was visually checked and improved by moving the hair aside from underneath the optodes, if needed. An additional opaque cap was then placed over the fNIRS set-up to protect external light from interfering with the hemodynamic measurements. For the tapping task, the motor channel in between optode C3-C1 of the contralateral (left) hemisphere was chosen as the ROI. As the fOLD toolbox does not provide specific information on anatomical brain representations, the C3-C1 hand motor channel was based on the EEG topography³⁶ in accordance with previous data on the homunculus hand position.³⁷

2.3.

Data Processing

2.3.1.

Behavioral data

Weight-shifting data were exported by D-Flow and analyzed within MATLAB 2018b (MathWorks, Natick, Massachusetts, United States). Similar to our previous analysis,³¹ CoM data were first low-pass filtered with a fourth-order Butterworth filter (cut-off = 6 Hz). Weight-shifting was then determined as the movement from the 80% stability limits threshold on the right to the 80% stability limits threshold on the left and vice versa. Outcome measures included weight-shifting speed and accuracy (CoM error) averaged over the seven trials.

The cloud-UPDRS smartphone data were analyzed with Microsoft Excel (version 2016). Outcome measures included the average number of taps per trial and the accuracy of target taps in pixels (target error) and calculated as:

Eq. (1)

\sqrt{(Xposition - Xtarget)^{2} + (Yposition - Ytarget)^{2}},

where Xposition and Yposition refer to the coordinates of the finger on the screen, and Xtarget and Ytarget refer to the coordinates of the targets on the screen.

2.3.2.

fNIRS data

Brain hemodynamic data were analyzed with the open access NIRS toolbox ( https://github.com/huppertt/nirs-toolbox)³⁸ implemented in MATLAB 2018b (MathWorks, Natick, Massachusetts, United States). Raw intensity signals were checked visually, resampled to 5 Hz, and converted into optical density using the Beer-Lambert law and a partial path length correction factor of 0.1, thereby correcting for light scatter caused by the brain tissue that the near-infrared light was travelling though, as stated in previous research.³⁹^,⁴⁰ A general linear model, including short separation channels and accelerometers, was used to estimate the task hemodynamic response, thereby correcting for signal variations due to physiological noise and movement artefacts.²⁰^,³⁵ The autoregressive iteratively reweighted least squares method was implemented to correct for motion and auto correlated noise.⁴¹ This method, including short separation channel regression, outperformed other filtering methods³⁵ and was shown to improve data reproducibility.³⁵ Accelerometers identified and corrected for changes in the fNIRS signal that corresponded with the accelerometer signal over a time-window of 15 s.⁴²

Channels were averaged within the predefined ROIs. A channel was included if the spatial specificity, as determined with the fOLD toolbox,³⁴ was at least 50% within a ROI⁴³ (see Fig. 1). As the SMA and PMC ROIs were grouped within the fOLD output, the medial channels were defined as SMA and the lateral channels as PMC.⁴⁴ The primary motor cortex (M1) was not included as ROI in this analysis, because spatial specificity did not exceed 50%. Midline-traversing channels were excluded, as cerebrospinal fluid running through the superior sagittal sinus could have interfered with the measurement of the underlying brain hemodynamics.⁴⁵ In addition, the tapping task-specific hand motor channel C3-C1 was excluded from the ROIs specified for the postural task, as spatial specificity was lower than 50% (35% M1 and 35% PMC). Trial-averaged relative oxygenated hemoglobin ( ${HbO}_{2}$ active trial - ${HbO}_{2}$ rest trial ( $μ mol / L$ )) was used as the primary outcome for each ROI. As secondary outcomes, HHb ( $μ mol / L$ ) and ${HbO}_{2}$ ( $μ mol / L$ ) values for the left and right ROIs were investigated separately.

Fig. 1

fNIRS lay-out with ROIs according to the fOLD toolbox for the postural task with a specificity level $\geq 50 %$ . The PFC is represented by the blue colored channels, the FEF by the green channels, the SMA by the orange channels, the PMC by the purple channels, and the SSC by the yellow channels. The channels situated at the midline were excluded (indicated by black crosses). The C3-C1 channel (black circle; brown channel) was included as the task-specific hand motor channel for the finger tapping task, consistent with the EEG topography. Nz, naison; Iz, inion; LPA, left point auricular; and RPA, right point auricular.

To a-posteriori check which channels were active during the postural or the finger-tapping task, signal changes in individual channels were assessed in a complementary analysis. A channel was classified as active when there was a significant average change during the task compared to rest in both ${HbO}_{2}$ (positive change) and HHb levels (negative change) over the three time points, which was FDR-corrected for the number of channels.

2.4.

Statistical Analysis

Statistical analyses were performed with SPSS (IBM SPSS Statistics, version 26). After checking the data distribution, test-retest differences in behavioral performance on both the postural and tapping task were investigated using a repeated measures analysis of variance (ANOVA) with time (test 1, 2, and 3) as within-subject factor. The assumption of sphericity was checked, and Greenhouse-Geisser correction used in case of violation. The same approach tested differences between time points for the fNIRS outcomes, and post-hoc tests were Bonferroni corrected for multiple comparisons. Next, single and average ICCs, as well as the corresponding confidence intervals (CIs), were calculated using a two-way mixed model with absolute agreement.²⁵^,⁴⁶ Single measure ICCs are reported as the primary measure of test-retest reliability, as only one fNIRS measurement (consisting of seven trials) was performed during each test session.²⁵^,⁴⁶ ICCs were obtained between test 1 and 2 and between test 1 and 3, investigating test-retest reliability on the same day and on consecutive days after cap removal, respectively. ICCs were interpreted as poor ( $ICC < 0.40$ ), fair ( $0.40 \leq ICC < 0.60$ ), good ( $0.60 \leq ICC < 0.75$ ), or excellent ( $0.75 \leq ICC < 1.00$ ).²⁵ To allow for a meaningful interpretation, negative ICC values were replaced by zero.²⁵ Additionally, the standard error of measurement (SEM) was calculated as

Eq. (2)

SDpooled * \sqrt{1 - ICC},

where SDpooled refers to the average standard deviation (SD) across measurements.⁴⁷^,⁴⁸ Finally, Bland–Altman plots are presented to visualize the mean difference in relation to the average

{HbO}_{2}

levels of the two tests for each subject separately.⁴⁹

3. Results

3.1.

Participants’ Characteristics

Twenty-two healthy older adults were recruited for this study, as part of the control group for a larger RCT (clinicaltrial.gov ID: NCT04594148). Two were excluded due to not meeting the in-/exclusion criteria (one had a MoCA $score < 26$ and one was aged below 65 years). Participant characteristics of the included 20 healthy older adults are shown in Table 1. They were all right-handed (self-reported).

Table 1

Participant characteristics.

	Dataset (N=20)
Gender (m/f)	10/10
Age (years)	71.00 (67.3 - 75.0)
Height (cm)	1.69 ± 0.1
Weight (kg)	71.17 ± 11.8
MiniBEST (0-28)	24.95 ± 1.9
FES-I (16-64)	19.00 (17.0, 23.5)
MoCA (0-30)	28.00 (26.3 - 29.8)

Note: Normally distributed data are displayed as mean ± SD, and not normally distributed data as median (first quartile - third quartile). MiniBEST, mini balance evaluation systems test; FES-I, Falls Efficacy Scale International; and MoCA, montreal cognitive assessment. MiniBEST, FES-I, and MoCA are presented for descriptive purposes only.

3.2.

Behavioral Results

Even though the postural task was not standardized, weight-shifting performance was similar across the three test sessions, as no differences were found in speed [ $F_{(1.53)} = 0.35$ , $p = 0.65$ , $η_{p}^{2} = 0.02$ ; see Figure S3(a) in the Supplementary Material] and accuracy [ $F_{(2)} = 0.52$ , $p = 0.60$ , $η_{p}^{2} = 0.03$ , see Figure S3(b) in the Supplementary Material]. Similar results were found for tapping performance [accuracy: $F_{(1.50)} = 1.70$ , $p = 0.21$ , $η_{p}^{2} = 0.09$ , see Figure S3(c) in the Supplementary Material; number of taps: $F_{(1.49)} = 1.30$ , $p = 0.28$ , $η_{p}^{2} = 0.07$ , see Figure S3(d) in the Supplementary Material]. Exploratory post-hoc tests with Bonferroni corrections for multiple testing also did not show any difference between tests (postural: corrected $p - values > 0.30$ ; tapping: corrected $p - values > 0.07$ ). It should be noted, however, that there was a trend towards a better accuracy for tapping in test 2 compared to test 1 ( $p = 0.07$ ).

3.3.

fNIRS Results

Figure 2 gives an overview of the ICCs and CIs between test 1-2 versus test 1-3 for both ${HbO}_{2}$ and HHb levels. This illustrates that for both outcomes, ICCs were higher for test 1-2 compared to test 1-3 and this particularly in SMA and PMC. The below sections describe these results in more detail.

Fig. 2

Intraclass correlations (ICC) and CIs of test 1 versus test 2 and test 1 versus test 3 for both (a) oxygenated ( ${HbO}_{2}$ ) and (b) deoxygenated (HHb) hemoglobin. ROIs of the weight-shifting task are displayed with green dot (ICC) and line (CI) and the task-specific ROI of the right finger tapping task (channel C3-C1) are represented by the purple dot (ICC) and line (CI).

3.3.1.

ROI reliability during the postural task without cap repositioning

In the total ROIs (left plus right channels), no differences in ${HbO}_{2}$ were found across time points [PFC: $F_{(1.41)} = 0.62$ , $p = 0.49$ ; FEFs: $F_{(2)} = 0.25$ , $p = 0.78$ ; SMA: $F_{(2)} = 0.73$ , $p = 0.49$ ; PMC: $F_{(1.20)} = 0.39$ , $p = 0.58$ ; SSC: $F_{(1.35)} = 0.83$ , $p = 0.40$ ; see Figs. 3(a)–3(e)]. Exploratory post-hoc tests also did not show differences between the three tests (all corrected $p - values > 0.17$ ). The ${HbO}_{2}$ test-retest reliability, as captured by the ICC-values and CIs between test 1-2 was excellent for the PFC ( $ICC = 0.87$ , $CI = [0.71, 0.95]$ ), PMC ( $ICC = 0.78$ , $CI = [0.53, 0.91]$ ) and SSC ( $ICC = 0.78$ , $CI = [0.51, 0.91]$ ), and fair for the FEF ( $ICC = 0.51$ , $CI = [0.10, 0.77]$ ) and SMA ( $ICC = 0.48$ , $CI = [0.05, 0.76]$ ) (see Table 2). Table 2 further illustrates that the reliability for the ROIs per hemisphere showed lower values, but still within an acceptable range (PFC: $ICC \geq 0.76$ ; FEF: $ICC \geq 0.49$ ; SMA: $ICC \geq 0.40$ ; PMC: $ICC \geq 0.74$ ; SSC: $ICC \geq 0.60$ ). Bland–Altman plots are shown in Fig. 4. Points fell largely within the limits of agreement, were equally distributed around zero, and showed no bias, corroborating the findings above. The pattern, which was overall similar for the different ROIs, did show that one participant with extreme activation (PFC) or deactivation (PMC, SMA) values also had the least consistency between time points. As for the SEMs, values ranged between 1.89 and $4.09 μ mol / L$ for the total ROIs and between 2.51 and $4.91 μ mol / L$ for the ROIs per hemisphere.

Fig. 3

(a)–(e) Relative ${HbO}_{2}$ levels (active trial - rest trial) for total (left plus right channels) ROIs at test 1, 2, and 3 for the postural task. Note that the scale on the $x$ -axis differs between ROIs as to visualize the slightest differences. (f) Relative ${HbO}_{2}$ levels for all total ROIs averaged over the three test moments. ROI, region of interest; PFC, prefrontal cortex; FEF, frontal eye fields; PMC, premotor cortex; SMA, supplementary motor area; and SSC, somatosensory cortex.

Table 2

Test-retest reliability based on single ICCs for relative HbO2 and HHb levels during the postural task.

			Mean ± SD (μmol/L)			ICC (95% CI)		SEM (μmol/L)
			Test 1	Test 2	Test 3	Test 1-2	Test 1-3	Test 1-2	Test 1-3
${HbO}_{2}$	PFC	Total	1.78 ± 8.28	1.53 ± 6.04	2.92 ± 6.02	0.87 (0.71, 0.95)**	0.50 (0.08, 0.77)*	2.58	5.13
		Left	2.39 ± 10.47	1.12 ± 7.61	1.94 ± 6.58	0.77 (0.50, 0.90)**	0.66 (0.32, 0.85)**	4.44	5.08
		Right	1.17 ± 7.78	1.94 ± 6.48	3.90 ± 6.78	0.84 (0.64, 0.93)**	0.00 (0.00, 0.39)	2.89	7.30
	FEF	Total	1.15 ± 4.63	1.70 ± 3.05	1.58 ± 3.50	0.51 (0.10, 0.77)*	0.64 (0.28, 0.84)**	2.74	2.47
		Teft	1.40 ± 4.47	1.77 ± 4.16	0.72 ± 4.65	0.49 (0.06, 0.76)*	0.31 (0.00, 0.65)	3.10	3.80
		Right	0.90 ± 5.98	1.63 ± 4.13	2.45 ± 4.02	0.59 (0.21, 0.81)**	0.48 (0.08, 0.76)*	3.29	3.66
	SMA	Total	2.15 ± 5.79	1.42 ± 5.57	3.20 ± 5.74	0.48 (0.05, 0.76)*	0.01 (0.00, 0.45)	4.09	5.73
		Left	1.82 ± 6.55	1.73 ± 4.90	3.43 ± 5.67	0.49 (0.07, 0.77)*	0.22 (0.00, 0.60)	4.11	5.40
		Right	2.48 ± 5.97	1.11 ± 6.72	2.98 ± 6.91	0.40 (0.00, 0.71)*	0.00 (0.00, 0.43)	4.91	6.45
	PMC	Total	4.08 ± 5.95	3.68 ± 4.77	5.05 ± 5.30	0.78 (0.53, 0.91)**	0.00 (0.00, 0.14)	2.51	5.63
		Left	4.16 ± 7.29	3.62 ± 5.73	5.11 ± 6.35	0.74 (0.45, 0.89)**	0.00 (0.00, 0.09)	3.36	6.83
		Right	4.00 ± 5.69	3.75 ± 5.29	5.00 ± 5.53	0.74 (0.46, 0.89)**	0.01 (0.00, 0.45)	2.78	5.60
	SSC	Total	3.54 ± 3.75	2.40 ± 4.26	3.20 ± 4.45	0.78 (0.51, 0.91)**	0.51 (0.09, 0.78)*	1.89	2.87
		Left	2.98 ± 4.07	2.37 ± 3.85	3.29 ± 5.46	0.60 (0.23, 0.82)**	0.06 (0.00, 0.49)	2.51	4.66
		Right	4.09 ± 5.41	2.43 ± 5.27	3.12 ± 5.34	0.73 (0.43, 0.89)**	0.65 (0.31, 0.85)**	2.76	3.18
HHb	PFC	Total	0.97 ± 3.24	0.56 ± 2.98	0.88 ± 3.31	0.47 (0.03,0.75)*	0.61 (0.23,0.83)**	2.28	2.05
		Left	0.64 ± 3.93	7.54 ± 4.36	1.05 ± 3.60	0.30 (0.00, 0.66)	0.37 (0.00, 0.70)	3.47	2.98
		Right	1.31 ± 3.46	0.37 ± 2.24	0.72 ± 3.28	0.63 (0.28, 0.83)**	0.72 (0.43, 0.88)**	1.78	1.78
	FEF	Total	0.57 ± 3.21	0.03 ± 1.55	0.25 ± 1.74	0.37 (0.00, 0.69)	0.39 (0.00, 0.71)*	2.00	2.02
		Left	−0.06 ± 1.48	0.22 ± 1.65	0.27 ± 2.19	0.29 (0.00, 0.65)	0.34 (0.00, 0.67)	1.32	1.52
		Right	1.20 ± 5.64	−0.15 ± 2.14	0.24 ± 1.91	0.44 (0.03, 0.73)*	0.28 (0.00, 0.64)	3.18	3.56
	SMA	Total	−0.31 ± 3.78	−0.74 ± 2.52	-0.10 ± 2.28	0.28 (0.00, 0.64)	0.19 (0.00, 0.58)	2.72	2.81
		Left	−0.53 ± 4.29	−0.62 ± 2.69	−0.12 ± 2.80	0.25 (0.00, 0.62)	0.25 (0.00, 0.62)	3.09	3.14
		Right	−0.08 ± 3.65	−0.86 ± 2.59	−0.09 ± 2.43	0.31 (0.00, 0.65)	0.14 (0.00, 0.55)	2.63	2.88
	PMC	Total	0.26 ± 2.26	−0.44 ± 2.52	0.14 ± 2.28	0.83 (0.57, 0.93)**	0.16 (0.00, 0.56)	0.99	2.08
		Left	0.14 ± 3.23	−0.26 ± 3.52	0.27 ± 2.63	0.86 (0.69, 0.94)**	0.00 (0.00, 0.45)	1.25	2.94
		Right	0.37 ± 1.86	−0.61 ± 2.18	0.01 ± 2.60	0.69 (0.25, 0.88)**	0.41 (0.00, 0.72)*	1.12	1.74
	SSC	Total	−1.00 ± 1.22	−1.28 ± 1.43	−1.08 ± 1.14	0.70 (0.40, 0.87)**	0.46 (0.02, 0.75)*	0.72	0.87
		Left	−0.98 ± 1.66	−1.22 ± 1.66	−1.13 ± 1.56	0.80 (0.58, 0.92)**	0.41 (0.00, 0.72)*	0.74	1.24
		Right	−1.01 ± 1.03	−1.34 ± 1.53	−1.03 ± 1.18	0.55 (0.17, 0.79)**	0.37 (0.00, 0.70)	0.88	0.88

Note: HbO2, oxygenated hemoglobin; HHb, deoxygenated hemoglobin; ROI, region of interest; SD, standard deviation; ICC, intraclass correlation coefficient; SEM, standard error of measurement; PFC, prefrontal cortex; FEF, frontal eye fields; SMA, supplementary motor area; PMC, premotor cortex; and SSC, somatosensory cortex.

^*Significant at α<0.05.

^**Significant at α<0.01.

Fig. 4

Bland–Altman plots for total (left plus right channels) ROIs during the postural task, visualizing the average ${HbO}_{2}$ levels ( $x$ -axis) and the difference in ${HbO}_{2}$ levels between test 1 and test 2 ( $y$ -axis). Dotted lines represent the 95% limits of agreement.

Overall, HHb test-retest reliability was lower compared to ${HbO}_{2}$ ( $ICC - 0.15$ on average for total ROIs), with some exceptions (see Table 2 and Fig. 2). Averaged ICCs, representing the test-retest reliability of the averaged test outcomes⁵⁰ showed higher values [see Table S2 in the Supplementary Material ( ${HbO}_{2}$ ) and Table S3 in the Supplementary Material (HHb)]. Additionally, individual channel-based analysis revealed that 13 out of 32 channels were activated during the postural weight-shifting task [see Figure S4(a) in the Supplementary Material], including those in the SSC, SMA, and PMC.

3.3.2.

ROI reliability during the postural task with cap repositioning

Table 2 also illustrates that the repositioning of the cap between test 1 and 3 led to less stable test-retest values than between test 1 and 2. Reliability values were lower for the PFC ( $ICC = 0.50$ , $CI = [0.08, 0.77]$ ) and SSC ( $ICC = 0.51$ , $CI = [0.09, 0.78]$ ), but remained fair. Reliability became poor for the SMA ( $ICC = 0.01$ , $CI = [0.00, 0.45]$ ) and the PMC ( $ICC = 0.00$ , $CI = [0.00, 0.14]$ ). However, reliability was good for the FEF ( $ICC = 0.64$ , $CI = [0.28, 0.84]$ ). Table 2 also shows that reliability for the ROIs per hemisphere was similar for the motor areas (SMA: $ICC \leq 0.22$ ; PMC: $ICC \leq 0.01$ ), and lower for the FEF ( $ICC \leq 0.48$ ). Interestingly, ICC-values were higher for the left PFC ( $ICC = 0.66$ , [0.32, 0.85]) and right SSC ( $ICC = 0.65$ , [0.31, 0.85]), but lower for the right PFC ( $ICC = 0.00$ , $CI = [0.00, 0.39]$ ) and left SSC ( $ICC = 0.06$ , [0.00, 0.49]). Bland–Altman plots of test 1 and 3 largely confirm the above described results (see Fig. 5). Limits of agreement increased for total ROIs compared to the test 1-2 comparison, especially for the PMC and SMA, but not for the FEF. The same participant with extreme (de)activation as previously mentioned, also showed the least consistency between tests 1 and 3 (PFC, PMC, SMA). SEM scores were also larger between test 1 and 3 (range: $2.47 - 5.73 μ mol / L$ ) than between test 1 and 2 (range: $2.51 - 4.09 μ mol / L$ ), with the FEF as the only exception (change in SEM: $- 0.27 μ mol / L$ ). SEMs ranged between 3.18 and $7.30 μ mol / L$ for the ROIs per hemisphere.

Fig. 5

Bland–Altman plots for total (left plus right channels) ROIs during the postural task, visualizing the average ${HbO}_{2}$ levels ( $x$ -axis) and the difference in ${HbO}_{2}$ levels between test 1 and test 3 ( $y$ -axis). Dotted lines represent the 95% limits of agreement.

The overall HHb test-retest reliability was comparable to ${HbO}_{2}$ ( $ICC - 0.03$ on average for total ROIs), though less stable between test 1 and 3 than between test 1 and 2 (see Table 2 and Fig. 2). Averaged ICCs showed higher values for both ${HbO}_{2}$ (see Table S2 in the Supplementary Material) and HHb (see Table S3 in the Supplementary Material).

3.3.3.

Task-specific fNIRS reliability during finger tapping

For the task-specific motor channel representing the right hand (C3-C1), no differences in relative ${HbO}_{2}$ levels were found across time points during the tapping task [ $F_{(1.53)} = 1.75$ , $p = 0.20$ ; see Fig. 6(a)]. Exploratory post-hoc tests also did not show differences between test moments (all corrected $p - values > 0.46$ ), even though a trend towards improved tapping accuracy was seen from test 1 to test 2. Interestingly, however, participants who showed better accuracy (lower tapping error) were as stable in their fNIRS outcomes, as assessed with Pearson’s correlations ( ${HbO}_{2}$ : $r = 0.30$ , $p = 0.20$ ; HHb: $r = - 0.33$ , $p = 0.15$ ; see Figure S5 in the Supplementary Material). The ICC-values were good between tests 1-2 ( $ICC = 0.66$ , $CI = [0.31, 0.84]$ ), but became poor between tests 1-3 ( $ICC = 0.38$ , $CI = [0.00, 0.70]$ ) (see Table 3). Bland–Altman plots largely confirm these results [see Figs. 6(b) and 6(c)]. Points fell between the limits of agreement, which covered a wider range and were more spread out between test 1 and 3 than between test 1 and 2. In agreement, the SEM score was $2.00 μ mol / L$ between test 1 and 2, and $2.93 μ mol / L$ between test 1 and 3.

Fig. 6

(a) Relative ${HbO}_{2}$ levels (active trial - rest trial) at test 1, 2, and 3 for the hand motor channel C3-C1 during the finger tapping task; (b) Bland–Altman plot for channel C3-C1, visualizing the average ${HbO}_{2}$ levels and the difference between ${HbO}_{2}$ levels at test 1 and 2, and (c) test 2 and 3.

Table 3

Test-retest reliability based on single ICCs for relative HbO2 and HHb levels during the finger tapping task.

	Mean ± SD (μmol/L)			ICC (95% CI)		SEM (μmol/L)
	Test 1	Test 2	Test 3	Test 1-2	Test 1-3	Test 1-2	Test 1-3
HbO	5.40 ± 3.38	4.97 ± 3.46	6.60 ± 4.04	0.66 (0.32-0.85)**	0.38 (0.00-0.70)*	2.00	2.93
HHb	−1.73 ± 1.46	−1.58 ± 1.28	−2.11 ± 2.05	0.80 (0.56-0.92)**	0.65 (0.31-0.84)**	0.62	1.06

Note: Values are determined for the task-specific motor channel C3-C1. HbO2, oxygenated hemoglobin; HHb, deoxygenated hemoglobin; SD, standard deviation; ICC, intraclass correlation coefficient; and SEM = standard error of measurement.

^*Significant at α<0.05.

^**Significant at α<0.01.

HHb test-retest reliability, as well as the ${HbO}_{2}$ and HHb averaged ICCs (see Table S4 in the Supplementary Material), were better than the single ICCs for ${HbO}_{2}$ , ranging from fair to excellent. Additionally, individual channel-based analysis revealed that five channels of the left hemisphere were active during the right finger-tapping task [see Fig. S4(b) in the Supplementary Material], including the C3-C1 channel, which confirmed the anatomical location of the right-hand motor region.

4. Discussion

4.1.

Main Study Findings

This is the first study to investigate fNIRS test-retest reliability of a postural weight-shifting and finger tapping control task in healthy older adults. When performing two measurements with an in-between resting period of 30 min and the cap remained in place, results showed fair (FEF, SMA) to excellent (PFC, PMC, and SSC) reliability during a postural task in five predefined ROIs and good reliability during tapping in one task-specific motor channel. Contrary to our hypothesis, these results are similar to published results in young healthy adults. However, as expected when testing on separate days after removal of the cap, reliability was reduced, and reliability became fair (PFC and SSC) to good (FEF) for the non-motor areas, and poor for the motor areas (SMA, PMC, and C3-C1 hand motor channel). These results are largely in line with previous studies, as detailed in Table 4, with the exception that we found poor reliability in the motor areas.

Table 4

Comparison between the current work and prior studies on fNIRS test-retest reliability in various motor tasks.

Study	Participants	Task	Brain region	Test intervala	Cap replacementb	Reliability (ICC)c
Current study	20 older adults 67.3 ± 75.0 years	VR weight-shifting right finger-tapping	PFC FEF SMA PMC SSC right hand motor area	30 min, 24 hrs	Markings on head Cz, CP2, FC1	${HbO}_{2} = 0.48$ to 0.87 (30 min), ${HbO}_{2} = 0.00$ to 0.64 (24 hr), HHb = 0.28 to 0.83 (30 min), HHb = 0.16 to 0.65 (24 hr)
1. Wyser et al.²⁰	15 young adults 27.0 ± 4.6 years	Self-paced, isometric, hand grasping practiced with metronome (∼1 Hz) right & left hand	Primary motor cortex bilateral (task-specific)	5.5 ± 3.1 days	TMS localization	${HbO}_{2} = 0.62$ to 0.79 HHb = 0.73 to 0.81
2. Bae et al.²¹	5 young adults mean: 21.8 years range: 21-23 years	Robotic passive hand movement right hand	Primary sensorimotor cortex left	10 sessions 1, 3, 7, 23 days, 15 min, 6 hr	3D digitizer	${HbO}_{2} = 0.002$ (overall)
3. Broscheid et al.²⁶	20 healthy adults 42.2 ± 9.8 years 20 MS-patients 41.0 ± 12.0 years	Self-paced walking back and forth 12 m for 6 min	PFC bilateral	24 hrs	Not mentioned	${HbO}_{2} = 0.39$ to 0.74 (healthy) HHb = 0.39 to 0.63 (healthy) ${HbO}_{2} = 0.00$ to 0.39 (MS) HHb = 0.20 to 0.56 (MS)
4. Stuart et al.²⁷	25 young adults 32.3 ± 7.5 years	2 min walking 2 min turning	PFC bilateral	5 to 10 min	3D digitizer	${HbO}_{2} = 0.67$ (walking) ${HbO}_{2} = 0.71$ (turning)
5. Dravida et al.²²	14 young adults 26.9 ± 9.5 years	cued digit manipulation tasks right hand, four different tasks	motor cortex bilateral (task-specific)	1 day	3D digitizer	${HbO}_{2} = 0.51$ to 0.83 HHb = 0.38 to 0.67
6. Plichta et al.²⁴	12 young adults 29.1 ± 6.0 years	Visually cued index finger-tapping ∼2.5 Hz, left and right	Motor cortex bilateral	3 weeks	Not mentioned	${HbO}_{2} = 0.70$ to 0.74 HHb = 0.67 to 0.77
7. Bhambhani et al.²³	13 healthy adults 31.5 ± 4.5 years 25 TBI-patients 31.6 ± 9.8 years	Maximal rhythmic handgrip exercises right hand	PFC left	24 to 48 hrs	Not mentioned	${HbO}_{2} = 0.83$ (healthy) ${HbO}_{2} = 0.70$ (TBI)

Note: ICC, intraclass correlation coefficient; HbO2, oxygenated hemoglobin; HHb, deoxygenated hemoglobin; and TMS, transcranial magnetic stimulation.

^aCap removed between tests in all studies;

^bmethod used for ensuring similar cap (re)placement at the different tests;

^csingle-measure; ICC, intraclass correlation coefficient

Behaviorally, performance on the postural and tapping task did not show significant improvements over time, suggesting that no test-induced learning effects were present. However, a trend towards more accurate tapping was shown from test 1 to test 2, despite providing auditory pacing. Interestingly, this improvement in behavioral accuracy was not associated with a change in fNIRS outcomes. Previous investigations on fNIRS test-retest reliability did not report on the stability of the behavioral correlates.²⁰^–²⁴^,²⁶^,²⁷ Four of these studies (study 1, 2, 5, and 6 in Table 4) standardized performance to some extent to prevent a learning effect, for instance by using cues,²² robotic passive hand movements,²¹ practicing the task with a metronome,²⁰ or using a very small time-window (1200 ms).²⁴ Furthermore, one study (study 3 in Table 4) checked for possible differences in exhaustion via a Borg scale, showing no differences between test moments.²⁶

4.2.

fNIRS Reliability

4.2.1.

Reliability during postural control without cap removal

Test-retest analysis of five predefined and commonly applied ROIs during the postural task revealed an overall fair to excellent reliability in ${HbO}_{2}$ levels when repeating two tests on the same day. This suggests that, when not removing the fNIRS cap and optodes, the fNIRS signal is fairly reliable over time for the FEF and SMA ( $ICC \geq 0.48$ ) and highly reliable for the PFC, PMC, and SSC ( $ICC \geq 0.78$ ). Similar results were found for the left and right hemispheres separately. To date, only one other study specifically investigated fNIRS test-retest reliability when testing twice on the same day (study 4 in Table 4),²⁷ although with cap removal. These authors studied only the PFC in young adults, resulting in a good reliability during both straight walking and turning ( $ICC \geq 0.67$ ). The higher reliability found in the current study for the postural task may be explained by not removing the cap, the lack of activity in this area, the shorter duration of signal extraction (20 sec versus 2 min) and higher number of trials (7 versus 1) in which the task was performed.⁵ Pilot testing prior to data collection also showed a 13% true positive rate increase by adding two trials (67% at five trials and 80% at seven trials). Additionally, other task-related aspects, such as heel strikes during gait, may induce movement artifacts, which were absent during the present postural task. As Stuart et al.²⁷ did not report on HHb, the overall lower reliability compared to ${HbO}_{2}$ found in this study cannot directly be compared.

4.2.2.

Reliability during postural control with cap removal

An important finding of the present study is that when measuring fNIRS ${HbO}_{2}$ levels on different days after cap removal and repositioning, reliability deteriorated. Interestingly, non-motor cortical areas seemed less affected by cap removal, indicated by fair to good reliability in the PFC, FEF, and SSC ( $ICC \geq 0.50$ ), which is comparable to the reliability found during walking²⁶ and handgrip exercises²³ (study 3 and 7 in Table 4). On the other hand, poor reliability was revealed in the motor areas ( $ICC \leq 0.01$ ), in agreement with a reliability study during passive hand movements²¹ (study 2 in Table 4), possibly driven by the lack of active movement. The finding that the non-motor areas proved to be more stable across consecutive days could be due the fact that this virtual reality-based postural task required strong non-motor involvement. The relatively poor reliability found for the motor areas during the postural task ( $ICC \leq 0.01$ ) is in contrast with previous research on hand grasping and finger tapping, which showed fair to excellent reliability after cap removal ( $ICC \geq 0.50$ ) (study 1, 5, and 6 in Table 4).²⁰^,²²^,²⁴ Besides cap repositioning, this finding may alternatively be explained by the fact that the postural task required whole body movements, resulting in wider and more variable cortical recruitment. The single channel-based analysis underscored this statement, as no clear activation pattern was found for the PMC and SMA. In addition, fNIRS signals pertaining to the lower limbs are more prone to systematic artifacts²⁹ and likely more difficult to capture as the motor representation of the lower extremities is located deeply within the interhemispheric fissure,³⁷ which could result in more inconsistent fNIRS measurements. Note that, similar to the present findings, HHb reliability after cap removal was overall comparable to ${HbO}_{2}$ findings in healthy adults (see Table 4).

4.2.3.

Task-specific fNIRS reliability during finger tapping

For finger tapping, reliability of the C3-C1 hand motor channel was good when assessed on the same day ( $ICC = 0.66$ ). This supports the notion that fNIRS is a sensitive neuroimaging tool for capturing ${HbO}_{2}$ changes during a specific motor tasks requiring localized cortical activation.²⁹ It should be noted that there was a trend towards better behavioral tapping accuracy from test 1 to test 2 ( $p = 0.07$ ). However, when exploring this result, we did not find differences in the ${HbO}_{2}$ levels or that the participants with improved accuracies had better reliability. Similar to the motor cortical ROIs of the postural task, reliability during tapping became poor when assessed on two consecutive days ( $ICC = 0.38$ ), though the decline was less steep ( $- 0.47$ (SMA) and $- 0.78$ (PMC) for the postural task and $- 0.28$ (C3-C1) for the tapping task). This highlights that using fNIRS on 2 consecutive days results in suboptimal reproducibility, despite efforts to standardize cap replacement. In contrast, previous studies investigating task-specific motor areas showed fair to excellent ICC-values ( $\geq 0.51$ ) during digit manipulation²² and hand grasping²⁰ when assessed on different days (study 1 and 5 in Table 4). These results, however, were found in young adults, who might show higher signal to noise ratios and therefore higher reliability.²⁸ It could also be that the localization method used in these studies, TMS localization²⁰ and registration of 3D-coordinates,²² provided a more accurate foundation for cap replacement after removal.

4.2.4.

Suggestions for improving fNIRS reliability

First, other methods of cap standardization could be explored, such as using a 3D-digitizer, a neuronavigation system⁵¹ or structural MRI scans for co-registration.⁵² Interestingly, we found that averaged ICCs were generally higher than single ICCs.²⁵^,⁴⁶ Especially for study protocols that involve cap removal, multiple fNIRS blocks within one testing session, and the use of averaged values of ${HbO}_{2}$ and HHb levels for between-test comparisons may be required. Second, future studies should investigate whether shortening of trial duration while increasing the number of trials within a block design, could lead to enhanced stability of fNIRS outcomes, as pilot testing revealed increased positive rates using this procedure. Third, behavioral tasks should be standardized as much as possible and any learning effects counteracted, as was done in the present study. Measuring on the same time of day and including large sample sizes is also recommended, especially when using a complex (motor) task. Finally, person-specific variables, such as vigilance, motivation, and effort, could additionally be assessed, as they can possibly affect reliability.

4.3.

Strengths and Limitations

Despite a relatively small sample size and the inherently variable nature of fNIRS signals, we found an overall acceptable test-retest reliability of brain oxygenated hemoglobin levels as measured in five commonly used ROIs, as well as in a task-specific motor channel. Although this is the first study investigating test-retest reliability in older adults, future research should investigate whether the findings generalize to patient populations as well. Unlike previous studies, we examined the behavioral test-retest results to take any possible learning effects into account. Moreover, short-separation channels were used to correct for physiological noise in the fNIRS signal.³⁵ A limitation of this study is that we used the same differential path length factor for all participants. It has been shown that age influences this factor during the conversion of optical density into ${HbO}_{2}$ , though it is not known for adults older than 50 years of age.⁵ Finally, fNIRS measurements are associated with high between-subject variance, as are the outcomes of other neuroimaging techniques, on which the ICC-value is dependent.²⁵ Moreover, the range of CIs accompanying ICCs varied. Therefore, future studies in larger samples are needed to verify fNIRS test-retest reliability in healthy older adults and other study populations.

5. Conclusions

The present results indicate that repeated fNIRS measurements have fair to excellent reliability in healthy older adults during motor tasks, though reliability became poorer when measuring on multiple days after repositioning the cap.

Disclosures

The authors declare that there are no conflicts of interest.

Acknowledgments

The authors would like to thank Marie Bogaerts, Annelien Loverix, Hanne Vandenbossche, and Ella Copermans for their help in data collection. This study could not have been done without the contribution of our participants, for which the authors are truly grateful. The work was supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement (Grant No. 721577) and the Research Foundation Flanders (FWO) (Grant No. G0A5619N). The funding sources had no involvement in the study design; in the collection, analysis, and interpretation of data; the writing of the report; and in the decision to submit the article for publication.

Code, Data, and Materials Availability

Study data and code are not publicly available due to privacy and ethical reasons. Data and code can be made available upon reasonable request to the corresponding author. Supplementary materials are provided online.

References

1.

X. Cui, S. Bray and A. L. Reiss, “Functional near infrared spectroscopy (NIRS) signal improvement based on negative correlation between oxygenated and deoxygenated hemoglobin dynamics,” Neuroimage, 49 (4), 3039 –3046 https://doi.org/10.1016/j.neuroimage.2009.11.050 NEIMEF 1053-8119 (2010). Google Scholar

2.

S. C. Bunce et al., “Functional near-infrared spectroscopy,” IEEE Eng. Med. Biol. Mag., 25 (4), 54 –62 https://doi.org/10.1109/MEMB.2006.1657788 IEMBDE 0739-5175 (2006). Google Scholar

3.

F. Jöbsis, “Noninvasive, infrared monitoring of cerebral and myocardial oxygen sufficiency and circulatory parameters,” Science, 198 (4323), 1264 –1267 https://doi.org/10.1126/science.929199 SCIEAS 0036-8075 (1977). Google Scholar

4.

J. A. Noah et al., “fMRI validation of fNIRS measurements during a naturalistic task,” J. Vis. Exp., 100 (e52116), 5 –9 https://doi.org/10.3791/52116 (2015). Google Scholar

5.

J. C. Menant et al., “A consensus guide to using functional near-infrared spectroscopy in posture and gait research,” Gait Posture, 82 254 –265 https://doi.org/10.1016/j.gaitpost.2020.09.012 (2020). Google Scholar

6.

S. Lloyd-Fox, A. Blasi and C. E. Elwell, “Illuminating the developing brain: the past, present and future of functional near infrared spectroscopy,” Neurosci. Biobehav. Rev., 34 269 –284 https://doi.org/10.1016/j.neubiorev.2009.07.008 NBREDE 0149-7634 (2010). Google Scholar

7.

V. Quaresima and M. Ferrari, “Functional near-infrared spectroscopy (fNIRS) for assessing cerebral cortex function during human behavior in natural/social situations: a concise review,” Organ. Res. Methods, 22 (1), 46 –68 https://doi.org/10.1177/1094428116658959 1094-4281 (2019). Google Scholar

8.

F. Herold et al., “Functional near-infrared spectroscopy in movement science: a systematic review on cortical activity in postural and walking tasks,” Neurophotonics, 4 (4), 041403 https://doi.org/10.1117/1.NPh.4.4.041403 (2017). Google Scholar

9.

Y. Ono et al., “Motor learning and modulation of prefrontal cortex: an fNIRS assessment,” J. Neural Eng., 12 (6), 066004 https://doi.org/10.1088/1741-2560/12/6/066004 1741-2560 (2015). Google Scholar

10.

M. A. Immink et al., “Prefrontal cortex activation during motor sequence learning under interleaved and repetitive practice: a two-channel near-infrared spectroscopy study,” Front. Hum. Neurosci., 15 (May), 1 –15 https://doi.org/10.3389/fnhum.2021.644968 (2021). Google Scholar

11.

R. Alves Heinze et al., “Hand motor learning in a musical context and prefrontal cortex hemodynamic response: a functional near-infrared spectroscopy (fNIRS) study,” Cogn. Process., 20 507 –513 https://doi.org/10.1007/s10339-019-00925-y (2019). Google Scholar

12.

O. Seidel et al., “Motor learning in a complex balance task and associated neuroplasticity: a comparison between endurance athletes and nonathletes,” J. Neurophysiol., 118 (3), 1849 –1860 https://doi.org/10.1152/jn.00419.2017 JONEA4 0022-3077 (2017). Google Scholar

13.

M. Hiyamizu et al., “Effects of self-action observation on standing balance learning: a change of brain activity detected using functional near-infrared spectroscopy,” NeuroRehabilitation, 35 (3), 579 –585 https://doi.org/10.3233/NRE-141153 (2014). Google Scholar

14.

X. Cui et al., “A quantitative comparison of NIRS and fMRI across multiple cognitive tasks,” Neuroimage, 54 2808 –2821 https://doi.org/10.1016/j.neuroimage.2010.10.069 NEIMEF 1053-8119 (2010). Google Scholar

15.

I. Tachtsidis and F. Scholkmann, “False positives and false negatives in functional near-infrared spectroscopy : issues, challenges, and the way forward,” Neurophotonics, 3 (3), 031405 https://doi.org/10.1117/1.NPh.3.3.031405 (2016). Google Scholar

16.

F. Scholkmann et al., “A review on continuous wave functional near-infrared spectroscopy and imaging instrumentation and methodology,” NeuroImage., 85 6 –27 https://doi.org/10.1016/j.neuroimage.2013.05.004 NEIMEF 1053-8119 (2014). Google Scholar

17.

H. Niu et al., “Test-retest reliability of graph metrics in functional brain networks: a resting-state fNIRS study,” www.plosone.org (2013). Google Scholar

18.

H. Zhang et al., “Is resting-state functional connectivity revealed by functional near-infrared spectroscopy test-retest reliable?,” J. Biomed. Opt., 16 (6), 067008 https://doi.org/10.1117/1.3591020 JBOPFO 1083-3668 (2011). Google Scholar

19.

H. Zhang et al., “Test-retest assessment of independent component analysis-derived resting-state functional connectivity based on functional near-infrared spectroscopy,” Neuroimage, 55 (2), 607 –615 https://doi.org/10.1016/j.neuroimage.2010.12.007 NEIMEF 1053-8119 (2011). Google Scholar

20.

D. G. Wyser et al., “Characterizing reproducibility of cerebral hemodynamic responses when applying short-channel regression in functional near-infrared spectroscopy,” Neurophotonics, 9 (1), 015004 https://doi.org/10.1117/1.NPh.9.1.015004 (2022). Google Scholar

21.

S. Bae, Y. Lee and P. H. Chang, “There is No test–retest reliability of brain activation induced by robotic passive hand movement: a functional NIRS study,” Brain Behav., 10 (10), 1 –13 https://doi.org/10.1002/brb3.1788 (2020). Google Scholar

22.

S. Dravida et al., “Comparison of oxyhemoglobin and deoxyhemoglobin signal reliability with and without global mean removal for digit manipulation motor tasks,” Neurophotonics, 5 (1), 011006 https://doi.org/10.1117/1.NPh.5.1.011006 (2017). Google Scholar

23.

Y. Bhambhani et al., “Reliability of near-infrared spectroscopy measures of cerebral oxygenation and blood volume during handgrip exercise in nondisabled and traumatic brain-injured subjects,” J. Rehabil. Res. Dev., 43 (7), 845 –856 https://doi.org/10.1682/JRRD.2005.09.0151 JRRDEC 0748-7711 (2006). Google Scholar

24.

M. M. Plichta et al., “Event-related functional near-infrared spectroscopy (fNIRS) based on craniocerebral correlations: reproducibility of activation?,” Hum. Brain Mapp., 28 (8), 733 –741 https://doi.org/10.1002/hbm.20303 HBRME7 1065-9471 (2007). Google Scholar

25.

L. Li and Z. Lin, “Tutorial on use of intraclass correlation coefficients for assessing intertest reliability and its application in functional near-infrared spectroscopy – based brain imaging,” J. Biomed. Opt., 20 (5), 050801 https://doi.org/10.1117/1.JBO.20.5.050801 JBOPFO 1083-3668 (2022). Google Scholar

26.

K. C. Broscheid et al., “Inter-session reliability of functional near-infrared spectroscopy at the prefrontal cortex while walking in multiple sclerosis,” Brain Sci., 10 (9), 1 –15 https://doi.org/10.3390/brainsci10090643 (2020). Google Scholar

27.

S. Stuart et al., “Pre-frontal cortical activity during walking and turning is reliable and differentiates across young, older adults and people with Parkinson’s disease,” Front. Neurol., 10 (MAY), 1 –11 https://doi.org/10.3389/fneur.2019.00536 (2019). Google Scholar

28.

M. Dadar, V. S. Fonov and D. L. Collins, “A comparison of publicly available linear MRI stereotaxic registration techniques,” Neuroimage, 174 191 –200 https://doi.org/10.1016/j.neuroimage.2018.03.025 NEIMEF 1053-8119 (2018). Google Scholar

29.

H. Cockx et al., “Functional near-infrared spectroscopy is sensitive to leg activity in the primary motor cortex, but requires rigorous correction for systemic fluctuations induced by movements,” in OSF Prepr., 1 –35 (2022). https://doi.org/10.31219/osf.io/xgnfh Google Scholar

30.

J. Willaert et al., “Does a novel exergame challenge balance and activate muscles more than existing off-the-shelf exergames?,” J. Neuroeng. Rehabil., 3 (1), 1 –13 https://doi.org/10.1186/s12984-019-0628-3 (2020). Google Scholar

31.

V. de Rond et al., “Compromised brain activity with age during a game-like dynamic balance task: single- vs. dual-task performance,” Front. Aging Neurosci., 13 (July), 1 –13 https://doi.org/10.3389/fnagi.2021.657308 (2021). Google Scholar

32.

D. A. Winter, “Biomechanics and Motor Control of Human Movement,” John Wiley & Sons, Inc.( (2009). Google Scholar

33.

C. Stamate et al., “The cloudUPDRS app: a medical device for the clinical assessment of Parkinson’s disease,” Pervasive Mob. Comput., 43 146 –166 https://doi.org/10.1016/j.pmcj.2017.12.005 (2018). Google Scholar

34.

G. Augusto et al., “fNIRS optodes’ location decider (fOLD): a toolbox for probe arrangement guided by brain regions-of-interest OPEN,” Sci. Rep., 8 3341 https://doi.org/10.1038/s41598-018-21716-z SRCEC3 2045-2322 (2018). Google Scholar

35.

H. Santosa et al., “Quantitative comparison of correction techniques for removing systemic physiological signal in functional near-infrared spectroscopy studies,” Neurophotonics, 7 (3), 035009 https://doi.org/10.1117/1.NPh.7.3.035009 (2020). Google Scholar

36.

V. Jurcak, D. Tsuzuki and I. Dan, “10/20, 10/10, and 10/5 systems revisited: their validity as relative head-surface-based positioning systems,” Neuroimage, 34 1600 –1611 https://doi.org/10.1016/j.neuroimage.2006.09.024 NEIMEF 1053-8119 (2007). Google Scholar

37.

W. Penfield and E. Boldrey, “Somatic motor and sensory representation in the cerebral cortex of man as studies by electrical stimulation,” Brain, 60 (4), 389 –443 https://doi.org/10.1093/brain/60.4.389 BRAIAK 0006-8950 (1937). Google Scholar

38.

H. Santosa et al., “The NIRS brain AnalyzIR toolbox,” Algorithms, 11 (5), 73 https://doi.org/10.3390/a11050073 1748-7188 (2018). Google Scholar

39.

G. Strangman, M. A. Franceschini and D. A. Boas, “Factors affecting the accuracy of near-infrared spectroscopy concentration calculations for focal changes in oxygenation parameters,” Neuroimage, 18 865 –879 https://doi.org/10.1016/S1053-8119(03)00021-1 NEIMEF 1053-8119 (2003). Google Scholar

40.

R. D. Hoge et al., “Simultaneous recording of task-induced changes in blood oxygenation, volume, and flow using diffuse optical imaging and arterial spin-labeling MRI,” Neuroimage, 25 (3), 701 –707 https://doi.org/10.1016/j.neuroimage.2004.12.032 NEIMEF 1053-8119 (2005). Google Scholar

41.

J. W. Barker, A. Aarabi and T. J. Huppert, “Autoregressive model based algorithm for correcting motion and serially correlated errors in fNIRS,” Biomed. Opt. Express, 4 (8), 1366 https://doi.org/10.1364/BOE.4.001366 BOEICL 2156-7085 (2013). Google Scholar

42.

J. Virtanen et al., “Accelerometer-based method for correcting signal baseline changes caused by motion artifacts in medical near-infrared spectroscopy,” J. Biomed. Opt., 16 (8), 087005 https://doi.org/10.1117/1.3606576 JBOPFO 1083-3668 (2011). Google Scholar

43.

P. H. S. Pelicioni et al., “Cortical activation during gait adaptability in people with Parkinson’s disease,” Gait Posture, 91 247 –253 https://doi.org/10.1016/j.gaitpost.2021.10.038 (2022). Google Scholar

44.

M. A. Mayka et al., “Three-dimensional locations and boundaries of motor and premotor cortices as defined by functional brain imaging: a meta-analysis,” Neuroimage, 31 (4), 1453 –1474 https://doi.org/10.1016/j.neuroimage.2006.02.004 NEIMEF 1053-8119 (2006). Google Scholar

45.

V. Letchuman and C. Donohoe, Neuroanatomy, Superior Sagittal Sinus, StatPearls, Treasure Island( (2019). Google Scholar

46.

T. K. Koo and M. Y. Li, “A guideline of selecting and reporting intraclass correlation coefficients for reliability research,” J. Chiropr. Med., 15 (2), 155 https://doi.org/10.1016/j.jcm.2016.02.012 (2016). Google Scholar

47.

J. P. Weir, “Quantifying test-retest reliability using the intraclas correlation coefficient and the SEM,” J. Strength Cond. Res., 19 (1), 231 –240 https://doi.org/10.1519/15184.1 (2005). Google Scholar

48.

C. Strouwen et al., “Test-retest reliability of dual-task outcome measures in people with parkinson disease,” Phys. Ther., 96 (8), 1276 –1286 https://doi.org/10.2522/ptj.20150244 POTPDY (2016). Google Scholar

49.

J. Martin Bland and D. G. Altman, “Statistical methods for assessing agreement between two methods of clinical measurement,” Lancet, 327 (8476), 307 –310 https://doi.org/10.1016/S0140-6736(86)90837-8 LANCAO 0140-6736 (1986). Google Scholar

50.

T. Johnstone et al., “Stability of amygdala BOLD response to fearful faces over multiple scan sessions,” Neuroimage, 25 (4), 1112 –1123 https://doi.org/10.1016/j.neuroimage.2004.12.016 NEIMEF 1053-8119 (2005). Google Scholar

51.

S. L. Novi et al., “Integration of spatial information increases reproducibility in functional near-infrared spectroscopy,” Front. Neurosci., 14 746 https://doi.org/10.3389/fnins.2020.00746 1662-453X (2020). Google Scholar

52.

M. A. Yücel et al., “Best practices for fNIRS publications,” Neurophotonics, 8 (1), 1 –34 https://doi.org/10.1117/1.NPh.8.1.012101 (2021). Google Scholar

Biography

Veerle de Rond is a PhD candidate at Neurorehabilitation Research Group (eNRGy) at KU Leuven. She obtained her BSc and MSc degrees in human movement sciences at the University of Groningen. Her research interests include the behavioral, neural (fNIRS), and neuromuscular (EMG) correlates of postural control and motor learning in ageing.

Moran Gilat is an assistant professor at the Parkinson Rehabilitation Research Lab (PRO-labo) within the Neurorehabilitation Research Group (eNRGy) at KU Leuven. His research focuses on the assessment, prediction, and treatment of freezing of gait in patients with Parkinson’s disease. Additionally, he is interested in developing interventions that improve motor learning and the accompanied neuroplasticity (fMRI, EEG, and fNIRS) in Parkinson’s disease.

Nicholas D’Cruz is a physical therapist and postdoctoral fellow at the Parkinson Rehabilitation Research Lab within the Neurorehabilitation Research Group and at the Motor Control and Neuroplasticity Research Group at KU Leuven. His research focuses on the behavioral and neural correlates (fMRI and fNIRS) of freezing of gait in patients with Parkinson’s disease and the effects of split-belt training.

Femke Hulzinga is a PhD candidate at the Parkinson Rehabilitation Research Lab within the Neurorehabilitation Research Group at KU Leuven. She obtained her BSc degree in human movement sciences at VU University Amsterdam and her MSc degree in biomedical sciences at Radboud University Nijmegen. Her research mainly focuses on split-belt training as a potential tool for improving gait in Parkinson’s disease. She is also interested in the underlying neural (fNIRS) and neuromuscular mechanisms.

Jean-Jacques Orban de Xivry is a full professor at the Motor Control and Neuroplasticity Research Group at KU Leuven. He is an engineer in applied mathematics by background and earned his PhD on computational neuroscience. His research interests include understanding the neural control (EEG and fNIRS) of movement and motor learning, primarily in the context of ageing and age-related diseases.

Alice Nieuwboer is a full professor at the Parkinson Rehabilitation Research Lab at KU Leuven. She is a physiotherapist by background and made several original contributions to gait and motor control research in Parkinson’s disease (PD), more specifically on freezing of gait. She oversees several clinical trials, investigating whether cueing and split-belt training are effective and whether consolidated motor learning can be achieved in PD and how this imprints on the brain.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Veerle de Rond, Moran Gilat, Nicholas D’Cruz, Femke Hulzinga, Jean-Jacques Orban de Xivry, and Alice Nieuwboer "Test-retest reliability of functional near-infrared spectroscopy during a finger-tapping and postural task in healthy older adults," Neurophotonics 10(2), 025010 (26 May 2023). https://doi.org/10.1117/1.NPh.10.2.025010

Received: 10 November 2022; Accepted: 14 April 2023; Published: 26 May 2023

Access the abstract

JOURNAL ARTICLE
20 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 3 scholarly publications.

Explore citations on Lens.org

KEYWORDS

Reliability

Shape memory alloys

Near infrared spectroscopy

Neurophotonics

Brain

Visualization

Hemodynamics

Show All Keywords

Subscribe to Digital Library

Receive Erratum Email Alert

Significance

Aim

Approach

Results

Conclusions

1.

Introduction

2.

Materials and Methods

2.1.

Participants

2.2.

Experimental Procedure and Tasks

2.2.1.

Procedure

2.2.2.

Postural task

2.2.3.

Finger tapping task

2.2.4.

fNIRS assessment

2.3.

Data Processing

2.3.1.

Behavioral data

Eq. (1)

2.3.2.

fNIRS data

Fig. 1

2.4.

Statistical Analysis

Eq. (2)

3.

Results

3.1.

Participants’ Characteristics

Table 1

3.2.

Behavioral Results

3.3.

fNIRS Results

Fig. 2

3.3.1.

ROI reliability during the postural task without cap repositioning

Fig. 3

Table 2

Fig. 4

3.3.2.

ROI reliability during the postural task with cap repositioning

Fig. 5

3.3.3.

Task-specific fNIRS reliability during finger tapping

Fig. 6

Table 3

4.

Discussion

4.1.

Main Study Findings

Table 4

4.2.

fNIRS Reliability

4.2.1.

Reliability during postural control without cap removal

4.2.2.

Reliability during postural control with cap removal

4.2.3.

Task-specific fNIRS reliability during finger tapping

4.2.4.

Suggestions for improving fNIRS reliability

4.3.

Strengths and Limitations

5.

Conclusions

Disclosures

Acknowledgments

Code, Data, and Materials Availability

References

Biography

Show All Keywords

Keywords/Phrases