Skip to main content

METHODS article

Front. Neuroanat., 17 December 2019

Automated Individualization of Size-Varying and Touching Neurons in Macaque Cerebral Microscopic Images

  • 1CEA-CNRS-UMR 9199, Laboratoire des Maladies Neurodégénératives, MIRCen, Université Paris-Saclay, Fontenay-aux-Roses, France
  • 2School of Computer Science and Engineering, Xi’an University of Technology, Xi’an, China

In biomedical research, cell analysis is important to assess physiological and pathophysiological information. Virtual microscopy offers the unique possibility to study the compositions of tissues at a cellular scale. However, images acquired at such high spatial resolution are massive, contain complex information, and are therefore difficult to analyze automatically. In this article, we address the problem of individualization of size-varying and touching neurons in optical microscopy two-dimensional (2-D) images. Our approach is based on a series of processing steps that incorporate increasingly more information. (1) After a step of segmentation of neuron class using a Random Forest classifier, a novel min-max filter is used to enhance neurons’ centroids and boundaries, enabling the use of region growing process based on a contour-based model to drive it to neuron boundary and achieve individualization of touching neurons. (2) Taking into account size-varying neurons, an adaptive multiscale procedure aiming at individualizing touching neurons is proposed. This protocol was evaluated in 17 major anatomical regions from three NeuN-stained macaque brain sections presenting diverse and comprehensive neuron densities. Qualitative and quantitative analyses demonstrate that the proposed method provides satisfactory results in most regions (e.g., caudate, cortex, subiculum, and putamen) and outperforms a baseline Watershed algorithm. Neuron counts obtained with our method show high correlation with an adapted stereology technique performed by two experts (respectively, 0.983 and 0.975 for the two experts). Neuron diameters obtained with our method ranged between 2 and 28.6 μm, matching values reported in the literature. Further works will aim to evaluate the impact of staining and interindividual variability on our protocol.

Introduction

The brain is the main part of the central nervous system and controls most body functions. It is constituted by a network of billions of neurons that range from 5 to 30 μm in diameter (Andersen et al., 2016). Information about the number, morphology (size and shape, etc.), and distribution (density and orientation, etc.) of neurons is essential to study brain development in health and disease. Such studies include development, aging (Williams and Herrup, 1988; Pakkenberg and Gundersen, 1997; Larsen et al., 2006; Pelvig et al., 2008; Karlsen and Pakkenberg, 2011; Walløe et al., 2014), cyto-architecture (Andrey et al., 2010; Andersen et al., 2016), and neurodegeneration (Thu et al., 2010; Waldvogel et al., 2015). However, these studies are challenging because of the varying size and color intensity of the neurons, their high density in certain regions, and the large information content of cellular-scale images. In this article, we define touching neurons as neurons that are in contact and/or neurons that seem to overlap because of the implicit projection of a three-dimensional (3-D) sample into a two-dimensional (2-D) image. In practice, stereology (Gundersen, 1986; West et al., 1991; Jelsing et al., 2006) is the reference method used by neurobiologists to estimate the number of cells in a region of interest (ROI). This technique is robust and unbiased when properly used but relies on long and tedious manual interventions. Moreover, the accuracy of the measurement depends mainly on three factors: (1) the complexity of the anatomical regions (cell density and organization), (2) the parameters of the method (such as the number or the size of sections to be considered and sampling of the optic dissectors that are necessary to be adjusted), and (3) the experience of the operator in stereology.

Today, more advanced techniques and more exhaustive studies on cell individualization are under development using new automated image processing methods. Conversely, to stereology, which takes into account the whole thickness of a sample and can deal with cells that are only partially embedded in the field of view, image processing methods only have access to flattened 2-D images. Mathematical morphology (Nedzved et al., 2000; Shu et al., 2013) can be applied to segment partially touching cells using the opening operation or the ultimate residues, but only strives in low-density regions. Approach based on concavity detection (Bai et al., 2008; Kothari et al., 2009; Zhang et al., 2013; Qi, 2014; Riccio et al., 2019) allows concave points on the contours of touching cells to be detected. Touching cells can be optimally separated using ellipse registration or a distance transformation algorithm, but false concave points due to noise are often present. It is not possible to apply this method to high-density cases where concavities may not be present. Region growing (Zucker, 1976; Adams and Bischof, 1994) can separate touching cells if appropriate seeds are detected. Otherwise, the non-detection or the overdetection of seeds will lead to, respectively, undersegmentation or oversegmentation. The Watershed algorithm is widely used to separate touching cells (Cousty et al., 2009) but easily generates oversegmentations and undersegmentations mainly due to noise in the images. To alleviate oversegmentation, algorithms have been proposed to select appropriate initial seeds (Yang et al., 2006; Lee et al., 2013; Shu et al., 2013; Dewan et al., 2014; Xu et al., 2014). Likewise, if appropriate initial contours are obtained, active contours can also avoid oversegmentation (Li et al., 2008). Graph-cut methods (Daněk et al., 2009; Al-Kofahi et al., 2010; Lou et al., 2012) are also popular today because they are robust to noise and integrate visual information and topological constraints. Nevertheless, these methods do not solve the problems of oversegmentation and undersegmentation even in simple cases where only a few cells are aggregated. The iCut algorithm (He et al., 2015) was proposed to individualize touching cells but fails to separate densely clustered cells in very dense regions [e.g., dentate gyrus (DG), a subregion of the hippocampus] due to the absence of concave points. In addition, this method produces non-natural regular individualization results because of the a priori fixed cell size. In recent years, the emergence of deep learning techniques has led to several applications for the analysis of complex cells of histology sections (Kainz et al., 2015; Zhang et al., 2015; Xie et al., 2016). In Kainz et al. (2015), a function of the distance to the center of the closest cell is designed to identify cell centers. However, a parameter corresponding to the average object size needs to be fixed a priori and cannot be adapted, making it poorly adapted to size-varying cells or very dense regions. A Fully Convolutional Regression Network (FCRN, Xie et al., 2016) was proposed to perform a regression of a cell spatial density map, providing an estimate of the number of cells. Nevertheless, the model considers a fixed model of Gaussian at the center of each cell (with σ = 2) that cannot be adapted to size-varying cells either. Furthermore, the authors reported that their method gives incorrect prediction in the case where a roughly rounded cell is clumped with four or more cells. Yet, regions like the DG contain thousands of aggregated cells. Deep learning, in addition, requires a large number of manually segmented training images and is computationally expensive. So far, a limited number of methods have been proposed for individualizing touching cells due to the complexity of the problem and the diversity of the configurations (cell type, immunohistochemistry staining, and digitization systems, etc.). In the case of a large number of aggregated neurons (e.g., DG), none of these methods can produce satisfactory results. Furthermore, most of the previous studies have been performed on specific data presenting stable object size or density that make these methods neither generic nor adapted to size-varying objects such as neurons.

This article reports a new image processing protocol aiming to automatically individualize size-varying and touching neurons and offers a rigorous and extensive validation. The experiment was performed on macaque brain sections stained by immunohistochemistry using the neuronal nuclei (NeuN) antibody. Noise in the digitized images was reduced by Gaussian filtering. Due to the large uncertainty about neuron sizes, this denoising step should be self-adaptive. Through an original enumeration approach, values of the Gaussian filter width were tested in a realistic range, and the optimal one was selected when locally stable individualization results were produced at the cellular level. Neuron center location and boundary information were enhanced by min-max filtering (You et al., 2016). Finally, neuron individualization was performed using a contour-based model. The individualization results obtained in this study are promising. The F-score of neurons counting using our approach is equal to 0.816 ± 0.062 in the ROIs and can achieve a higher score of 0.905 ± 0.001 in the subiculum. To the best of our knowledge, the proposed method presents the unique ability to process massive touching neurons in the DG with an F-score superior to 0.85. Moreover, the proposed method was ultimately compared with an adapted stereology technique, which is the gold standard technique used in biology. The obtained results show a high linear correlation with stereology (respectively, 0.983 and 0.975 for the two experts) which makes our approach promising for further biological studies.

Our contributions are as follows. (1) We proposed an original multiscale protocol to individualize size-varying and numerous touching neurons. (2) We built three reliable image datasets that are large, highly variable, and representative of the complexity of the macaque brain (anatomical regions, neuron density, neuron size) from 17 major anatomical regions. (3) We manually generated four large reference databases as ground truth with the contributions of five experts in the domain of biology and image processing (segmentation and counting). (4) The proposed method was compared with Watershed visually and quantitatively with several indexes. (5) Originally, the proposed method was compared with adapted stereology.

Materials and Methods

Biological Material

All animal studies were conducted according to French regulations (Directive 2010/63/EU–French Act Rural Code R 214-87 to 131). The animal facility is authorized by veterinarian inspectors (authorization no. D 92-032-02) and complies with Standards for Humane Care and Use of Laboratory Animals of the Office of Laboratory Animal Welfare (OLAW–no. #A5826-01).

This work was performed on a 9-year-old healthy adult male cynomolgus macaque, Macaca fascicularis. After euthanasia, its brain was extracted from skull and frozen. The brain was then cut into 40-μm-thick serial coronal sections. Eight interleaved series were produced, leading to 0.32-mm interspace between two consecutive sections of a series. About 133 brain sections of one series were stained using a neuronal marker NeuN. These sections were digitized using a whole-slide imaging bright field scanner (Axio Scan.Z1, Zeiss) with an in-plane resolution of 0.22 μm (×20 magnification). In the current work, a subset of three brain sections, the 81st, the 91st, and the 101st section (noted Nos. 81, 91, and 101; Figure 1), along the rostro-caudal axis from the frontal pole to the caudal end of the cerebral cortex were selected and processed. They are representative of the most frequent neuron distributions found in the entire brain. They contain 17 major anatomical regions (Supplementary Table 1): cortex, hippocampus, thalamus, and so on. Each digitized brain section corresponds to about 140 GB of data.

FIGURE 1
www.frontiersin.org

Figure 1. (A) Location along the rostro-caudal axis of the three selected coronal sections (sagittal view). A, anterior; P, posterior. (B) Image of brain section No. 91 of 230,000 pixels × 188,000 pixels (50.74 mm × 41.33 mm), ∼145 GB.

Datasets

As the staining protocol was performed simultaneously for all analyzed histological sections, the staining is assumed to be similar among the sections treated, without bias in staining intensity. Sections Nos. 81, 101 were selected for learning segmentation of neuronal staining and section No. 91 for testing the individualization method. Then, three different datasets were produced for this study.

To learn how to segment neuronal staining, a segmentation dataset of a hundred representative images (512 pixels × 512 pixels) from sections Nos. 81, 101 were extracted (Supplementary Figures 1–3 and Supplementary Table 2) and manually segmented into three classes (neuronal staining, non-stained tissue, and background) by an expert.

To validate the individualization method, an individualization dataset of 58 images (5,000 pixels × 5,000 pixels) extracted from section No. 91 (Supplementary Figures 4, 5 and Supplementary Table 3) presenting different neuron densities and different anatomical regions was created. In these images, each neuron was manually identified by an expert by a point in its center, noted centroid whose intensity is the darkest among its neighbors, providing information about the location and the number of neurons (about 0–4,000 neurons per image).

To compare our method with stereological neuron counting, an anatomical stereology dataset including eight large images (including from 3 × 106 to 1.2 × 108 pixels) selected from six anatomical regions presenting a wide range of neuron densities were generated (Supplementary Figure 6). Neuron counting methods based on 2-D image processing deal with flattened images (single focus setting), which do not allow us to differentiate entire neurons present in the histological section from partial neurons produced at the surface of the section during the cutting process. We chose the optical dissector method as a reference to evaluate our methodology, but we did not take into account dead zones to be able to compare the two approaches. In this way, it was possible to evaluate the individualization process, but a small bias was introduced in the counting process, possibly resulting in overdetection due to partially embedded neurons. Two biologist experts estimated the number of neurons on these images directly under a microscope (Leica DM6000) with the software Mercator (Explora Nova, Supplementary Figure 7) using an adapted stereology technique.

The histological sections were stained by NeuN whose expression is present in the nucleus as well as in the cytoplasm of the neurons. The staining is darker in the nucleus and lighter in the cytoplasm. Figure 2 shows the transverse profiles of pixel intensity in three histological images ranging from low to high neuron densities. Neurons stained by NeuN are considered to be compact in this study (disk shape approximation). The reverse intensity of the transverse profile was considered to be similar to a Gaussian distribution. Therefore, we modeled this spatial profile by a Gaussian distribution with σn as parameter, which can be seen as the probability density of a pixel being a neuron pixel. σn can be evaluated by the diameter d of a neuron of interest (NOI) according to the three-sigma rule (Pukelsheim, 1994):

FIGURE 2
www.frontiersin.org

Figure 2. Transverse profiles of pixel intensity (B1–B3) in three histological images from anatomical regions of thalamus (A1), cortex (A2), and dentate gyrus (DG) (A3).

P r ( μ - 3 σ n x μ + 3 σ n ) 0.9773 2 × 3 σ n + 1 d (1)

where μ is the expectation, interpreted as neuron centroid, and σn is the spatial standard deviation, related to the neuron size. x is an observation from Gaussian distributed random variable.

Considering that the diameter of the largest neuron is 30 μm (Andersen et al., 2016) (about 136 pixels at × 20 magnification), we calculated that the value of σn is inferior to 23 pixels according to the spatial resolution used in this work. Taking into account the appropriate sampling for computation test, we investigated integer values of σn ranging from 1 to 23 pixels.

To segment individual neurons, the general idea was to first extract pixels corresponding to the neuron class and then separate touching neurons, which can be transformed into a problem of detection of the centroids of the neurons. Prior to the detection, a previous denoising step was performed by applying Gaussian filter parameterized by σ. Local minima corresponding to expected centroids can then be detected by the proposed min-max filter, which is able to enhance in a robust way the information of the centroids as well as the contours of neurons (see section “Centroid Detection Based on Min-Max Filter”). The choice of the parameter σ influences the final detection result. A small value of σ cannot remove all of the noise leading to the overdetection of the centroids by min-max filter, whereas a large value of σ will over smooth the image leading to the underdetection of the centroids. We supposed in this paper that the centroids can be correctly detected when the parameter of the Gaussian filter is adapted to that of Gaussian model of the NOI considered. Thus, the parameter σ of the Gaussian filter was defined by the σn estimated on the NOI. In the case of touching neurons, the estimation of σn was calculated by analyzing all of the individualization results, associated to all possible values of σ (ranging from 1 to 23 pixels). We assumed that σn was the one associated with the local stable individualization result. In practice, this was obtained by considering at least two consecutive σ, which gave stable individualization results (see section “Estimation of the Optimal σ”). Once the optimal σ to be used for each neuron was estimated, we applied the adaptive Gaussian filter to reduce noise to detect centroids of neurons of different sizes, and we performed the final individualization. The general workflow of the proposed methodology is presented in Figure 3.

FIGURE 3
www.frontiersin.org

Figure 3. Global strategy to perform the individualization of neurons.

Tissue Segmentation

A Random Forest (RF) model (Breiman, 2001), including 100 decision trees for tissue segmentation, was generated using the segmentation dataset, which was divided into two subsets, a learning set and a validation set. Seventeen features (R, G, B, H, S, V, X, Y, Z, L, a, b, M, V, LBP10, LBP40, and LBP63) were studied to produce the RF model. Thereinto, four main color spaces defined by CIE (Commission Internal de l’Eclairage) were considered. They are RGB (Red, Green, and Blue), HSV (Hue, Saturation, Value), XYZ (one CIE color space, which represents perceptual uniformity), and Lab (a typical CIE color space transformed non-linearly from XYZ) (Cheng et al., 2001). M and V are, respectively, the mean value and variance of gray-level intensity computed in a cross of 10 pixels. Local binary pattern (LBP) is a texture feature (Wang and He, 1990; Ojala et al., 2002) computed in a disk. The radius of this disk is fixed to 10, 40, and 63 pixels in this study, which correspond to the radiuses of small, average, and large neurons. To produce the RF model with optimal features while keeping satisfactory segmentation performance, we selected four features, L, M, V, and LBP40, which were progressively selected while keeping the best candidate based on successive tests performed from the entire set of features. This selection of features was confirmed in previous work, aiming to objectively determine optimal features and which pointed out that numerous combinations were able to produce proposer segmentation (Vandenberghe et al., 2015; Bouvier et al., 2018). This strategy of selection provided relevant, limited, and intelligible features compared to deep learning techniques, which can be assimilated to black boxes. Previous works performed on histological sections supported our choice as well based on the robustness of this approach (Vandenberghe et al., 2015, 2018).

The individualization dataset was then segmented into three classes based on the RF model produced. In practice, a median filter (7 pixels × 7 pixels) was applied to reduce impulsional noise and to regularize neuron contours in binary images. Then, connected components presenting an area inferior to a third of the surface of the smallest neuron (estimated to 127 pixels) were removed. This step can possibly remove partial neurons detected, as their size is insufficient to be counted. The binary image of neuron class produced was noted by Im.

Selection of the Optimal σ for Each Expected Neuron

The following procedure was performed on each connected component, which was calculated based on Im.

As neuron size is variable in the brain (ranging from 5 to 30 μm in diameter), it is important to estimate the optimal σ of the Gaussian filter to be applied to properly detect the centroids of size-varying neurons in multiple brain regions using min-max filter. The parameter of the optimal σ of Gaussian filter was estimated in the following three steps.

Centroid Detection Based on Min-Max Filter

The detection of neuron centroids was performed on the grayscale image Ig, which is the average of R, G, and B channels of the original color image (Hanbury, 2008). The grayscale image was filtered with a Gaussian kernel (Igf), and non-neuron pixels were masked out (Igfm). This image was used as an input to an original min-max filter (You et al., 2016) resulting in an extrema map (Ie). Briefly, for each pixel o of Igfm, let D(r, o) be the disk (not including o) of radius r centered on o, N(D) the number of pixels in D(r, o), Max(o) the number of pixels whose intensity value in D(r, o) is inferior or equal to Igfm(o) and Min(o) the number of pixels whose intensity value is superior to Igfm(o). The calculated value was then:

I e ( o ) = Max ( o ) - Min ( o ) N ( D ) (2)

Consequently, pixels whose value is −1 are defined as minima in the local disk while those whose value is 1 are defined as maxima. r was fixed to 10 pixels corresponding to the expected minimum neuron radius (2.5 μm). The two-step process (Gaussian filtering combined with min-max filtering) can be repeated iteratively n times to refine the extrema map. For densely clustered neurons, performing this process one time is not sufficient to distinguish them since their staining intensities are too similar or even equal to extract ideal minima. In addition, in a few cases, if the intensity of one neuron is entirely higher than that of its neighboring neurons, its corresponding minima could not be determined. On the contrary, multiple iterations would lead to excessive enhancement since equation (2) is iteratively computed in the disk of fixed radius r. Therefore, a compromise needs to be found. n, the number of iterations, was set to n = 2 (see section “Number of Iterations–n”). For a given value of σ, pixels of value −1 were selected as neuron centroids, and each centroid was assigned a unique label (id).

Neuron Individualization by Competitive Region Growing

Neuron individualization was performed based on the use of a discrete contour-based model. Contours were initialized as r0-pixel-radius (1<r0<10) circles centered on each centroid, and all pixels inside the contours were assigned with their centroids’ label (id). As the intensities of neuron boundaries in Ie are close to its maxima, we proposed to give each contour point an expanding speed that depends on the contour curvature and Ie’s intensity. If the contour curvature on a point is smaller and/or the Ie’s intensity on a point is darker (distant from the boundary), the expansion speed should be faster, and vice versa. Let p be a contour point, κ(p) the curvature-dependent term, h(p) the intensity-dependent term, and o the position of the neuron centroid. The next position p’ of p was calculated as:

op = op + op op × κ ( p ) × h ( p ) (3)

Let k(p) be the contour curvature at point p and c a predefined curvature chosen by the user. K(p) was calculated by equation (1), making the contour expanding for curvatures smaller than c (k(p) < c) and shrinking for curvatures greater than c (k(p) > c), which was calculated as:

κ ( p ) = c - k ( p ) (4)

The intensity-dependent term was inspired by the work of Perona and Malik (1990). Let t be a coefficient empirically set to 0.8 × max(Ie). h(p) was calculated as:

h ( p ) = exp ( - ( I e ( p ) + 1 2 t ) 2 ) (5)

After each progression, the contour was smoothed by a mean filter. It was implemented for each contour point by taking the average position information of its two adjacent neighbor points. Then, the distance between two consecutive points p and q was examined; if it was superior to a predefined maximal distance dmax, new points were interpolated automatically according to:

{ N = pq d max - 1 op i = op + pq N + 1 × i 1 i N (6)

where N represents the number of new points to interpolate, and pi is the ith point to interpolate.

Pixels around each contour point p within dmax distance were examined. Those pixels nearer to their centroid compared to p and not yet labeled were assigned their centroid’s id.

All cell contours in a connected component produced during neuron class segmentation were simultaneously expanded, and contour crossing was forbidden. The possibility to perform individualization on connected components makes it possible to massively parallelize the process. In our experiment, we fixed the number of expanding iterations at 100. This number can be arbitrarily high and should be chosen so that the computation time is low enough to enable the contours to reach every neuron boundary. When several cell contours were expanded too close, the new expansion location may jump into another cell. It was forbidden, and the expansion stopped. At the end of this process, unlabeled pixels may exist, and they were assigned to a label according to their neighbors in a 3 × 3 window by majority voting.

Estimation of the Optimal σ

This individualization method using a single σ-fixed parameter for the entire dataset has resulted in overindividualization and under individualization due to inappropriate value of σ to treat neurons presenting different sizes. A multiscale strategy was thus integrated in this work to deal with this major issue. We then tested all possible values of σ to obtain information related to centroids, labels, and contours of neurons, and we estimated the optimal σ for every expected neuron inside each connected component. We analyzed the relationship between σ-dependent contour image and the map of accumulated contours calculated by summing all σ-dependent contours to estimate the optimal value of σ for an expected neuron. Our hypothesis was that consistent consecutive values of σ should produce similar or close contours that will be accumulated in the same location.

Figure 4 shows the individualization results depending on σ values. Figure 4A presents the original image of a connected component including three neurons. Figures 4B–J display the σ-dependent label images overlapped with detected centroids. Figures 4N–V display the σ-dependent contour images. Figure 4M is the map of accumulated contours, and Figure 4K is the map of the optimal σ determined. Figure 4W illustrates the individualization result produced by the proposed method. In Figure 4I, close individualization results corresponding to the same number of neurons detected were obtained for consecutive values of σ ranging from 8 to 11. It was defined as a stable state (E). Figure 4J presents another stable state for σ ranging from 12 to 23. In this example, the stable states present when σ varies from 8 to 11 (E1, Figure 4I) and from 12 to 23 (E2, Figure 4J).

FIGURE 4
www.frontiersin.org

Figure 4. σ-dependent individualization results. (A) Original image of a connected component including three neurons. (B–H) Individualization results for σ ranging from 1 to 7. (I) Close individualization results for σ ranging from 8 to 11. (J) Individualization result for σ ranging from 12 to 23. (N–V) Contours of the individual neurons corresponding to (B–J). The different colors in (B–J) represent the individual neurons, and the white points represent the detected centroids. The white curves in (N–V) represent the contours of the individual neurons. (M) Map of accumulated contours, summation of all σ-dependent contours. Violet color represents the minimum value (1 contour); blue-like color represents overlapped contours, which are candidate for individualization of touching neurons; and red color represents the maximum value (23 cumulated contours). (K) Map of the optimal σ determined. The color blue represents σ = 5, and the red represents σ = 12. (W) Illustration of the individualization results produced by the proposed method. The white points represent the detected centroids, and the red curves are the final neuron contours.

Next, we analyzed the σ-dependent neuron contours. The idea was to first synthesize this information on a map of accumulated contours (Figure 4M), which was calculated by summing σ-dependent contour for all possible values of σ. Points of higher intensity value in the map describe zones of higher probability to be contours between touching neurons. Because a connected component consists of at least one or multiple neurons and no prior information about neuron size is known, intensity values corresponding to real contours in the map may be different. Therefore, to find the optimal contours, we proposed to study all of the intensity levels that can be encountered in the map of accumulated contours using a threshold s ranging from 1 to 23. This approach enabled us to detect both small and high number of consecutive values of σ producing stable contours of neurons. A similarity criterion between the thresholded maps and each σ-dependent contour result was then calculated according to the Dice score:

Dice ( σ , s ) = 2 × N c ( σ , s ) N i ( σ ) + N as ( s ) (7)

where Ni is the number of pixels in the σ-dependent contour result, Nas is the number of pixels in s-dependent thresholded map, and Nc is the number of pixels that exist in the same spatial location in both σ-dependent contour result and s-dependent thresholded map. This computation was performed at the connected component level. The Dice score varied between 0 and 1 (perfect superimposition).

Table 1 lists Dice scores computed for the connected component given in the example of Figure 4. The closer the value is to 1, the more the σ-dependent contour is similar to the s-dependent thresholded map. We then fixed a threshold of Dice value (Sdice), which defines a sufficient similarity between two contours. This threshold is calculated as follows:

TABLE 1
www.frontiersin.org

Table 1. Table of Dice scores computed in the example of Figure 4.

{ S dice = min E i E { σ E i s = 1 23 Dice ( σ , s ) / ( 23 × Card ( σ E i ) ) } S dice = 0.8 i f S dice > 0.8 (8)

where Ei represents a stable state of the set E = {E1, E2, …, Em} (m ≤ 11, theoretically up to 11 stable states). Card(σ ∈ Ei) is the number of σ belonging to the state of Ei. 0.8 is a threshold parameter defined empirically.

In Table 1, given a fixed s, the blue numbers are superior to Sdice and are local maxima. It means that the corresponding σ provides locally the best individualization result. Given a fixed σ, when s increases, the effect of overindividualization is reduced, but conversely the risk of under individualization increases. If at least two consecutive maximal values of Dice exist, this σ is selected as a candidate. In the example of Table 1, candidates for the optimal σ are 5, 8, 9, and 12 (red color in σ column).

Then we generated the map of the optimal σ to save the values of the optimal σ to be applied for each neuron when applying adaptive Gaussian filtering. The pixels corresponding to neurons in this map were initialized to σ = 23. The study of the optimal σ was performed from the largest value of σ candidate to the smallest. For each σ studied, individualization results were compared with σ+1 result because σ−1 result probably leads to overindividualization. If any of the two NOIs individualized by σ and σ+1 were sufficiently similar [Dice[NOIa(σ), NOIb(σ+1)] > 0.95, neuron region reproducibility, equation (9)], this value of σ was selected as the optimal one for the individualized NOI and was updated in the map of the optimal σ. If not, σ kept its previous value.

Dice ( NOI a ( σ ) , NOI b ( σ + 1 ) )
= 2 × S ( NOI a ( σ ) , NOI b ( σ + 1 ) ) S ( NOI a ( σ ) ) + S ( NOI b ( σ + 1 ) ) (9)

where NOIa(σ) and NOIb(σ+1) represent, respectively, one NOI individualized by σ and σ+1, S[NOIa(σ)] and S[NOIb(σ+1)] represent, respectively, the surface of NOIa individualized by σ and the surface of NOIb individualized by σ+1, and S[NOIa(σ), NOIb(σ+1)] is the number of pixels that exist in the same spatial location in both two NOIs individualized. 0.95 was defined empirically.

Note that neuron density in the DG is the highest in the brain. As thousands of neurons are aggregated, the DG forms an extremely large connected component. Moreover, the stained neurons are similar to each other according to size in this region. The detection of the neurons in DG region is therefore very sensitive to the parameter σ of the Gaussian filter. However, neuron sizes in the DG have a low variability, making it possible to apply a fixed value of σ. To evaluate this σ, we selected all of the images including the DG from the individualization dataset (seven out of 58 images). Then, new binary images were generated by keeping only the DG. With these binary images, we computed the 23 σ-dependent individualization results using the proposed method and the ground truth (manual counting in the same regions). Using the evaluation method described in section “Comparison of Automated and Manual Neuron Counting,” the optimal σ was determined, and details will be provided in section “Optimal σ for the Region of the DG.” The DG represents a very small fraction of the total amount of data processed and was the only region requiring the general protocol to be adapted.

Adaptive Neuron Individualization Using the Optimal σ

At this stage, we have defined an unsupervised strategy to automatically determine the optimal σ to be applied to each NOI. We then applied the two-step process (adaptive Gaussian filtering using the map of the optimal σ as parameter and min-max filtering) to detect the final centroids of neurons of different sizes. The two-step process was applied twice (optimized in section “Number of Iterations–n”) because according to the experimental results on real data, one iteration led to underdetection of centroids, and more than two iterations led to overdetection. The detected centroids were selected as initial seeds for region growing process based on the discrete contour-based model leading to the final individualization result.

Evaluation Methods of Neuron Individualization

The proposed method was compared with the Watershed algorithm (Cousty et al., 2009), which defines the watershed cuts based on the principle of the drop of water on a topographic surface. As Watershed is sensitive to noise, a preprocessing step is necessary. We used Gaussian filter to reduce noise in the images from the individualization dataset. The choice of σ is crucial because we have no a priori information about neuron size. Therefore, Gaussian filter with the optimal σ computed for each NOI by the proposed method was applied before using Watershed algorithm.

The quality of neuron individualization was evaluated by three evaluation methods, which are introduced in the following parts.

Comparison of Automated and Manual Neuron Counting

The relative count error ε was used to validate the neuron count (Kothari et al., 2009). It was defined as the absolute difference between the automated (Na) and expert count (Ne) divided by the expert count:

ε = | N a - N e | N e (10)

The smaller the relative count error is, the more accurate the automated method count.

In addition, a complementary individualization score that considers the colocalization of the neurons individualized using the automated method compared to the centroids pointed by the expert was defined. If exactly one expert centroid is superimposed on the neuron area automatically detected, this one is considered as a true positive. Otherwise, it is either oversegmented (zeros expert centroid) or undersegmented (more than 1 expert centroid). Recall (R), Precision (P), and F-score (F) were calculated according to:

R = N t N e ; P = N t N a ; F = 2 R × P R + P (11)

where Nt is the number of correctly automatically segmented neurons (true positive), Ne is the expert count, and Na is the automatic counting of neurons.

The bigger the value of the F-score is, the better performance the method reaches. This validation method was widely used in previously published works (Al-Kofahi et al., 2010; Shu et al., 2013; Zhang et al., 2013; He et al., 2015; Poulain et al., 2015; Molnar et al., 2016).

These two validation methods are generic but do not take into account certain properties of the segmented neurons (e.g., individualization quality and shape/contour accuracy). This led us to propose a second method.

Study of the Location of the Centroids and the Contours of the Neurons

The interest of this approach is to consider local spatial information at the individual neuron level. The expert manually segmented individual neurons by marking their centroids and drawing their contours. Because this is a tremendous work that is time-consuming, we proposed a trade-off approach to produce preliminary results based on certain representative images and to assess the quality using the following indexes: the distances of the contours (Distance_mean_contour) or the centroids (Distance_centroid) between automated and manual individualization results, the overlapping fraction between neurons automatically and manually individualized (Dicearea) [equation (12)]. Based on the individualization dataset, a representative subset of nine images corresponding to seven anatomical regions (caudate, claustrum, cortex, hippocampus, putamen, subiculum, and thalamus) was chosen. On each image, 100 neurons were randomly selected and segmented by an expert.

{ d min ( i , j ) = min j C auto { ( i x - j x ) 2 + ( i y - j y ) 2 } d mean = 1 N manual # i C manual # j C auto d min ( i , j ) Dice area = 2 × S common S manual + S auto (12)

where Nmanual is the number of points in the contour drawn by the expert, dmin is for each point i(ix,iy) of the contour drawn by the expert, the minimal distance with the point j(jx,jy) in the contour delineated by the automatic method, dmean is the average distance between neuron contours segmented manually and automatically, C represents neuron contour, Smanual is the neuron surface derived from the contour drawn by the expert, Sauto is the neuron surface segmented by the automatic method, and Scommon is the overlapped area between manual and automatically segmented neurons.

These metrics allow different automated individualization methods to be efficiently evaluated. However, this work was limited due to the important number of manual operations required. A posteriori, the methods listed above (section “Comparison of Automated and Manual Neuron Counting” and section “Study of the Location of the Centroids and the Contours of the Neurons”), respectively, implied to manually deal with about 112,000 neurons in 50 images (5,000 pixels × 5,000 pixels) and 900 neurons in nine images in the individualization dataset.

The gold standard method for estimating the number of cells in an anatomical region is stereology. This led us to the third evaluation method.

Comparison of Automated Neuron Counting Versus Stereology

Biologists were asked to estimate the number of neurons in the stereology dataset (eight regions) with quantitative stereological counting techniques based on optical dissectors. Compared to classical stereological approach, dead zone regions were not considered to make possible to compare the results with our counting methodology. Two experts counted the neurons in samples from every anatomical region directly under a microscope using the software Mercator. Then, the number of neurons was estimated according to the optimal sampling fraction (two settings tested), and the two counting results were compared (method vs. experts).

Results

Before presenting the results, it is important to mention that the individualization dataset consists of 58 images, in which eight images were discarded. They concern four blurred images (one image of caudate, two images of cortex, and one image of putamen) for which the expert cannot give ground truth and four images without staining tissue (two images of white matter and two images of ventricle).

Parameters Optimization

Number of Iterations–n

Example of neuron detection with different number of iterations is shown in Figure 5. We found that a consistent estimation between the expert and automated method was obtained for two iterations (10 centroids detected) because one iteration of min-max map computation leads to underdetection of neurons and three iterations to overdetection. The setting of two iterations of min-max map computation enabled to detect small, blurred, as well as bright neurons.

FIGURE 5
www.frontiersin.org

Figure 5. Min-max map computation (σ = 4). (A) Original color image. (B) One iteration Ie. (C) Two iterations Ie. (D) Three iterations Ie. Red crosses are neuron centroids marked by the expert, whereas green crosses are the detected neuron centroids figure from You et al. (2016).

To generalize this illustrative example, we extended this study to the individualization dataset. The relative count error ε (Table 2) and the F-score (Figure 6) were computed by grouping the images corresponding to the same anatomical regions (50 images grouped into seven groups: caudate, claustrum, cortex, hippocampus, putamen, subiculum, and thalamus).

TABLE 2
www.frontiersin.org

Table 2. Relative count error based on the number of iteration.

FIGURE 6
www.frontiersin.org

Figure 6. F-score computed with different number of iterations. The F-score presents the biggest values for all anatomical regions when we performed two iterations of the two-step process (adaptive Gaussian filtering and min-max filtering).

From Table 2, we observe that for all of the anatomical regions, the smallest average value of the relative count error (0.105 ± 0.114) is obtained for two iterations. Looking at the results region by region, we find that in the cortex, the relative error is similar for n = 2 and n = 3, but the standard deviation is smaller for n = 2, which provides more stable individualization results. In the claustrum and putamen, although the relative count error is smaller for n = 3, the standard deviation is greater. Considering the average relative count error, n = 2 is consequently the optimal processing to perform.

For F-score based on Figure 6, the value for n = 2 provides the best scores for all anatomical regions.

In summary, the optimal number of iterations selected is 2.

Optimal σ for the Region of the DG

It was mentioned previously that the DG is a particular case. σ = 5 is the optimal value of σ determined using F-score criteria [equation (11)].

Figure 7 shows the F-score for σ varying from 1 to 23 on the images including only DG regions. The average maximal value is 0.885 for σ = 5, and the F-score for every image including DG is superior to 0.850, which denotes a good quality of counting process. Values of σ between 4 and 6 provided high F-scores as well, which demonstrate the robustness of our choice.

FIGURE 7
www.frontiersin.org

Figure 7. F-score computed for different values of σ for the region of dentate gyrus (DG). The different colors represent the seven images of DG. “hip” is the abbreviation for the name of the anatomical region of the hippocampus.

The need to specifically process the DG is justified by the fact that it represents a particular case with a massive aggregation of small neurons. The size of the left DG on section No. 91 corresponds to 106 pixels. In practice, the identification of the related connected components of DG is simple due to its size compared to other connected components that are significantly smaller.

Comparison of Automated and Manual Neuron Counting

In terms of performance of the individualization of neurons, iCut, and Watershed were compared with the proposed method (for size-fixed neurons) in You et al., 2016. The results showed that our method gave the best individualization performance. However, due to the algorithmic complexity of iCut [O(n3), where n is the number of foreground pixels in the image], we could not compare it with other methods on the individualization dataset. In addition, for the individualization of size-varying neurons, iCut is no longer adapted because a unique parameter for object size is fixed a priori. Therefore, we compared the proposed method with Watershed on the individualization dataset.

Figure 8 presents typical results on three different images presenting different neuron densities. Image 1 illustrates image of thalamus region with a few individual neurons, image 2 represents moderately dense image of cortex region in which several neurons touch each other, and image 3 represents extremely dense image of DG region in which many neurons are aggregated. Figures 8A1–A3 are the ground truth (red dots centroids manually annotated). Figures 8B1–B3 represent neuron classification results that are insufficient to segment individual neurons when aggregated neurons are present. Figures 8C1–C3 present the images of the optimal σ evaluated by the proposed method (local values estimated at neuron level). Figures 8D1–D3 present two-iteration min-max maps in which neuron centers appear in dark and neuron contours appear in bright-intensity values. Figures 8E1–E3 show the final individualization results obtained by the proposed method. Figures 8F1–F3 present the individualization results obtained by Watershed algorithm using our optimal σ map. Both methods perform well in simple cases. However, in more complex situations, Watershed results present overindividualization and under individualization, and making the segmented contours unnatural (straight borders, large neurons detected). Visually, the proposed method provides better neuron individualization results.

FIGURE 8
www.frontiersin.org

Figure 8. Neuron individualization results shown in images of 512 pixels × 512 pixels extracted from those obtained on the individualization dataset. Images 1–3 represent the anatomical region of thalamus, cortex, and DG, presenting different neuron densities. (A1–A3) Original images. Red points represent neuron centroids marked by the expert. (B1–B3) Binary images of segmented neuron class. (C1–C3) Images of optimal σ. (D1–D3) Min-max maps obtained after the two-step process of adaptive Gaussian filtering and min-max filtering. (E1–E3) Individualization results obtained using the proposed method. (F1–F3) Individualization results obtained using Watershed algorithm.

Figure 9 shows the F-scores obtained with the proposed method for the 50 images of the individualization dataset demonstrating its ability to individualize the neurons with high F-scores, especially in the caudate, cortex, and subiculum, where average F-scores are, respectively, 0.874, 0.877, and 0.905. The F-scores computed in the hippocampus vary from 0.726 to 0.898 due to the heterogeneity of this region, which contains different kinds and distributions of neurons. Particularly, in the subregions CA2 and CA3, the intensity of certain neuron centers is lighter. These staining changes may be due to unsuitable staining marker (Mullen et al., 1992) corresponding to different levels of expression of the antigen. As the staining for this kind of neuron does not respect our hypothesis, the proposed method is not suitable in this case. All the same, the proposed method provides a good performance (F-score > 0.726).

FIGURE 9
www.frontiersin.org

Figure 9. F-score calculated using our method on 50 images of the individualization dataset. L (respectively, R) means that this image is extracted from the left (respectively, right) side of the section.

The same evaluation was realized on the results produced using the Watershed algorithm. The mean F-score and standard deviation for seven anatomical regions are displayed in Table 3, showing that the proposed method systematically provides higher F-score values for every anatomical region. Moreover, the standard deviation of the proposed method is smaller, which demonstrates a good robustness for both simple and complex cases.

TABLE 3
www.frontiersin.org

Table 3. F-scores computed using two neuron individualization methods, the proposed method and a Watershed algorithm.

Study of the Location of the Centroids and the Contours of the Neurons

The quality of the individualization process between the proposed method and the Watershed algorithm using the optimal σ map was compared.

Figure 10 presents a comparison synthesis of the results for the supplementary parameters investigated. It shows that neurons segmented by the proposed method superimpose better with the neurons segmented manually than those segmented with Watershed. The average value of Dicearea is 0.77 for the proposed method and 0.72 for Watershed. The detected centroids and segmented contours, compared to Watershed, are closer to those segmented manually by the expert. The average value of the distance between centroids (Distance_centroid) and of the distance between contours (Distance_mean_contour) are, respectively, 1.31 and 0.98 μm for the proposed method, and 1.45 and 1.26 μm for Watershed. Again, we observe that the quality of individualization of neurons using our method is superior compared to the one obtained with Watershed. These quantitative indexes are of major interest because they resume and confirm the qualitative evaluation of the individualization previously mentioned in Figure 9. They provide additional quantitative local and spatial information concerning neuron individualization quality.

FIGURE 10
www.frontiersin.org

Figure 10. Results obtained to compare the different individualization methods, the proposed method and Watershed using new criteria (Dicearea, Distance_centroid and Distance_mean_contour). Dicearea represents the overlapping fraction between neurons automatically and manually individualized, Distance_mean_contour and Distance_centroid represent, respectively, the distances of the contours and the centroids between automated and manual individualization results.

Comparison of Automated Neuron Counting Versus Stereology

We validated the proposed method by comparing our results with an adapted stereology technique using the method of the optical dissectors, which is the gold standard method in biology to count objects. No dead zones were considered for stereology technique to enable comparison in the same conditions (no depth information available in image processing approach). This technique is operator-dependent, two experts (b1 and b2) performed the counting twice by modifying the sampling fraction to obtain a stable estimation of the number of neurons and ensured the high counting accuracy. In this study, the sampling fraction varied from 1/100 to 1/4 on eight large images. The results obtained with stereology using the second sampling were selected (Supplementary Table 4). As the surface of eight images are very different, neuron density instead of neuron number was considered to normalize the results obtained using the proposed method and the adapted stereology technique. The analysis was performed with linear regression shown in Figure 11. The dark gray color represents the confidence interval around the solid line in dark blue color. The correlation coefficient between neuron densities was computed with the proposed method and the adapted stereology technique performed by two experts (b1 and b2). The high correlation values (0.983 for b1 and 0.975 for b2) and the low p-value (1.20e-05 for b1 and 3.76e-05 for b2) state that the proposed method is very promising to estimate the neuron density and the number of neurons in different anatomical regions.

FIGURE 11
www.frontiersin.org

Figure 11. Scatterplot of neuron density calculated by the proposed method compared with stereological estimations produced by the two experts. The unit is the number of neurons per square millimeter.

Average Neuron Radius in Different Anatomical Regions

Figure 12 shows the average radius of individualized neurons in different anatomical regions of the macaque brain. Six summary statistics (minimum, first quartile, average, median, third quartile, and maximum) are listed in Supplementary Table 5.

FIGURE 12
www.frontiersin.org

Figure 12. Average radius of individualized neurons.

We observe that most of the neurons, which remain in the range of the first and third quartiles, in the claustrum (average radius ranging from 2.99 to 4.52 μm) and the putamen (average radius varying from 3.31 to 4.73 μm) are smaller than the neurons of the other regions (average radius varying from 3.34 to 6.02 μm). The average radius of individualized neurons in our study varies from 1 to 14.3 μm (corresponding from 2 to 28.6 μm in diameter, five outliers for neurons were excluded), which corresponds quite well to the study by Andersen et al. (2016) who estimated that the size of neurons varies between 5 and 30 μm in diameter. The detection of the very small neurons may be due to the production of partial neurons during the cutting process or to a problem during the digitization (focal plane of scanning not centered on the neuron, position of the neurons in the depth of the cut, damaged cells during staining, etc.).

Execution Time

The proposed method was implemented in C++ on a 64-bit computer workstation under Linux (CPU: Intel Xeon E5-2630 v3 at 2.4 GHz, RAM: 128 GB). Twenty-three cores of the computer were used for the parallel calculation of 58 images of the individualization dataset [BrainVISA Soma-Workflow module (Laguitton et al., 2011), CPU parallelization]. The average execution time for the final individualization result was 15.6 h, that is, 16.1 min per image of 5,000 pixels × 5,000 pixels. This execution time is very reasonable compared to the necessary time to do stereology, and it is perfectly adapted to deal with large images especially with parallelization.

Discussion

We proposed a novel method for the individualization of size-varying and large number of touching neurons in microscopic macaque brain image. We first applied an RF classifier, a reliable segmentation approach, based on four features (L, M, V, and LBP40) to segment stained and non-stained tissue. This combination of features presented a limited number of color and texture features, which give satisfactory segmentation results. Alternative methods can also be considered and plugged into our protocol. In particular, convolutional neural networks (CNN)-based methods are promising approaches. As a preliminary test, we applied U-net (Ronneberger et al., 2015) on the segmentation dataset, and we obtained comparable segmentation results. To individualize the touching size-varying neurons, we model each neuron individually (centroid and contour) to deal with different sizes and densities. The proposed method investigates all possible individualization results parameterized by σ, which defines the size of the Gaussian kernel filter, and then determines the optimal σ for each NOI based on the Dice score. Using this adaptive Gaussian filter and an original min-max filter, the key information such as location (centroid) and boundary of each expected NOI is enhanced in a map of extrema. The multiple studies performed in the present work confirmed the genericity and interest to use the min-max filter. Finally, a discrete contour-based model is applied to achieve neuron individualization. This model is a robust region-based segmentation method. Instead of examining all neighboring pixels of initial seeds used in traditional region growing methods, this model examines several pixels of discrete contour. Taking advantage of information about pixel intensity in the extrema map and contour curvature at the pixel level, discrete contours expand through a competition approach. Besides, the use of discrete contour alleviates the noise on the neuron contour finally segmented and saves execution time. Considering the general satisfactory results obtained by U-net on other biological data, a future work will be initiated soon to test the ability of U-net to individualize touching cell as it was recently proposed (Falk et al., 2019).

The proposed method provides good performances both qualitatively and quantitatively. It is worth mentioning that our method is able to handle the most difficult cases involving massive touching neurons like in the DG, which is part of the hippocampus. This region of the hippocampus is extremely complex to analyze. An interesting biological result is that the size of the neurons in DG regions is stable based on the single optimal value of σ estimated (Figure 7). The hippocampus contains a wide range of neurons with different sizes and different kinds of neurons in several subregions (CA2 and CA3), which can lead to different staining and non-optimal individualization results. The F-scores in the CA2 and CA3 vary between 0.726 and 0.788. Nevertheless, the mean F-score still reaches 0.816 in the hippocampus, denoting that several subregions (CA1, CA4, and DG) of the hippocampus can be successfully treated by the proposed method. Most of the biological studies aiming to count neurons avoid to target this region due to its complexity and favor optic density for quantitative measurements (Brureau et al., 2017).

The proposed method was compared with Watershed, a reference individualization method in the literature. In addition to the F-score described above, three supplementary criteria were proposed in this article: mean distance between the centroids, mean distance between the contours, and Dice superimpositions. According to these criteria, the neurons individualized by the proposed method are better superimposed with manual segmentation results than Watershed (0.77 for the proposed method vs. 0.72 for Watershed). Moreover, the centroids and contours segmented by the proposed method are closer to those segmented by the experts (respectively, 9.98 and 21.89% closer) than Watershed. Therefore, the proposed method is proven to be an efficient method to properly identify centroids and accurately estimate the location of neuron contours. All datasets produced in this study will be made available to enable fair benchmarking.

To further validate the proposed method, we also applied stereology, the gold standard in the domain, onto a dedicated dataset with the participation of the two experts. They were asked to estimate the number of neurons in various anatomical ROIs, which were selected in accordance with their interest in biology. Therefore, we obtained a wide range of neuron densities present in the brain (ranging from 343 to 8,167 neurons per mm2). The neuron density calculated by the proposed method is highly correlated with those estimated by the stereology technique (0.983 and 0.975 respectively, for b1 and b2 experts). In this work, the dead zones classically used in stereology that exclude cut cells were not considered image processing approaches and cannot deal with this phenomenon (Z-stack can be envisioned but will dramatically increase the amount of data to deal with). In this situation, the results obtained by the experts and by our method were consistent. Nevertheless, it will be necessary to precisely estimate the bias introduced at this occasion to evaluate, based on a tolerance level to be defined, if our method could be or not an alternative to stereology in biological studies. This point is the main limitation of the proposed method compared to stereology. On the other hand, our method makes it possible to rapidly analyze large amounts of data, which can provide a first screening at brain-section level to identify and investigate regions of interest. Such a strategy is hardly feasible with stereology. This work is a preliminarily study in major anatomical ROIs on a single histological section and is certainly worth being extended to study entire brains (series of histological sections) and multiple animals (Amunts et al., 2013). Future work will also aim to evaluate the effects of staining variations as well as intersubject effects in our protocol. These preliminary results demonstrate the potential of the proposed method to individualize and count the neurons with high accuracy. To our knowledge, this is the first study in this field to use stereology to evaluate an automated analysis method.

The validation work performed in this study constitutes an original contribution and produced original processed data. In total, five experts participated in this time-consuming and tedious work. Four reference databases annotated by experts were derived from this study and can be used for future research and development: (1) manual segmentation of neuronal staining in 100 images (512 pixels × 512 pixels) in the segmentation dataset, (2) 111,971 neuron centroids manually marked in the individualization dataset concerning 50 images (5,000 pixels × 5,000 pixels), (3) 900 neuron centroids and contours drawn by experts in the individualization dataset concerning nine images, and (4) 37,000 neurons counted by experts using stereology technique in the stereology dataset.

With the ability to count neurons of the proposed method, it is possible to extend neuron counting to series of sections or to an entire brain. For instance, the section 91 consists of about 230,000 pixels × 188,000 pixels. We can divide this section into about 1,730 images of 5,000 pixels × 5,000 pixels. These images can be processed in parallel by the proposed method. The number of neurons in one section can then be calculated and mesoscopic quantitative heat maps of neuron density derived (Vandenberghe et al., 2016). The use of a supercomputer can dramatically decrease computation time, making it possible to significantly extend this study to produce original cartographies of neuron distribution through the brain.

Preliminary results of parallelization of segmentation codes have demonstrated a high scalability of our method. This work is carried out on a CPU-based architecture but may be extended to GPU architectures in the future, particularly with regard to the implementation of deep learning methods.

A first application aiming to compute neuron size was performed on the individualization dataset. Although neurons have various shapes and may be cut according to their location in the histological section, we can still estimate the average neuron radius with statistical tools, thanks to the large number of neurons processed. This information can be used to roughly estimate neuron size or to perform comparative studies considering that similar biases are introduced in various measurements. The statistical result shows that neuron size calculated by the proposed method (2–28.6 μm in diameter) fits well with the size that was read in the literature (5–30 μm in diameter). It is important to keep in mind that morphological measurements made in 2-D remain an estimate and that it is necessary to acquire volumes to obtain more precise measurements. In this context, the first strength of our approach is the new extracted information to enrich current knowledge of the available data. For instance, the proposed individualization method provides interesting descriptive information (location, size, intensity, shape, and network organization) on the population of neurons for each region. In anatomical regions, this information can be used to better describe brain development, aging, disease, or even evaluate a therapy by producing analyses at the cellular level. The second strength of our approach is the scalability of these technologies (to whole section, brain, and groups) as all processings are performed at the connected component level, which makes parallelization schemes extremely efficient. It will support interesting perspectives for biological studies targeting cytoarchitecture analysis.

Nowadays, neuron individualization is still an important and challenging subject of research in the field of neuroscience. In 2014, Schmitz et al. (2014) compared available methods for cell segmentation, showing that the individualization of cells is a complex problem and stressing the need to develop new approaches in this area. Although several works have been done over the past years, most of the proposed methods are based on mathematical morphology, detection of concavities, the Watershed algorithm, and graph-cut algorithms. Deep learning is a promising approach in the future but needs to be carefully investigated to evaluate its ability to robustly solve this question. Most of these methods have been focused on relatively simple or very specific cases. When we deal with complex cases, such as the DG, where thousands of neurons are aggregated, these methods cannot properly individualize neurons. In fact, the reason why the existing image processing methods do not perform well is that they are performed on 2-D images, with a single scanning focus plane, which increases the difficulty of individualizing touching neurons. It might be possible to acquire several planes associated to the depth in the sections when scanning using Z-stack technique so as to reconstruct a pseudo-3-D volume and apply an improved version of the proposed method. However, new problems would inevitably arise, in particular the increase of the volumetry of the data to deal with.

Nevertheless, this work is promising. The proposed tool can help address major biological challenges, such as improving our understanding of brain development or aging, deciphering pathology mechanisms, or evaluating novel therapies in neurodegenerative diseases.

Conclusion

This paper presents a robust and efficient method to individualize size-varying and touching neurons in microscopic images of macaque brain sections. The results obtained are promising. The accuracy of the proposed method is close to 0.816 in the hippocampus, which is a very complex anatomical region and superior to 0.841 in the other regions studied (caudate, claustrum, cortex, putamen, subiculum, and thalamus). Moreover, the accuracy and performance of the proposed method are significantly higher qualitatively and quantitatively compared to Watershed algorithm in the anatomical regions tested. Compared with an adapted stereology technique, we found that the counting results obtained with our method are highly correlated (0.983 for b1 and 0.975 for b2) and could be considered as a reliable alternative. Supplementary researches based on extended datasets (sections and animals) should be envisioned to confirm this point. The average neuron radius estimated by the proposed method is coherent compared to the acknowledged findings of literature. The automated detection of millions of neurons in the whole brain is still challenging. Perspectives of this work will be to extend the application of the proposed method to entire brains.

Data Availability Statement

The datasets generated for this study will not be made publicly available. The data will be available later in the form of an article and an access because it corresponds to a large amount of information.

Ethics Statement

The animal study was reviewed and approved by the Comité d’éthique agréé par le MESR dont relève l’EU: CETEA DSV – Comité n°44.

Author Contributions

ZY, CJ, A-SH, and TD created the datasets. ZY, YB, CB, NS, and TD developed the software and performed the image processing. A-SH and TD performed the tissue classification. ZY and TD located the neuron centroids and drew the neuron contours. PG and CJ estimated the neuron number with stereological techniques. ZY, A-SH, and TD performed the statistical analysis. A-SH, PG, CJ, PH, and TD planned the histology and imaging experiments. ZY and TD wrote the manuscript.

Funding

This work was partially supported by the French National Funds (PIA2’ program) under the contract number P112331-3422142 (3D NeuroSecure project) and the Fund of Doctoral Start-up of Xi’an University of Technology, number 112/256081811. This work was granted access to the HPC resources of TGCC under the allocation 2019-(A0040310374) made by the GENCI.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnana.2019.00098/full#supplementary-material

References

Adams, R., and Bischof, L. (1994). Seeded region growing. IEEE Trans. Pattern Anal. Mach. Intell. 16, 641–647. doi: 10.1109/34.295913

CrossRef Full Text | Google Scholar

Al-Kofahi, Y., Lassoued, W., Lee, W., and Roysam, B. (2010). Improved automatic detection and segmentation of cell nuclei in histopathology images. IEEE Trans. Biomed. Eng. 57, 841–852. doi: 10.1109/TBME.2009.2035102

PubMed Abstract | CrossRef Full Text | Google Scholar

Amunts, K., Lepage, C., Borgeat, L., Mohlberg, H., Dickscheid, T., Rousseau, M.-E., et al. (2013). BigBrain: an ultrahigh-resolution 3D human brain model. Science 340, 1472–1475. doi: 10.1126/science.1235381

PubMed Abstract | CrossRef Full Text | Google Scholar

Andersen, P., Morris, R., Amaral, D., Bliss, T., and O’Keefe, J. (2016). The Hippocampus Book. Oxford: Oxford University Press.

Google Scholar

Andrey, P., Kiêu, K., Kress, C., Lehmann, G., Tirichine, L., Liu, Z., et al. (2010). Statistical Analysis of 3D images detects regular spatial distributions of centromeres and chromocenters in animal and plant nuclei. PLoS Comput. Biol. 6:e1000853. doi: 10.1371/journal.pcbi.1000853

PubMed Abstract | CrossRef Full Text | Google Scholar

Bai, X., Sun, C., and Zhou, F. (2008). “Touching cells splitting by using concave points and ellipse fitting,” in Proceedings of the 2008 Digital Image Computing: Techniques and Applications, Canberra, ACT.

Google Scholar

Bouvier, C., Clouchoux, C., Souedet, N., Hérard, A. S., You, Z., Jan, C., et al. (2018). “Computational optimization for fast and robust automatic segmentation in virtual microscopy using brute-force-based feature selection,” in Proceedings of the International Conference on Pattern Recognition and Artificial Intelligence, Singapore.

Google Scholar

Breiman, L. (2001). Random Forests. Mach. Learn. 45, 5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

Brureau, A., Blanchard-Bregeon, V., Pech, C., Hamon, S., Chaillou, P., Guillemot, J.-C., et al. (2017). NF-L in cerebrospinal fluid and serum is a biomarker of neuronal damage in an inducible mouse model of neurodegeneration. Neurobiol. Dis. 104, 73–84. doi: 10.1016/j.nbd.2017.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, H. D., Jiang, X. H., Sun, Y., and Wang, J. L. (2001). Color image segmentation: advances and prospects. Pattern Recognit. 34, 2259–2281. doi: 10.1016/s0031-3203(00)00149-7

CrossRef Full Text | Google Scholar

Cousty, J., Bertrand, G., Najman, L., and Couprie, M. (2009). Watershed cuts: minimum spanning forests and the drop of water principle. IEEE Trans. Pattern Anal. Mach. Intell. 31, 1362–1374. doi: 10.1109/TPAMI.2008.173

PubMed Abstract | CrossRef Full Text | Google Scholar

Daněk, O., Matula, P., Ortiz-de-Solórzano, C., Muñoz-Barrutia, A., Maška, M., and Kozubek, M. (2009). “Segmentation of touching cell nuclei using a two-stage graph cut model, in: image analysis, lecture notes in computer science,” in Paper Presented at the Scandinavian Conference on Image Analysis, Berlin.

Google Scholar

Dewan, M. A. A., Ahmad, M. O., and Swamy, M. N. S. (2014). A method for automatic segmentation of nuclei in phase-contrast images based on intensity, convexity and texture. IEEE Trans. Biomed. Circuits Syst. 8, 716–728. doi: 10.1109/TBCAS.2013.2294184

PubMed Abstract | CrossRef Full Text | Google Scholar

Falk, T., Mai, D., Bensch, R., Çiçek, O., Abdulkadir, A., Marrakchi, Y., et al. (2019). U-Net: deep learning for cell counting, detection, and morphometry. Nat. Methods 16, 67–70. doi: 10.1038/s41592-018-0261-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Gundersen, H. J. (1986). Stereology of arbitrary particles. a review of unbiased number and size estimators and the presentation of some new ones, in memory of William R. Thompson. 143(Pt 1), 3–45. doi: 10.1111/j.1365-2818.1986.tb02764.x

CrossRef Full Text | Google Scholar

Hanbury, A. (2008). Constructing cylindrical coordinate colour spaces. Pattern Recognit. Lett. 29, 494–500. doi: 10.1016/j.patrec.2007.11.002

CrossRef Full Text | Google Scholar

He, Y., Gong, H., Xiong, B., Xu, X., Li, A., Jiang, T., et al. (2015). iCut: an Integrative Cut Algorithm Enables Accurate Segmentation of Touching Cells. Sci. Rep. 5:12089. doi: 10.1038/srep12089

PubMed Abstract | CrossRef Full Text | Google Scholar

Jelsing, J., Nielsen, R., Olsen, A. K., Grand, N., Hemmingsen, R., and Pakkenberg, B. (2006). The postnatal development of neocortical neurons and glial cells in the göttingen minipig and the domestic pig brain. J. Exp. Biol. 209, 1454–1462. doi: 10.1242/jeb.02141

PubMed Abstract | CrossRef Full Text | Google Scholar

Kainz, P., Urschler, M., Schulter, S., Wohlhart, P., and Lepetit, V. (2015). “You should use regression to detect cells,” in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 MICCAI 2015. Lecture Notes in Computer Science, eds N. Navab, J. Hornegger, W. Wells, and A. Frangi (Cham: Springer), 276–283. doi: 10.1007/978-3-319-24574-4_33

CrossRef Full Text | Google Scholar

Karlsen, A. S., and Pakkenberg, B. (2011). Total numbers of neurons and glial cells in cortex and basal ganglia of aged brains with down syndrome–a stereological study. Cereb. Cortex 1991, 2519–2524. doi: 10.1093/cercor/bhr033

PubMed Abstract | CrossRef Full Text | Google Scholar

Kothari, S., Chaudry, Q., and Wang, M. D. (2009). “Automated cell counting and cluster segmentation using concavity detection and ellipse fitting techniques,” in Proceedings 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Boston, MA.

Google Scholar

Laguitton, S., Rivière, D., Vincent, T., Fischer, C., Geoffroy, D., Souedet, N., et al., (2011). “Soma-workflow: a unified and simple interface to parallel computing resources,” in MICCAI Workshop High Perform. Distrib. Comput. Med. Imaging.

Google Scholar

Larsen, C. C., Bonde Larsen, K., Bogdanovic, N., Laursen, H., Graem, N., Samuelsen, G. B., et al. (2006). Total number of cells in the human newborn telencephalic wall. Neuroscience 139, 999–1003. doi: 10.1016/j.neuroscience.2006.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, G. G., Lin, H., Tsai, M., Chou, S., Lee, W., Liao, Y., et al. (2013). Automatic Cell Segmentation and Nuclear-to-Cytoplasmic Ratio Analysis for Third Harmonic Generated Microscopy Medical Images. IEEE Trans. Biomed. Circuits Syst. 7, 158–168. doi: 10.1109/TBCAS.2013.2253463

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, G., Liu, T., Nie, J., Guo, L., Chen, J., Zhu, J., et al. (2008). Segmentation of touching cell nuclei using gradient flow tracking. J. Microsc. 231, 47–58. doi: 10.1111/j.1365-2818.2008.02016.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Lou, X., Koethe, U., Wittbrodt, J., and Hamprecht, F. A. (2012). “Learning to segment dense cell nuclei with shape prior,” in Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI.

Google Scholar

Molnar, C., Jermyn, I. H., Kato, Z., Rahkama, V., Östling, P., Mikkonen, P., et al. (2016). Accurate morphology preserving segmentation of overlapping cells based on active contours. Sci. Rep. 6:32412. doi: 10.1038/srep32412

PubMed Abstract | CrossRef Full Text | Google Scholar

Mullen, R. J., Buck, C. R., and Smith, A. M. (1992). NeuN, a neuronal specific nuclear protein in vertebrates. Dev. Camb. Engl. 116, 201–211.

Google Scholar

Nedzved, A., Ablameyko, S., and Pitas, I. (2000). “Morphological segmentation of histology cell images,” in Proceedings of the 15th International Conference on Pattern Recognition, Barcelona.

Google Scholar

Ojala, T., Pietikäinen, M., and Mäenpää, T. (2002). Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal. Mach. Intell. 24, 971–987. doi: 10.1109/TPAMI.2002.1017623

CrossRef Full Text | Google Scholar

Pakkenberg, B., and Gundersen, H. J. (1997). Neocortical neuron number in humans: effect of sex and age. J. Comp. Neurol. 384, 312–320. doi: 10.1002/(sici)1096-9861(19970728)384:2<312::aid-cne10>3.0.co;2-k

PubMed Abstract | CrossRef Full Text | Google Scholar

Pelvig, D. P., Pakkenberg, H., Stark, A. K., and Pakkenberg, B. (2008). Neocortical glial cell numbers in human brains. Neurobiol. Aging 29, 1754–1762. doi: 10.1016/j.neurobiolaging.2007.04.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Perona, P., and Malik, J. (1990). Scale-space and edge detection using anisotropic diffusion. IEEE Trans. Pattern Anal. Mach. Intell. 12, 629–639. doi: 10.1109/34.56205

CrossRef Full Text | Google Scholar

Poulain, E., Prigent, S., Soubies, E., and Descombes, X. (2015). “Cells detection using segmentation competition,” in Proceedings of the 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI), New York, NY.

Google Scholar

Pukelsheim, F. (1994). The Three Sigma Rule. Am. Stat. 48, 88–91. doi: 10.1080/00031305.1994.10476030

CrossRef Full Text | Google Scholar

Qi, J. (2014). Dense nuclei segmentation based on graph cut and convexity-concavity analysis. J. Microsc. 253, 42–53. doi: 10.1111/jmi.12096

PubMed Abstract | CrossRef Full Text | Google Scholar

Riccio, D., Brancati, N., Frucci, M., and Gragnaniello, D. (2019). A new unsupervised approach for segmenting and counting cells in high-throughput microscopy image sets. IEEE J. Biomed. Health Inform. 23, 437–448. doi: 10.1109/JBHI.2018.2817485

PubMed Abstract | CrossRef Full Text | Google Scholar

Ronneberger, O., Fischer, P., and Brox, T. (2015). “U-Net: convolutional networks for biomedical image segmentation,” in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Lecture Notes in Computer Science, eds N. Navab, J. Hornegger, W. M. Wells, and A. F. Frangi (Berlin: Springer International Publishing).

Google Scholar

Schmitz, C., Eastwood, B. S., Tappan, S. J., Glaser, J. R., Peterson, D. A., and Hof, P. R. (2014). Current automated 3D cell detection methods are not a suitable replacement for manual stereologic cell counting. Front. Neuroanat. 8:27. doi: 10.3389/fnana.2014.00027

CrossRef Full Text | Google Scholar

Shu, J., Fu, H., Qiu, G., Kaye, P., and Ilyas, M. (2013). Segmenting overlapping cell nuclei in digital histopathology images. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2013, 5445–5448. doi: 10.1109/EMBC.2013.6610781

PubMed Abstract | CrossRef Full Text | Google Scholar

Thu, D. C., Oorschot, D. E., Tippett, L. J., Nana, A. L., Hogg, V. M., Synek, B. J., et al. (2010). Cell loss in the motor and cingulate cortex correlates with symptomatology in Huntington’s disease. Brain J. Neurol. 133, 1094–1110. doi: 10.1093/brain/awq047

PubMed Abstract | CrossRef Full Text | Google Scholar

Vandenberghe, M. E., Balbastre, Y., Souedet, N., Hérard, A.-S., Dhenain, M., Frouin, F., et al. (2015). Robust supervised segmentation of neuropathology whole-slide microscopy images, in 2015. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2015, 3851–3854.

Google Scholar

Vandenberghe, M. E., Hérard, A.-S., Souedet, N., Sadouni, E., Santin, M. D., Briet, D., et al. (2016). High-throughput 3D whole-brain quantitative histopathology in rodents. Sci. Rep. 6:20958. doi: 10.1038/srep20958

PubMed Abstract | CrossRef Full Text | Google Scholar

Vandenberghe, M. E., Souedet, N., Hérard, A.-S., Ayral, A.-M., Letronne, F., Balbastre, Y., et al. (2018). Voxel-based statistical analysis of 3D immunostained tissue imaging. Front. Neurosci. 12:754. doi: 10.3389/fnins.2018.00754

CrossRef Full Text | Google Scholar

Waldvogel, H. J., Kim, E. H., Tippett, L. J., Vonsattel, J. P. G., and Faull, R. L. (2015). The Neuropathology of Huntington’s Disease. Curr. Top. Behav. Neurosci. 22, 33–80.

Google Scholar

Walløe, S., Pakkenberg, B., and Fabricius, K. (2014). Stereological estimation of total cell numbers in the human cerebral and cerebellar cortex. Front. Hum. Neurosci. 8:508. doi: 10.3389/fnhum.2014.00508

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., and He, D.-C. (1990). Texture classification using texture spectrum. Pattern Recognit. 23, 905–910. doi: 10.1016/0031-3203(90)90135-8

CrossRef Full Text | Google Scholar

West, M. J., Slomianka, L., and Gundersen, H. J. G. (1991). Unbiased stereological estimation of the total number of neurons in the subdivisions of the rat hippocampus using the optical fractionator. Anat. Rec. 231, 482–497. doi: 10.1002/ar.1092310411

PubMed Abstract | CrossRef Full Text | Google Scholar

Williams, R. W., and Herrup, K. (1988). The control of neuron number. Annu. Rev. Neurosci. 11, 423–453. doi: 10.1146/annurev.ne.11.030188.002231

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, W., Noble, J. A., and Zisserman, A. (2016). Microscopy cell counting and detection with fully convolutional regression networks. Comput. Methods Biomech. Biomed. Eng. Imaging Vis. 6, 283–292. doi: 10.1186/s12859-019-3037-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, H., Lu, C., and Mandal, M. (2014). An Efficient Technique for Nuclei Segmentation Based on Ellipse Descriptor Analysis and Improved Seed Detection Algorithm. IEEE J. Biomed. Health Inform. 18, 1729–1741. doi: 10.1109/JBHI.2013.2297030

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, X., Li, H., and Zhou, X. (2006). Nuclei Segmentation Using Marker-Controlled Watershed, Tracking Using Mean-Shift, and Kalman Filter in Time-Lapse Microscopy. IEEE Trans. Circuits Syst. Regul. Pap. 53, 2405–2414. doi: 10.1109/TCSI.2006.884469

CrossRef Full Text | Google Scholar

You, Z., Vandenberghe, M. E., Balbastre, Y., Souedet, N., Hantraye, P., Jan, C., et al. (2016). “Automated cell individualization and counting in cerebral microscopic images,” in Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ.

Google Scholar

Zhang, C., Sun, C., and Pham, T. D. (2013). Segmentation of clustered nuclei based on concave curve expansion. J. Microsc. 251, 57–67. doi: 10.1111/jmi.12043

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Xing, F., Su, H., Yang, L., and Zhang, S. (2015). High-throughput histopathological image analysis via robust cell segmentation and hashing. Med. Image Anal. 26, 306–315. doi: 10.1016/j.media.2015.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Zucker, S. W. (1976). Region Growing: childhood and adolescence. Comput. Graph. Image Process. 5, 382–399. doi: 10.1016/s0146-664x(76)80014-7

CrossRef Full Text | Google Scholar

Keywords: neuron individualization, touching neurons, size-varying neurons, microscopic images, macaque brain

Citation: You Z, Balbastre Y, Bouvier C, Hérard A-S, Gipchtein P, Hantraye P, Jan C, Souedet N and Delzescaux T (2019) Automated Individualization of Size-Varying and Touching Neurons in Macaque Cerebral Microscopic Images. Front. Neuroanat. 13:98. doi: 10.3389/fnana.2019.00098

Received: 30 July 2019; Accepted: 22 November 2019;
Published: 17 December 2019.

Edited by:

Marcello Rosa, Monash University, Australia

Reviewed by:

Piotr Majka, Nencki Institute of Experimental Biology (PAS), Poland
Sam Merlin, Western Sydney University, Australia
Leon Teo, Australian Regenerative Medicine Institute (ARMI), Australia

Copyright © 2019 You, Balbastre, Bouvier, Hérard, Gipchtein, Hantraye, Jan, Souedet and Delzescaux. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Thierry Delzescaux, dGhpZXJyeS5kZWx6ZXNjYXV4QGNlYS5mcg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.