An unsupervised map of excitatory neuron dendritic morphology in the mouse visual cortex

Weis, Marissa A.; Papadopoulos, Stelios; Hansel, Laura; Lüddecke, Timo; Celii, Brendan; Fahey, Paul G.; Wang, Eric Y.; Bae, J. Alexander; Bodor, Agnes L.; Brittain, Derrick; Buchanan, JoAnn; Bumbarger, Daniel J.; Castro, Manuel A.; Collman, Forrest; da Costa, Nuno Maçarico; Dorkenwald, Sven; Elabbady, Leila; Halageri, Akhilesh; Jia, Zhen; Jordan, Chris; Kapner, Dan; Kemnitz, Nico; Kinn, Sam; Lee, Kisuk; Li, Kai; Lu, Ran; Macrina, Thomas; Mahalingam, Gayathri; Mitchell, Eric; Mondal, Shanka Subhra; Mu, Shang; Nehoran, Barak; Popovych, Sergiy; Reid, R. Clay; Schneider-Mizell, Casey M.; Seung, H. Sebastian; Silversmith, William; Takeno, Marc; Torres, Russel; Turner, Nicholas L.; Wong, William; Wu, Jingpeng; Yin, Wenjing; Yu, Szi-chieh; Reimer, Jacob; Berens, Philipp; Tolias, Andreas S.; Ecker, Alexander S.

doi:10.1038/s41467-025-58763-w

Download PDF

Article
Open access
Published: 09 April 2025

An unsupervised map of excitatory neuron dendritic morphology in the mouse visual cortex

Nature Communications volume 16, Article number: 3361 (2025) Cite this article

6025 Accesses
4 Citations
33 Altmetric
Metrics details

Subjects

Abstract

Neurons in the neocortex exhibit astonishing morphological diversity, which is critical for properly wiring neural circuits and giving neurons their functional properties. However, the organizational principles underlying this morphological diversity remain an open question. Here, we took a data-driven approach using graph-based machine learning methods to obtain a low-dimensional morphological “bar code” describing more than 30,000 excitatory neurons in mouse visual areas V1, AL, and RL that were reconstructed from the millimeter scale MICrONS serial-section electron microscopy volume. Contrary to previous classifications into discrete morphological types (m-types), our data-driven approach suggests that the morphological landscape of cortical excitatory neurons is better described as a continuum, with a few notable exceptions in layers 5 and 6. Dendritic morphologies in layers 2–3 exhibited a trend towards a decreasing width of the dendritic arbor and a smaller tuft with increasing cortical depth. Inter-area differences were most evident in layer 4, where V1 contained more atufted neurons than higher visual areas. Moreover, we discovered neurons in V1 on the border to layer 5, which avoided deeper layers with their dendrites. In summary, we suggest that excitatory neurons’ morphological diversity is better understood by considering axes of variation than using distinct m-types.

Morphological diversity of single neurons in molecularly defined cell types

Article Open access 06 October 2021

Neuronal diversity and stereotypy at multiple scales through whole brain morphometry

Article Open access 26 November 2024

Connectivity of single neurons classifies cell subtypes in mouse brains

Article Open access 21 March 2025

Introduction

Neurons have incredibly complex and diverse shapes. Since Ramón y Cajal, neuroanatomists have studied their morphology¹ and have classified them into different types. From a computational point of view, a neuron’s dendritic morphology constrains which inputs it receives, how these inputs are integrated, and, thus, which computations the neuron and the circuit it is part of can learn to perform.

Less than 15% of neocortical neurons are inhibitory, yet they are morphologically the most diverse and can be classified reliably into well-defined subtypes^2,3,4. The vast majority of cortical neurons are excitatory. Excitatory cells can be divided into spiny stellate and pyramidal cells⁵. Although pyramidal cells have a very stereotypical dendritic morphology, they exhibit a large degree of morphological diversity. Recent studies subdivide them into 10–20 cell types using manual classification⁶ or clustering algorithms applied to dendritic morphological features^7,8,9.

Existing studies of excitatory morphologies have revealed a number of consistent patterns, such as the well-known thick-tufted pyramidal cells of layer 5^6,7,8,9,10. However, a commonly agreed-upon morphological taxonomy of excitatory neuron types is yet to be established. For instance, Markram et al.⁶ describe two types of thick-tufted pyramidal cells based on the location of the bifurcation point of the apical dendrite (early vs. late). Later studies suggest that these form two ends of a continuous spectrum^7,8. Other authors even observe that morphological features overall do not form isolated clusters and suggest an organization into families with more continuous variation within families¹¹. There are two main limitations of previous morphological characterizations: First, many rely on relatively small numbers of reconstructed neurons used to asses the morphological landscape. Second, they represent the dendritic morphology using summary statistics such as point counts, segment lengths, volumes, density profiles (so-called morphometrics;^9,12,13), or graph-based topological measures¹⁴. These features were handcrafted by humans and may not capture all crucial axes of variation.

We here take a data-driven approach using a recently developed unsupervised representation learning approach¹⁵ to extract a morphological feature representation directly from the dendritic skeleton. We apply this approach to a large-scale anatomical dataset¹⁶ to obtain low-dimensional vector embeddings ("bar codes”) of more than 30,000 neurons in mouse visual areas V1, AL, and RL. Our analysis suggests that excitatory neurons’ morphologies form a continuum, with notable exceptions such as layer 5 thick-tufted cells, and vary with respect to three major axes: soma depth, total apical, and total basal skeletal length. Moreover, we observed a number of morphological features in the upper layers: Neurons in layers 2/3 showed a trend of a decreasing width of their dendritic arbor and a smaller tuft with increasing cortical depth. In layer 4, morphologies showed area-specific variation: atufted neurons were primarily located in the primary visual cortex, while tufted neurons were more abundant in higher visual areas. Finally, layer 4 neurons in V1 on the border to layer 5 showed a tendency towards avoiding layer 5 with their dendrites.

Results

Self-supervised learning of embeddings for 30,000 excitatory neurons from visual cortex

Our goal was to perform a large-scale census of the dendritic morphologies of excitatory neurons without prescribing a-priori which morphological features to use. Therefore, we used machine learning techniques¹⁵ to learn the features directly from the neuronal morphology.

Our starting point was a 1.3 × 0.87 × 0.82 mm³ volume of tissue from the visual cortex of an adult P75–87 mouse, which has been densely reconstructed using serial section electron microscopy¹⁶. This volume has been segmented into individual cells, including non-neuronal types and more than 54,000 neurons whose soma was located within the volume. From these detailed reconstructions we extracted each neuron’s dendritic tree and represented it as a skeleton (Fig. 1A)¹⁷: each neuron’s dendritic morphology was represented as a graph, where each node had a location in 3d space. This means we focused on the location and branching patterns of the dendritic tree, not fine-grained details of spines or synapses (see companion paper¹⁸), or any subcellular structures (see companion paper¹⁹).

**Fig. 1: Pipeline to generate vector embeddings for large-scale datasets that capture the morphological features of the neurons’ dendritic trees.**

Our next step was to embed these graphs into a vector space that defined a measure of similarity, such that similar morphologies were mapped onto nearby points in embedding space (Fig. 1B). To do so, we employed a recently developed self-supervised learning method called GraphDINO¹⁵ that learns semantic representations of graphs without relying on manual annotations. The idea of this method is to generate two “views” of the same input by applying random identity-preserving transformations such as rotations around the vertical axis, slightly perturbing node locations, or dropping subbranches (Fig. 1B, top and bottom). Then, both views are encoded using a neural network. The neural network is trained to map both views onto similar vector embeddings. For model training, the data was split into training, validation, and test data to ensure that the model did not overfit (Section “Morphological feature learning using GraphDINO”). The model outputs a 32-dimensional vector for each neuron that captures the morphological features of the neuron’s dendritic tree. Thus, each neuron is represented as a point in this 32-dimensional vector space (Fig. 1C).

At this stage, we performed another quality control step: Using the learned embeddings as a similarity metric between neurons, we clustered the neurons into 100 clusters and manually inspected the resulting clusters. We found a non-negligible fraction of neurons whose apical dendrite left the volume or was lost during tracing (see Methods for details). We removed neurons whose somata are in close proximity to the imaged volume boundary (Fig. 2A). Additionally, we used the clusters containing fragmented neurons as examples for broken neurons and trained a classifier to predict whether a neuron has reconstruction errors using the learned morphological embeddings as input features (Fig. 2B, Supplementary Fig. 2A, B). We then removed all neurons from the dataset that were classified as erroneous. Also, at this point, we removed all interneurons from the dataset since we focused on excitatory neurons in this paper (Fig. 2C, Supplementary Fig. 2C, D). We further removed neurons with cut apical dendrites (Section “Supervised classifiers”).

**Fig. 2: Visualization of soma depths and cortical layer assignments of excitatory neuronal morphologies showing mostly a continuum with distinct clusters only in deeper layers.**

The vector embeddings of the remaining 32,571 excitatory neurons in the dataset were organized by cortical depth (Fig. 2E) and, as a consequence, could distinguish well between different cortical layers (Fig. 2F, G; note that there is no 1:1 correspondence between cortical depth and layer as the layer boundaries varied across the volume.). The learned embeddings could also distinguish between broad cell types (Fig. 2H, I) that were assigned by expert neuroanatomists¹⁸ based on the cortical origin of the somata and their long-range projection type (IT: intratelencephalic or intracortical; ET: extratelencephalic or subcortical projecting, NP: near projecting, and CT: cortico-thalamic). Note that neither the location of the soma nor the projection type was provided to the model, showing that the dendritic morphology by itself provides information on these broad cell types. One exception is the 6P-CT and 6P-IT cells, which were partly intermingled in the embedding space. 6P-IT cells show a high variance in their dendritic morphology, which in some cases are indistinguishable from 6P-CT cells when no information about the projection type is used (Fig. 2H, I).

To demonstrate that the learned embedding is generally applicable beyond EM datasets and the MICrONS dataset specifically, we used the GraphDINO model trained on MICrONS to embed 61 neurons from mouse visual cortex²⁰ that have been recorded using PatchSeq²¹ and show that the model generalizes to other datasets and recording techniques (Supplementary Fig. 10; Supplementary Note 2).

Dendritic morphologies mostly form a continuum with distinct clusters only in deeper layers

We noticed that the embedding space appeared to form largely a continuum, with only a few fairly distinct clusters, such as the layer 5 ET cells (Fig. 2H, purple). Previous papers have characterized excitatory morphologies by categorizing neurons into morphological types (m-types), with the number of types varying between nine and nineteen^{6,7,9,14,18,22}. But is categorization into discrete types the best way of describing the landscape of morphologies, or is it rather characterized by continuous variation? The answer depends on the structure of the data. Consider the following toy example where the data is generated by a mixture of two normal distributions (Fig. 3A): If the two components are well separated, it makes sense to define each one as a distinct type (Fig. 3A, left). However, if they are strongly overlapping such that the resulting data distribution is not even bimodal (Fig. 3A, right), describing the distribution by two types is not useful, and identifying the two types by clustering will not work reliably, either. But there are also scenarios in between, where the distinction is not as straightforward (Fig. 3A, middle). Thus, the question of whether a distribution is discrete or forms a continuum does not have a binary answer – it is rather a matter of degree.

**Fig. 3: Cluster versus continuum analysis.**

To understand to what degree our dataset forms a continuum, we devised a simple procedure based on synthetic data that emulates the real data to some extent but allows us to manipulate the degree of separation. The synthetic data was generated from a Gaussian mixture model (GMM) fit to our morphological embeddings, from which we kept the cluster means and weights but replaced the covariance matrix to be spherical with varying variances (σ²). Following previous estimations of number of excitatory cell types in the rodent sensory cortex^{6,7,9,14,18,22}, we generated synthetic data distributions with 20 clusters (Fig. 3B), as well as with 10 and 40 clusters as controls (Supplementary Fig. 5). When the variance was small (σ² = 0.05), all clusters were clearly distinct (Fig. 3B, left). As we increased the variance to 1, the distribution became more and more continuous (Fig. 3B, right). At intermediate values of 0.3 or 0.5 the synthetic data distribution qualitatively resembled the real data (Fig. 3D).

To make the comparison more quantitative, we asked two questions, which can be answered using the synthetic data for which the ground truth generating process is known. First, we asked under which conditions we could reliably identify the underlying clusters that generated the data (Fig. 3C). To do so, we assumed we did not know the generative process and clustered the synthetic data repeatedly by fitting Gaussian mixture models (GMMs) with varying number of components and random initial conditions. We found that in the extreme scenario, when all clusters were clearly separated, the result of the clustering was highly consistent across runs when the number of clusters matched the ground truth (Fig. 3C; ARI ≥0.85 for σ² ≤ 0.1 and number of ground truth components equal to number of GMM components). As the degree of overlap between the clusters increased, the consistency of the clustering result decreased and the optimal number of clusters was increasingly less clearly defined. For a larger degree of overlap (σ² > 0.5), the consistency of clusterings decreased monotonically with the number of clusters, and no optimal number of clusters could be determined. The same was true for the real data (Fig. 3E): There was no noticeable peak in the ARI across different numbers of clusters, suggesting that the scenario with σ² ≥ 0.5 is realistic in this regard (Neuronal data: ARI = 0.63 for 20 clusters; compared to ARI = 0.62 for 20 clusters and σ² = 0.5 for the synthetic data).

Next, we investigated the degree to which individual clusters were distinct from their neighboring clusters. Even though certain parts of the distribution appeared continuous, there could be clusters that are separable. To address this question, we built a k-nearest-neighbor graph from the clustering output, connecting each cluster to its k = 3 nearest neighbors. We then quantified for each pair of neighboring clusters how separated they are. To do so, we projected all data points assigned to the pair onto the direction connecting the two cluster means (Fig. 3B, insets left) and computed the dip statistic²³. The dip statistic measures how bimodal a distribution is by computing how much its empirical cumulative distribution deviates from that of the closest uniform distribution (Fig. 3B, insets right). It is close to zero for unimodal distributions and increases with increasing separation of the two modes of a bimodal distribution. This analysis confirmed the qualitative impression from the t-distributed stochastic neighbor embedding (t-SNE;²⁴) that the layer 5 ET cluster (purple cluster 12 in Fig. 3B, D) was separated more from its nearest neighbor (cluster 0, green) than two representative example clusters from layer 2/3 (clusters 1 and 17, red and teal), which were not separated and appeared to divide a continuum more or less arbitrarily. These two patterns of results in the neuronal data were reproduced well by the synthetic data with a standard deviation of 0.5 (Fig. 3B, insets 5 & 6). Examination of the entire nearest-neighbor graph showed that layers 2–4, including the upper part of layer 5, form a continuum with no neighboring clusters being well-separated, clusters in layer 5 were more distinct, and two clusters in layer 6 (inverted and subplate neurons) stood out from a larger clique of layer 6 clusters (Fig. 3F). Over the entire dataset, the maximum dip statistic (maximally separated clusters) of the neuronal data was in between the maximal dip statistic for the synthetic data with σ² = 0.3 and σ² = 0.5 (Fig. 3G), again suggesting that the qualitative visualization by t-SNE captures the underlying structure of the data well.

The analyses presented so far established that our learned morphological embeddings form mostly a continuum. Could this result be caused by our learning methods? We found no evidence that this was the case, as using a different contrastive learning objective (Supplementary Fig. 7) to train GraphDINO and varying model hyperparameters (Supplementary Fig. 6) produced the same result. Similarly, using handcrafted morphometrics from earlier studies^7,18 on our data did not change our conclusions (Supplementary Fig. 9, Supplementary Note 1). Additionally, we employed alternative dimensionality reduction techniques with varying settings (Supplementary Fig. 8) to ensure that our interpretation is not dependent on t-SNE for visualization.

The landscape of morphological variation across layers

Given the results from the previous section, we conclude that excitatory morphologies were mostly organized along a continuum, with only a few distinct clusters in the deeper layers. Therefore, we did not base our subsequent analyses on a set of m-types as previous studies did but instead investigated the major axes of variation within the morphological embedding space. The cortical organization into layers is well established, so we separated cells by cortical layer. We determined the layer boundaries by training a classifier using our 32-dimensional morphological embeddings and a set of 922 neurons manually assigned to layers by experts (Fig. 2D, F, G). As expected, the inferred layer boundaries indicated that layer 4 was approximately 20% thicker in V1 than in higher visual areas RL and AL (Fig. 2G; mean ± SD: 118 ± 6 μm in V1 vs. 97 ± 6 μm in HVA), the difference being compensated for by layers 2/3 and 6 each being approximately 10 μm thinner. In the following, we proceed by assigning neurons to layers based on their soma location relative to these inferred boundaries.

To visualize the main axes of morphological variation within each layer, we performed nonlinear dimensionality reduction using t-SNE and identified morphological features that formed gradients within the t-SNE embedding space. Based on visual inspection, we found the following six morphological metrics to account well for a large fraction of the dendritic morphological diversity in our dataset (see Fig. 4 for an illustration): (1) depth of the soma relative to the pia, (2) height of the cell, (3) total length of the apical dendrites, (4) width of the apical dendritic tree, (5) total length of the basal dendrites, and (6) location of the basal dendritic tree relative to the soma ("basal bias”).

**Fig. 4: Schematic of morphometric descriptors computed from neuronal skeletons and their labeled compartments.**

Layer 2/3: Width and length of apical dendrites decrease with depth

In layer 2/3 (L2/3), we found a continuum of dendritic morphologies that formed a gradient from superficial to deep, with deeper neurons (in terms of soma depth) becoming thinner and less tufted (Fig. 5A L2/3 a,b,c). The strongest predictors of the embeddings were the depth of the soma relative to the pia and the total height of the cell (coefficient of determination R² > 0.9; Fig. 5B L2/3; Supplementary Table A.3). These two metrics were also strongly correlated (Spearman’s rank correlation coefficient, ρ = 0.93; Fig. 5C L2/3; Supplementary Table 4), since nearly all L2/3 cells had an apical dendritic tree that reached to the pial surface (see example morphologies in Fig. 5A L2/3, top). L2/3 cells varied in terms of their degree of tuftedness: both the total length and width of their apical tuft decreased with the depth of the soma relative to the pia (Fig. 5A L2/3 b,c). L2/3 cells also varied along a third axis: the skeletal length of their basal dendrites (Fig. 5A L2/3 d), but this property was not strongly correlated with either soma depth or shape of the apical dendrites (Fig. 5C L2/3).

**Fig. 5: t-SNE visualization of vector embeddings per cortical layer reveal axes of variation in neuronal morphologies.**

Layer 4: Small or no tufts and some cells’ basal dendrites avoid layer 5

The dendritic morphology of layer 4 (L4) was again mostly a continuum and appeared to be a continuation of the trends from L2/3: The skeletal length of the apical dendrites was shorter, on average, than that of most L2/3 cells (Fig. 5A L4 b) and approximately 20% of the cells were atufted. Within L4, the total apical skeletal length was not correlated with the depth of the soma (ρ = 0.0; Fig. 5C L4; Supplementary Table 4), suggesting that it forms an independent axis of variation. Considerable variability was observed in terms of the total length of the basal dendritic tree, but – as in L2/3 – it was not correlated with any of the other properties.

Our data-driven embeddings revealed another axis of variation that had previously not been considered important: the location of the basal dendritic tree relative to the soma ("basal bias”; Fig. 4). We found that many L4 cells avoided reaching into L5 with their dendrites (Fig. 5A L4 c). As a result, the depth of the basal dendrites was anticorrelated with the depth of the soma (ρ = −0.29; Fig. 5A L4 c and Fig. 5C L4; Supplementary Table 4). We will come back to this observation later (see Section “Layer 4 cells avoiding layer 5 are located primarily in primary visual cortex”).

Layer 5: Thick-tufted cells stand out

Layer 5 (L5) showed a less uniformly distributed latent space than L2/3 or L4 (Figs. 5A L5, 3F). Most distinct was the cluster of well-known thick-tufted pyramidal tract (PT) cells^6,7,8,9,10 on the bottom right (Fig. 5A L5 d, light green points), also known as extratelencephalic (ET) projection neurons. These cells accounted for approximately 17% of the cells within L5 (based on a classifier trained on a smaller, manually annotated subset of the data; see Methods). They were restricted almost exclusively to the deeper half of L5 (Fig. 5A L5 a and d, inset 2; C inset top right), and compared to other L5 cells, they have the longest skeleton for all three dendritic compartments: apical, basal, and oblique.

Another morphologically distinct type of cell was apparent at the end of the layer 5 spectrum: the near-projecting (NP) cells^7,25 with their long and sparse basal dendrites (Fig. 5A L5 d, inset 3). These cells accounted for approximately 4% of the cells within L5. They tended to send their dendrites deeper (relative to the soma), had little or no obliques, and tended to have small or no apical tufts.

The remaining roughly 80% of the cells within L5 varied continuously in terms of the skeletal length of the different dendritic compartments. While there was a correlation between apical and basal skeletal length (apical vs. basal: ρ = 0.43; Fig. 5 L5 C; Supplementary Table 4), there was also a substantial degree of diversity. Within this group, there was no strong correlation of morphological features with the location of the soma within L5 (depth vs. apical length ρ = 0.2, depth vs. basal ρ = 0.06; Fig. 5 L5 C; Supplementary Table 4).

In upper L5, we found a group of cells that resembled the L4 cells whose dendrites avoid L5 (Fig. 5A L5 d, inset 1). These cells were restricted to the uppermost portion of L5 and morphologically resembled L4 cells by being mostly atufted and exhibiting upwards curved basal dendrites. We refer to these cells as displaced L4 cells. Their presence could be caused by our piece-wise linear estimation of the layer boundaries being not precise enough. Alternatively, it could suggest that there are no precise laminar boundaries based on morphological features of neurons, but instead, different layers blend into one another as observed by previous studies^9,19.

Layer 6: Long and narrow, oblique and inverted pyramidal neurons

Dendritic morphology in layer 6 (L6) also formed a continuum with a large degree of morphological diversity. The dominant feature of L6 was the large variety of cell heights (R² > 0.9; Fig. 5 L6 B; Supplementary Table 3). Overall, the height of a cell was not strongly correlated with its soma’s location within L6 (ρ = −0.13; Fig. 5 L6 C; Supplementary Table 4). Unlike other layers, where the apical dendrites usually reach all the way up to layer 1, many cells in L6 have shorter apical dendrites. However, due to tracing errors, our analysis overestimated the number of such short cells. We therefore manually inspected 183 putative atufted early-terminating neurons within L6 and found that, among those, 45% were incompletely traced, whereas 55% were true atufted cells whose apical dendrite terminated clearly below L1 (Section “Manual validation of apical skeletons“).

As described previously^7,9, the dendritic tree of L6 cells is narrower than in the layers above. Also consistent with previous work, we found a substantial number of horizontal and inverted pyramidal neurons, where the apical dendrite points sideways or downwards, respectively (Fig. 5A L6 d, inset 1 & 6). However, apicals of inverted and horizontal cells are currently not detected by the automatic compartment identification (see companion paper¹⁷), rendering an automatic analysis of the apical dendrites in layer 6 currently unreliable. This does not affect the learned embeddings, as GraphDINO is trained without knowledge about the differentiation of dendritic compartments.

Pyramidal neurons are less tufted in V1 than in higher visual areas

After our layer-wise survey of excitatory neurons’ morphological features, we next asked whether there are inter-areal differences between primary visual cortex (V1) and higher visual areas (HVAs). The total length of the apical dendrites of neurons in V1 was significantly shorter than for neurons in HVA (Fig. 6A): For L2/3, neurons in V1 had on average 16% shorter apical branches than in HVA (mean ± SD: 1,423 ± 440 μm in V1 vs. 1,688 ± 554 μm in HVA; t-test: p < 10⁻¹⁰, Cohen’s d = 0.53). Similarly, L4 neurons in V1 had, on average 16% shorter apical branches than in HVA (851 ± 264 μm vs. 1,019 ± 313 μm; p < 10⁻¹⁰, d = 0.58). In L5, neurons in V1 had on average 14% shorter apical branches than L5 neurons in HVA (1,326 ± 661 μm vs. 1,549 ± 745 μm; p < 10⁻¹⁰, d = 0.32). While the trend continued in L6, the difference in apical length between V1 and HVA neurons was smaller. There was only a 4% increase in apical length in HVA compared to V1 (1,112 ± 383 μm vs. 1,159 ± 397 μm; p = 1.810⁻⁶, d = 0.12). For this analysis, only neurons with identified apical dendrites were taken into account (see companion paper¹⁷).

**Fig. 6: Inter-areal differences between primary visual cortex (V1) and higher visual areas (HVAs).**

Upon closer inspection, we observed that L4 contained substantially more atufted neurons in V1 than in higher visual areas RL and AL (Fig. 6A). We clustered each layer’s morphological embeddings into 15 clusters using a Gaussian Mixture Model and looked for clusters that were restricted to particular brain areas. Clusters that were clearly confined to V1 or HVAs were primarily found in L4. When classifying (manually, at the cluster-level) L4 neurons into atufted, small tufted, and tufted, we observed that atufted neurons were almost exclusively located in V1, while tufted neurons were more frequent in HVAs (Fig. 6B).

Layer 4 cells avoiding layer 5 are located primarily in the primary visual cortex

We observed a second area difference related to the morphological trait of L4 neurons described above. Recall that these cells’ dendrites avoid reaching into L5. Interestingly, these cells were located in a very narrow strip of approximately 50 μm above the border between L4 and L5 (Fig. 7A). Moreover, they were atufted and almost exclusively located in V1 (Fig. 7B).

**Fig. 7: Basal bias neurons in primary visual cortex (V1).**

The morphological property of avoiding layer 5 has a functional correlate

Lastly, we asked whether morphological variation can be linked to the neurons’ functional properties. While an extensive investigation of the structure–function relationship is beyond the scope of this study, we took one morphological aspect revealed by our study as a proof of principle: We investigated whether L4 neurons that avoided reaching into layer 5 with their dendrites differ in their tuning to visual stimuli from other neurons in layer 4. To address this question, we made use of the fact that for many of the neurons in our dataset, we have measurements of how they respond to natural stimuli¹⁶. We leveraged a functional digital twin – a model that accurately predicted the response of a neuron to arbitrary visual stimuli²⁶ – to extract a functional bar code – a vector embedding f_i that describes the input-output function of a neuron analogous to how our morphological bar codes describe their morphology (Fig. 7C). From this functional bar code of each neuron, we predicted one of its morphological properties: the basal bias metric. We found that the basal bias of L4 neurons could be predicted reasonably well from the neurons’ response functions to visual stimuli (Fig. 7D; Pearson correlation ρ = 0.41, p < 10⁻¹⁰). This analysis could be confounded by cortical depth being predictive of the basal bias. However, a model predicting the basal bias from cortical depth and functional bar code explained significantly more variance in the basal bias metric than one using only cortical depth as predictor (R² = 0.28 for both predictors vs. 0.21 for depth only; ρ = 0.53 and ρ = 0.46, respectively; Fisher’s z-test of difference between the correlation coefficients: p = 0.0015).

Discussion

In summary, our data-driven unsupervised learning approach identified the known morphological features of excitatory cortical neurons’ dendrites and enabled us to make four main observations: (1) Superficial L2/3 neurons are wider than deep ones; (2) L4 neurons in V1 are less tufted than those in HVAs; (3) the basal dendrites of a subset of atufted L4 neurons in V1 avoid reaching into L5; (4) excitatory cortical neurons form mostly a continuum with respect to dendritic morphology, with some notable exceptions.

First, our finding that superficial L2/3 neurons are wider than deeper ones is clearly visible in the data both qualitatively and quantitatively. A similar observation has been made recently in concurrent work²⁷.

Second, in L4, a substantial number of cells are completely atufted. Here we see a differentiation with respect to brain areas: completely atufted cells are mostly restricted to V1 while HVA neurons in L4 tend to be more tufted. Why would V1 neurons be less tufted than those in higher visual areas? V1 – as the first cortical area for visual information processing – and L4 – as the input layer, in particular – might be less modulated by feedback connections than other layers and higher visual areas. Therefore, these neurons might sample the feedback input in L1 less than other neurons.

Third, we found that some neurons at the bottom of L4 of V1 avoid reaching into L5 with their dendrites. To our knowledge, this morphological pattern has not been described before in the visual cortex. Retrospectively, it can be observed in Gouwens and colleagues’ data: their spiny m-types 4 and 5, which are small- or atufted L4 neurons, show a positive basal bias (assuming their “basal bias y" describes the same property; Gouwens et al.⁷; Supplementary Fig. 15). Whether such cells are restricted to the bottom of layer 4 or are simply morphologically insdistinguishable from other cells when located more superficially cannot be answered from our data. However, interestingly, this morphological pattern correlated with the functional properties of the neurons. While this is by no means an exhaustive characterization of how morphology and function are related, this result shows that they are and that such relationships can be identified by data-driven methods. What function could avoiding L5 have? Similarly to the non-existing tuft of these neurons, avoiding L5 could support these neurons in focusing on the thalamic input (which targets primarily L4) and, thus, represent and distribute the feedforward drive within the local circuit. It is, therefore, tempting to speculate that these atufted, L5-avoiding L4 neurons might be precursors of spiny stellate cells, which are nearly absent in the mouse visual cortex²⁸, but exist only in somewhat more developed sensory areas like barrel cortex or in cat and primate V1.

Fourth, except from the well-known L5 extratelencephalic (ET) projection neurons and some characteristic morphologies in L6 (subplate and inverted cells), our data and methods suggest that excitatory neurons in the mouse visual cortex form mostly a continuum with respect to dendritic morphology.

Previous studies, in contrast, work on the premise that discrete cell types exist and categorize neurons into up to 20 m-types^{6,7,8,9,18,19,22}, most of them using clustering methods on morphological features^7,9,22,29. While they assume that each cluster corresponds to a distinct m-type, they report the presence of variability within their proposed m-types. Furthermore, their visualizations of morphometrics per m-type depict further intra-class variability^7,8,18. Thus, we believe that our data is consistent with previous work, but our data-driven, quantitative approach suggests that the morphological landscape of cortical excitatory neurons is better described as a continuum, with a few notable exceptions in deeper layers. This notion has also been brought up recently by transcriptomics studies, which observe continuous variation among cell types in cortex^{11,30,31,32,33} as well as subcortical areas^34,35. Furthermore, variation within transcriptomic types found in several of the studies aligns with variation observed in other modalities^11,32. Scala et al.¹¹ suggest that neurons are organized into a small number of distinct and broad “families”, each of which exhibits substantial continuous variation among its family members. In their case, a substantial degree of morphological variation was evident among excitatory neurons of the IT type, and this variation correlated with transcriptomic variation within the type as well as the cortical depth of the neuron – resembling the gradual decrease in the width of the apical tuft with increasing cortical depth we observed. Our analysis supports the notion of broad “families” with intrinsic variation: excitatory cells can be mostly separated by layers into roughly a handful of families, each of which contains a substantial degree of variation in terms of morphology, which might also co-vary with other modalities.

This result does not rule out the possibility that there are in fact distinct types; it simply suggests that features beyond dendritic morphology need to be taken into account to clearly identify these types. For instance, the results of ref. ¹⁸ suggest that the 5P-NP cells can be separated from other layer 5 pyramidal neurons by considering the class of interneurons that target them. It is also not guaranteed that our data-driven method identifies all relevant morphological features. Every method has (implicit or explicit) inductive biases. We tried to avoid explicit human-defined features, but by choosing a graph-based input representation, we provided different inductive biases than, for instance, a voxel-based representation or one based on point clouds. However, the fact that we could reconcile known morphological features, discover novel ones, and achieve good classification accuracy on an annotated subset of the data suggests that our learned embeddings indeed contain a rich and expressive representation of a neuron’s dendritic morphology.

Our study was done on a single animal, which presents both advantages and disadvantages. The main advantage of this design is that our dataset is not contaminated by variability across animals (e.g., “batch effects” due to data processing or variation across animals). Such variability could blur otherwise distinct boundaries between cell types and make a discrete organization appear more continuous than it actually is. By sampling within one animal, we control for this potential confound. However, this design comes with the obvious disadvantages of N = 1: We cannot assess the variability across animals and some of the conclusions may be specific to this one individual rather than the population of mice in general.

In summary, recent studies of morphological as well as transcriptomic characteristics of cortical excitatory neurons suggest the presence of a few broad families of cell types, each exhibiting considerable intrinsic variation^11,32,33. Due to this continuous variation, a separation into finer cell types within these families is ambiguous. This raises the question of whether it is feasible to establish a comprehensive atlas of cortical excitatory cell types. We suggest that we should rather think of the variability across cells as axes of variation, understand how these axes of variation correlate between modalities, and whether they are just insignificant biological heterogeneity or indeed functionally relevant.

Methods

Dataset

The dataset consists of a 1.3 × 0.87 × 0.82 mm³ volume of tissue from the visual cortex of an adult P75–87 mouse, which has been densely reconstructed using serial section electron microscopy (EM)¹⁶. We used the subvolume 65, which covers approximately 1.3 × 0.56 × 0.82 mm³. It includes all layers of the cortex and spans the primary visual cortex (V1) and two higher visual areas, the anterolateral area (AL) and the rostrolateral area (RL). We refer to the original paper on the dataset¹⁶ for details on the identification and morphological reconstruction of individual neurons.

Skeletonization and cell compartment label assignment

The EM reconstructions yielded neuronal meshes. These meshes might be incomplete or exhibit different kinds of errors, including merges of other neuronal or non-neuronal compartments onto the neurons. Therefore, an automatic proof-reading pipeline that resulted in neuronal skeletons was executed (companion paper; Celii et al.¹⁷).

For the skeletal detection from the reconstructed meshes, the meshes were first downsampled to 25% of their resolution and made watertight. Then, glia and nuclei meshes were identified and removed. For the remaining meshes, the locations of the somata were identified using a soma detection algorithm³⁶. Each neurite submesh was then skeletonized using a custom skeletonization algorithm that transformed axonal and dendritic processes into a series of line segments to obtain the skeleton (companion paper; Celii et al.¹⁷). For each skeleton, the highest probability axon subgraph was determined, and all other non-soma nodes were labeled as dendrites. A final heuristic algorithm classifies subgraphs of dendritic nodes into compartments, such as apical trunks generally projecting from the top half of somas and with a general upward trajectory and obliques as projections off the apical trunks at an approximate 90-degree angle. For further details on the compartment label assignment, please see companion paper¹⁷.

Coordinate transformations

The EM volume is not perfectly aligned. First, the pial surface is not a horizontal plane parallel to the (x, z)-plane, but is instead slightly tilted. Second, the thickness of the cortex varies across the volume such that the distance from the pia to the white matter is not constant. Without any pre-processing, an unsupervised learning algorithm would pick up these differences and, for instance, find differences of layer 6 neurons across the volume simply because in some parts of the volume, they tend to be located deeper than in others, and their apical dendrites that reach to layer 1 tend to be larger. Using relative coordinates solves such issues if pia and white matter correspond to planes (approximately) parallel to the (x, z)-plane. To transform our coordinate system in such standardized coordinates, we first applied a rotation about the z-axis of 3.5 degrees. This transformation removed the systematic rotation with respect to the native axes (Supplementary Fig. 1B). To standardize measurements across depth (y-axis) and to account for differential thickness of the cortex, we estimated the best linear fit for both pial surface and white matter boundary by using a set of manually placed points, which are located on a regular grid along (x, z) with a spacing of 25 μm. For each (x, z)-coordinate, the y-coordinate was normalized such that the pia’s y coordinate corresponded to the average depth of the pia and the same for the white matter. This transformation resulted in an approximation of the volume where the pia and white matter boundaries are horizontal planes orthogonal to the y-axis and parallel to the (x, z)-plane. Supplementary Fig. 1C shows example neurons before and after normalization. All training and subsequent analysis were performed on this pre-processed data.

Expert cell type labels

For a subset of the neurons in the volume experts labeled neurons according the following cell types: layer 2/3 and 4 pyramidal neurons, layer 5 near-projecting (NP), extratelencenphalic (ET) and intratelencenphalic (IT) neurons, layer 6 intratelencenphalic (IT) and cortico-thalamic (CT) neurons, Martinotti cells (MC), basket cells (BC), bipolar cells (BPC) and neurogliaform cells (NGC). Cell types were assigned based on visual inspection of individual cells, taking into account morphology, synapses and connectivity, and nucleus features and their (x, y, z)-location. All neurons were taken from one 100-μm column in the primary visual cortex (see companion paper, Schneider-Mizell et al.¹⁸). We did not use neurons with expert labels to train GraphDINO, but used them only for evaluation.

Morphological feature learning using GraphDINO

For learning morphological features in an unsupervised, purely data-driven way, we used a recently developed machine learning method called GraphDINO¹⁵. GraphDINO maps the skeleton graph of a neuron onto a 32-dimensional feature vector, which we colloquially refer to as the neuron’s “bar code”. For training GraphDINO, each neuron’s skeleton was represented as an undirected graph G = (V, E). V is the set of nodes \({\{{v}_{i}\}}_{i=1}^{N}\) and E the set of undirected edges E = {e_ij = (v_i, v_j)} that connect two nodes v_i, v_j. Each node has a feature vector attached to it that holds the 3d Cartesian coordinate of the node, relative to the soma of the neuron. The soma has the coordinate (0, 0, 0), i.e. is at the origin of the coordinate system. Because axons have not been reconstructed well in the data yet, we focused on the dendritic skeleton only and removed segments labeled as axon. We trained GraphDINO on a subset of the dataset, retaining 5113 neurons for validation and 2941 neurons for testing. The test set was chosen to contain the 1011 neurons that were labeled by expert anatomists into morphological cell types (Section “Expert cell type labels”;¹⁸), while the other 1930 neurons were i.i.d. sampled. The training and validation sets were i.i.d. sampled from the remaining neurons with a 90%−10% split (Supplementary Fig. 4).

GraphDINO is trained by generating two “views” of the same input graph by applying random identity-preserving transformations (described below). These two views are both encoded by the same neural network. The training objective is to maximize the similarity between the embeddings of these two views. To obtain the two views of one input graph, we subsampled the graph, randomly rotated it around the y-axis (orthogonal to pia), dropped subbranches, and perturbed node locations. When subsampling the graph, we randomly dropped all but 200 nodes, always retaining the branching points. Rotations around the y-axis were uniformly distributed around the circle. During subbranch deletion we removed n = 5 subbranches. For node location jittering, we used σ = 1. In addition, the entire graph was randomly translated with σ = 1. For further details on the augmentation strategies, see Weis et al.¹⁵.

The Adjacency-Conditioned Attention network architecture had seven AC-Attention layers with four attention heads each. The dimensionality of the latent representation \({{{\bf{z}}}}\in {{\mathbb{R}}}^{{d}_{1}}\) was set to d₁ = 32, and the dimensionality of the projection \({{{\bf{p}}}}\in {{\mathbb{R}}}^{{d}_{2}}\) was d₂ = 5000. All other architecture details are as described in the original paper¹⁵. For training, we used the Adam optimizer³⁷ with a batch size of 128 for 50,000 iterations. The learning rate was linearly increased to 10⁻³ during the first 1000 iterations and then decayed using an exponential schedule with a decay rate of 0.5.

We ran ablation experiments using different dimensionalities for the latent space d₁ ∈ {16, 32, 64, 128} and varied the number of training iterations i ∈ {25,000, 50,000, 100,000, 200,000} (Supplementary Fig. 6). Additionally, we replaced the cross-entropy loss with the contrastive SimCLR loss³⁸ and trained variants with different mini-batch size b ∈ {128, 1024, 2048} (Supplementary Fig. 7), as contrastive losses have been shown to be sensitive to the number of negative samples used in the loss³⁸. Training with b = 2048 diverged.

Morphological clustering

For qualitative inspection of the data and the analyses in Figs. 6B and 7B, we clustered the neurons using the learned vector embedding of each neuron’s morphological features. We fit a Gaussian Mixture Model (GMM) with a diagonal covariance matrix using scipy³⁹ on the whole dataset as well as per cortical layer using 60 clusters and 15 clusters, respectively. As we found no evidence that these clusters (or any other clustering with fewer or more clusters) represent distinct cell types, we did not use this clustering to define cell types but rather think of them as modes or representing groups of neurons with similar morphological features.

Data quality control steps

The dataset was generated by automatic segmentation of EM images and subsequent automatic processing into skeletons. As a consequence, not all cells are reconstructed perfectly. There is a substantial fraction of wrongly merged or incompletely segmented cells. We used a combination of our learned GraphDINO embeddings and supervised classifiers trained on a subset of the neurons (n = 1011) which were manually proofread and annotated by experts (see Section “Expert cell type labels” and companion paper, Schneider-Mizell et al.¹⁸). Our quality control pipeline was as follows: First, we computed GraphDINO embeddings on the full dataset of 54,192 neurons (including both excitatory and inhibitory neurons). Next, we removed neurons that are close to the boundaries of the volume, as these neurons are only partly reconstructed. After this step, we were left with 43,666 neurons. Within this dataset, we identified neurons that are incorrectly reconstructed using a supervised classifier described in the next section, reducing the dataset to 37,362 neurons. Subsequently, we identified interneurons using a supervised classifier described in the next section, reducing the dataset to 33,997 excitatory neurons. Finally, on this dataset we manually proofread around 480 atufted neurons. As a result, we identify and remove another set of 2684 neurons whose reconstructions were incomplete, leaving us with a final sample size of 31,313 putative excitatory and correctly reconstructed neurons for our main analyses.

Supervised classifiers

To identify reconstruction errors and interneurons, we used a subset of the dataset (n = 1011) that was manually proofread and annotated with cell type labels by experts (see Section “Expert cell type labels” and companion paper, Schneider-Mizell et al.¹⁸). Based on these and additional neurons we identified as segmentation errors, we trained classifiers to detect segmentation errors, inhibitory cells and cortical layer membership using our learned 32-dimensional vector embeddings of the neurons’ skeletons (see Section “Morphological feature learning using GraphDINO”). In our subsequent analysis, we focused on neurons that were identified as complete and excitatory by our classifier. We used the inferred cortical layer labels to perform layer-specific analyses.

For all classifiers, we used ten-fold cross-validation on a grid search to find the best hyperparameters. We tested logistic regression with the following hyperparameters: type of regularization (none, L1, L2, or elastic net), regularization weight C ∈ 0.5, 1, 3, 5, 10, 20, 30, and whether to use class weights that are inversely proportional to class frequencies or no class weights. In addition, we tested support vector machines (SVMs) with the following hyperparameters: type of kernel (Linear, RBF or polynomial), L2 regularization weight C ∈ 0.5, 1, 3, 5, 10, 20, 30 and degree of polynomial d ∈ 2, 3, 5, 7, 10, 20 for the polynomial kernel and whether to use class weights or no weights. After having determined the optimal hyperparameters using cross-validation, we retrained the classifier using the optimal hyperparameters on its entire labeled set.

Removal of fragmented neurons

To remove fragmented neurons prior to analysis, we trained a classifier to differentiate between the manually proofread neurons from all layers (n = 1011) and fragmented cells (n = 240). We identified fragmented cells using our clustering of the vector embeddings of the whole dataset without boundary neurons (n = 43,666) into 25 clusters per layer and manually identify clusters that contained fragmented cells (2–3 clusters per layer). We then sampled 60 fragmented cells per layer as training data for our classifier.

We trained a support vector machine (SVM) using cross-validation as described above. Its cross-validated accuracy was 94% (Supplementary Fig. 3A). The best hyperparameters were: polynomial kernel of degree 3 and C = 3. We used those hyperparameters to retrain the classifier on the full training set of 1251 neurons. Using this classifier, we inferred whether a neuron is fragmented for the entire dataset (n = 43,666). We then removed cells predicted to be fragmented (n = 6304) from subsequent analyses.

To validate the classification into fragmented and whole cells, we manually inspected ten neurons that were not in “fragmented” clusters before classification but were flagged as fragmented by the classifier. Nine out of ten had missing segments due to segmentation errors or due to apical dendrites leaving the volume.

Removal of inhibitory neurons

Analogously, we trained a classifier to predict whether a neuron is excitatory or inhibitory by using the manually proofread and annotated neurons (n = 1011) (Section “Expert cell type labels”). As input features to the classifier, we used our learned embeddings and, additionally, two morphometric features: synaptic density on apical shafts (number of synapses per micrometer of skeletal length except those located on spines) and spine density (number of spines per micrometer of skeletal length). These two features have been shown to separate excitatory from inhibitory neurons well in previous work (see companion paper, Celii et al.¹⁷). The annotated dataset contains 922 excitatory and 89 inhibitory neurons.

We trained a logistic regression model. Its cross-validated accuracy was 99% (Supplementary Fig. 3B). The best hyperparameters were: L2 regularization (C = 5) and using class weights. We used those hyperparameters to retrain on the full training set of 1011 neurons. Using this classifier, we inferred whether a neuron is excitatory or inhibitory for the entire dataset after removing fragmented cells and after the removal of 227 neurons that do not have spine and synapse densities available (n = 37,135). We then removed all inhibitory cells from subsequent analyses (n = 3138).

Inference of cortical layers

To determine cortical layer labels for the entire dataset, we followed a two-stage procedure. First, we inferred the layer of each neuron using a trained classifier. Then, we determined anatomical layer boundaries based on the optimal cortical depth that separates adjacent layers.

We first trained an SVM classifier for excitatory cells on the 922 manually annotated excitatory neurons by pooling the cell type labels per layer. Its cross-validated balanced accuracy was 90% (Supplementary Fig. 3C). The best hyperparameters were: polynomial kernel of degree 5, C = 3. Using this classifier, we inferred the cortical layer of all excitatory neurons (n = 33,997; Fig. 2).

The spatial distribution of inferred layer assignments was overall well confined to their respective layers. As expected, there was some spatial overlap of labels at the boundaries since layer boundaries are not sharp. We nevertheless opted for assigned neurons to layers based on their anatomical location rather than their inferred label. To do so, we determined the optimal piece-wise linear function that separated two consecutive layers. Thus, the layer assignments used for subsequent analyses were purely based on the soma depth of each neuron relative to the inferred layer boundaries – not on the classifier output.

Inference of cell type labels

In Fig. 5, we show cell type labels for layer 5. These were determined by training an SVM to classify the excitatory neurons into cell types using the 922 manually annotated neurons. The cross-validated balanced accuracy of this classifier was 83% (Supplementary Fig. 3D). The best hyperparameters were: polynomial kernel of degree 2, C = 20, using class weights. Using this classifier, we inferred cell type labels for all excitatory neurons after the removal of neurons with cut apical dendrites (see next section) (n = 32,571).

Manual validation of apical skeletons

We found a significant fraction of atufted neurons across layers 4–6. To determine the extent to which these cells are actually atufted or an artifact of incomplete reconstructions, we manually inspected 479 neurons in Neuroglancer⁴⁰ with respect to the validity of their apical termination. During manual inspection, we annotated neurons’ reconstruction as “naturally terminating,” “out-of-bounds,” “reconstruction issue” or “unsegmented region.” Reconstruction issues were the case where the EM slice was segmented correctly, but the tracing failed to connect two parts of the same neuron. Unsegmented regions were the case where one or multiple EM images or parts thereof were not segmented correctly, and therefore, the neuron could not be traced correctly. In addition, we classified the neurons as either “atufted,” “small tufted” or “tufted,” both before validation and after correcting reconstruction errors.

For layer 4, we inspected 120 atufted neurons. Of those, 64% have missing segments on their apical dendrites, and 36% have a natural termination. Note, however, that 74% of the neurons had a consistent tuft before and after validation. Even though parts of the apical dendrite were missing, qualitatively, the degree of tuftedness did not change. For atufted neurons, this means that their apical dendrite merely terminated early, but this reconstruction error did not change their classification as atufted. In layer 4, neurons with a natural termination end more superficially than neurons with missing segments. We therefore excluded L4 neurons from the analysis whose apicals end more than 96 μm below the pia to exclude neurons with reconstruction errors from our analysis. This threshold was selected such that the F1-score is maximized, i.e. retaining as many atufted neurons with natural termination, while removing as many neurons with missing segments as possible. The threshold was computed on the 120 manually validated neurons. This process excluded 557 neurons from layer 4.

For layer 5, we inspected 176 neurons with early-terminating apical dendrites. Of those, 59 showed a natural apical termination, while 117 had reconstruction issues or left the volume. We found no clear quantitative metric like the depth of the apical to exclude neurons with unnatural terminations. Therefore, we excluded neurons based on their cluster membership from further analysis if the cluster contained more than 50% of neurons with unnatural terminations. Of the 15 clusters, we excluded four, corresponding to 1258 out of 5858 L5 neurons.

For layer 6, we inspected 183 neurons with early terminating apicals. Of those, 100 showed a natural apical termination, while 83 had reconstruction issues or left the volume. Due to the slant of the volume, long, narrow L6 cells near the volume boundary have a high likelihood of leaving the boundary with their apical dendrite. Therefore, we excluded all L6 neurons whose apical dendrite left the volume (n = 867) prior to our analysis. We considered a neuron as leaving the volume if the most superficial point of its apical tree is within a few micrometers of the volume boundary.

Overall, we excluded 2684 neurons as a result of this manual validation step, resulting in a final sample size of 31,313 neurons used in our analysis (Figs. 5–7).

Cortical area boundaries

Cortical area boundaries were manually drawn from retinotopic maps of visual cortex taken before EM imaging. For further details, see companion paper¹⁶.

Dimensionality reduction

For visualization of the learned embeddings, we reduced the dimensionality of the 32d embedding vector to 2d using t-distributed stochastic neighbor embedding (t-SNE;²⁴) using the openTSNE package⁴¹ with cosine distance and a perplexity of 30 for t-SNE plots of individual cortical layers and a perplexity of 300 for the whole dataset.

The perplexity of t-SNE needs to be set dependent on the dataset size. We followed the recommendation of Kobak and Berens⁴² of setting it to perplexity p = n/100, which led to the approximate perplexity of 300 for our dataset of around 30,000 excitatory cells. However, to show that our interpretation is not restricted to this specific perplexity, we visualized additional runs with p ∈ {30, 100, 1000} (Supplementary Fig. 8).

Additionally, we used UMAP⁴³ and PaCMAP⁴⁴ with different numbers of neighbors p ∈ {30, 100, 300, 1000} to show that our interpretation is not dependent on the use of t-SNE (Supplementary Fig. 8).

Visualization

For all plots displaying continuous morphometrics, the continuous variable was discretized into ten percentiles for coloring.

Morphometric descriptors

We computed morphometrics based on the neuronal skeletons for the analysis of the learned latent space. Morphometrics were not used for learning the morphological vector embeddings. We computed morphometrics based on compartment labels: soma, apical dendrites, basal dendrites, and oblique dendrites (Section “Skeletonization and cell compartment label assignment”). They are visualized in Fig. 4. Total apical length is defined as the total length of all segments of the skeletons that are classified as apical dendrites. Total basal length is computed analogously. Depth refers to the depth of the soma centroid relative to the pia after volume normalization (Section “Coordinate transformations”), where pia depth is equal to zero. Height is the absolute difference between the highest and the lowest skeleton node of a neuron in y-direction. Apical width refers to the widest extent of apical dendrites in the (x, z)-plane. Basal bias describes the difference between the soma depth and the center of mass of the basal dendrites along the y-axis. Due to the dataset size, compartment labeling was done automatically (see companion paper¹⁷). However, identifying apical dendrites rule-based does not work well for all neurons. For instance, it fails for the inverted L6 neurons¹⁷. For Fig. 5, we removed neurons for which the automatic morphometric pipeline failed. For layer 2/3: 10,196 of 10,564 neurons are included in the analysis, for layer 4: 7751 of 7775, for layer 5: 4443 of 4600, and for layer 6: 8274 of 8374. The GraphDINO feature space has the advantage of being independent of knowing which branches are apical and which are basal dendrites. However, our downstream analysis relies on it in certain parts (Figs. 5–7).

Statistics

Apical lengths in Section “Pyramidal neurons are less tufted in V1 than in higher visual areas” were compared between V1 and HVA per laminar layer with four independent two-tailed Student’s t-tests. The single-test significance level of 0.01 was corrected to 0.0025 for multiple tests using Bonferroni correction. Only neurons that have any nodes labeled as apical were taken into account for this analysis. In L2/3, n = 6760 neurons were taken into account from V1 and n = 3436 from HVA; for L4 n = 5217 (V1) and n = 2534 (HVA); for L5 n = 3708 (V1) and n = 1924 (HVA); and for L6 n = 3959 (V1) and n = 2618 (HVA).

Cluster analysis

Generation of synthetic data

To obtain synthetic data distributions that are close to the neuronal data, we first fit Gaussian Mixture Models (GMMs) with the number of components n ∈ {10, 20, 40} and diagonal covariance matrices to the neuronal embeddings, extracting cluster means and weights of the fit mixture components. Using these, we subsequently generated synthetic data from Gaussian mixtures with isotropic covariance matrices with increasing variances spanning the space from distinctly separated clusters to continuous distributions (Fig. 3B & Supplementary Fig. 5). We used variances σ² ∈ {0.005, 0.01, 0.03, 0.05, 0.07, 0.1, 0.3, 0.5, 0.7, 1.0} for each number of components n ∈ {10, 20, 40}, resulting in 27 synthetic datasets. For each Gaussian mixture, we drew 32,571 samples, equivalent to the number of analyzed excitatory neurons. Samples were 32-dimensional, like the morphological embeddings.

ARI analysis

To judge whether the correct number of clusters can be recovered, we split the data (both synthetic datasets and the neuronal data) into training and validation data (90%–10% split). For each synthetic dataset and the neuronal data, we fit 100 GMMs with a number of components ∈ {7, 10, 15, 20, 40, 60, 80} and isotropic covariance matrix. We then computed the pairwise adjusted rank index (ARI) between the different clustering runs for the same number of components and reported the average ARI on the validation set (Fig. 3B & Supplementary Fig. 5). All visualizations show clustering runs with the best log-likelihood score on the validation set (Fig. 3).

Unimodality versus bimodality of neighboring clusters

To examine if two neighboring clusters (neighboring in terms of least Euclidean distance between cluster means) form a uni- or bimodal distribution, we first projected the samples of the two clusters onto the line connecting the two cluster means. We then visualized the 1d histogram as well as the cumulative distribution function (CDF) of the samples from both clusters. Additionally, we computed the dip statistic²³ to quantify how close two neighboring clusters are to forming a unimodal distribution. The dip statistic was computed using the python package diptest (https://pypi.org/project/diptest/). We scaled the dip statistic with a factor of 4 such that the extreme case of two delta distributions at x_i and x_j with i ≠ j result in dip = 1. As exemplified by the synthetic data, when neighboring clusters evolve from discrete clusters to form a continuum, the dip statistic decreases, and the CDF forms a smooth curve (Fig. 3B, grey insets 1–6).

Connectivity graph

For each cluster of the Gaussian mixture model with 20 components of the neuronal data, we computed the dip statistic to its three nearest neighbors based on Euclidean distance in the 32-dimensional embedding space. We thresholded the neighbor selection by the average distance of all clusters to their third-nearest neighbor to avoid including spurious connections between clusters that do not have any close neighbors (threshold = 2.38 Euclidean distance in latent space). The line width of the graph (Fig. 3F) was determined as the inverse dip statistic between the nearest neighbors. Additionally, we computed the maximum dip statistic between all clusters and their nearest neighbor for the neuronal data and the synthetic datasets (Fig. 3G).

Prediction of morphological features from functional bar codes

The MICrONS dataset encompasses EM images as well as Calcium imaging of the same portion of the visual cortex of one mouse¹⁶. The companion paper by Wang et al.²⁶ created a digital twin of the functional properties of the neurons from the Calcium imaging data (Fig. 7C). We used the resulting functional embeddings of the neurons as input features to a linear regression model to predict the basal bias metric of the layer 4 neurons, thereby predicting a morphological feature from the functional properties of the neurons. There are 2347 L4 neurons in V1 with both functional and morphological data available. We performed nested cross-validation to select hyperparameters and report test set performance using 10-fold cross-validation for the inner and the outer loop. To select hyperparameters, a grid search over regularization strength α ∈ {0.01, 0.1, 0.5, 1, 5, 10} as well as L1 to L2 ratio ∈ {0, 0.25, 0.5, 0.75, 1.0} was performed. The best model had a R²-score of 0.17, and ground truth and predicted basal bias had a Pearson correlation of 0.41 (Fig. 7D, p < 10⁻¹⁰). To control for soma depth as a confounder, we repeated the analysis predicting the basal bias from the soma depth as well as from the functional embeddings in addition to the soma depth, resulting in R² = 0.28 for both predictors vs. 0.21 for depth only (ρ = 0.53, p < 10⁻¹⁰ and ρ = 0.46, p < 10⁻¹⁰, respectively). We tested the difference in the correlation coefficients using a two-tailed Fisher’s z-test, resulting in a significant difference between the two (p = 0.0015).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Data collection is described in the companion paper on the data release¹⁶. Data for this paper was analyzed at materialization version 374. Data is publicly available via https://www.microns-explorer.org/cortical-mm3. The embedding data and morphometrics generated in this study are provided in the Source Data file. Source data are provided with this paper.

Code availability

The code for GraphDINO is available at https://eckerlab.org/code/weis2021b/. The analysis code is available at https://eckerlab.org/code/weis2024/. Analyses were performed in Python 3.10 using custom code and the libraries Matplotlib362, Numpy124, openTSNE062, Pandas152, Pytorch113, Scikit-learn120, Scipy110, and Seaborn012 for general computation, machine learning, and data visualization.

References

Ramón y Cajal, S. Histologie du système nerveux de l’homme et des vertébrés. (Ed. Maloine, A.) https://gallica.bnf.fr/ark:/12148/bpt6k6213192g (Paris, 1911).
DeFelipe, J. et al. New insights into the classification and nomenclature of cortical GABAergic interneurons. Nat. Rev. Neurosci. 14, 202–216 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ding, Z. et al. Functional connectomics reveals general wiring rule in mouse visual cortex. Nature. (in press).
Jiang, X. et al. Principles of connectivity among morphologically defined cell types in adult neocortex. Science 350, aac9462 (2015).
Article PubMed PubMed Central Google Scholar
O’Leary, J. L. Structure of the area striata of the cat. J. Comp. Neurol. 75, 131–164 (1941).
Article Google Scholar
Markram, H. et al. Reconstruction and simulation of neocortical microcircuitry. Cell 163, 456–492 (2015).
Article CAS PubMed Google Scholar
Gouwens, N. W. et al. Classification of electrophysiological and morphological neuron types in the mouse visual cortex. Nat. Neurosci. 22, 1182–1195 (2019).
Kanari, L. et al. Objective morphological classification of neocortical pyramidal cells. Cereb. Cortex 29, 1719–1735 (2019).
Article PubMed PubMed Central Google Scholar
Oberlaender, M. et al. Cell type-specific three-dimensional structure of thalamocortical circuits in a column of Rat Vibrissal Cortex. Cereb. Cortex 22, 2375–2391 (2012).
Article PubMed Google Scholar
Kalmbach, B. E. et al. Signature morpho-electric, transcriptomic, and dendritic properties of extratelencephalic-projecting human layer 5 neocortical pyramidal neurons. bioRxiv. https://doi.org/10.1101/2020.11.02.365080 (2020)..
Scala, F. et al. Phenotypic variation of transcriptomic cell types in mouse motor cortex. Nature 598, 1–7 (2021).
Article Google Scholar
Scorcioni, R., Polavaram, S. & Ascoli, G. A. L-measure: a web-accessible tool for the analysis, comparison and search of digital reconstructions of neuronal morphologies. Nat. Protoc. 3, 866–876 (2008).
Article CAS PubMed PubMed Central Google Scholar
Laturnus, S., Kobak, D. & Berens, P. A systematic evaluation of interneuron morphology representations for cell type discrimination. Neuroinform 18, 591–609 (2020).
Article Google Scholar
Kanari, L. et al. A topological representation of branching neuronal morphologies. Neuroinformatics 16, 3–13 (2017).
Article PubMed Central Google Scholar
Weis, M. A., Hansel, L., Lüddecke, T. & Ecker, A. S. Self-supervised graph representation learning for neuronal morphologies. Trans. Mach. Learn. Res. 899, https://openreview.net/forum?id=ThhMzfrd6r (2023).
MICrONS Consortium. Functional connectomics spanning multiple areas of mouse visual cortex. Nature. (in press).
Celii, B. et al. NEURD: A mesh decomposition framework for automated proofreading and morphological analysis of neuronal em reconstructions. Nature. (in press).
Schneider-Mizell, C. M. et al. Cell-type-specific inhibitory circuitry from a connectomic census of mouse visual cortex. Nature. (in press).
Elabbady, L. et al. Quantitative census of local somatic features in mouse visual cortex. Nature. (in press).
Berg, S. et al. Human neocortical expansion involves glutamatergic neuron diversification. Nature 598, 151–158 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Cadwell, C. et al. Electrophysiological, transcriptomic and morphologic profiling of single neurons using patch-seq. Nat. Biotechnol. 34, 199–203 (2015).
Narayanan, A. et al. Graph2vec: Learning distributed representations of graphs. arXiv.org, 1707.05005, https://arxiv.org/abs/1707.05005 (2017).
Hartigan, J. A. & Hartigan, P. M. The dip test of unimodality. Ann. Stat. 13, 70–84 (1985).
Article MathSciNet Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. (JMLR) 9, 2579–2605 (2008).
Google Scholar
Kim, E. J., Juavinett, A. L., Kyubwa, E. M., Jacobs, M. W. & Callaway, E. M. Three types of cortical layer 5 neurons that differ in brain-wide connectivity and function. Neuron 88, 1253–1267 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, E. Y. et al. Foundation model of neural activity predicts response to new stimulus types and anatomy. Nature. (in press).
Weiler, S. et al. Orientation and direction tuning align with dendritic morphology and spatial connectivity in mouse visual cortex. Curr. Biol. 32, 1743–1753.e7 (2022).
Article PubMed Google Scholar
Scala, F. et al. Layer 4 of mouse neocortex differs in cell types and circuit organization between sensory areas. Nat. Commun. 10, 4174 (2019).
Article ADS PubMed PubMed Central Google Scholar
Marx, M. & Feldmeyer, D. Morphology and physiology of excitatory neurons in layer 6b of the somatosensory rat barrel cortex. Cereb. Cortex 23, 2803–2817 (2012).
Article PubMed PubMed Central Google Scholar
Tasic, B. et al. Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nat. Neurosci. 19, 335–346 (2016).
Article CAS PubMed PubMed Central Google Scholar
Tasic, B. et al. Shared and distinct transcriptomic cell types across neocortical areas. Nature 563, 72–78 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Gouwens, N. W. et al. Integrated morphoelectric and transcriptomic classification of cortical gabaergic cells. Cell 183, 935–953.e19 (2020).
Article PubMed PubMed Central Google Scholar
Yao, Z. et al. A taxonomy of transcriptomic cell types across the isocortex and hippocampal formation. Cell 184, 3222–3241 (2021).
Article CAS PubMed PubMed Central Google Scholar
Stanley, G., Gokce, O., Malenka, R. C., Südhof, T. & Quake, S. R. Continuous and discrete neuron types of the adult murine striatum. Neuron 105, 688–699 (2019).
Article PubMed Google Scholar
Wang, W. X. & Lefebvre, J. L. Morphological pseudotime ordering and fate mapping reveal diversification of cerebellar inhibitory interneurons. Nat. Commun. 13. https://doi.org/10.1038/s41467-022-30977-2 (2022).
Yaz, I. O. & Loriot, Sébastien Triangulated surface mesh segmentation. In CGAL User and Reference Manual. CGAL Editorial Board, 5.5.1 edition. https://doc.cgal.org/5.5.1/Manual/packages.html#PkgSurfaceMeshSegmentation (2022).
Kingma, D. P.& Ba, J. Adam: A method for stochastic optimization. In Proc. Int. Conf. Learn. Represent. (ICLR). https://openreview.net/forum?id=8gmWwjFyLj (2015).
Chen, T., Kornblith, S., Norouzi, M. & Hinton, G. A simple framework for contrastive learning of visual representations. In Proc. Int. Conf. Mach. Learn. (ICML) (2020).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. (JMLR) 12, 2825–2830 (2011).
MathSciNet Google Scholar
Maitin-Shepard, J. et al. https://github.com/google/neuroglancer (2021).
Poličar, P. G., Stražar, M. & Zupan, Blaž openTSNE: a modular python library for t-SNE dimensionality reduction and embedding. bioRxiv. https://doi.org/10.1101/731877 (2019)
Kobak, D. & Berens, P. The art of using t-sne for single-cell transcriptomics. Nat. Commun. 10, 5416 (2019).
Article ADS PubMed PubMed Central Google Scholar
McInnes, L., Healy, J., Saul, N. & Großberger, L. UMAP: Uniform manifold approximation and projection. J. Open Source Softw. 3, 861 (2018).
Article Google Scholar
Wang, Y., Huang, H., Rudin, C. & Shaposhnik, Y. Understanding how dimension reduction tools work: An empirical approach to deciphering t-sne, umap, trimap, and pacmap for data visualization. J. Mach. Learn. Res. (JMLR) 22, 1–73 (2021).
MathSciNet Google Scholar

Download references

Acknowledgements

M.A.W. was supported by the International Max Planck Research School for Intelligent Systems (IMPRS-IS), Tübingen. This work was supported by the European Research Council (ERC) under the European Union’s Horizon Europe research and innovation program (Grant agreement No. 101041669 and No. 101039115). Views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. The work was also supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior/Interior Business Center (DoI/IBC) contract numbers D16PC00003, D16PC00004, and D16PC0005. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DoI/IBC, or the U.S. Government. The authors thank David Markowitz, the IARPA MICrONS Program Manager, who coordinated this work during all three phases of the MICrONS program. We thank IARPA program managers Jacob Vogelstein and David Markowitz for co-developing the MICrONS program. We thank Jennifer Wang, IARPA SETA for her assistance. We also thank the Allen Institute for Brain Science founder, Paul G. Allen, for his vision, encouragement, and support. This work was also supported by the National Institute of Mental Health under Award Numbers R01 MH109556, RF1MH130416-01, P30EY002520, the NSF NeuroNex program through grant NSF-1707400, the National Institute of Mental Health and National Institute of Neurological Disorders And Stroke under Award Number U19MH114830, and the National Eye Institute award number R01 EY026927 as well as Core Grant for Vision Research T32-EY-002520-37. PB is a member of the Excellence Cluster 2064 “Machine Learning New Perspectives for Science” (ref. no. 390727645).

Author information

Authors and Affiliations

Institute of Computer Science and Campus Institute Data Science, University of Göttingen, Göttingen, Germany
Marissa A. Weis, Laura Hansel, Timo Lüddecke & Alexander S. Ecker
Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany
Marissa A. Weis
Center for Neuroscience and AI, Baylor College of Medicine, Houston, TX, USA
Stelios Papadopoulos, Brendan Celii, Paul G. Fahey, Eric Y. Wang, Jacob Reimer & Andreas S. Tolias
Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
Stelios Papadopoulos, Brendan Celii, Paul G. Fahey, Eric Y. Wang, Jacob Reimer & Andreas S. Tolias
Department of Ophthalmology, Stanford University, Stanford, CA, USA
Stelios Papadopoulos, Paul G. Fahey & Andreas S. Tolias
Byers Eye Institute, Stanford University, Stanford, CA, USA
Stelios Papadopoulos, Paul G. Fahey & Andreas S. Tolias
Stanford BioX, Stanford University, Stanford, CA, USA
Stelios Papadopoulos, Paul G. Fahey & Andreas S. Tolias
Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA
Stelios Papadopoulos, Paul G. Fahey & Andreas S. Tolias
Department of Electrical and Computer Engineering, Rice University, Houston, TX, USA
Brendan Celii & Andreas S. Tolias
Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
J. Alexander Bae, Manuel A. Castro, Sven Dorkenwald, Akhilesh Halageri, Zhen Jia, Chris Jordan, Nico Kemnitz, Kisuk Lee, Kai Li, Ran Lu, Thomas Macrina, Eric Mitchell, Shanka Subhra Mondal, Shang Mu, Barak Nehoran, Sergiy Popovych, H. Sebastian Seung, William Silversmith, Nicholas L. Turner, William Wong, Jingpeng Wu & Szi-chieh Yu
Department of Electrical Engineering, Princeton University, Princeton, NJ, USA
J. Alexander Bae & Shanka Subhra Mondal
Allen Institute for Brain Science, Seattle, WA, USA
Agnes L. Bodor, Derrick Brittain, JoAnn Buchanan, Daniel J. Bumbarger, Forrest Collman, Nuno Maçarico da Costa, Leila Elabbady, Dan Kapner, Sam Kinn, Gayathri Mahalingam, R. Clay Reid, Casey M. Schneider-Mizell, Marc Takeno, Russel Torres & Wenjing Yin
Department of Computer Science, Princeton University, Princeton, NJ, USA
Sven Dorkenwald, Zhen Jia, Kai Li, Thomas Macrina, Barak Nehoran, Sergiy Popovych, H. Sebastian Seung & Nicholas L. Turner
Massachusetts Institute of Technology, Cambridge, MA, USA
Kisuk Lee
Hertie Institute for AI in Brain Health, University of Tübingen, Tübingen, Germany
Philipp Berens
Tübingen AI Center, Tübingen, Germany
Philipp Berens
Department of Electrical Engineering, Stanford University, Stanford, CA, USA
Andreas S. Tolias
Max Planck Institute for Dynamics and Self-Organization, Göttingen, Germany
Alexander S. Ecker

Authors

Marissa A. Weis
View author publications
Search author on:PubMed Google Scholar
Stelios Papadopoulos
View author publications
Search author on:PubMed Google Scholar
Laura Hansel
View author publications
Search author on:PubMed Google Scholar
Timo Lüddecke
View author publications
Search author on:PubMed Google Scholar
Brendan Celii
View author publications
Search author on:PubMed Google Scholar
Paul G. Fahey
View author publications
Search author on:PubMed Google Scholar
Eric Y. Wang
View author publications
Search author on:PubMed Google Scholar
J. Alexander Bae
View author publications
Search author on:PubMed Google Scholar
Agnes L. Bodor
View author publications
Search author on:PubMed Google Scholar
Derrick Brittain
View author publications
Search author on:PubMed Google Scholar
JoAnn Buchanan
View author publications
Search author on:PubMed Google Scholar
Daniel J. Bumbarger
View author publications
Search author on:PubMed Google Scholar
Manuel A. Castro
View author publications
Search author on:PubMed Google Scholar
Forrest Collman
View author publications
Search author on:PubMed Google Scholar
Nuno Maçarico da Costa
View author publications
Search author on:PubMed Google Scholar
Sven Dorkenwald
View author publications
Search author on:PubMed Google Scholar
Leila Elabbady
View author publications
Search author on:PubMed Google Scholar
Akhilesh Halageri
View author publications
Search author on:PubMed Google Scholar
Zhen Jia
View author publications
Search author on:PubMed Google Scholar
Chris Jordan
View author publications
Search author on:PubMed Google Scholar
Dan Kapner
View author publications
Search author on:PubMed Google Scholar
Nico Kemnitz
View author publications
Search author on:PubMed Google Scholar
Sam Kinn
View author publications
Search author on:PubMed Google Scholar
Kisuk Lee
View author publications
Search author on:PubMed Google Scholar
Kai Li
View author publications
Search author on:PubMed Google Scholar
Ran Lu
View author publications
Search author on:PubMed Google Scholar
Thomas Macrina
View author publications
Search author on:PubMed Google Scholar
Gayathri Mahalingam
View author publications
Search author on:PubMed Google Scholar
Eric Mitchell
View author publications
Search author on:PubMed Google Scholar
Shanka Subhra Mondal
View author publications
Search author on:PubMed Google Scholar
Shang Mu
View author publications
Search author on:PubMed Google Scholar
Barak Nehoran
View author publications
Search author on:PubMed Google Scholar
Sergiy Popovych
View author publications
Search author on:PubMed Google Scholar
R. Clay Reid
View author publications
Search author on:PubMed Google Scholar
Casey M. Schneider-Mizell
View author publications
Search author on:PubMed Google Scholar
H. Sebastian Seung
View author publications
Search author on:PubMed Google Scholar
William Silversmith
View author publications
Search author on:PubMed Google Scholar
Marc Takeno
View author publications
Search author on:PubMed Google Scholar
Russel Torres
View author publications
Search author on:PubMed Google Scholar
Nicholas L. Turner
View author publications
Search author on:PubMed Google Scholar
William Wong
View author publications
Search author on:PubMed Google Scholar
Jingpeng Wu
View author publications
Search author on:PubMed Google Scholar
Wenjing Yin
View author publications
Search author on:PubMed Google Scholar
Szi-chieh Yu
View author publications
Search author on:PubMed Google Scholar
Jacob Reimer
View author publications
Search author on:PubMed Google Scholar
Philipp Berens
View author publications
Search author on:PubMed Google Scholar
Andreas S. Tolias
View author publications
Search author on:PubMed Google Scholar
Alexander S. Ecker
View author publications
Search author on:PubMed Google Scholar

Contributions

We use the CRediT system for author roles. Conceptualization: A.S.E., M.A.W., A.S.T., P.B. Methodology: M.A.W., P.B., A.S.E., E.Y.W. Software: M.A.W., S.P., T.L. Validation: M.A.W., S.P. Formal analysis: M.A.W., S.P., L.H. Investigation: M.A.W., S.P., B.C. Resources: B.C., P.G.F., J.A.B., A.L.B., D.B., J.B., D.J.B., M.A.C., F.C., N.Md.C., S.D., L.E., A.H., Z.J., C.J., D.K., N.K., S.K., Ki.L., Ka.L., R.L., T.M., G.M., E.M., S.S.M., S.M., B.N., S.P., R.C.R., C.M.S.M., H.S.S., W.S., M.T., R.T., N.L.T., W.W., J.W., W.Y., S.Y. Data curation: M.A.W., S.P., B.C. Writing - Original draft: M.A.W., S.P., A.S.E. Writing - Review & editing: M.A.W., S.P., L.H., T.L., A.S.T., A.S.E. Visualization: M.A.W., S.P., T.L. Supervision: A.S.E., A.S.T., J.R. Project administration: A.S.E. Funding acquisition: A.S.E., A.S.T., J.R.

Corresponding author

Correspondence to Alexander S. Ecker.

Ethics declarations

Competing interests

A.S.T is a cofounder of Vathes Inc. and UploadAI LLC, companies in which he has financial interests. J.R. is co-founder of Vathes Inc. and UploadAI LLC, companies in which he has financial interests. A.S.E. is a cofounder of Maddox AI GmbH, in which he has financial interests. TM and HSS disclose financial interests in Zetta AI LLC. The remaining authors declare no competing interests.

Peer review

Peer review information

: Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Weis, M.A., Papadopoulos, S., Hansel, L. et al. An unsupervised map of excitatory neuron dendritic morphology in the mouse visual cortex. Nat Commun 16, 3361 (2025). https://doi.org/10.1038/s41467-025-58763-w

Download citation

Received: 24 October 2024
Accepted: 01 April 2025
Published: 09 April 2025
DOI: https://doi.org/10.1038/s41467-025-58763-w