#DataAnalysis - kbin.earth

FabMusacchio , 3 days ago to random

Just figured out that #neuroscience has its own version of Moore's law: The number of simultaneously recorded #neurons doubles every ~7 years. This scaling has profound implications for #DataAnalysis and #modeling in #ComputationalNeuroscience. In this post, I review Stevenson & Kording's 2011 paper and reflect on its relevance today:

🌍 https://www.fabriziomusacchio.com/blog/2026-02-05-moores_law_for_neural_recordings/

#CompNeuro

Figure 3 (panel a) from Alessio P. Buccino et al. (2025)ꜛ shows the output of the visualization and quality control stage in a modern large-scale spike sorting pipeline. The panel displays an interactive view of raw and preprocessed electrophysiological data recorded with Neuropixels 2.0 probes, illustrating the simultaneous acquisition of activity from hundreds to thousands of channels. The visualization highlights the dense spatiotemporal structure of the recorded signals and the necessity of scalable preprocessing, inspection, and quality control before spike sorting and downstream analysis. The figure exemplifies the practical data volumes and organizational challenges that accompany contemporary high-density neural recordings. Stevenson and Kording predicted such developments over a decade ago by noting the exponential growth in simultaneously recorded neurons. Source: Buccino et al., Efficient and reproducible pipelines for spike sorting large-scale electrophysiology data, 2025, bioRxiv 2025.11.12.687966, doi: 10.1101/2025.11.12.687966ꜛ (license: CC BY 4.0)
Figure 2 (panels a–i) from Marius Pachitariu et al. (2024) shows graph-based clustering strategies used in Kilosort4 to structure large-scale spike datasets. The figure illustrates how dense, high-dimensional spike features are iteratively reassigned and merged to obtain stable clusters from large neural populations. Panel a sketches the neighbor-based reassignment process that progressively reduces an initially large set of clusters. Panel b shows an example clustering overlaid on a t-SNE embedding of spike features. Panel c presents the hierarchical merging tree used to decide which clusters should be combined based on a modularity cost. Panel d summarizes the criteria for accepting or rejecting merges, combining feature-space bimodality with refractory-period constraints derived from spike timing. Panels e and f show the final clustering result, highlighting units that exhibit refractory periods. Panels g and h characterize the resulting units using average waveforms, autocorrelograms, cross-correlograms, and regression projections. Panel i visualizes the spatial distribution of clustered spikes along the probe. Together, the figure exemplifies how modern spike sorting algorithms impose structure on massive datasets by combining graph methods, statistical criteria, and biophysical constraints. Source: Pachitariu et al., Spike sorting with Kilosort4, 2024, Nature Methods, 914–921, DOI: 10.1038/s41592-024-02232-7ꜛ (license: CC BY 4.0)

Figure 2 (panels a–i) from Marius Pachitariu et al. (2024) shows graph-based clustering strategies used in Kilosort4 to structure large-scale spike datasets. The figure illustrates how dense, high-dimensional spike features are iteratively reassigned and merged to obtain stable clusters from large neural populations. Panel a sketches the neighbor-based reassignment process that progressively reduces an initially large set of clusters. Panel b shows an example clustering overlaid on a t-SNE embedding of spike features. Panel c presents the hierarchical merging tree used to decide which clusters should be combined based on a modularity cost. Panel d summarizes the criteria for accepting or rejecting merges, combining feature-space bimodality with refractory-period constraints derived from spike timing. Panels e and f show the final clustering result, highlighting units that exhibit refractory periods. Panels g and h characterize the resulting units using average waveforms, autocorrelograms, cross-correlograms, and regression projections. Panel i visualizes the spatial distribution of clustered spikes along the probe. Together, the figure exemplifies how modern spike sorting algorithms impose structure on massive datasets by combining graph methods, statistical criteria, and biophysical constraints. Source: Pachitariu et al., Spike sorting with Kilosort4, 2024, Nature Methods, 914–921, DOI: 10.1038/s41592-024-02232-7ꜛ (license: CC BY 4.0)

$Figure 2 (panels a–i) from Marius Pachitariu et al. (2024) shows graph-based clustering strategies used in Kilosort4 to structure large-scale spike datasets. The figure illustrates how dense, high-dimensional spike features are iteratively reassigned and merged to obtain stable clusters from large neural populations. Panel a sketches the neighbor-based reassignment process that progressively reduces an initially large set of clusters. Panel b shows an example clustering overlaid on a t-SNE embedding of spike features. Panel c presents the hierarchical merging tree used to decide which clusters should be combined based on a modularity cost. Panel d summarizes the criteria for accepting or rejecting merges, combining feature-space bimodality with refractory-period constraints derived from spike timing. Panels e and f show the final clustering result, highlighting units that exhibit refractory periods. Panels g and h characterize the resulting units using average waveforms, autocorrelograms, cross-correlograms, and regression projections. Panel i visualizes the spatial distribution of clustered spikes along the probe. Together, the figure exemplifies how modern spike sorting algorithms impose structure on massive datasets by combining graph methods, statistical criteria, and biophysical constraints. Source: Pachitariu et al., Spike sorting with Kilosort4, 2024, Nature Methods, 914–921, DOI: 10.1038/s41592-024-02232-7ꜛ (license: CC BY 4.0)$

ALT

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ Binder +1 More

h4ckernews Bot , 23 days ago to random

Challenges in Join Optimization

https://www.starrocks.io/blog/inside-starrocks-why-joins-are-faster-than-youd-expect

#HackerNews #JoinOptimization #Challenges #DataAnalysis #SQLPerformance #StarRocks

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 27 days ago to random

Why There's No Single Best Way to Store Information

https://www.quantamagazine.org/why-theres-no-single-best-way-to-store-information-20260116/

#HackerNews #informationstorage #dataanalysis #techinsights #knowledgeorganization #dataarchitecture

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

LabPlot , 1 month ago to Open Source

Did you know that #LabPlot (#free, #OpenSource) includes a built-in #library of #DataAnalysis and #DataVisualization example #projects?

Each project is categorized by type, so you can quickly find what you need.

@labplot
Open Source

Try it now:
1️⃣ Download #LabPlot: https://labplot.org/download.
2️⃣ File > Open Example.

#FreeSoftware #OpenSource #FOSS #DataViz #Data #DataScience #Statistics #Physics #Chemistry #Math #Science #Research #Engineering #OriginPro #Graphpad #Sigmaplot #Alternative

Image: Example projects available in LabPlot

ALT

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ kde

h4ckernews Bot , 1 month ago to random

65% of Hacker News Posts Have Negative Sentiment, and They Outperform

https://philippdubach.com/standalone/hn-sentiment/

#HackerNews #HackerNews #NegativeSentiment #Outperforming #DataAnalysis #CommunityTrends #65PercentInsights

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 1 month ago to random

Mapping Protests in Iran

https://www.fdd.org/analysis/2025/06/25/mapping-the-protests-in-iran-2/

#HackerNews #MappingProtests #Iran #Protests #SocialJustice #DataAnalysis #FreedomOfSpeech

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 1 month ago to random

Why does a least squares fit appear to have a bias when applied to simple data?

https://stats.stackexchange.com/questions/674129/why-does-a-linear-least-squares-fit-appear-to-have-a-bias-when-applied-to-simple

#HackerNews #leastSquaresFit #bias #simpleData #statistics #dataAnalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

LabPlot , 1 month ago to random

#LabPlot is a #free, #OpenSource tool designed for scientific #DataVisualization and #DataAnalysis, perfect for #researchers and #engineers. :boost_love:

One of its standout features is #Maxima (but also #Python, #R etc.) integration—you can create #notebooks that combine text, #LaTeX, Maxima commands, and plots, making it easy to produce scientific documents with live calculations and results.

Image source:
https://maxima-french-doc.fr/interfaces/
#FLOSS #FOSS #Science #Math #Physics #Chemistry #Data #STEM

Image: LabPlot+Maxima

Image: LabPlot+Maxima

ALT

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ Binder

h4ckernews Bot , 1 month ago to random

22 GB of hacker news in SQLite

https://hackerbook.dosaygo.com

#HackerNews #HackerNews #SQLite #DataAnalysis #DataStorage #TechNews #22GBInsights

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 1 month ago to random

What 4M posts reveal about going viral on Hacker News

https://hn-ph.vercel.app

#HackerNews #goingviral #HackerNews #socialmedia #trends #virality #dataanalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

rook , 1 month ago to random

Hi, I’m Rook (previously known as DarkFoxDK). I’m a 30-something nerdy #trans vixen.

I’ve been doing #WebDevelopment, #programming, and #database stuff since I was a kit, and I'm still loving it.

Currently, I work with complex #automation and #robotics systems in #HealthCare, doing a mix of #UserSupport, #SystemAdmin, and #DataAnalysis.

At home, I’m constantly tinkering with #SmartHome projects using #HomeAssistant, building my own #electronics, and battling my #3DPrinting hardware.

I also enjoy running around as a big, silly #fox in the #Furry fandom. #Fursuiter

I’m #ActuallyAutistic, and #ADHD / #AuDHD,. I’m a proud #transgender member of the #LGBTQIA #queer community,

#introduction

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ welcome

h4ckernews Bot , 1 month ago to random

Notes on Sorted Data

https://amit.prasad.me/blog/sorted-data

#HackerNews #Notes #on #Sorted #Data #DataSorting #TechInsights #DataAnalysis #HackerNews

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

We collected 10k hours of neuro-language data in our basement

https://condu.it/thought/10k-hours

#HackerNews #neurodata #languagelearning #basementresearch #dataanalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

Super fast aggregations in PostgreSQL 19

https://www.cybertec-postgresql.com/en/super-fast-aggregations-in-postgresql-19/

#HackerNews #SuperFastAggregations #PostgreSQL19 #DatabasePerformance #DataAnalysis #TechNews

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

Python Data Science Handbook

https://jakevdp.github.io/PythonDataScienceHandbook/

#HackerNews #Python #Data #Science #Handbook #DataScience #Python #Programming #MachineLearning #DataAnalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

LabPlot , 2 months ago to Open Source

We have implemented #TimeSeries #SeasonalDecomposition using STL and MSTL. :boost_love:

➡️ #STL breaks down data into trend, single #seasonality, and noise.

➡️ #MSTL extends this to extract multiple seasonal patterns iteratively. These methods improve analysis and #forecasting for complex seasonal data.

@labplot
Open Source

#GSOC #OpenSource #FOSS
#FLOSS #KDE #DataAnalysis #DataViz #TimeSeries

Image: STL and MSTL methods in LabPlot

ALT

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ kde

h4ckernews Bot , 2 months ago to random

RL is more information inefficient than you thought

https://www.dwarkesh.com/p/bits-per-sample

#HackerNews #RL #information #inefficiency #AI #research #machinelearning #dataanalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

Is DWPD Still a Useful SSD Spec?

https://klarasystems.com/articles/is-dwpd-still-useful-ssd-spec/

#HackerNews #SSD #Spec #DWPD #Technology #Storage #Innovation #DataAnalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

Measuring Latency (2015)

https://bravenewgeek.com/everything-you-know-about-latency-is-wrong/

#HackerNews #Measuring #Latency #Latency2015 #TechInsights #NetworkPerformance #DataAnalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 2 months ago to random

Fourier Transforms

https://www.continuummechanics.org/fourierxforms.html

#HackerNews #FourierTransforms #Mathematics #SignalProcessing #DataAnalysis #Waveforms

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 3 months ago to random

Disassembling terabytes of random data with Zig and Capstone to prove a point

https://jstrieb.github.io/posts/random-instructions/

#HackerNews #Disassembling #terabytes #of #random #data #with #Zig #and #Capstone #to #prove #a #point

Zig #Capstone #DataAnalysis #RandomData #Disassembly

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

darkfox , 3 months ago to random

Hi, I’m Rook (previously known as DarkFoxDK). I’m a 30-something nerdy #trans vixen.

I’ve been doing #WebDevelopment, #programming, and #database stuff since I was a kit, and I'm still loving it.

Currently, I work with complex #automation and #robotics systems in #HealthCare, doing a mix of #UserSupport, #SystemAdmin, and #DataAnalysis.

At home, I’m constantly tinkering with #SmartHome projects using #HomeAssistant, building my own #electronics, and battling my #3DPrinting hardware.

I also enjoy running around as a big, silly #fox in the #Furry fandom. #Fursuiter

I’m #ActuallyAutistic, #AuDHD, and #ADHD. I’m a proud #transgender member of the #LGBTQIA #queer community,

#introduction

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

+ welcome

h4ckernews Bot , 3 months ago to random

Developers are choosing older AI models, and the data explains why

https://www.augmentcode.com/blog/developers-are-choosing-older-ai-models-and-16b-tokens-of-data-explain-why

#HackerNews #Developers #AI #OlderModels #DataAnalysis #TechTrends

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 3 months ago to random

ChatGPT shares data on how many users exhibit psychosis or suicidal thoughts

https://www.bbc.com/news/articles/c5yd90g0q43o

#HackerNews #ChatGPT #Psychosis #Users #MentalHealth #SuicidalThoughts #DataAnalysis

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

h4ckernews Bot , 3 months ago to random

Egg prices vs. Consumer Price Index since 1980

https://fred.stlouisfed.org/graph/?g=1Nm5b

#HackerNews #EggPrices #ConsumerPriceIndex #Inflation #DataAnalysis #EconomicTrends

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...