The data newsletter by @puntofisso.
Hello, regular readers and welcome new ones :) This is Quantum of Sollazzo, the newsletter about all things data. I am Giuseppe Sollazzo, or @puntofisso. I've been sending this newsletter since 2012 to be a summary of all the articles with or about data that captured my attention over the previous week. The newsletter is and will always (well, for as long as I can keep going!) be free, but you're welcome to become a friend via the links below.
The most clicked link last week was South China Morning Post's excellent graphical take on King Charles' coronation.
Some of you asked how easy to reach Wolverhampton is. It's just a 15-20 minutes ride from Birmingham New Street, which is one of the best connected train stations in the country. And the campus itself is a short walk from Wolverhampton station. So you have no excuses :-)
If you've never heard of Open Data Camp, it's a volunteer-run unconference for all things data and transparency (not just open data, but the name has stuck). I helped co-found it in 2014 and we've been running a few events since. It is well attended by data wranglers, advocates, public sector data officials (including, once, a Permanent Secretary!), and data journalists. The message is: if you are interested in any aspect of data, this is the event for you. We had 120 attendees in Belfast. Can you help us beat that record?
Last but not least, Open UK has launched their State of Open: The UK in 2023 Survey, which will feed into their State of Open: The UK in 2023 report.
'till next week,
You're receiving this email because you subscribed to Quantum of Sollazzo, a weekly newsletter covering all things data, written by Giuseppe Sollazzo (@puntofisso). If you have a product or service to promote and want to support this newsletter, you can sponsor an issue.
Quite a lot in this article by Reuters, including some music analysis.
"Some societies in the former Soviet bloc managed to build functioning democracies, but others didn't; why it happened has lessons for the West too".
Francesco Piccinelli's DaNumbers returns with a look at politics, which includes the intriguing chart below.
One of those great interactives by the Washington Post, using occupational exposure data by Edward W. Felten, Princeton University; Manav Raj, University of Pennsylvania; and Robert Seamans, New York University, “Occupational Heterogeneity in Exposure to Generative AI.” Salary data is from the “2022 Occupational Employment and Wage Statistics” by the U.S. Bureau of Labor Statistics.
"Do you understand their true scale?"
Featuring an excellent use of scrollytelling.
"As plastic waste increases in Slovenia, the European Union, and the world, so do the difficulties in disposing of and recycling it."
The first of a 4-part data-driven investigation with the collaboration of the European Data Journalism Network. More detail about the investigation, including links to data and methodology, can be found here.
Good Tools. 100% free, hard-to-find, zero-cost digital tools that offer time and money savings to indie entrepreneurs.
A browser-based library that claims to be able to visualize up to 1 million nodes, more than a million edges, "with a reasonable trade-off in force graph simulation accuracy".
"An Interactive Visualization Interface for Multidimensional Datasets."
In a sense, similar to Cosmograph, but for more complex data.
Great music tutorial by Ableton. Not quite data, but it might be useful to the data sonifiers among you.
"I demonstrate a widely applicable pattern which integrates Prolog as a critical component in a data science analysis. Analytic methods are used to generate properties about the data under study and Prolog is used to reason about the data via the generated properties. The post includes some examples of piece-wise regression on timeseries data by symbolic reasoning. I also discuss the general pattern of application a bit."
"Simulated Hospital is a tool that generates realistic and configurable hospital patient data in HL7v2 format."
"This book aims to lift the curtain on reproducible finance by providing a fully transparent code base for many common financial applications."
A pdf on archive.org of a 1980s Russian book by Lev Tarasov.
"This text is divided into two major parts. The aim of the first part is to convince the reader that the random world begins directly in his or her own living room because, in fact, all modern life is based on probability. The first part is on the concept of probability and considers making decisions in conflict situations, optimizing queues, games, and the control of various processes, and doing random searches. The second part of this text shows how fundamental chance is in nature using the probabilistic laws of modern physics and biology as examples. Elements of quantum mechanics are also involved, and this allows the author to demonstrate how probabilistic laws are basic to microscopic phenomena. The idea is that the reader, passing from the first part of the book to the second one, would see that probability is not only around us but it is at the basis of everything."
This is more an invite to research and visualize than anything :-) But it reminded me that aged about 10 I had made an agreement with my dad on where I could go on my own or with friends, walking or cycling, which included a map of my hometown and nearby areas. It was definitely ok to get to the nearest town (circa 10km).
Datawrapper's Rose Mintzer-Sweeney: "Instead of traps and poisons, the squirrels of New York get an adorable citizen-science project: the Squirrel Census."
Rose highlights how to create a custom choropleth in Datawrapper, shaped after and overlayed onto a map of Central Park.
Alasdair Rae (profile in Quantum issue #461) looks at how many people live within the boundaries of US Internst, using MapTiler and QGIS.
"The Eras Tour encompasses 44 songs across Taylor Swift's 10 albums over a three-hour set."
"DiffusionDB is the first large-scale text-to-image prompt dataset. It contains 14 million images generated by Stable Diffusion using prompts and hyperparameters specified by real users."
quantum of sollazzo is supported by ProofRed's excellent proofreading. If you need high-quality copy editing or proofreading, head to http://proofred.co.uk. Oh, they also make really good explainer videos.
[*] this is for all $5+/months Github sponsors. If you are one of those and don't appear here, please e-mail me