The data newsletter by @puntofisso.
Hello, regular readers and welcome new ones :) This is Quantum of Sollazzo, the newsletter about all things data. I am Giuseppe Sollazzo, or @puntofisso. I’ve been sending this newsletter since 2012 to be a summary of all the articles with or about data that captured my attention over the previous week. The newsletter is and will always (well, for as long as I can keep going!) be free, but you’re welcome to become a friend via the links below.
I managed to survive my road-trip of Sicily, despite not one but two car breakdowns in 24 hours. That didn’t take away from the adventure, and I now know how to replace a tyre, skill I truly hope I won’t need ever again.
The most clicked link last week was MoveMap. A few of you asked whether we could do something like this for the UK. I suspect that the area is too small for some of the differences to be meaningful, but maybe at a European level? Not sure.
That also prompts me to ask a question: is there any other way you’d like to engage with one another? I tried setting up a subreddit for Quantum of Sollazzo, but it never caught on. Any suggestions would be welcome, as are general ideas on how to make the newsletter better.
Last week I attended Journocoders, one of my favourite meetups. We had a great session on how to use Sentinel-2 images and I managed to animate a visualisation of the Etna eruption of 2022 using false colours.
‘till next week,
You're receiving this email because you subscribed to Quantum of Sollazzo, a weekly newsletter covering all things data, written by Giuseppe Sollazzo (@puntofisso). If you have a product or service to promote and want to support this newsletter, you can sponsor an issue.
mySociety’s Alex Parsons has just published a blog post about climate/local gov data. The list of available data is pretty impressive and they are keen to chat to anyone who might find their data useful, or who would like to use it and don’t know how.
“FiveThirtyEight drew on news reports, debate footage, campaign materials and social media and reached out to every single Republican nominee for the House, Senate, governor, secretary of state and attorney general to determine their position on the 2020 election.“
A wealth of different data powers this incredible website by ITS Leeds academic Dr Malcolm Morgan. Carbon.place is “a free tool which estimates the per-person carbon footprint for every Lower Super Output Area (LSOA) in England.“
“Worldwide tree cover has grown by 2.24 million square kilometers — the size of Texas and Alaska combined — in the last 35 years, according to a paper in the science journal “Nature.”“
Some good news (hopefully).
A good thread by @depthsofwiki.
(via Lucilla Piccari)
Splink is a open source library from the Ministry of Justice for probabilistic record linkage at scale which is gaining adoption in UK Government and beyond(https://t.co/itJn3qPqiM). This is a Python library for fast, accurate, and scalable probabilistic data linkage using a variety of SQL backends. Splink has had 3m+ downloads and users in OGDs, academia and the private sector.
Great set of resources by the Society of Professional Journalists’ toolbox.
A handy tool to play with chart creation online using Pandas, Matplotlib, Seaborn, and Plotly Express.
“A missing data pattern is the structure of observed and missing values in a data set. This is not to be confused with a missing data mechanism, which describes possible relationships between data and one’s propensity for missing values. Patterns describe where the gaps in the data are, whereas mechanisms explain why the values are missing.“
“Seamlessly visualize your JSON data instantly into graphs.“
A guide on how to create BBC style graphics in R.
Check out PostApex
PostApex is the premium email advertising platform that lets you reach over 100 million readers across the world. You can discover email.
“And no (public) resources for working with the data yet”. A lot has been said about this release of several hundreds of TeraBytes of price data – a great resource, or obfuscation-by-overwhelm? – and this blog post has a good set of tech thoughts.
Very interesting report by charity data legends Data Orchard.
“A knowledge graph about queer history“ by Katharina Brunner.
Not sure how I missed this but it’s pretty cool for those who wish to travel by train through Europe.
(via Nicola del Monaco)
“Explore the Marvel Cinematic Universe like never before!“.
“Each main character and actor is connected to the movies they appear in, allowing you to see the connections between your favorite Marvel heroes (or villains). Of course, if you haven’t seen the movie, spoiler ahead!“
CC-BY visuals by Professor Ed Hawkins, including the famous “Global Warming Stripes”.
A summary of animated dataviz that designer Krisztina Szucs created for UEFA Euro 2020, with many interesting ideas discussed.
You might have seen the series of tweets and articles about Loab, the creepy face that keeps appearing in images produced by an AI model, which also seems to “infect” any image that gets mixed with it.
I found this visualization by Prof James Teo rather good at understanding conceptually what’s happening: “Here is an example of a UMAP of a latent space. If there is a cluster apart from the main cluster of pretty art, then a negative prompt from most embeddings in the main cluster will decode to embeddings in the purple cluster.”
“Using open cameras and AI to find how an Instagram photo is taken.“. I won’t ask if this is legal…
Song Exploder is one of my favourite non-tech podcasts. In each episode, they pretty much eviscerate a song. Episode 239 interviews Madonna and producer Stuart Price on their famous hit “Hung up”, which I had always considered a blasphemous appropriation of ABBA… but there’s actually quite a lot to it :)
quantum of sollazzo is supported by ProofRed’s excellent proofreading. If you need high-quality copy editing or proofreading, head to http://proofred.co.uk. Oh, they also make really good explainer videos.
[*] this is for all $5+/months Github sponsors. If you are one of those and don’t appear here, please e-mail me