The data newsletter by @puntofisso.
Hello, regular readers and welcome new ones :) This is Quantum of Sollazzo, the newsletter about all things data. I am Giuseppe Sollazzo, or @puntofisso. I’ve been sending this newsletter since 2012 to be a summary of all the articles with or about data that captured my attention over the previous week. The newsletter is and will always (well, for as long as I can keep going!) be free, but you’re welcome to become a friend via the links below.
The most clicked link last week was the Washington Post’s look at what food is better for the planet.
If you’re reading this: what would you like me to create next? Just reply back :)
‘till next week,
You're receiving this email because you subscribed to Quantum of Sollazzo, a weekly newsletter covering all things data, written by Giuseppe Sollazzo (@puntofisso). If you have a product or service to promote and want to support this newsletter, you can sponsor an issue.
“A nonpartisan, data-driven snapshot by USA Facts”.
“James, 38, keeps scoring as if he is still in his 20s. This season is tied for his third-best mark (30.2 points per game), trailing only the 30.3 points he averaged last season and his 31.4 in 2005-06.“
“In Senegal, rising seas have led to devastating coastal erosion. If there is a war against climate change, the UNESCO World Heritage city of Saint-Louis is on the front lines. And the ocean is winning.“
Brilliant inclusion of a highly-technical illustration in the middle of a great story. Well done NPR team.
A searchable archive of “documented incidents involving potential war crimes in Ukraine” by PBS.
“The second edition of The State of Data Journalism Survey results is now public! Explore the insights from the largest data journalism survey of the year.“
Mostly nicely visualized with Flourish.
Quite a lot of interesting features in matplotlib’s newest release, including error bars on polar plots (themselves drawn in polar coordinates),
Something to watch, as it hasn’t launched yet.
“Blocktype is an online tool, that gives you a fast and accurate way of evaluating a sites development potential, without needing to hire an architect or a planner. “
Euan Mills, one of the co-founder, tweets: “We’ve added some perimeter blocks to the @BlocktypeApp library, and it’s pretty fun to play around with. Apparently, if we replaced Palace of Westminster with #housing at only 6 storeys we could house 2,200 people, and provide over 17,000 sqm of open space!! “
Source code for a “knowledge-graph based chatbot using GPT3 and Neo4j”.
Data engineer Tomaz Bratanic says on LinkedIn: “a repository that constructs a knowledge graph based chatbot using GPT3 and Neo4j that can search and summarize the information in news articles.“
Watch out for a blog post.
“Have you ever found yourself in a situation where you were trying to analyze a complex and highly correlated data set and felt overwhelmed by the amount of information? This is where Independent Component Analysis (ICA) comes in. ICA is a powerful technique in the field of data analysis that allows you to separate and identify the underlying independent sources in a multivariate data set.“
A useful tutorial on getting started with Tableau and Google Sheets.
Published as part of the Knowing Machines research projects, this is an… oddly formatted web page with a lot of interesting content.
A collection of Python packages related to sports analytics and visualizations. (Is chess a sport?)
Basically, an end-to-end practical look at what QGIS is for.
Anyone can start speaking a new language in just three weeks with Babbel
If you’ve always wanted to learn a new language, Babbel will be the most productive 10 minutes of your day. Trusted by over 10 million subscribers worldwide, the subscription-based language learning platform can get you confidently conversing in a new tongue in just three weeks. This isn’t your grade school textbook: the bite-sized lessons, available in 14 languages, teach you localized vocabulary you’ll actually use in the real world. Plus, speech recognition technology helps you perfect your accent. Sign up today and get up to 60 percent off your subscription.
“For more than a decade now, the fact that people have a hard time gaining actionable insights from their data has been blamed on its size. “Your data is too big for your puny systems,” was the diagnosis, and the cure was to buy some new fancy technology that can handle massive scale. Of course, after the Big Data task force purchased all new tooling and migrated from Legacy systems, people found that they still were having trouble making sense of their data. They also may have noticed, if they were really paying attention, that data size wasn’t really the problem at all.“
“ESRI Shapefile is a file format for storing geospatial vector data. It has been around since the early 1990s and is still the most commonly used vector data exchange format.
While Shapefiles have enabled many successful activities over the years, they also have a number of limitations that complicate software development and reduce efficiency.
We, members of the geospatial IT industry, believe that it is time to stop using Shapefiles as the primary vector data exchange format and to replace them with a format that takes advantage of the huge advances that have been made since Shapefile was introduced.“
The shortlist for this recent competition has been published.
Quite a bit of downloadable data in this interactive site.
“This crazy, beautiful chart illustrates the incredible complexity of managing one of our nation’s most crucial – and invisible – national assets: the radio spectrum.“
“OpenAI’s chatbot offers paraphrases, whereas Google offers quotes. Which do we prefer?“
quantum of sollazzo is supported by ProofRed’s excellent proofreading. If you need high-quality copy editing or proofreading, head to http://proofred.co.uk. Oh, they also make really good explainer videos.
[*] this is for all $5+/months Github sponsors. If you are one of those and don’t appear here, please e-mail me